|
Auto-ACD: A Large-scale Dataset for Audio-Language Representation Learning
Luoyi Sun, Xuenan Xu, Mengyue Wu, Weidi Xie
ACM MM, 2024
ArXiv / Webpage / Dataset / Code / Model
Utilize computer vision tools to generate a large-scale, high-quality audio-language dataset.
|
|
Sound Generation Method with Timing-aligned Visual Feature Mapping
Zhifeng Xie, Luoyi Sun, Yuzhou Sun, Chunpeng Yu, Lizhang Ma
CADCG, 2022
pdf
New framework for high-quality sound generation, matching to silent videos in content and timing alignment.
|
|
Multi-Scale Graph Convolutional Interaction Network For Salient Object Detection
Wenqi Che, Luoyi Sun, Zhifeng Xie, Youdong Ding, Kanli Han
ICIP, 2021
pdf
Proposed the multi-scale graph convolutional interaction network (MGCINet), and get the SOTA on five benchmark datasets.
|
Patents
- A Speech-Driven Editable Face Reenactment Method, 2023
Jiaheng Zheng, Shiyu Xia, Luoyi Sun, Zhifeng Xie
- A Video Scene Segmentation Method Based on Multimodal Semantic Interaction, 2023
Yihui Liao, Zhiwen Jiang, Luoyi Sun, Zhifeng Xie
|
Honors
- Outstanding Graduate, Shanghai Municipal Education Commission, 2023
- National Scholarship, Ministry of Education, 2022
- The First Prize Scholarship, Shanghai University (Top 5%), 2020, 2021, 2022
- Second Class Prize, National Post-Graduate Mathematical Contest in Modeling, 2021
- The Second Prize Scholarship, Yunnan University (Top 10%), 2017, 2018
- Excellent Student Cadre, Yunnan University (Top 5%), 2017, 2018
|
Hobbies
- Swimming, Cycling, Rock Climbing, Singing, Photography, Watching Movies, Piano
|
|