avatar

Yusheng Dai

PhD Candidate
yusheng.dai@monash.edu


Hello 👋!

I am Yusheng Dai, a PhD candidate at Monash University in Australia, working with Prof. Jianfei Cai (IEEE Fellow) and Prof. Qiuhong Ke. Before that, I completed my Master’s program at University of Science and Technology of China (USTC), working with Prof. Jun Du and Prof. Chin-hui Lee (IEEE Fellow).

My research focuses on Audio-Visual Foundation World Models, working toward an interactive metaverse driven by real-time video and sound generation. Representative works include:

Note: I am looking for collaborators to do great work in audio and speech — generation or understanding, frontend or backend. I bring strong research insights and sharp storytelling. If your work is potential enough to achieve high impact, email me. I am fast enough: I have gone to help my collorators from zero context to a finished paper in one week, multiple times.

News

Selected Publications [Google Scholar]

  1. CVPR
    Yusheng Dai, Zehua Chen, Yuxuan Jiang, Baolong Gao, Qiuhong Ke, Jianfei Cai, Jun Zhu
    IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2026.


  2. ACL
    Yuxuan Jiang, Zehua Chen, Zeqian Ju, Yusheng Dai, Weibei Dou, Jun Zhu
    Annual Meeting of the Association for Computational Linguistics (ACL), 2026.

  3. ICCV
    Yusheng Dai, Chenxi Wang, Chang Li, Chen Wang, et.al.
    International Conference on Computer Vision (ICCV), 2025.

  4. CVPR
    Yusheng Dai, Hang Chen, Jun Du, Chin-hui Lee, et.al.
    IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2024.


© Yusheng Dai, 2023