“论文列表”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
(以“[1] Haoran Sun, Chen Chen, Lantian Li, Dong Wang, CycleFlow: Purify Information Factors by Cycle Loss, Odyssey 2022 (Best Student Paper) [2] Haoran Sun, Dong Wang, L...”为内容创建页面)
(没有差异)

2026年2月13日 (五) 15:33的版本

[1] Haoran Sun, Chen Chen, Lantian Li, Dong Wang, CycleFlow: Purify Information Factors by Cycle Loss, Odyssey 2022 (Best Student Paper) [2] Haoran Sun, Dong Wang, Lantian Li, Chen Chen, Thomas Fang Zheng, Random Cycle Loss and Its Application to Voice Conversion, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.45, no.8, August, 2023. [3] Chen Chen, Dong Wang, Thomas Fang Zheng, CN-CVS: A Mandarin Audio-Visual Dataset for Large Vocabulary Continuous Visual to Speech Synthesis, ICASSP 2023. [4] Chen Chen, Xiaolou Li, Zehua Liu, Lantian Li, Dong Wang, Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective, ISCSLP 2024. [5] Liu Z, Li X, Chen C, et al. CNVSRC 2024: The Second Chinese Continuous Visual Speech Recognition Challenge[J]. Interspeech 2025 [6] Chen Chen, Zehua Liu, Xiaolou Li, Lantian Li, Dong Wang, CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge, Interspeech 2024 [7] Ying Shi, Lantian Li, Shi Yin, Dong Wang, Jiqing Han, Serialized Output Training by Learned Dominance, Interspeech 2024 [8] Yuan J, Shi Y, Wang D, et al. MT-HuBERT: Self-Supervised Mix-Training for Few-Shot Keyword Spotting in Mixed Speech[J]. ICASSP 2026. [9] Junming Yuan, Ying Shi, Lantian Li, Dong Wang, Askar Hamdulla, Few-Shot Keyword Spotting from Mixed Speech, Interspeech 2024 [10] Shi, Y., Wang, D.*, Li, L., Han, J., Yin, S. (2023) Spot Keywords From Very Noisy and Mixed Speech. Proc. INTERSPEECH 2023, 1488-1492 [11] Lin W, Chen J, Wang T, et al. Neural Scoring: A Refreshed End-to-End Approach for Speaker Verification in Complex Conditions[J]. IEEE Signal Processing Letters, 2025. [12] Ying Shi, Lantian Li, Dong Wang, Jiqing Han, Keyword Guided Target Speech Recognition, IEEE Signal Processing Letters, 2024 [13] Zehua Liu, Xiaolou Li, Li Guo, Lantian Li, Dong Wang, Leveraging Large Language Models in Visual Speech Recognition: Model Scaling, Context-Aware Decoding, and Iterative Polishing, APSIPA 2025. [14] 陈琛,语音转换任务中的信息解耦优化方法研究,硕士论文,清华大学,2024. [15] Liu Z, Li X, Chen C, et al. AlignVSR: Audio-visual cross-modal alignment for visual speech recognition[C]//Proceedings of the 2025 11th International Conference on Communication and Information Processing. 2025: 161-165. [16] Cai Y, Li L, Abel A, et al. Maximum Gaussianality training for deep speaker vector normalization[J]. Pattern Recognition, 2024, 145: 109977 [17] Lantian Li, Ruiqi Liu, Jiawen Kang, Yue Fa, Hao Cui, Yunqi Cai, Ravichander Vipperla, Thomas Fang Zheng and Dong Wang. "CN-Celeb: multi-genre speaker recognition", Speech Communication, 2022. [18] Cai, Y., Li, J. & Wang, D. Fast and generalizable micromagnetic simulation with deep neural nets. Nat Mach Intell 6, 1330–1343 (2024).