Published Papers
2025
CNVSRC 2024 Challenge
CNVSRC 2024: The Second Chinese Continuous Visual Speech Recognition Challenge
Zehua Liu, Xiaolou Li, Chen Chen, Lantian Li, Dong Wang
Interspeech 2025
2024
iscslp
Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
Chen Chen, Xiaolou Li, Zehua Liu, Lantian Li, Dong Wang
International Symposium on Chinese Spoken Language Processing (ISCSLP) 2024
CNVSRC 2023 Challenge
CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge
Chen Chen, Zehua Liu, Xiaolou Li, Lantian Li, Dong Wang
Interspeech 2024
Zero-Shot Fake Video Detection
Zero-Shot Fake Video Detection by Audio-Visual Consistency
Xiaolou Li, Zehua Liu, Chen Chen, Lantian Li, Li Guo, Dong Wang
Interspeech 2024
2023
CN-Celeb-AV
CN-Celeb-AV: A Multi-Genre Audio-Visual Dataset for Person Recognition
Lantian Li, Xiaolou Li, Haoyu Jiang, Chen Chen, Ruihai Hou, Dong Wang
Interspeech 2023
Preprints
2025
LLM-VSR
📋 Preprint
Leveraging Large Language Models in Visual Speech Recognition: Model Scaling, Context-Aware Decoding, and Iterative Polishing
Zehua Liu, Xiaolou Li, Li Guo, Lantian Li, Dong Wang
arXiv preprint arXiv:2506.02012 (2025)
2024
AlignVSR
📋 Preprint
AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition
Zehua Liu, Xiaolou Li, Chen Chen, Li Guo, Lantian Li, Dong Wang
arXiv preprint arXiv:2410.16438 (2024)
2024-2025 Xiaolou Li