Yan Zhou | 周䶮

I am a second-year Ph.D. student at Institute of Computing Technology, Chinese Academy of Sciences (ICT/CAS), luckily advised by Prof. Yang Feng at ICT Natural Language Processing (ICTNLP) group.

Email   |   Github   |   Google Scholar

profile photo
Research

My research interests mainly lie in natural language processing (NLP) and machine translation (MT). In particular, I am now interested in end-to-end speech-to-text translation (S2TT) and speech-to-speech translation (S2ST). I also have some research on large language models (LLMs) for machine translation or speech translation. Before that, I also had some research on multilingual neural machine translation.

News

[2024/9] Our speech-language model LLaMA-Omni is released! It is a powerful speech interaction model built upon Llama-3.1-8B-Instruct, which achieves low-latency and high-quality speech interactions. Check our paper, code, and model!

[2024/5] A paper about speech-to-speech translation that I participated in is accepted to ACL2024!

[2023/9] A paper about speech-to-speech translation that I participated in is accepted to NeurIPS2023!

[2023/9] I start my Ph.D. life at University of Chinese Academy of Sciences (UCAS), which is located in Huairou District, Beijing, near the beautiful Yanqi Lake.

[2023/6] I obtain my B.E. degree from Tsinghua University.

[2023/6] Our LLM BayLing (百聆) is released! BayLing is an instruction-following LLM with advanced language alignment and multi-turn interaction capability. Read our paper and try our online demo! Thanks to all collaborators!

[2023/5] A long paper about speech translation is accepted to ACL 2023 main conference! Thanks to all collaborators!

Publications

CTC-based Non-autoregressive Textless Speech-to-Speech Translation
Qingkai Fang, Zhengrui Ma, Yan Zhou, Min Zhang, Yang Feng
Findings of ACL 2024
Paper / Code

CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation
Yan Zhou, Qingkai Fang, Yang Feng
ACL 2023 (CCF-A)
Paper / Code

DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation
Qingkai Fang, Yan Zhou, Yang Feng
NeurIPS 2023 (CCF-A)
Paper / Code
Preprints

!!! LLaMA-Omni: Seamless Speech Interaction with Large Language Models
Qingkai Fang, Shoutao Guo, Yan Zhou, Zhengrui Ma, Shaolei Zhang, Yang Feng
Paper / Code / Model

BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models
Shaolei Zhang, Qingkai Fang, Zhuocheng Zhang, Zhengrui Ma, Yan Zhou, Langlin Huang, Mengyu Bu, Shangtong Gui, Yunji Chen, Xilin Chen, Yang Feng

Paper / Code / Demo / Project Page
Experience
Tsinghua University , China
2019.08 - 2023.6
Undergraduate Student
Institute of Computing Technology, Chinese Academy of Sciences , China
2023.09 - now
Ph.D. student
Selected Awards

  • 2019: Second-class scholarship for freshmen, Tsinghua University


  • Updated at September 2024