Ru Peng (Perry)

Hi 😃! 彭儒, PhD @ Zhejiang University.

rupeng.jpg

"Only love endures the passage of time"

I’m a final-year PhD student at Computer Science Department of Zhejiang University (ZJU), advised by Professors Junbo Zhao and Gang Chen, and affiliated with DiLab-ZJU. Also, I was a research intern at Alibaba Qwen Team, working with Dayiheng Liu, Chang Zhou and Jingren Zhou on data management and synthesis for QWEN series models. Previously, I was fortunate to collaborate with Professors Tianyong Hao, Yi Fang and Kehai Chen, who ushered me into the research journey.

My research interests span across several AI fields, including LLMs (current emphasis), Machine learning, NLP and multimodal - toward building AGI to change human living:

I am open to opportunities across academia and industry — feel free to get in touch!

CV    Google Scholar Google Scholar    Twitter Twitter    Email Email    Wechat WeChat    GitHub GitHub    Huggingface Huggingface

 

🔥 News

Apr 29, 2025 Qwen3 series foundation models are released now.
Feb 11, 2025 Our paper “LLM-Enhanced Query Generation and Retrieval Preservation for Task-Oriented Dialogue” is accepted at Findings of ACL 2025!
Feb 11, 2025 Our paper “DataMan: Data Manager for Pre-training Large Language Models” is accepted at ICLR 2025!
Dec 19, 2024 Qwen2.5 technical report are released now.
Sep 20, 2024 One paper “Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation” is accepted at Findings of EMNLP 2024 and two paper “Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model”, “Embedding and Gradient Say Wrong: A White-Box Method for Hallucination Detection” are accepted at EMNLP 2024!
Sep 19, 2024 Qwen2.5 series foundation models are released now.
Jul 15, 2024 Qwen2 technical report are released now.
Jul 04, 2024 Release the paper of “Dotamath” for mathematical reasoning.
Jun 17, 2024 Qwen2 series foundation models are released now.
May 16, 2024 Our paper “DORY: Deliberative Prompt Recovery for LLM” is accepted at Findings of ACL 2024!
Feb 04, 2024 Qwen1.5 series foundation models are released now.
Jan 16, 2024 Our paper “Energy-based Automated Model Evaluation” is accepted at ICLR 2024!
Oct 23, 2023 I started my internship at Alibaba Qwen Team! Ping me if you want to meet up in HangZhou :)
Jul 15, 2023 Our paper “CAME: Contrastive Automated Model Evaluation” is accepted at ICCV 2023!
Oct 06, 2022 Our paper “Distill The Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation” is accepted at EMNLP 2022 (Oral)!
Sep 10, 2022 Started my PhD’s degree at College of Computer Science and Technology of Zhejiang University!
Apr 06, 2022 Our paper “HybridVocab: Towards Multi-Modal Machine Translation via Multi-Aspect Alignment” is accpeted at ICMR 2022 (Oral)!

📝 Selected Publications

  1. Qwen1.5 Blog
    qwen1_5_blog.jpeg
    Introducing qwen1. 5
    Qwen Team
    Online Blog, 2024
  2. Qwen2 Technical Report
    qwen2_technical_report.jpg
    Qwen2 technical report, 2024
    An Yang, Baosong Yang, Binyuan Hui, Bo Zheng, Bowen Yu, Chang Zhou, Chengpeng Li, Chengyuan Li, and 3 more authors
    arXiv preprint arXiv:2407.10671, 2024
  3. Qwen2.5 Technical Report
    qwen2_5_technical_report.jpg
    Qwen2.5 Technical Report
    An Yang, Baosong Yang, Beichen Zhang, Binyuan Hui, Bo Zheng, Bowen Yu, Chengyuan Li, Dayiheng Liu, and 33 more authors
    arXiv preprint arXiv:2412.15115, 2024
  4. Qwen3 Technical Report
    qwen3_technical_report.png
    Qwen3 technical report
    An Yang, Anfeng Li, Baosong Yang, Beichen Zhang, Binyuan Hui, Bo Zheng, Bowen Yu, Chang Gao, and 3 more authors
    arXiv preprint arXiv:2505.09388, 2025
  5. ICLR 2025
    ICLR2025_DataMan.jpg
    DataMan: Data Manager for Pre-training Large Language Models
    In The Thirteenth International Conference on Learning Representations, 2025

📚 Academic Services

  • Conference Reviewer: ICLR 2024, 2025; ICML 2023, 2025; NeurIPS 2022, 2023, 2024; CVPR 2025; ICCV 2023, 2025; ECCV 2024; ACL 2024; AISTATS 2025; COLM 2024.
  • Journal Reviewer: IEEE Transactions on Big Data (TBD), Transactions of Machine Learning Research (TMLR).
  • Publication Chair: International Conference on Natural Language Processing (ICNLP) 2025.

😊 Miscellaneous

  • I yearn to be a carefree guitar-playing singer👩‍🎤, strumming melodies of mirth🎵 into life.
  • I am a sports enthusiast interested in basketball🏀, football⚽, outdoor running🏃‍♂️ and so much more.
  • I love traveling around the world🗺️, gathering with friends🍻, and trying anything new — except terrible food.

Totoro Bottle