Publications

My research interests spread across LLMs, machine learning, NLP and multimodal. Please refer to my publications below.

2025

  1. Qwen3 Technical Report
    qwen3_technical_report.png
    Qwen3 technical report
    An Yang, Anfeng Li, Baosong Yang, Beichen Zhang, Binyuan Hui, Bo Zheng, Bowen Yu, Chang Gao, and 3 more authors
    arXiv preprint arXiv:2505.09388, 2025
  2. ICLR 2025
    ICLR2025_DataMan.jpg
    DataMan: Data Manager for Pre-training Large Language Models
    In The Thirteenth International Conference on Learning Representations, 2025

2024

  1. ICLR 2024
    ICLR2024_EnergyAutoEval.png
    Energy-based Automated Model Evaluation
    Ru Peng , Heming Zou, Haobo Wang, Yawen Zeng, Zenan Huang, and Junbo Zhao
    In The Twelfth International Conference on Learning Representations, 2024
  2. ACL Findings 2024
    ACL_Findings2024_Dory.jpg
    DORY: Deliberative Prompt Recovery for LLM
    Lirong Gao, Ru Peng , Yiming Zhang, and Junbo Zhao
    In Findings of the Association for Computational Linguistics ACL 2024, 2024
  3. Qwen1.5 Blog
    qwen1_5_blog.jpeg
    Introducing qwen1. 5
    Qwen Team
    Online Blog, 2024
  4. ArXiv 2024
    Arxiv2024_DotaMath.jpg
    Dotamath: Decomposition of thought with code assistance and self-correction for mathematical reasoning
    Chengpeng Li, Guanting Dong, Mingfeng Xue, Ru Peng , Xiang Wang, and Dayiheng Liu
    arXiv preprint arXiv:2407.04078, 2024
  5. Qwen2 Technical Report
    qwen2_technical_report.jpg
    Qwen2 technical report, 2024
    An Yang, Baosong Yang, Binyuan Hui, Bo Zheng, Bowen Yu, Chang Zhou, Chengpeng Li, Chengyuan Li, and 3 more authors
    arXiv preprint arXiv:2407.10671, 2024
  6. Qwen2.5 Technical Report
    qwen2_5_technical_report.jpg
    Qwen2.5 Technical Report
    An Yang, Baosong Yang, Beichen Zhang, Binyuan Hui, Bo Zheng, Bowen Yu, Chengyuan Li, Dayiheng Liu, and 33 more authors
    arXiv preprint arXiv:2412.15115, 2024
  7. EMNLP Findings 2024
    EMNLP_Findings2024_InferenceTimeDecontamination.jpg
    Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
    Qin Zhu, Qinyuan Cheng, Runyu Peng, Xiaonan Li, Ru Peng , Tengxiao Liu, Xipeng Qiu, and Xuan-Jing Huang
    In Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
  8. EMNLP 2024
    EMNLP2024_Inference_Intervention.jpg
    Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model
    Chenhan Yuan, Fei Huang, Ru Peng , Keming Lu, Bowen Yu, Chang Zhou, and Jingren Zhou
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
  9. EMNLP 2024
    EMNLP2024_Hallucination_Detection.jpg
    Embedding and Gradient Say Wrong: A White-Box Method for Hallucination Detection
    Xiaomeng Hu, Yiming Zhang, Ru Peng , Haozhe Zhang, Chenwei Wu, Gang Chen, and Junbo Zhao
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

  1. ArXiv 2023
    Arxiv2023_SLT.png
    Better Sign Language Translation with Monolingual Data
    Ru Peng , Yawen Zeng, and Junbo Zhao
    arXiv preprint arXiv:2304.10844, 2023
  2. ICCV 2023
    ICCV2023_ContrastiveAutoEval.jpg
    Came: Contrastive automated model evaluation
    Ru Peng , Qiuyang Duan, Haobo Wang, Jiachen Ma, Yanbo Jiang, Yongjun Tu, Xiu Jiang, and Junbo Zhao
    In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

  1. ICONIP 2022 Oral
    ICONIP2022_NMT.jpg
    Deps-SAN: Neural Machine Translation with Dependency-Scaled Self-Attention Network
    Ru Peng , Nankai Lin, Yi Fang, Shengyi Jiang, Tianyong Hao, Boyu Chen, and Junbo Zhao
    In International Conference on Neural Information Processing, 2022
  2. ICMR 2022 Oral
    ICMR2022_MMT.jpg
    Hybridvocab: Towards multi-modal machine translation via multi-aspect alignment
    Ru Peng , Yawen Zeng, and Junbo Zhao
    In Proceedings of the 2022 International Conference on Multimedia Retrieval, 2022
  3. EMNLP 2022 Oral
    EMNLP2022_MMT.jpg
    Distill The Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation
    Ru Peng , Yawen Zeng, and Junbo Zhao
    In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021

  1. NCA 2021
    NCA2021_NMT.jpg
    Syntax-aware neural machine translation directed by syntactic dependency degree
    Ru Peng , Tianyong Hao, and Yi Fang
    Neural Computing and Applications, 2021

2019

  1. CCMT 2019 Best Paper Candidates
    CCMT2019_NMT.jpg
    Neural machine translation with attention based on a new syntactic branch distance
    Ru Peng , Zhitao Chen, Tianyong Hao, and Yi Fang
    In Machine Translation: 15th China Conference, CCMT 2019, Nanchang, China, September 27–29, 2019, Revised Selected Papers 15, 2019