🔥 News | Ru Peng (Perry)

Mar 02, 2026	Our paper “W2S: Weak-to-Strong Prompt Correction for Large Language Models” is accepted at Machine Learning 2026!
Jan 26, 2026	Our paper “OptimSyn: Influence-Guided Rubrics Optimization for Synthetic Data Generation” is accepted at ICLR 2026!
Aug 18, 2025	Ant RL technical report “Reinforcement learning with rubric anchors“(extending RLVR with 10k+ Rubric rewards) is now released.
Feb 11, 2025	Our paper “LLM-Enhanced Query Generation and Retrieval Preservation for Task-Oriented Dialogue” is accepted at Findings of ACL 2025!
Feb 11, 2025	Our paper “DataMan: Data Manager for Pre-training Large Language Models” is accepted at ICLR 2025!
Dec 19, 2024	Qwen2.5 technical report are released now.
Sep 20, 2024	One paper “Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation” is accepted at Findings of EMNLP 2024 and two paper “Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model”, “Embedding and Gradient Say Wrong: A White-Box Method for Hallucination Detection” are accepted at EMNLP 2024!
Sep 19, 2024	Qwen2.5 series foundation models are released now.
Jul 15, 2024	Qwen2 technical report are released now.
Jul 04, 2024	Release the paper of “Dotamath” for mathematical reasoning.
Jun 17, 2024	Qwen2 series foundation models are released now.
May 16, 2024	Our paper “DORY: Deliberative Prompt Recovery for LLM” is accepted at Findings of ACL 2024!
Feb 04, 2024	Qwen1.5 series foundation models are released now.
Jan 16, 2024	Our paper “Energy-based Automated Model Evaluation” is accepted at ICLR 2024!
Oct 23, 2023	I started my internship at Alibaba Qwen Team! Ping me if you want to meet up in HangZhou :)
Jul 15, 2023	Our paper “CAME: Contrastive Automated Model Evaluation” is accepted at ICCV 2023!
Oct 06, 2022	Our paper “Distill The Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation” is accepted at EMNLP 2022 (Oral)!
Sep 10, 2022	Started my PhD’s degree at College of Computer Science and Technology of Zhejiang University!
Apr 06, 2022	Our paper “HybridVocab: Towards Multi-Modal Machine Translation via Multi-Aspect Alignment” is accpeted at ICMR 2022 (Oral)!