🔥 News

Apr 29, 2025 Qwen3 series foundation models are released now.
Feb 11, 2025 Our paper “LLM-Enhanced Query Generation and Retrieval Preservation for Task-Oriented Dialogue” is accepted at Findings of ACL 2025!
Feb 11, 2025 Our paper “DataMan: Data Manager for Pre-training Large Language Models” is accepted at ICLR 2025!
Dec 19, 2024 Qwen2.5 technical report are released now.
Sep 20, 2024 One paper “Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation” is accepted at Findings of EMNLP 2024 and two paper “Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model”, “Embedding and Gradient Say Wrong: A White-Box Method for Hallucination Detection” are accepted at EMNLP 2024!
Sep 19, 2024 Qwen2.5 series foundation models are released now.
Jul 15, 2024 Qwen2 technical report are released now.
Jul 04, 2024 Release the paper of “Dotamath” for mathematical reasoning.
Jun 17, 2024 Qwen2 series foundation models are released now.
May 16, 2024 Our paper “DORY: Deliberative Prompt Recovery for LLM” is accepted at Findings of ACL 2024!
Feb 04, 2024 Qwen1.5 series foundation models are released now.
Jan 16, 2024 Our paper “Energy-based Automated Model Evaluation” is accepted at ICLR 2024!
Oct 23, 2023 I started my internship at Alibaba Qwen Team! Ping me if you want to meet up in HangZhou :)
Jul 15, 2023 Our paper “CAME: Contrastive Automated Model Evaluation” is accepted at ICCV 2023!
Oct 06, 2022 Our paper “Distill The Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation” is accepted at EMNLP 2022 (Oral)!
Sep 10, 2022 Started my PhD’s degree at College of Computer Science and Technology of Zhejiang University!
Apr 06, 2022 Our paper “HybridVocab: Towards Multi-Modal Machine Translation via Multi-Aspect Alignment” is accpeted at ICMR 2022 (Oral)!