Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Tong Zheng, Hongming Zhang, Wenhao Yu, Xiaoyang Wang, Xinyu Yang, Runpeng Dai, Rui Liu, Huiwen Bao, Chengsong Huang, Heng Huang, Dong Yu

Tencent AI Lab · University of Maryland, College Park · University of North Carolina at Chapel Hill· City University of Hong Kong · Washington University in St. Louis


[Paper] https://arxiv.org/abs/2509.07980

[Code] https://github.com/zhengkid/Parallel-R1

[Data & Models] https://huggingface.co/Parallel-R1

Parallel-R1框架图