PRIME-RL
/

P1-235B-A22B

Text Generation

reinforcement-learning

Model card Files Files and versions

JC-Chen commited on 28 days ago

Commit

6afa46e

·

verified ·

1 Parent(s): f7f6ff7

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -121,6 +121,16 @@ solution = tokenizer.decode(outputs[0], skip_special_tokens=True)
 print(solution)
 ```
 ## Citation
 ```bibtex

 print(solution)
 ```
+## 🙏 Acknowledgements
+We are grateful to the open-source community for their invaluable contributions. Special thanks to:
+- **[Qwen3](https://huggingface.co/collections/Qwen/qwen3)** - for providing the foundational base models that powered our research
+- **[slime](https://github.com/THUDM/slime)** - for their innovative work on efficient reinforcement learning framework that powered our training pipeline
+- **[verl](https://github.com/volcengine/verl)** - for the versatile reinforcement learning framework that enabled our training pipeline
+- **[sglang](https://github.com/sgl-project/sglang)** - for the efficient LLM serving and inference infrastructure
+- **[Megatron-LM](https://github.com/NVIDIA/Megatron-LM)** - for the large-scale model training framework
 ## Citation
 ```bibtex