Ruibin Xiong PRO
chrisxiong
AI & ML interests
LLM
Recent Activity
upvoted
a
paper
about 1 month ago
Low-probability Tokens Sustain Exploration in Reinforcement Learning
with Verifiable Reward
upvoted
a
paper
about 2 months ago
Reinforcement Learning on Pre-Training Data
Organizations
None yet