Ruibin Xiong's picture

5

Ruibin Xiong PRO

chrisxiong

https://scholar.google.com/citations?user=P3GLUqQAAAAJ&hl=en

AI & ML interests

LLM

Recent Activity

upvoted a paper about 1 month ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

upvoted a paper about 2 months ago

Reinforcement Learning on Pre-Training Data

upvoted a paper 9 months ago

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet