nzy's picture

1 3

nzy

Evernight

·

AI & ML interests

Code Generation

Recent Activity

upvoted a paper about 1 month ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

upvoted a paper 7 months ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

commented on a paper over 1 year ago

On Leakage of Code Generation Evaluation Datasets

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3 • 74

upvoted a paper 7 months ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Paper • 2504.11343 • Published Apr 15 • 19

upvoted a paper over 1 year ago

On Leakage of Code Generation Evaluation Datasets

Paper • 2407.07565 • Published Jul 10, 2024 • 6