3 24 22

Andrew Zhao

andrewzh

https://andrewzh112.github.io/

AI & ML interests

Reinforcement Learning, Agents

Recent Activity

upvoted a paper about 2 months ago

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

upvoted a paper about 2 months ago

GEM: A Gym for Agentic LLMs

upvoted a paper about 2 months ago

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance

View all activity

Organizations

None yet

upvoted 3 papers about 2 months ago

upvoted a paper 2 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 188

upvoted a paper 4 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28 • 81

upvoted a paper 5 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30 • 50

upvoted 2 papers 6 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 185

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Paper • 2505.13308 • Published May 19 • 27

authored a paper 6 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 186

updated a collection 6 months ago

Absolute Zero Reasoner

Collection

6 items • Updated May 9 • 56

upvoted a paper 7 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 186

commented a paper 7 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 186 •

updated 2 models 7 months ago

andrewzh/Absolute_Zero_Reasoner-Coder-14b

15B • Updated May 6 • 108 • 29

andrewzh/Absolute_Zero_Reasoner-Coder-3b

3B • Updated May 6 • 20 • 13

updated a collection 7 months ago

Absolute Zero Reasoner

Collection

6 items • Updated May 9 • 56

published 2 models 7 months ago

andrewzh/Absolute_Zero_Reasoner-Coder-14b

15B • Updated May 6 • 108 • 29

andrewzh/Absolute_Zero_Reasoner-Coder-3b

3B • Updated May 6 • 20 • 13

updated a collection 7 months ago

Absolute Zero Reasoner

Collection

6 items • Updated May 9 • 56

Andrew Zhao

AI & ML interests

Recent Activity

Organizations

andrewzh's activity