wenxueru's picture

7 14

wenxueru

Aunderline

·

https://github.com/wenxueru

Aunderline

AI & ML interests

None yet

Recent Activity

upvoted a collection 4 days ago

upvoted an article 2 months ago

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

upvoted a paper 3 months ago

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

View all activity

Organizations

None yet

upvoted a collection 4 days ago

OLMo 2

Artifacts for the OLMo 2 release. • 35 items • Updated about 5 hours ago • 147

upvoted an article 2 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Jul 29

•

198

upvoted a paper 3 months ago

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Paper • 2508.21104 • Published Aug 28 • 35

upvoted 2 papers 5 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262

Critic-CoT: Boosting the reasoning abilities of large language model via Chain-of-thoughts Critic

Paper • 2408.16326 • Published Aug 29, 2024 • 1

upvoted a collection 6 months ago

Qwen3

84 items • Updated Aug 6 • 1.43k

upvoted a paper 11 months ago

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

Paper • 2501.01830 • Published Jan 3 • 18