5 10 7

suu

Suu

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

The Path Not Taken: RLVR Provably Learns Off the Principals

upvoted a collection about 1 month ago

AEPO

upvoted a paper about 1 month ago

Agentic Entropy-Balanced Policy Optimization

View all activity

Organizations

upvoted a paper 6 days ago

The Path Not Taken: RLVR Provably Learns Off the Principals

Paper • 2511.08567 • Published 7 days ago • 25

upvoted a collection about 1 month ago

AEPO

Collection

The official datasets and model checkpoints of AEPO • 4 items • Updated 29 days ago • 3

upvoted a paper about 1 month ago

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16 • 102

upvoted a paper about 2 months ago

CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning

Paper • 2509.20712 • Published Sep 25 • 19

upvoted a paper 2 months ago

Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts

Paper • 2502.12928 • Published Feb 18 • 1

upvoted 2 collections 3 months ago

KlearReasoner-8B

Collection

KlearReasoner-8B • 6 items • Updated Oct 18 • 4

RL+reason model

Collection

254 items • Updated 1 day ago • 21

upvoted a paper 3 months ago

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

Paper • 2508.07629 • Published Aug 11 • 41

upvoted a paper 11 months ago

CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts

Paper • 2410.16077 • Published Oct 21, 2024 • 1

upvoted a paper over 1 year ago

MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts

Paper • 2407.09816 • Published Jul 13, 2024 • 1

suu

AI & ML interests

Recent Activity

Organizations

Suu's activity