Hanning Zhang

HanningZhang

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents

upvoted a paper 25 days ago

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

upvoted a paper 25 days ago

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

View all activity

Organizations

upvoted a paper 1 day ago

CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents

Paper • 2511.02734 • Published 5 days ago • 19

upvoted 2 papers 25 days ago

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

Paper • 2510.11769 • Published 27 days ago • 25

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Paper • 2510.12693 • Published 26 days ago • 26

updated 3 models 26 days ago

published 3 models 26 days ago

HanningZhang/goedel_v2_8b_sft_5e-6_bs16_ep2

Text Generation • 8B • Updated 26 days ago • 28

HanningZhang/deepseek_v2_7b_sft_5e-6_bs16_ep3

Text Generation • 7B • Updated 26 days ago • 28

HanningZhang/goedel_v2_8b_sft_5e-6_bs16_ep3

Text Generation • 8B • Updated 26 days ago • 35

updated 3 models 26 days ago

HanningZhang/deepseek_v2_7b_sft_5e-6_bs16_ep2

Text Generation • 7B • Updated 26 days ago • 25

HanningZhang/kimina_8b_sft_5e-6_bs16_ep2

Text Generation • 8B • Updated 26 days ago • 30

HanningZhang/kimina_8b_sft_5e-6_bs16_ep3

Text Generation • 8B • Updated 26 days ago • 30

published 3 models 26 days ago

HanningZhang/kimina_8b_sft_5e-6_bs16_ep2

Text Generation • 8B • Updated 26 days ago • 30

HanningZhang/deepseek_v2_7b_sft_5e-6_bs16_ep2

Text Generation • 7B • Updated 26 days ago • 25

HanningZhang/kimina_8b_sft_5e-6_bs16_ep3

Text Generation • 8B • Updated 26 days ago • 30

updated a model 26 days ago

HanningZhang/kimina_8b_sft_5e-6_bs32

Text Generation • 8B • Updated 26 days ago • 7

updated a model about 1 month ago

HanningZhang/goedel_v2_8b_sft_5e-6_bs32

Text Generation • 8B • Updated about 1 month ago • 3

published 2 models about 1 month ago

HanningZhang/goedel_v2_8b_sft_5e-6_bs32

Text Generation • 8B • Updated about 1 month ago • 3

HanningZhang/kimina_8b_sft_5e-6_bs32

Text Generation • 8B • Updated 26 days ago • 7

updated a model about 1 month ago

HanningZhang/deepseek_v2_7b_sft_5e-6_bs32

Text Generation • 7B • Updated about 1 month ago • 4

Hanning Zhang

AI & ML interests

Recent Activity

Organizations

HanningZhang's activity