Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Xiaofan Zhu's picture

Xiaofan Zhu

Augusteinia

AI & ML interests

VLM, RL, Robotics

Organizations

None yet

Collections 5

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15 • 83
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published May 21 • 54
Scaling Diffusion Transformers Efficiently via μP

Paper • 2505.15270 • Published May 21 • 35
Vision Transformers Don't Need Trained Registers

Paper • 2506.08010 • Published Jun 9 • 21

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

Paper • 2505.10557 • Published May 15 • 47
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22 • 35
PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Paper • 2505.15929 • Published May 21 • 49
VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos

Paper • 2506.05349 • Published Jun 5 • 24

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15 • 83
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published May 21 • 54
Scaling Diffusion Transformers Efficiently via μP

Paper • 2505.15270 • Published May 21 • 35
Vision Transformers Don't Need Trained Registers

Paper • 2506.08010 • Published Jun 9 • 21

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

Paper • 2505.10557 • Published May 15 • 47
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22 • 35
PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Paper • 2505.15929 • Published May 21 • 49
VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos

Paper • 2506.05349 • Published Jun 5 • 24

View 5 collections

models 0

None public yet

datasets 0

None public yet

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs