Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shiyi Cao's picture
1 9 1

Shiyi Cao

eva98
21world's profile picture admariner's profile picture tudorizer's profile picture
·

AI & ML interests

None yet

Organizations

Efficient-Large-Model's profile picture LLaVA Internal's profile picture NovaSky's profile picture

upvoted 2 collections 9 months ago

NovaSky Papers

Collection
2 items • Updated Feb 21 • 3

Sky-T1-7B

Collection
A series of 7B models trained with different recipes and the corresponding training data. • 8 items • Updated Feb 14 • 7
upvoted 2 papers 9 months ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Paper • 2502.07374 • Published Feb 11 • 40
upvoted 2 papers 10 months ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16 • 71

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 285
upvoted a paper 12 months ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 59
upvoted a paper about 1 year ago

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19, 2024 • 52
upvoted a paper over 1 year ago

VILA^2: VILA Augmented VILA

Paper • 2407.17453 • Published Jul 24, 2024 • 41
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs