Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Lean Wang's picture
1 1

Lean Wang

AdaHousman
KreemjayInnovation's profile picture multimodalart's profile picture Molbap's profile picture
·

AI & ML interests

None yet

Organizations

DeepSeek's profile picture

authored a paper 9 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 165
authored 4 papers about 1 year ago

Temporal Reasoning Transfer from Text to Video

Paper • 2410.06166 • Published Oct 8, 2024 • 13

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

Paper • 2305.14160 • Published May 23, 2023 • 1

Towards Codable Watermarking for Injecting Multi-bits Information to LLMs

Paper • 2307.15992 • Published Jul 29, 2023 • 1

Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts

Paper • 2408.15664 • Published Aug 28, 2024 • 15
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs