Zekun

atom1024

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

upvoted an article 2 months ago

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

liked a model 10 months ago

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

View all activity

Organizations

None yet

upvoted a paper 2 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 141

upvoted an article 2 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

Jun 3

•

284

liked a model 10 months ago

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

Text Generation • 71B • Updated Feb 24 • 1.02M • • 732

upvoted an article over 1 year ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

•

122

liked a model over 1 year ago

HuggingFaceM4/idefics2-8b

Image-Text-to-Text • 8B • Updated Oct 14, 2024 • 9.05k • 617

upvoted 3 papers over 1 year ago

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14, 2024 • 129

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

Paper • 2402.17177 • Published Feb 27, 2024 • 88

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Paper • 2402.17485 • Published Feb 27, 2024 • 195

upvoted 9 papers almost 2 years ago

VMamba: Visual State Space Model

Paper • 2401.10166 • Published Jan 18, 2024 • 39

Improving fine-grained understanding in image-text pre-training

Paper • 2401.09865 • Published Jan 18, 2024 • 18

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Paper • 2401.02117 • Published Jan 4, 2024 • 33

Distributed Inference and Fine-tuning of Large Language Models Over The Internet

Paper • 2312.08361 • Published Dec 13, 2023 • 28

upvoted 2 papers about 2 years ago

OtterHD: A High-Resolution Multi-modality Model

Paper • 2311.04219 • Published Nov 7, 2023 • 34

GLaMM: Pixel Grounding Large Multimodal Model

Paper • 2311.03356 • Published Nov 6, 2023 • 37

liked a model about 2 years ago

mistralai/Mistral-7B-Instruct-v0.1

Text Generation • 7B • Updated Jul 24 • 536k • 1.81k

Zekun

AI & ML interests

Recent Activity

Organizations

atom1024's activity

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

How NuminaMath Won the 1st AIMO Progress Prize