Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arXiv:2510.13998

Selected_Trending_Papers

about 11 hours ago

TradingAgents: Multi-Agents LLM Financial Trading Framework

Paper • 2412.20138 • Published Dec 28, 2024 • 7
MinerU: An Open-Source Solution for Precise Document Content Extraction

Paper • 2409.18839 • Published Sep 27, 2024 • 29
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 129
Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5 • 110

Paper2Web: Let's Make Your Paper Alive!

Paper • 2510.15842 • Published 22 days ago • 25
Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6 • 111
BitNet Distillation

Paper • 2510.13998 • Published 24 days ago • 52
Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

Paper • 2510.27623 • Published 9 days ago • 12

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 76
Robot Learning: A Tutorial

Paper • 2510.12403 • Published 26 days ago • 103
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

Paper • 2510.13344 • Published 25 days ago • 61
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7 • 52

BitNet Distillation

Paper • 2510.13998 • Published 24 days ago • 52

kurakurai/Luth-LFM2-1.2B

Text Generation • 1B • Updated 27 days ago • 59 • 22
kurakurai/Luth-1.7B-Instruct

Text Generation • 2B • Updated 27 days ago • 107 • 13
Qwen/Qwen3-1.7B

Text Generation • 2B • Updated Jul 26 • 1.23M • • 313
Qwen/Qwen3-4B

Text Generation • 4B • Updated Jul 26 • 1.13M • • 434

Interesting Papers

BitNet Distillation

Paper • 2510.13998 • Published 24 days ago • 52

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published 24 days ago • 30
Attention Is All You Need for KV Cache in Diffusion LLMs

Paper • 2510.14973 • Published 23 days ago • 37
BitNet Distillation

Paper • 2510.13998 • Published 24 days ago • 52
GigaBrain-0: A World Model-Powered Vision-Language-Action Model

Paper • 2510.19430 • Published 18 days ago • 44

Run on CPU Optimizations

BitNet Distillation

Paper • 2510.13998 • Published 24 days ago • 52
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published 11 days ago • 63

Read Later Stack

Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published 26 days ago • 31
Self-Improving LLM Agents at Test-Time

Paper • 2510.07841 • Published about 1 month ago • 9
Making Mathematical Reasoning Adaptive

Paper • 2510.04617 • Published Oct 6 • 22
DocReward: A Document Reward Model for Structuring and Stylizing

Paper • 2510.11391 • Published 27 days ago • 26

inference optimization

Low-Rank Adapters Meet Neural Architecture Search for LLM Compression

Paper • 2501.16372 • Published Jan 23 • 12
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Paper • 2501.16937 • Published Jan 28 • 7
Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10 • 32
Identifying Sensitive Weights via Post-quantization Integral

Paper • 2503.01901 • Published Feb 28 • 8

Selected_Trending_Papers

about 11 hours ago

TradingAgents: Multi-Agents LLM Financial Trading Framework

Paper • 2412.20138 • Published Dec 28, 2024 • 7
MinerU: An Open-Source Solution for Precise Document Content Extraction

Paper • 2409.18839 • Published Sep 27, 2024 • 29
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 129
Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5 • 110

Interesting Papers

BitNet Distillation

Paper • 2510.13998 • Published 24 days ago • 52

Paper2Web: Let's Make Your Paper Alive!

Paper • 2510.15842 • Published 22 days ago • 25
Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6 • 111
BitNet Distillation

Paper • 2510.13998 • Published 24 days ago • 52
Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

Paper • 2510.27623 • Published 9 days ago • 12

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published 24 days ago • 30
Attention Is All You Need for KV Cache in Diffusion LLMs

Paper • 2510.14973 • Published 23 days ago • 37
BitNet Distillation

Paper • 2510.13998 • Published 24 days ago • 52
GigaBrain-0: A World Model-Powered Vision-Language-Action Model

Paper • 2510.19430 • Published 18 days ago • 44

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 76
Robot Learning: A Tutorial

Paper • 2510.12403 • Published 26 days ago • 103
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

Paper • 2510.13344 • Published 25 days ago • 61
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7 • 52

Run on CPU Optimizations

BitNet Distillation

Paper • 2510.13998 • Published 24 days ago • 52
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published 11 days ago • 63

BitNet Distillation

Paper • 2510.13998 • Published 24 days ago • 52

Read Later Stack

Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published 26 days ago • 31
Self-Improving LLM Agents at Test-Time

Paper • 2510.07841 • Published about 1 month ago • 9
Making Mathematical Reasoning Adaptive

Paper • 2510.04617 • Published Oct 6 • 22
DocReward: A Document Reward Model for Structuring and Stylizing

Paper • 2510.11391 • Published 27 days ago • 26

kurakurai/Luth-LFM2-1.2B

Text Generation • 1B • Updated 27 days ago • 59 • 22
kurakurai/Luth-1.7B-Instruct

Text Generation • 2B • Updated 27 days ago • 107 • 13
Qwen/Qwen3-1.7B

Text Generation • 2B • Updated Jul 26 • 1.23M • • 313
Qwen/Qwen3-4B

Text Generation • 4B • Updated Jul 26 • 1.13M • • 434

inference optimization

Low-Rank Adapters Meet Neural Architecture Search for LLM Compression

Paper • 2501.16372 • Published Jan 23 • 12
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Paper • 2501.16937 • Published Jan 28 • 7
Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10 • 32
Identifying Sensitive Weights via Post-quantization Integral

Paper • 2503.01901 • Published Feb 28 • 8

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs