SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity Paper • 2506.16500 • Published Jun 19 • 17
MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering Paper • 2505.07782 • Published May 12 • 19
Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild Paper • 2410.05357 • Published Oct 7, 2024
PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches Paper • 2410.10870 • Published Oct 8, 2024 • 1
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published Apr 29 • 98
Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models Paper • 2503.22879 • Published Mar 28 • 9
Quamba: A Post-Training Quantization Recipe for Selective State Space Models Paper • 2410.13229 • Published Oct 17, 2024 • 1
Efficient Low-rank Backpropagation for Vision Transformer Adaptation Paper • 2309.15275 • Published Sep 26, 2023 • 1