LMUFormer: Low Complexity Yet Powerful Spiking Model With Legendre Memory Units Paper • 2402.04882 • Published Jan 20, 2024
AFLoRA: Adaptive Freezing of Low Rank Adaptation in Parameter Efficient Fine-Tuning of Large Models Paper • 2403.13269 • Published Mar 20, 2024
LAWCAT: Efficient Distillation from Quadratic to Linear Attention with Convolution across Tokens for Long Context Modeling Paper • 2509.18467 • Published Sep 22
Thinkquel: A Model Dedicated to Text-to-dbt Using Synthetic Data and a Span-Aware Objective Paper • 2510.00186 • Published Sep 30 • 3