view article Article Thinking Outside the Attention Box: Introducing Gated Associative Memory (GAM) By rishiraj • Sep 3 • 5
ComoRAG: A Cognitive-Inspired Memory-Organized RAG for Stateful Long Narrative Reasoning Paper • 2508.10419 • Published Aug 14 • 73
view article Article ChatML vs Harmony: Understanding the new Format from OpenAI 🔍 By kuotient • Aug 9 • 43
view article Article <p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p> By hba123 • Aug 12 • 12
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Paper • 2508.02193 • Published Aug 4 • 130
view article Article Understanding Gemma 3n: How MatFormer Gives You Many Models in One By rishiraj • Jun 26 • 48
view article Article Transformers Are Getting Old: Variants and Alternatives Exist! By ProCreations • Jul 5 • 42
Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test Paper • 2506.21551 • Published Jun 26 • 28
view article Article Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. By tiiuae and 9 others • May 15 • 36
view article Article LeRobot goes to driving school: World’s largest open-source self-driving dataset Mar 11 • 102
Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights Paper • 2502.09619 • Published Feb 13 • 35
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters! Paper • 2502.07374 • Published Feb 11 • 40