gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. β’ 2 items β’ Updated Aug 7 β’ 376
ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation Paper β’ 2503.21729 β’ Published Mar 27 β’ 29
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM Mar 12 β’ 468
emπing series Collection crispy sentence embedding family β’ 5 items β’ Updated Oct 14, 2024 β’ 27
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit β’ 28 items β’ Updated 10 days ago β’ 89
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5, 2024 β’ 298
view article Article Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer) Jun 16, 2023 β’ 42
To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models Paper β’ 2407.01920 β’ Published Jul 2, 2024 β’ 17
OLMo Suite Collection Artifacts for the first set of OLMo models. β’ 18 items β’ Updated Apr 30 β’ 73
Model-Based Control with Sparse Neural Dynamics Paper β’ 2312.12791 β’ Published Dec 20, 2023 β’ 6
Building Cooperative Embodied Agents Modularly with Large Language Models Paper β’ 2307.02485 β’ Published Jul 5, 2023 β’ 11