view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases 15 days ago • 48
view article Article The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs 4 days ago • 11
OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation Paper • 2511.13655 • Published 2 days ago • 9
SYNTH Collection Fully generalist synthetic dataset and SOTA small reasoners • 3 items • Updated 9 days ago • 9
GVE Collection Towards General Video Embeddings: Models and Benchmarks • 4 items • Updated 17 days ago • 19
Reasoning Language Model Inference Serving Unveiled: An Empirical Study Paper • 2510.18672 • Published 29 days ago • 7
LightOnOCR Collection The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR • 7 items • Updated 6 days ago • 14
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published 20 days ago • 107
Reasoning with Sampling: Your Base Model is Smarter Than You Think Paper • 2510.14901 • Published Oct 16 • 47
AION-1: Omnimodal Foundation Model for Astronomical Sciences Paper • 2510.17960 • Published about 1 month ago • 28
When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling Paper • 2510.15346 • Published Oct 17 • 33
Robust Layerwise Scaling Rules by Proper Weight Decay Tuning Paper • 2510.15262 • Published Oct 17 • 5