PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper β’ 2510.14528 β’ Published Oct 16 β’ 94
view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR 28 days ago β’ 60
InternVL3.5 Collection This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). β’ 54 items β’ Updated Sep 28 β’ 103
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper β’ 2508.18265 β’ Published Aug 25 β’ 206
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels Aug 18 β’ 87
view article Article What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models Aug 4 β’ 28
view article Article AI Companionship: Why We Need to Evaluate How AI Systems Handle Emotional Bonds Jul 21 β’ 22
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders Jul 9 β’ 717
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model Oct 29, 2024 β’ 59
view article Article The Anthropic Ruling: Why AI Training Just Got Legal (But Piracy Didn't) Jun 24 β’ 10