Ξ¦eat: Physically-Grounded Feature Representation Paper β’ 2511.11270 β’ Published 10 days ago β’ 10
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge Feb 7 β’ 250
view article Article CinePile 2.0 - making stronger datasets with adversarial refinement Oct 23, 2024 β’ 18
view article Article PaliGemma β Google's Cutting-Edge Open Vision Language Model May 14, 2024 β’ 274
V-JEPA 2 Collection A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann β’ 8 items β’ Updated Jun 13 β’ 171
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation Paper β’ 2506.03147 β’ Published Jun 3 β’ 58
D-FINE Collection State-of-the-art real-time object detection model with Apache 2.0 licence β’ 15 items β’ Updated May 5 β’ 55
view article Article π Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker! Jan 29 β’ 20
Executable Code Actions Elicit Better LLM Agents Paper β’ 2402.01030 β’ Published Feb 1, 2024 β’ 182