RynnVLA-002: A Unified Vision-Language-Action and World Model Paper β’ 2511.17502 β’ Published 6 days ago β’ 21
Parrot: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs Paper β’ 2511.17220 β’ Published 6 days ago β’ 15
Benchmarking Diversity in Image Generation via Attribute-Conditional Human Evaluation Paper β’ 2511.10547 β’ Published 14 days ago β’ 4
UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist Paper β’ 2511.08521 β’ Published 16 days ago β’ 37
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models Paper β’ 2511.10629 β’ Published 14 days ago β’ 115
Depth Anything 3: Recovering the Visual Space from Any Views Paper β’ 2511.10647 β’ Published 14 days ago β’ 88
Kimi Linear: An Expressive, Efficient Attention Architecture Paper β’ 2510.26692 β’ Published 28 days ago β’ 111
WithAnyone: Towards Controllable and ID Consistent Image Generation Paper β’ 2510.14975 β’ Published Oct 16 β’ 83
Durian: Dual Reference-guided Portrait Animation with Attribute Transfer Paper β’ 2509.04434 β’ Published Sep 4 β’ 10
OpenAI-GPT 20B, 37B ,120B: Neo, reg, uncensored, ablit. Collection OpenAi's model in various sizes and formats, including NEO Imatrix, DI, Tri Matrix, Uncensored, Albiterated, and Brainstorm 20x (37B). β’ 9 items β’ Updated 11 days ago β’ 7
200+ Roleplay, Creative Writing, Uncensored, NSFW models. Collection Oldest models listed first, with Newest models at bottom of the page. Most repos have full examples, instructions, best settings and so on. β’ 326 items β’ Updated 3 days ago β’ 369
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper β’ 2508.18265 β’ Published Aug 25 β’ 207
T-LoRA: Single Image Diffusion Model Customization Without Overfitting Paper β’ 2507.05964 β’ Published Jul 8 β’ 118
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers Paper β’ 2506.23918 β’ Published Jun 30 β’ 88
WebSailor: Navigating Super-human Reasoning for Web Agent Paper β’ 2507.02592 β’ Published Jul 3 β’ 122
Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs Paper β’ 2506.21656 β’ Published Jun 26 β’ 15
ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models Paper β’ 2506.21356 β’ Published Jun 26 β’ 22