IndicConformer Collection A collection of ASR models for 22 scheduled languages of India β’ 25 items β’ Updated 11 days ago β’ 22
view article Article Fine-Tune XLSR-Wav2Vec2 for low-resource ASR with π€ Transformers Nov 15, 2021 β’ 36
view article Article Fine-Tune Whisper For Multilingual ASR with π€ Transformers Nov 3, 2022 β’ 323
BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent Paper β’ 2509.15566 β’ Published Sep 19 β’ 14
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. β’ 8 items β’ Updated Nov 23, 2024 β’ 88
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch May 21 β’ 229
view article Article LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone! Mar 7 β’ 87
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Paper β’ 2504.11536 β’ Published Apr 15 β’ 63
view article Article AMD + π€: Large Language Models Out-of-the-Box Acceleration with AMD GPU Dec 5, 2023 β’ 4
view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... By srinivasbilla β’ Jan 20 β’ 73
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics Paper β’ 2412.07774 β’ Published Dec 10, 2024 β’ 30
DrawingSpinUp: 3D Animation from Single Character Drawings Paper β’ 2409.08615 β’ Published Sep 13, 2024 β’ 20
Animate-X: Universal Character Image Animation with Enhanced Motion Representation Paper β’ 2410.10306 β’ Published Oct 14, 2024 β’ 57