Anthonny Olime's picture

Anthonny Olime

Aviv-anthonnyolime

·

AI & ML interests

None yet

Recent Activity

liked a model 8 days ago

BAAI/Emu3.5

liked a model 11 days ago

meituan-longcat/LongCat-Video

liked a model 11 days ago

MiniMaxAI/MiniMax-M2

View all activity

Organizations

upvoted an article about 2 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11

• 161

upvoted a paper 3 months ago

SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Paper • 2411.05007 • Published Nov 7, 2024 • 22

upvoted an article 3 months ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Aug 18

• 87

upvoted 6 papers 3 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14 • 142

Next Visual Granularity Generation

Paper • 2508.12811 • Published Aug 18 • 49

TempFlow-GRPO: When Timing Matters for GRPO in Flow Models

Paper • 2508.04324 • Published Aug 6 • 11

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14 • 60

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Paper • 2508.09834 • Published Aug 13 • 53

DINOv3

Paper • 2508.10104 • Published Aug 13 • 276

upvoted 2 collections 3 months ago

Nemotron-Pre-Training-Dataset

7 items • Updated about 16 hours ago • 40

DINOv3

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21 • 373

upvoted 2 articles 3 months ago

Article

TextQuests: How Good are LLMs at Text-Based Video Games?

Aug 12

• 35

Article

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

By

and 4 others •

Aug 11

• 75

upvoted a paper 3 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 189

upvoted a collection 3 months ago

GLM-4.5V

4 items • Updated Aug 18 • 26

upvoted an article 3 months ago

Article

Comparing sub 50GB Llama 4 Scout quants (KLD/Top P)

By

•

Apr 9

• 45

upvoted 4 collections 3 months ago

Ring

Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling. • 5 items • Updated 28 days ago • 19

Parakeet

NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 12 items • Updated about 16 hours ago • 46

Canary

A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 5 items • Updated about 16 hours ago • 29

MM Grounding DINO

8 items • Updated Aug 1 • 5