M's picture

M

Aneerudh

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

liked a dataset 9 days ago

amazon/AmazonQAC

upvoted an article 18 days ago

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

View all activity

Organizations

upvoted a paper 9 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published 14 days ago • 100

upvoted an article 18 days ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

23 days ago

•

45

upvoted an article about 1 month ago

Article

Supercharge your OCR Pipelines with Open Models

Oct 21

•

264

upvoted a paper about 1 month ago

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents

Paper • 2510.14967 • Published Oct 16 • 33

upvoted a paper about 2 months ago

CoDA: Coding LM via Diffusion Adaptation

Paper • 2510.03270 • Published Sep 27 • 42

upvoted an article 3 months ago

Article

MCP for Research: How to Connect AI to Research Tools

Aug 18

•

63

upvoted a collection 4 months ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 382

upvoted a collection 5 months ago

ERNIE 4.5

collection of ERNIE 4.5 models. • 27 items • Updated 15 days ago • 178

upvoted a paper 6 months ago

WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Paper • 2504.21776 • Published Apr 30 • 59

upvoted a collection 6 months ago

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated Jul 11 • 349

upvoted a collection 7 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 210

upvoted a paper 7 months ago

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Paper • 2505.01441 • Published Apr 28 • 39

upvoted an article 7 months ago

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Mar 22, 2024

•

104

upvoted 2 papers 9 months ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 36

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5 • 232

upvoted an article 9 months ago

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25

•

172

upvoted a collection 10 months ago

Reasoning Datasets

Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2 • 61

upvoted a collection over 1 year ago

mistralai_hackathon

Synthetic datasets and fine-tuned Mistral models used in MistralAI Hackathon • 21 items • Updated Feb 4 • 4