sree

srisree

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

PleIAs/Baguettotron

liked a model 9 days ago

MiniMaxAI/MiniMax-M2

liked a Space 12 days ago

linoyts/Qwen-Image-Edit-Angles

View all activity

Organizations

upvoted an article 18 days ago

Article

Provence: efficient and robust context pruning for retrieval-augmented generation

Jan 28

•

upvoted 3 papers about 1 month ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16 • 94

Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention

Paper • 2510.04212 • Published Oct 5 • 23

Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models

Paper • 2510.03561 • Published Oct 3 • 24

upvoted a paper about 2 months ago

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 78

upvoted an article 3 months ago

Article

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

May 17

•

upvoted 2 articles 4 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9

•

717

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

726

upvoted 2 articles 5 months ago

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

Jun 19

•

Article

Gemma 3n fully available in the open-source ecosystem!

Jun 26

•

120

upvoted an article 8 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

721

upvoted a paper 8 months ago

VACE: All-in-One Video Creation and Editing

Paper • 2503.07598 • Published Mar 10 • 56

upvoted 2 articles over 1 year ago

Article

Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU!

Apr 21, 2024

•

Article

Outpainting III - Inpaint Model

Apr 23, 2024

•

upvoted a paper over 1 year ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 69

upvoted a collection almost 2 years ago

Qwen1.5

Collection

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated Jul 21 • 210

upvoted 2 papers almost 2 years ago

InstantID: Zero-shot Identity-Preserving Generation in Seconds

Paper • 2401.07519 • Published Jan 15, 2024 • 57

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 37

upvoted a paper about 2 years ago

GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

Paper • 2311.07562 • Published Nov 13, 2023 • 15

sree

AI & ML interests

Recent Activity

Organizations

srisree's activity

Provence: efficient and robust context pruning for retrieval-augmented generation

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

SmolLM3: smol, multilingual, long-context reasoner

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

Gemma 3n fully available in the open-source ecosystem!

Uncensor any LLM with abliteration

Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU!

Outpainting III - Inpaint Model