Maozhou Ge's picture

Maozhou Ge

Gmc2

·

GHGmc2

AI & ML interests

None yet

Recent Activity

upvoted a collection 9 days ago

upvoted an article 10 days ago

Finetune Stable Diffusion Models with DDPO via TRL

liked a model 12 days ago

moonshotai/Kimi-K2-Thinking

View all activity

Organizations

None yet

upvoted a collection 9 days ago

LLaDA 2.0

2 items • Updated 24 days ago • 15

upvoted an article 10 days ago

Article

Finetune Stable Diffusion Models with DDPO via TRL

Sep 29, 2023

•

19

liked a model 12 days ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated 10 days ago • 153k • • 1.27k

liked a Space 18 days ago

The Smol Training Playbook

The secrets to building world-class LLMs

upvoted a collection 21 days ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 656

upvoted an article 29 days ago

Article

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

Aug 18

•

31

liked a dataset 30 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 6.19k • 600

upvoted a collection 30 days ago

InternVL3.5-Core

This collection includes only the InternVL3.5 checkpoints that have completed the full training pipeline (i.e., Pretraining, SFT, MPO, Cascade RL). • 30 items • Updated Sep 28 • 12

upvoted 2 collections about 1 month ago

Nemotron-Pre-Training-Dataset

7 items • Updated about 1 hour ago • 41

Inference Optimized Checkpoints (with Model Optimizer)

A collection of generative models quantized and optimized for inference with TensorRT Model Optimizer. • 43 items • Updated about 1 hour ago • 52

liked a dataset about 1 month ago

lmms-lab/multimodal-open-r1-8k-verified

Viewer • Updated Jan 27 • 7.69k • 5.77k • 67

upvoted an article about 1 month ago

Article

Fixing Gradient Accumulation

Oct 16, 2024

•

63

liked a model about 1 month ago

google/siglip2-so400m-patch14-384

Zero-Shot Image Classification • 1B • Updated Feb 21 • 481k • 62

liked a dataset about 1 month ago

Salesforce/Webscale-RL

Viewer • Updated Oct 14 • 1.11M • 3.09k • 79

upvoted a paper about 1 month ago

BroRL: Scaling Reinforcement Learning via Broadened Exploration

Paper • 2510.01180 • Published Oct 1 • 17

liked a model about 2 months ago

deepseek-ai/DeepSeek-V3.2-Exp-Base

Text Generation • 685B • Updated Oct 9 • 402 • 42

upvoted a collection about 2 months ago

DeepSeek-V3.2

2 items • Updated Sep 29 • 448

upvoted a paper about 2 months ago

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16 • 115

upvoted a collection about 2 months ago

Qwen3-VL

37 items • Updated 17 days ago • 411

liked a dataset about 2 months ago

Juelg/RPD-maniskill

Updated Jul 18 • 311 • 1