7 19 28

Juyoung Suk PRO

juyoungml

https://juyoungml.github.io/

AI & ML interests

LLM

Recent Activity

upvoted a paper 12 days ago

ACG: Action Coherence Guidance for Flow-based VLA models

upvoted a paper 2 months ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

upvoted an article 3 months ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

View all activity

Organizations

upvoted a paper 12 days ago

ACG: Action Coherence Guidance for Flow-based VLA models

Paper • 2510.22201 • Published 15 days ago • 36

upvoted a paper 2 months ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Paper • 2508.08221 • Published Aug 11 • 48

upvoted 2 articles 3 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 247

Article

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 51

upvoted a collection 7 months ago

Trillion-7B-preview

Collection

5 items • Updated Jul 15 • 5

commented a paper 7 months ago

Trillion 7B Technical Report

Paper • 2504.15431 • Published Apr 21 • 37 •

authored 3 papers 7 months ago

MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models

Paper • 2410.17578 • Published Oct 23, 2024 • 1

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation

Paper • 2412.10424 • Published Dec 10, 2024 • 2

Trillion 7B Technical Report

Paper • 2504.15431 • Published Apr 21 • 37

upvoted a paper 7 months ago

Trillion 7B Technical Report

Paper • 2504.15431 • Published Apr 21 • 37

liked a Space 8 months ago

3.45k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

New activity in trillionlabs/Trillion-7B-preview 8 months ago

Update README.md

#1 opened 8 months ago by

juyoungml

liked a model 8 months ago

trillionlabs/Trillion-7B-preview

Text Generation • 8B • Updated Apr 25 • 227 • 86

New activity in juyoungml/Massive-Preferences-10K 11 months ago

Librarian Bot: Add language metadata for dataset

#1 opened about 1 year ago by

librarian-bot

New activity in juyoungml/DepthQA 11 months ago

[bot] Conversion to Parquet

#2 opened about 1 year ago by

parquet-converter

Librarian Bot: Add language metadata for dataset

#1 opened about 1 year ago by

librarian-bot

New activity in prometheus-eval/prometheus-7b-v2.0 11 months ago

Tokenizer chat template doesn't accept system prompt

#3 opened over 1 year ago by

gabrielmbmb

updated a model 11 months ago

juyoungml/bge-m3-ko-v1.1

upvoted a collection 11 months ago

EXAONE-3.5

Collection

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B • 11 items • Updated Jul 7 • 119

upvoted a paper 11 months ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published Dec 4, 2024 • 48

Juyoung Suk PRO

AI & ML interests

Recent Activity

Organizations

juyoungml's activity

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

From GRPO to DAPO and GSPO: What, Why, and How

The Ultra-Scale Playbook

Update README.md

Librarian Bot: Add language metadata for dataset

[bot] Conversion to Parquet

Librarian Bot: Add language metadata for dataset

Tokenizer chat template doesn't accept system prompt