Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
marin-community
's Collections
All Marin ISOFlops
DCLM Baseline ISOFlop Models
CommonPile ISOFlop Models
Nemotron ISOFlop Models
Fantastic Optimizers Open-sourced Models
Fantastic Optimizers Open-sourced Models
updated
12 days ago
Best-tuned model for each setting for https://arxiv.org/abs/2509.02046;
Upvote
-
OptimizerStudy/adamw_1.2b_1
2B
•
Updated
17 days ago
•
13
OptimizerStudy/adamw_1.2b_2
2B
•
Updated
17 days ago
•
10
OptimizerStudy/adamw_1.2b_4
2B
•
Updated
17 days ago
•
8
OptimizerStudy/adamw_1.2b_8
2B
•
Updated
17 days ago
•
8
OptimizerStudy/adamw_130m_1
0.3B
•
Updated
17 days ago
•
14
OptimizerStudy/adamw_130m_16
0.3B
•
Updated
17 days ago
•
8
OptimizerStudy/adamw_130m_2
0.3B
•
Updated
17 days ago
•
8
OptimizerStudy/adamw_130m_4
0.3B
•
Updated
17 days ago
•
11
OptimizerStudy/adamw_130m_8
0.3B
•
Updated
17 days ago
•
8
OptimizerStudy/adamw_300m_1
0.5B
•
Updated
17 days ago
•
10
OptimizerStudy/adamw_300m_16
0.5B
•
Updated
17 days ago
•
9
OptimizerStudy/adamw_300m_2
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/adamw_300m_4
0.5B
•
Updated
17 days ago
•
7
OptimizerStudy/adamw_300m_8
0.5B
•
Updated
17 days ago
•
7
OptimizerStudy/adamw_520m_1
0.8B
•
Updated
17 days ago
•
7
OptimizerStudy/adamw_520m_2
0.8B
•
Updated
17 days ago
•
7
OptimizerStudy/adamw_520m_4
0.8B
•
Updated
17 days ago
•
6
OptimizerStudy/adamw_520m_8
0.8B
•
Updated
17 days ago
•
329
OptimizerStudy/cautious_130m_1
0.3B
•
Updated
17 days ago
•
8
OptimizerStudy/cautious_130m_2
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/cautious_130m_4
0.3B
•
Updated
17 days ago
•
12
OptimizerStudy/cautious_130m_8
0.3B
•
Updated
17 days ago
•
8
OptimizerStudy/cautious_300m_1
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/cautious_300m_2
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/cautious_300m_4
0.5B
•
Updated
17 days ago
•
7
OptimizerStudy/cautious_300m_8
0.5B
•
Updated
17 days ago
•
10
OptimizerStudy/cautious_520m_1
0.8B
•
Updated
17 days ago
•
7
OptimizerStudy/cautious_520m_2
0.8B
•
Updated
17 days ago
•
7
OptimizerStudy/cautious_520m_4
0.8B
•
Updated
17 days ago
•
13
OptimizerStudy/cautious_520m_8
0.8B
•
Updated
17 days ago
•
7
OptimizerStudy/kron_130m_1
0.3B
•
Updated
17 days ago
•
8
OptimizerStudy/kron_130m_2
0.3B
•
Updated
17 days ago
•
8
OptimizerStudy/kron_130m_4
0.3B
•
Updated
17 days ago
•
8
OptimizerStudy/kron_130m_8
0.3B
•
Updated
17 days ago
•
14
OptimizerStudy/kron_300m_1
0.5B
•
Updated
17 days ago
•
11
OptimizerStudy/kron_300m_2
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/kron_300m_4
0.5B
•
Updated
17 days ago
•
9
OptimizerStudy/kron_300m_8
0.5B
•
Updated
17 days ago
•
7
OptimizerStudy/kron_520m_1
0.8B
•
Updated
17 days ago
•
7
OptimizerStudy/kron_520m_2
0.8B
•
Updated
17 days ago
•
7
OptimizerStudy/kron_520m_4
0.8B
•
Updated
17 days ago
•
7
OptimizerStudy/kron_520m_8
0.8B
•
Updated
17 days ago
•
7
OptimizerStudy/lion_130m_1
0.3B
•
Updated
17 days ago
•
10
OptimizerStudy/lion_130m_2
0.3B
•
Updated
17 days ago
•
8
OptimizerStudy/lion_130m_4
0.3B
•
Updated
17 days ago
•
6
OptimizerStudy/lion_130m_8
0.3B
•
Updated
17 days ago
•
10
OptimizerStudy/lion_300m_1
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/lion_300m_2
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/lion_300m_4
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/lion_300m_8
0.5B
•
Updated
17 days ago
•
11
OptimizerStudy/lion_520m_1
0.8B
•
Updated
17 days ago
•
11
OptimizerStudy/lion_520m_2
0.8B
•
Updated
17 days ago
•
6
OptimizerStudy/lion_520m_4
0.8B
•
Updated
17 days ago
•
7
OptimizerStudy/lion_520m_8
0.8B
•
Updated
17 days ago
•
10
OptimizerStudy/mars_130m_1
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/mars_130m_2
0.3B
•
Updated
17 days ago
•
12
OptimizerStudy/mars_130m_4
0.3B
•
Updated
17 days ago
•
9
OptimizerStudy/mars_130m_8
0.3B
•
Updated
17 days ago
•
9
OptimizerStudy/mars_300m_1
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/mars_300m_2
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/mars_300m_4
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/mars_300m_8
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/mars_520m_1
0.8B
•
Updated
17 days ago
•
14
OptimizerStudy/mars_520m_2
0.8B
•
Updated
17 days ago
•
7
OptimizerStudy/mars_520m_4
0.8B
•
Updated
17 days ago
•
7
OptimizerStudy/mars_520m_8
0.8B
•
Updated
17 days ago
•
7
OptimizerStudy/mini_130m_1
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/mini_130m_2
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/mini_130m_4
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/mini_130m_8
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/mini_300m_1
0.5B
•
Updated
17 days ago
•
10
OptimizerStudy/mini_300m_2
0.5B
•
Updated
17 days ago
•
6
OptimizerStudy/mini_300m_4
0.5B
•
Updated
17 days ago
•
9
OptimizerStudy/mini_300m_8
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/mini_520m_1
0.8B
•
Updated
17 days ago
•
7
OptimizerStudy/mini_520m_2
0.8B
•
Updated
17 days ago
•
8
OptimizerStudy/mini_520m_4
0.8B
•
Updated
17 days ago
•
8
OptimizerStudy/mini_520m_8
0.8B
•
Updated
17 days ago
•
8
OptimizerStudy/muon_1.2b_1
2B
•
Updated
17 days ago
•
13
OptimizerStudy/muon_1.2b_2
2B
•
Updated
17 days ago
•
10
OptimizerStudy/muon_1.2b_4
2B
•
Updated
17 days ago
•
7
OptimizerStudy/muon_1.2b_8
2B
•
Updated
17 days ago
•
14
OptimizerStudy/muon_130m_1
0.3B
•
Updated
17 days ago
•
9
OptimizerStudy/muon_130m_16
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/muon_130m_2
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/muon_130m_4
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/muon_130m_8
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/muon_300m_1
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/muon_300m_2
0.5B
•
Updated
17 days ago
•
7
OptimizerStudy/muon_300m_4
0.5B
•
Updated
17 days ago
•
10
OptimizerStudy/muon_300m_8
0.5B
•
Updated
17 days ago
•
7
OptimizerStudy/muon_520m_1
0.8B
•
Updated
17 days ago
•
8
OptimizerStudy/muon_520m_2
0.8B
•
Updated
17 days ago
•
8
OptimizerStudy/muon_520m_4
0.8B
•
Updated
17 days ago
•
8
OptimizerStudy/muon_520m_8
0.8B
•
Updated
17 days ago
•
264
OptimizerStudy/nadamw_1.2b_1
2B
•
Updated
17 days ago
•
7
OptimizerStudy/nadamw_1.2b_2
2B
•
Updated
17 days ago
•
10
OptimizerStudy/nadamw_1.2b_4
2B
•
Updated
17 days ago
•
5
OptimizerStudy/nadamw_1.2b_8
2B
•
Updated
17 days ago
•
11
OptimizerStudy/nadamw_130m_1
0.3B
•
Updated
17 days ago
•
11
OptimizerStudy/nadamw_130m_16
0.3B
•
Updated
17 days ago
•
11
OptimizerStudy/nadamw_130m_2
0.3B
•
Updated
17 days ago
•
16
OptimizerStudy/nadamw_130m_4
0.3B
•
Updated
17 days ago
•
9
OptimizerStudy/nadamw_130m_8
0.3B
•
Updated
17 days ago
•
10
OptimizerStudy/nadamw_300m_1
0.5B
•
Updated
17 days ago
•
10
OptimizerStudy/nadamw_300m_16
0.5B
•
Updated
17 days ago
•
10
OptimizerStudy/nadamw_300m_2
0.5B
•
Updated
17 days ago
•
9
OptimizerStudy/nadamw_300m_4
0.5B
•
Updated
17 days ago
•
13
OptimizerStudy/nadamw_300m_8
0.5B
•
Updated
17 days ago
•
10
OptimizerStudy/nadamw_520m_1
0.8B
•
Updated
17 days ago
•
8
OptimizerStudy/nadamw_520m_2
0.8B
•
Updated
17 days ago
•
8
OptimizerStudy/nadamw_520m_4
0.8B
•
Updated
17 days ago
•
8
OptimizerStudy/nadamw_520m_8
0.8B
•
Updated
17 days ago
•
8
OptimizerStudy/scion_130m_1
0.3B
•
Updated
17 days ago
•
10
OptimizerStudy/scion_130m_2
0.3B
•
Updated
17 days ago
•
6
OptimizerStudy/scion_130m_4
0.3B
•
Updated
17 days ago
•
8
OptimizerStudy/scion_130m_8
0.3B
•
Updated
17 days ago
•
8
OptimizerStudy/scion_300m_1
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/scion_300m_2
0.5B
•
Updated
17 days ago
•
8
OptimizerStudy/scion_300m_4
0.5B
•
Updated
17 days ago
•
7
OptimizerStudy/scion_300m_8
0.5B
•
Updated
17 days ago
•
6
OptimizerStudy/scion_520m_1
0.8B
•
Updated
17 days ago
•
10
OptimizerStudy/scion_520m_2
0.8B
•
Updated
17 days ago
•
10
OptimizerStudy/scion_520m_4
0.8B
•
Updated
17 days ago
•
9
OptimizerStudy/scion_520m_8
0.8B
•
Updated
17 days ago
•
9
OptimizerStudy/soape_1.2b_1
2B
•
Updated
17 days ago
•
9
OptimizerStudy/soape_1.2b_2
2B
•
Updated
17 days ago
•
9
OptimizerStudy/soape_1.2b_4
2B
•
Updated
17 days ago
•
8
OptimizerStudy/soape_1.2b_8
2B
•
Updated
17 days ago
•
7
OptimizerStudy/soape_130m_1
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/soape_130m_16
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/soape_130m_2
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/soape_130m_4
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/soape_130m_8
0.3B
•
Updated
17 days ago
•
7
OptimizerStudy/soape_300m_1
0.5B
•
Updated
17 days ago
•
6
OptimizerStudy/soape_300m_16
0.5B
•
Updated
17 days ago
•
11
OptimizerStudy/soape_300m_2
0.5B
•
Updated
17 days ago
•
11
OptimizerStudy/soape_300m_4
0.5B
•
Updated
17 days ago
•
11
OptimizerStudy/soape_300m_8
0.5B
•
Updated
17 days ago
•
11
OptimizerStudy/soape_520m_1
0.8B
•
Updated
17 days ago
•
11
OptimizerStudy/soape_520m_2
0.8B
•
Updated
17 days ago
•
10
OptimizerStudy/soape_520m_4
0.8B
•
Updated
17 days ago
•
10
OptimizerStudy/soape_520m_8
0.8B
•
Updated
17 days ago
•
216
OptimizerStudy/sophia_130m_1
0.3B
•
Updated
17 days ago
•
10
OptimizerStudy/sophia_130m_2
0.3B
•
Updated
17 days ago
•
10
OptimizerStudy/sophia_130m_4
0.3B
•
Updated
17 days ago
•
12
OptimizerStudy/sophia_130m_8
0.3B
•
Updated
17 days ago
•
11
OptimizerStudy/sophia_300m_1
0.5B
•
Updated
17 days ago
•
12
OptimizerStudy/sophia_520m_1
0.8B
•
Updated
17 days ago
•
9
Upvote
-
Share collection
View history
Collection guide
Browse collections