Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Qi
user074
Follow
AI & ML interests
None yet
Recent Activity
updated
a collection
13 days ago
SSA
updated
a collection
13 days ago
SSA
updated
a collection
13 days ago
SSA
View all activity
Organizations
None yet
user074
's models
29
Sort: Recently updated
user074/SSA_SFT_RL_0.5B
Text Generation
•
0.5B
•
Updated
13 days ago
•
9
user074/SSA_SFT_RL_1.5B
Text Generation
•
2B
•
Updated
13 days ago
•
9
user074/SSA_SFT_RL_3B
Text Generation
•
3B
•
Updated
13 days ago
•
12
user074/SSA_RL_0.5B
Text Generation
•
0.5B
•
Updated
13 days ago
•
8
user074/SSA_RL_1.5B
Text Generation
•
2B
•
Updated
13 days ago
•
9
user074/SSA_RL_3B
Text Generation
•
3B
•
Updated
13 days ago
•
10
user074/llava-v1.5-7b-compression256
7B
•
Updated
20 days ago
•
9
user074/llava-v1.5-7b-compression64
7B
•
Updated
20 days ago
•
9
user074/llava-v1.5-7b-compression1
7B
•
Updated
20 days ago
•
12
user074/llava-v1.5-7b-compression16
7B
•
Updated
20 days ago
•
8
user074/llava-v1.5-7b-normWmean
7B
•
Updated
20 days ago
•
18
user074/llava-v1.5-7b-multilayerNorm
7B
•
Updated
20 days ago
•
12
user074/grpo_qwen3b_composer_randomseed_final
Text Generation
•
3B
•
Updated
Jul 31
user074/grpo_qwen3b_composer_random
Updated
Jul 28
user074/grpo_qwen3b_composer_8answers_math_gsm_fix_reward
Text Generation
•
3B
•
Updated
May 8
user074/grpo_qwen1b_composer_8answers_math_gsm_fix_reward
Text Generation
•
2B
•
Updated
May 8
user074/grpo_qwen05b_composer_8answers_math_gsm_fix_reward
Text Generation
•
0.5B
•
Updated
May 8
user074/grpo_qwen3b_composer_nothinking
Text Generation
•
3B
•
Updated
May 8
user074/grpo_qwen1b_composer_nothinking
Text Generation
•
2B
•
Updated
May 8
user074/grpo_qwen05b_composer_nothinking
Text Generation
•
0.5B
•
Updated
May 8
user074/grpo_qwen05b_composer_sft
Text Generation
•
0.5B
•
Updated
May 6
user074/sft_qwen3b_composer_5e_6
Text Generation
•
3B
•
Updated
May 5
user074/sft_qwen1b_composer_2e_5
Text Generation
•
2B
•
Updated
May 5
user074/grpo_qwen05b_composer
Text Generation
•
0.5B
•
Updated
May 5
user074/grpo_qwen05b_composer_5answers_math_gsm_fix_reward
Text Generation
•
0.5B
•
Updated
May 5
user074/selfplay_qwen3b
Text Generation
•
3B
•
Updated
May 2
user074/selfplay_qwen3b_mix_SFT
Text Generation
•
3B
•
Updated
Apr 24
user074/grpo_qwen3b
Text Generation
•
3B
•
Updated
Apr 24
•
1
user074/selfplay_qwen3b_evaluator
Text Generation
•
3B
•
Updated
Apr 23