Qi's picture

Qi

user074

AI & ML interests

None yet

Recent Activity

updated a collection 13 days ago

updated a collection 13 days ago

updated a collection 13 days ago

View all activity

Organizations

None yet

user074 's models 29

user074/SSA_SFT_RL_0.5B

Text Generation • 0.5B • Updated 13 days ago • 9

user074/SSA_SFT_RL_1.5B

Text Generation • 2B • Updated 13 days ago • 9

user074/SSA_SFT_RL_3B

Text Generation • 3B • Updated 13 days ago • 12

user074/SSA_RL_0.5B

Text Generation • 0.5B • Updated 13 days ago • 8

user074/SSA_RL_1.5B

Text Generation • 2B • Updated 13 days ago • 9

user074/SSA_RL_3B

Text Generation • 3B • Updated 13 days ago • 10

user074/llava-v1.5-7b-compression256

7B • Updated 20 days ago • 9

user074/llava-v1.5-7b-compression64

7B • Updated 20 days ago • 9

user074/llava-v1.5-7b-compression1

7B • Updated 20 days ago • 12

user074/llava-v1.5-7b-compression16

7B • Updated 20 days ago • 8

user074/llava-v1.5-7b-normWmean

7B • Updated 20 days ago • 18

user074/llava-v1.5-7b-multilayerNorm

7B • Updated 20 days ago • 12

user074/grpo_qwen3b_composer_randomseed_final

Text Generation • 3B • Updated Jul 31

user074/grpo_qwen3b_composer_random

user074/grpo_qwen3b_composer_8answers_math_gsm_fix_reward

Text Generation • 3B • Updated May 8

user074/grpo_qwen1b_composer_8answers_math_gsm_fix_reward

Text Generation • 2B • Updated May 8

user074/grpo_qwen05b_composer_8answers_math_gsm_fix_reward

Text Generation • 0.5B • Updated May 8

user074/grpo_qwen3b_composer_nothinking

Text Generation • 3B • Updated May 8

user074/grpo_qwen1b_composer_nothinking

Text Generation • 2B • Updated May 8

user074/grpo_qwen05b_composer_nothinking

Text Generation • 0.5B • Updated May 8

user074/grpo_qwen05b_composer_sft

Text Generation • 0.5B • Updated May 6

user074/sft_qwen3b_composer_5e_6

Text Generation • 3B • Updated May 5

user074/sft_qwen1b_composer_2e_5

Text Generation • 2B • Updated May 5

user074/grpo_qwen05b_composer

Text Generation • 0.5B • Updated May 5

user074/grpo_qwen05b_composer_5answers_math_gsm_fix_reward

Text Generation • 0.5B • Updated May 5

user074/selfplay_qwen3b

Text Generation • 3B • Updated May 2

user074/selfplay_qwen3b_mix_SFT

Text Generation • 3B • Updated Apr 24

user074/grpo_qwen3b

Text Generation • 3B • Updated Apr 24 • 1

user074/selfplay_qwen3b_evaluator

Text Generation • 3B • Updated Apr 23