agurung/v1ff_savebestearly_sft_qwen7B_25percent_lr_1e4_bptt_offset Text Generation • 8B • Updated 26 days ago • 35
agurung/v2ff_savebestearly_sft_qwen7B_25percent_lr_1e4_bptt_offset Text Generation • 8B • Updated 26 days ago • 108
agurung/v3ff_savebestearly_sft_qwen7B_25percent_lr_1e4_bptt_offset_newprompt Text Generation • 8B • Updated 26 days ago • 139
hdong0/deepseek-Qwen-1.5B-batch-mix-GRPO_deepscaler_acc_seq_end_mask_thin_mu_8_warmed_4x4_plus Text Generation • 2B • Updated 23 days ago • 75
Neelectric/Llama-3.1-8B-Instruct_SFT_Math-220kv00.07 Text Generation • 266k • Updated 12 days ago • 56
Neelectric/Llama-3.1-8B-Instruct_SFT_Math-220kv00.09 Text Generation • 266k • Updated 9 days ago • 98
Neelectric/Llama-3.1-8B-Instruct_SFT_Math-220kv00.08 Text Generation • 266k • Updated 8 days ago • 154
Neelectric/Llama-3.1-8B-Instruct_SFT_Math-220kv00.11 Text Generation • 266k • Updated 2 days ago • 84