atrost
/

math_sft_40K_trl_think_SFT_Regularized-0.5_Normalize-True

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

math_sft_40K_trl_think_SFT_Regularized-0.5_Normalize-True

3.46 GB

1 contributor

History: 2 commits

atrost's picture

Model save

fa4613d verified about 2 months ago