Gemma 3 12B IT SFT

Gemma 3 12B SFT model trained on STEM dataset as introduced in EDUMATH: Generating Standards-aligned Educational Math Word Problems. See our project repo for usage and our paper for training details/metrics.

Citation

@misc{christ2025edumathgeneratingstandardsalignededucational,
      title={EDUMATH: Generating Standards-aligned Educational Math Word Problems}, 
      author={Bryan R. Christ and Penelope Molitz and Jonathan Kropko and Thomas Hartvigsen},
      year={2025},
      eprint={2510.06965},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2510.06965}, 
}
Downloads last month
3
Safetensors
Model size
12B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for bryanchrist/gemma3_12b_sft

Finetuned
(122)
this model
Finetunes
1 model

Dataset used to train bryanchrist/gemma3_12b_sft

Collection including bryanchrist/gemma3_12b_sft