arxiv:2406.06608
michael ilie PRO
skdrx
AI & ML interests
None yet
Organizations
models
8
skdrx/gemma2-2b-it-falsereject
3B
•
Updated
•
2
•
1
skdrx/gemma-2-2b-finemath-finetune
3B
•
Updated
skdrx/ds_coder_6.7_inst_rlsf_varname
7B
•
Updated
•
29
skdrx/amd135m_reasoning_finetune
0.1B
•
Updated
•
44
skdrx/rslf_dscoder1.3b-inst-varname-gguf
1B
•
Updated
•
9
skdrx/rlsf_ds_1.3b_instruct_varname
Text Generation
•
1B
•
Updated
•
1
skdrx/rlstarfmodel_ds_inst
Updated
skdrx/Replete-LLM-Qwen2-7b_Beta-Preview-Q4_K_S-GGUF
8B
•
Updated
•
12
datasets
11
skdrx/python-dpo-dataset-complete-just-formatting
Viewer
•
Updated
•
37k
•
9
skdrx/python-dpo-dataset-varname
Viewer
•
Updated
•
2k
•
10
skdrx/python-dpo-dataset-formatted
Viewer
•
Updated
•
2k
•
7
skdrx/python-dpo-dataset-varname-formatted-combined-ONLYSYSTEMPROMPT
Viewer
•
Updated
•
1k
•
7
skdrx/python-dpo-dataset-varname-formatted-combined-NOSYSTEMPROMPT
Viewer
•
Updated
•
1k
•
8
skdrx/python-dpo-dataset-varname-formatted-ONLYSYSTEMPROMPT
Viewer
•
Updated
•
1k
•
10
skdrx/python-dpo-dataset-varname-formatted-NOSYSTEMPROMPT
Viewer
•
Updated
•
1k
•
5
skdrx/python-dpo-dataset-varname-formatted-combined
Viewer
•
Updated
•
2k
•
8
skdrx/python-dpo-dataset-varname-formatted
Viewer
•
Updated
•
2k
•
7
skdrx/rlsf_dpo
Viewer
•
Updated
•
10k
•
10
•
1