safouaneelg
/

Apertus-8B-Instruct-2509-AQUA-RAT-SFT

Text Generation

4-bit precision

Model card Files Files and versions

safouaneelg commited on Sep 5

Commit

3444b13

·

verified ·

1 Parent(s): 589c12a

update model card

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -31,7 +31,9 @@ Check out the model info here: [Swiss-AI/LLM](https://huggingface.co/collections
 # Finetuned on AQUA-RAT
 This repo contains the fine-tuned version of Apertus on [AQuA-RAT dataset](https://huggingface.co/datasets/deepmind/aqua_rat).
 The fine-tuning was performed using Unsloth on one GPU (RTX A6000 48 GB) with the following parameters:
 - per_device_train_batch_size: 8
 - gradient_accumulation_steps: 4 (effective batch size: 32)
 - warmup_steps: 10
@@ -46,7 +48,7 @@ The fine-tuning was performed using Unsloth on one GPU (RTX A6000 48 GB) with th
 - eval_strategy: steps
 - eval_steps: 150
 - packing: True
--
 ## How to use
 You can run this fine-tuned version using the below instructions:

 # Finetuned on AQUA-RAT
 This repo contains the fine-tuned version of Apertus on [AQuA-RAT dataset](https://huggingface.co/datasets/deepmind/aqua_rat).
 The fine-tuning was performed using Unsloth on one GPU (RTX A6000 48 GB) with the following parameters:
 - per_device_train_batch_size: 8
 - gradient_accumulation_steps: 4 (effective batch size: 32)
 - warmup_steps: 10
 - eval_strategy: steps
 - eval_steps: 150
 - packing: True
 ## How to use
 You can run this fine-tuned version using the below instructions: