Kurosawama's picture
Update README.md
8cccc16 verified
---
library_name: transformers
tags:
- trl
- dpo
- first-order-logic
datasets:
- yale-nlp/FOLIO
language:
- en
base_model:
- meta-llama/Llama-3.1-8B-Instruct
---
# Model Card for Model ID
Aligned version of meta-llama/Llama-3.1-8B-Instruct for Logical Reasoning
### Model Description
This is an aligned model using DPO in order to improve the base model's performance in formal reasoning in first-order logic.
- **Developed by:** [Grupo de Ingeniería Lingüística]
- **Language(s) (NLP):** [English]
- **License:** [Whichever one Llama 3 uses]
- **Finetuned from model [meta-llama/Llama-3.1-8B-Instruct]:**
### Model Sources [optional]
- **Repository:** [[Github](https://github.com/Kurocaguama/Into-The-Limits-of-Logic)]
- **Paper:** Into The Limits of Logic: Alignment Methods for Formal Logic Reasoning
## Evaluation
## Citation [optional]
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
**BibTeX:**
[More Information Needed]