|
|
--- |
|
|
library_name: transformers |
|
|
tags: |
|
|
- trl |
|
|
- dpo |
|
|
- first-order-logic |
|
|
datasets: |
|
|
- yale-nlp/FOLIO |
|
|
language: |
|
|
- en |
|
|
base_model: |
|
|
- meta-llama/Llama-3.1-8B-Instruct |
|
|
--- |
|
|
|
|
|
# Model Card for Model ID |
|
|
|
|
|
Aligned version of meta-llama/Llama-3.1-8B-Instruct for Logical Reasoning |
|
|
|
|
|
### Model Description |
|
|
|
|
|
This is an aligned model using DPO in order to improve the base model's performance in formal reasoning in first-order logic. |
|
|
|
|
|
- **Developed by:** [Grupo de Ingeniería Lingüística] |
|
|
- **Language(s) (NLP):** [English] |
|
|
- **License:** [Whichever one Llama 3 uses] |
|
|
- **Finetuned from model [meta-llama/Llama-3.1-8B-Instruct]:** |
|
|
|
|
|
### Model Sources [optional] |
|
|
|
|
|
- **Repository:** [[Github](https://github.com/Kurocaguama/Into-The-Limits-of-Logic)] |
|
|
- **Paper:** Into The Limits of Logic: Alignment Methods for Formal Logic Reasoning |
|
|
|
|
|
|
|
|
## Evaluation |
|
|
|
|
|
|
|
|
## Citation [optional] |
|
|
|
|
|
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. --> |
|
|
|
|
|
**BibTeX:** |
|
|
|
|
|
[More Information Needed] |
|
|
|