Kurosawama
/

Llama-3.1-8B-Instruct-Full-align

first-order-logic

Model card Files Files and versions

Llama-3.1-8B-Instruct-Full-align / README.md

Kurosawama's picture

Update README.md

8cccc16 verified about 2 months ago

|

history blame contribute delete

1.03 kB

	---
	library_name: transformers
	tags:
	- trl
	- dpo
	- first-order-logic
	datasets:
	- yale-nlp/FOLIO
	language:
	- en
	base_model:
	- meta-llama/Llama-3.1-8B-Instruct
	---

	# Model Card for Model ID

	Aligned version of meta-llama/Llama-3.1-8B-Instruct for Logical Reasoning

	### Model Description

	This is an aligned model using DPO in order to improve the base model's performance in formal reasoning in first-order logic.

	- Developed by: [Grupo de Ingeniería Lingüística]
	- Language(s) (NLP): [English]
	- License: [Whichever one Llama 3 uses]
	- Finetuned from model [meta-llama/Llama-3.1-8B-Instruct]:

	### Model Sources [optional]

	- Repository: [[Github](https://github.com/Kurocaguama/Into-The-Limits-of-Logic)]
	- Paper: Into The Limits of Logic: Alignment Methods for Formal Logic Reasoning


	## Evaluation


	## Citation [optional]

	<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->

	BibTeX:

	[More Information Needed]