teapotai
/

teapotllm

Text Generation

Transformers.js

text2text-generation

text-generation-inference

Model card Files Files and versions

zakerytclarke commited on Mar 18

Commit

3f93979

·

verified ·

1 Parent(s): 2d29e48

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -263,7 +263,7 @@ Teapot LLM is fine-tuned from [flan-t5-base](https://huggingface.co/google/flan-
 - [Hyperparameters] The model was trained with various learning rates and monitored to ensure task specific performance was learned without catastrophic forgetting.
 ### Evaluation
-TeapotLLM is focused on in-context reasoning tasks, and therefore most benchmarks are not suitable for evaluation
 #### Synthqa Evaluation
 [Synthqa](https://huggingface.co/datasets/teapotai/synthqa) is a dataset focused on in-context QnA and information extraction tasks. We use the validation set to benchmark TeapotLLM against other models of similar size. All benchmarks were run using a Google Colab Notebook running on CPU with High Ram. Teapot significantly outperforms models of similar size, with low latency CPU inference and improved accuracy.

 - [Hyperparameters] The model was trained with various learning rates and monitored to ensure task specific performance was learned without catastrophic forgetting.
 ### Evaluation
+TeapotLLM is focused on in-context reasoning tasks, and therefore most benchmarks are not suitable for evaluation. We want TeapotLLM to be a practical tool for QnA and information extraction, so we have developed custom datasets to benchmark performance.
 #### Synthqa Evaluation
 [Synthqa](https://huggingface.co/datasets/teapotai/synthqa) is a dataset focused on in-context QnA and information extraction tasks. We use the validation set to benchmark TeapotLLM against other models of similar size. All benchmarks were run using a Google Colab Notebook running on CPU with High Ram. Teapot significantly outperforms models of similar size, with low latency CPU inference and improved accuracy.