Update README.md
Browse files
README.md
CHANGED
|
@@ -11,11 +11,11 @@ widget:
|
|
| 11 |
---
|
| 12 |
## Model description
|
| 13 |
|
| 14 |
-

|
| 15 |
|
| 16 |
GPT-2 model from Lithuania using Wikipedia corpus dataset based on GPT-2 small model.
|
| 17 |
|
| 18 |
+
This is only the first version of the model; over time model will be improved using a more extensive dataset and better data preparation.
|
| 19 |
|
| 20 |
## Training data
|
| 21 |
This model was pre-trained with 180MB of Lithuanian Wikipedia. The texts are tokenized using a byte-level version of Byte Pair Encoding (BPE).
|