cgus commited on
Commit
79daa35
·
verified ·
1 Parent(s): 964f255

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -2
README.md CHANGED
@@ -1,9 +1,30 @@
1
  ---
2
  license: apache-2.0
3
  base_model:
4
- - mistralai/Mistral-Nemo-Base-2407
5
- library_name: transformers
6
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  # silly-v0.2
8
 
9
  Finetune of [Mistral-Nemo-Base-2407](https://huggingface.co/mistralai/Mistral-Nemo-Base-2407) designed to emulate the writing style of character.ai models.
 
1
  ---
2
  license: apache-2.0
3
  base_model:
4
+ - wave-on-discord/silly-v0.2
5
+ library_name: exllamav2
6
  ---
7
+ # silly-v0.2-exl2
8
+ Original model: [silly-v0.2](https://huggingface.co/wave-on-discord/silly-v0.2) by [wave-on-discord](https://huggingface.co/wave-on-discord)
9
+ Based on: [Mistral-Nemo-Base-2407](https://huggingface.co/mistralai/Mistral-Nemo-Base-2407) by [Mistral AI](https://huggingface.co/mistralai)
10
+
11
+ ## Quants
12
+ [4bpw h6 (main)](https://huggingface.co/cgus/silly-v0.2-exl2/tree/main)
13
+ [4.5bpw h6](https://huggingface.co/cgus/silly-v0.2-exl2/tree/4.5bpw-h6)
14
+ [5bpw h6](https://huggingface.co/cgus/silly-v0.2-exl2/tree/5bpw-h6)
15
+ [6bpw h6](https://huggingface.co/cgus/silly-v0.2-exl2/tree/6bpw-h6)
16
+ [8bpw h8](https://huggingface.co/cgus/silly-v0.2-exl2/tree/8bpw-h8)
17
+
18
+ ## Quantization notes
19
+ Made with Exllamav2 0.3.1 with default dataset.
20
+ The model can be used with Nvidia RTX GPUs on Windows or RTX/AMD ROCm on Linux with TabbyAPI or Text-Generation-WebUI.
21
+ Should be usable at 6bpw/16k context with something like RTX3060/12GB or 6bpw/32k with RTX4060Ti/16GB, both Q8 cache.
22
+
23
+ In my brief testing the model had interesting writing style but it's very fragile and easily starts looping or repeating.
24
+ I guess it should be used with DRY sampler to avoid repetition/loops. Both TabbyAPI and TGW have it.
25
+ I don't recommend using repetition_penalty or frequency_penalty samplers for this as they are far far more destructive than DRY.
26
+
27
+ # Original model card
28
  # silly-v0.2
29
 
30
  Finetune of [Mistral-Nemo-Base-2407](https://huggingface.co/mistralai/Mistral-Nemo-Base-2407) designed to emulate the writing style of character.ai models.