Update README.md
Browse files
README.md
CHANGED
|
@@ -31,7 +31,7 @@ license: cc-by-nc-4.0
|
|
| 31 |
- quants done with an importance matrix for improved quantization loss
|
| 32 |
- 0, K & IQ quants in basically all variants from Q8 down to IQ1_S
|
| 33 |
- Quantized with [llama.cpp](https://github.com/ggerganov/llama.cpp) commit [04976db7a819fcf8bfefbfc09a3344210b79dd27](https://github.com/ggerganov/llama.cpp/commit/04976db7a819fcf8bfefbfc09a3344210b79dd27) (master from 2024-05-07)
|
| 34 |
-
-
|
| 35 |
```
|
| 36 |
./imatrix -c 512 -m $model_name-f16.gguf -f $llama_cpp_path/groups_merged.txt -o $out_path/imat-f16-gmerged.dat
|
| 37 |
```
|
|
|
|
| 31 |
- quants done with an importance matrix for improved quantization loss
|
| 32 |
- 0, K & IQ quants in basically all variants from Q8 down to IQ1_S
|
| 33 |
- Quantized with [llama.cpp](https://github.com/ggerganov/llama.cpp) commit [04976db7a819fcf8bfefbfc09a3344210b79dd27](https://github.com/ggerganov/llama.cpp/commit/04976db7a819fcf8bfefbfc09a3344210b79dd27) (master from 2024-05-07)
|
| 34 |
+
- Imatrix generated with [this](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384) dataset.
|
| 35 |
```
|
| 36 |
./imatrix -c 512 -m $model_name-f16.gguf -f $llama_cpp_path/groups_merged.txt -o $out_path/imat-f16-gmerged.dat
|
| 37 |
```
|