Spaces:

unsloth
/

README

Running

App Files Files Community

Ling-1T-GGUF UD-Q8_K_XL context size is limited to 32k rather than 128k?

#17

by superciliousdude - opened 20 days ago

Discussion

superciliousdude

20 days ago

Hi,

In my (admittedly limited) testing, it appears that the Ling-1T-GGUF model is limited to 32k of context rather than the 128k that it is supposed to have according to the model card of the unquantised repo. I was astonished to find this model performing so spectacularly poorly compared to Kimi K2 (at the same quant) but it turns out that its not a fair fight since I cannot configure this one to use more than 32k of context.

Can someone please confirm if this is indeed the case and if so, is it intentional or a bug?

Thanks in advance.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment