game's picture

2 5 44

game

goodgame

·

AI & ML interests

None yet

Recent Activity

new activity 12 days ago

cturan/MiniMax-M2-GGUF:Actual tests show it works well. The Q4K quantized model maintains a decoding speed of around 27 tokens after multiple turns of casual conversation

liked a model 12 days ago

cturan/MiniMax-M2-GGUF

upvoted a collection 15 days ago

View all activity

Organizations

upvoted a collection 15 days ago

Cerebras REAP

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 14 items • Updated 4 days ago • 35

upvoted a collection 3 months ago

GLM-4.5-THIREUS-SPECIAL_SPLIT

These model shards are meant to be used with Thireus' GGUF Tool Suite - https://gguf.thireus.com/ • 56 items • Updated Oct 5 • 2

upvoted 2 collections 4 months ago

EXL3 models

36 items • Updated about 17 hours ago • 38

INT8 LLMs for vLLM

Accurate INT8 quantized models by Neural Magic, ready for use with vLLM! • 50 items • Updated Sep 26, 2024 • 17

upvoted a collection about 1 year ago

DeepSeek-V2.5

2 items • Updated Dec 10, 2024 • 43