view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare Apr 19, 2024 β’ 187
FP8 LLMs for vLLM Collection Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! β’ 44 items β’ Updated Oct 17, 2024 β’ 76
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15, 2024 β’ 190
π Dataset comparison models Collection 1.8B models trained on 350BT to compare different pretraining datasets β’ 8 items β’ Updated Jun 12, 2024 β’ 41
A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation Paper β’ 2310.16656 β’ Published Oct 25, 2023 β’ 50
Sparse Foundational Llama 2 Models Collection Sparse pre-trained and fine-tuned Llama models made by Neural Magic + Cerebras β’ 27 items β’ Updated Apr 16 β’ 9
DeepSparse Sparse LLMs Collection Useful LLMs for DeepSparse where we've pruned at least 50% of the weights! β’ 10 items β’ Updated Sep 26, 2024 β’ 5
Open LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 65 items β’ Updated Mar 20 β’ 649
Sparse Finetuning for Inference Acceleration of Large Language Models Paper β’ 2310.06927 β’ Published Oct 10, 2023 β’ 15
Sparse Finetuning MPT Collection Explore our breakthrough in sparse fine-tuning LLMs! Our novel method maintains downstream accuracy even with >70% sparsity. β’ 13 items β’ Updated Sep 26, 2024 β’ 3
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models Paper β’ 2309.14717 β’ Published Sep 26, 2023 β’ 45