Derrick Mwiti's picture

Derrick Mwiti

mwitiderrick

·

AI & ML interests

None yet

Organizations

upvoted 5 articles over 1 year ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19, 2024

•

187

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Mar 9, 2023

•

68

Article

Putting RL back in RLHF

Jun 12, 2024

•

107

Article

Welcome Gemma 2 - Google’s new open LLM

Jun 27, 2024

•

132

Article

Proximal Policy Optimization (PPO)

Aug 5, 2022

•

66

upvoted a collection over 1 year ago

FP8 LLMs for vLLM

Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 44 items • Updated Oct 17, 2024 • 76

upvoted 3 articles over 1 year ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

•

190

Article

A Dive into Vision-Language Models

Feb 3, 2023

•

78

Article

Vision Language Models Explained

Apr 11, 2024

•

492

upvoted 2 collections over 1 year ago

OpenELM Instruct Models

4 items • Updated Aug 25 • 123

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 41

upvoted a paper over 1 year ago

A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation

Paper • 2310.16656 • Published Oct 25, 2023 • 50

upvoted a collection over 1 year ago

Sparse Foundational Llama 2 Models

Sparse pre-trained and fine-tuned Llama models made by Neural Magic + Cerebras • 27 items • Updated Apr 16 • 9

upvoted a collection almost 2 years ago

DeepSparse Sparse LLMs

Useful LLMs for DeepSparse where we've pruned at least 50% of the weights! • 10 items • Updated Sep 26, 2024 • 5

upvoted a collection about 2 years ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 649

upvoted 2 papers about 2 years ago

VeRA: Vector-based Random Matrix Adaptation

Paper • 2310.11454 • Published Oct 17, 2023 • 30

Sparse Finetuning for Inference Acceleration of Large Language Models

Paper • 2310.06927 • Published Oct 10, 2023 • 15

upvoted a collection about 2 years ago

Sparse Finetuning MPT

Explore our breakthrough in sparse fine-tuning LLMs! Our novel method maintains downstream accuracy even with >70% sparsity. • 13 items • Updated Sep 26, 2024 • 3

upvoted a paper about 2 years ago

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

Paper • 2309.14717 • Published Sep 26, 2023 • 45