When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA Paper • 2510.04849 • Published Oct 6 • 111
Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report Paper • 2510.14880 • Published 24 days ago • 15
view article Article Granite Embedding R2: Setting New Standards for Enterprise Retrieval By hansolosan • 26 days ago • 15
mmBERT: A Modern Multilingual Encoder with Annealed Language Learning Paper • 2509.06888 • Published Sep 8 • 12
Medical and Scientific Literature Models Collection Models for working with medical and scientific literature. • 10 items • Updated Sep 18 • 8
Hallucination detection Collection Trained ModernBERT (base and large) for detection hallucinations in LLM responses. The models are trained as token classifications. • 4 items • Updated May 18 • 19
view article Article Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications By nmmursit and 5 others • Aug 29 • 27
TinyLettuce Collection This Collection contains our small, Ettin-encoder (https://arxiv.org/abs/2507.11412) based models trained on synthetic and RagTruth data. • 6 items • Updated Aug 31 • 3
view article Article 🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders By adaamko and 1 other • Aug 31 • 15
Splade Models Collection The collection includes Splade models from different authors that can be load thanks to the Sparse Encoder modules of Sentence Transformers • 16 items • Updated Jul 30 • 8
view article Article 🥬 LettuceDetect Goes Multilingual: Fine-tuning EuroBERT on Synthetic Translations By adaamko and 1 other • May 19 • 9
Multilingual Hallucination Detection Collection These are our EuroBERT fine-tunes on our translated RAGTruth datasets. • 13 items • Updated May 18 • 5
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26 • 170