Neither Valid nor Reliable? Investigating the Use of LLMs as Judges Paper • 2508.18076 • Published Aug 25 • 6
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects Paper • 2504.19838 • Published Apr 28 • 22
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications Paper • 2409.07314 • Published Sep 11, 2024 • 56
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs Paper • 2409.14988 • Published Sep 23, 2024 • 23
Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation Paper • 2409.12941 • Published Sep 19, 2024 • 24
view article Article Retrieval Augmented Generation with Huggingface Transformers and Ray Feb 10, 2021 • 6
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published Sep 19, 2024 • 140
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion Paper • 2409.11406 • Published Sep 17, 2024 • 27
CogVLM2: Visual Language Models for Image and Video Understanding Paper • 2408.16500 • Published Aug 29, 2024 • 57
view article Article dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified Aug 22, 2024 • 13