Yedidia AGNIMO

YedsonUQ

AI & ML interests

[Uncertainty Quantification, "Hallucinations"] in LLMs, Federated Learning

Recent Activity

updated a collection about 1 month ago

Reinforcement Learning (RL)

updated a collection about 1 month ago

Foundational Deep Learning - Architecture

updated a collection 3 months ago

Benchmark and Evaluation

View all activity

Organizations

None yet

upvoted a paper 3 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 192

upvoted a paper 5 months ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19 • 126

upvoted 5 papers 6 months ago

Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models

Paper • 2506.01413 • Published Jun 2 • 16

A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models

Paper • 2505.07591 • Published May 12 • 11

upvoted 3 papers 7 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 186

Cost-of-Pass: An Economic Framework for Evaluating Language Models

Paper • 2504.13359 • Published Apr 17 • 4

I-Con: A Unifying Framework for Representation Learning

Paper • 2504.16929 • Published Apr 23 • 29

upvoted a paper 8 months ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 171

upvoted 7 papers 9 months ago

IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval

Paper • 2503.04644 • Published Mar 6 • 21

How to Steer LLM Latents for Hallucination Detection?

Paper • 2503.01917 • Published Mar 1 • 11

Language Models' Factuality Depends on the Language of Inquiry

Paper • 2502.17955 • Published Feb 25 • 33

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published Feb 25 • 50

An Overview of Large Language Models for Statisticians

Paper • 2502.17814 • Published Feb 25 • 4

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 126

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20 • 100

upvoted 2 papers 10 months ago

Linear Correlation in LM's Compositional Generalization and Hallucination

Paper • 2502.04520 • Published Feb 6 • 11

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 222

Yedidia AGNIMO

AI & ML interests

Recent Activity

Organizations

YedsonUQ's activity