arXiv:2504.18919
Adam Mahdi
ammaox
ยท
AI & ML interests
LLMs, multimodal AI
Recent Activity
upvoted
a
paper
about 11 hours ago
Measuring what Matters: Construct Validity in Large Language Model
Benchmarks
upvoted
a
paper
6 months ago
Clinical knowledge in LLMs does not translate to human interactions
authored
a paper
7 months ago
Clinical knowledge in LLMs does not translate to human interactions