1 44 9

Brun

JM-Brun

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

SurveyBench: How Well Can LLM(-Agents) Write Academic Surveys?

updated a collection about 1 month ago

Research Tool

updated a collection about 1 month ago

Agents

View all activity

Organizations

None yet

upvoted 2 papers about 1 month ago

SurveyBench: How Well Can LLM(-Agents) Write Academic Surveys?

Paper • 2510.03120 • Published Oct 3 • 6

CoDA: Agentic Systems for Collaborative Data Visualization

Paper • 2510.03194 • Published Oct 3 • 28

upvoted a paper 3 months ago

T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables

Paper • 2508.19813 • Published Aug 27 • 25

upvoted an article 3 months ago

Article

Introducing AI Sheets: a tool to work with datasets using open AI models!

Aug 8

•

102

upvoted a paper 3 months ago

MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers

Paper • 2508.20453 • Published Aug 28 • 63

upvoted 3 papers 4 months ago

Small Language Models are the Future of Agentic AI

Paper • 2506.02153 • Published Jun 2 • 21

Promptomatix: An Automatic Prompt Optimization Framework for Large Language Models

Paper • 2507.14241 • Published Jul 17 • 17

Mitigating Object Hallucinations via Sentence-Level Early Intervention

Paper • 2507.12455 • Published Jul 16 • 7

upvoted 10 papers 5 months ago

In-Context Learning Strategies Emerge Rationally

Paper • 2506.17859 • Published Jun 21 • 10

SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks

Paper • 2507.01001 • Published Jul 1 • 47

Hammer: Robust Function-Calling for On-Device Language Models via Function Masking

Paper • 2410.04587 • Published Oct 6, 2024 • 2

Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving

Paper • 2506.17104 • Published Jun 20 • 1

LLMs Will Always Hallucinate, and We Need to Live With This

Paper • 2409.05746 • Published Sep 9, 2024 • 6

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Paper • 2503.12605 • Published Mar 16 • 35

upvoted 2 papers 6 months ago

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published May 29 • 94

Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published May 21 • 54

Brun

AI & ML interests

Recent Activity

Organizations

JM-Brun's activity

Introducing AI Sheets: a tool to work with datasets using open AI models!