gaia-eval-l1-20

Running

App Files Files

gaia-eval-l1-20 / README.md

kengboon

Update with AgentWorkflow

cc6c3da 7 months ago

preview code

raw

history blame

489 Bytes

metadata

title: AI Agent Evaluator - GAIA Benchmark
emoji: 🕵🏻‍♂️
colorFrom: purple
colorTo: gray
sdk: gradio
python_version: 3.12.9
sdk_version: 5.29.1
app_file: main.py
short_description: Evaluate an AI agent on a subset of GAIA benchmark
datasets:
  - gaia-benchmark/GAIA
pinned: false
tags:
  - agent
  - ai-agent
  - gaia-benchmark
  - langchain
  - langgraph
  - agent-course

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference