Spaces:
Running
Running
metadata
title: AI Agent Evaluator - GAIA Benchmark
emoji: 🕵🏻♂️
colorFrom: purple
colorTo: gray
sdk: gradio
python_version: 3.12.9
sdk_version: 5.29.1
app_file: main.py
short_description: Evaluate an AI agent on a subset of GAIA benchmark
datasets:
- gaia-benchmark/GAIA
pinned: false
tags:
- agent
- ai-agent
- gaia-benchmark
- langchain
- langgraph
- agent-course
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference