VisArena / app.py

Commit History

Fix Gradio bindings: remove gr.Request from inputs; rely on implicit request injection. Docs already updated earlier.
12a36fb
Running

Peiran commited on

Login required + per-session batch (20), min raters filter (20), periodic reload, spaced repeats, and metrics logging (duration, flat scores); add hidden states and CSV fields; enforce HF-auth annotator id
6109248

Peiran commited on

Annotator ID from HF user: remove manual input, extract from gr.Request (X-Forwarded-User/username); wire through handlers; keep per-user dedup + global balancing
1688aaf

Peiran commited on

Per-user scheduling + global balancing: add Annotator ID, per-user dedup, count-based prioritization, CSV adds annotator_id; update UI bindings accordingly
64125ec

Peiran commited on

Fix indentation inside Blocks: move event bindings and controls under gr.Blocks context; py_compile passes
bf7288d

Peiran commited on

Ensure no duplicate evaluations: idempotent submit, update pair_state to remove evaluated pair, and clear UI on completion; add graceful returns when no pairs
579bdeb

Peiran commited on

Fix: robust empty-pairs handling, CSV trimming, and Python 3.8+ type hints; improve slider bounds and graceful no-op when no pending pairs
6a51e6d

Peiran commited on

problem fix
6803948

Peiran commited on

UI: switch to English-only labels/messages; minor cleanup. No logic changes.
78b57a9

Peiran commited on

Validation: skip pairs with missing image paths; docs: add Updates section describing pairing/scheduling, storage, fairness, and config (AGENTS.md updated but intentionally ignored)
9f6abf4

Peiran commited on

Pairing improvements: filter already-evaluated pairs from /data, round-robin schedule across test_ids, alternate A/B order per pair; ensure submit maps scores to correct model columns and auto-advance
88f2a10

Peiran commited on

Improve Hub upload robustness: return detailed error messages to UI and print exceptions to logs
b25a877

Peiran commited on

Persist evals to /data CSV and upload per-submission JSONL to dataset repo (peiranli0930/VisEval); add UI feedback
43656b3

Peiran commited on

Persist evaluations: write CSV locally and optionally push to Space repo when HF_TOKEN/SPACE_ID available
591d755

Peiran commited on

UI update: mask model info, new layout (original on top, A/B bottom), per-image 4 scores, and CSV schema update
ed54e20

Peiran commited on

Add Scene Composition & Object Insertion evaluation UI
7d7268b

Peiran commited on