Commits · peiranli0930/VisArena

Fix Gradio bindings: remove gr.Request from inputs; rely on implicit request injection. Docs already updated earlier.

12a36fb

Running

Peiran commited on Oct 23

Login required + per-session batch (20), min raters filter (20), periodic reload, spaced repeats, and metrics logging (duration, flat scores); add hidden states and CSV fields; enforce HF-auth annotator id

6109248

Peiran commited on Oct 23

Annotator ID from HF user: remove manual input, extract from gr.Request (X-Forwarded-User/username); wire through handlers; keep per-user dedup + global balancing

1688aaf

Peiran commited on Oct 23

Per-user scheduling + global balancing: add Annotator ID, per-user dedup, count-based prioritization, CSV adds annotator_id; update UI bindings accordingly

64125ec

Peiran commited on Oct 23

Fix indentation inside Blocks: move event bindings and controls under gr.Blocks context; py_compile passes

bf7288d

Peiran commited on Oct 23

Ensure no duplicate evaluations: idempotent submit, update pair_state to remove evaluated pair, and clear UI on completion; add graceful returns when no pairs

579bdeb

Peiran commited on Oct 23

Fix: robust empty-pairs handling, CSV trimming, and Python 3.8+ type hints; improve slider bounds and graceful no-op when no pending pairs

6a51e6d

Peiran commited on Oct 23

problem fix

6803948

Peiran commited on Oct 23

UI: switch to English-only labels/messages; minor cleanup. No logic changes.

78b57a9

Peiran commited on Oct 22

Validation: skip pairs with missing image paths; docs: add Updates section describing pairing/scheduling, storage, fairness, and config (AGENTS.md updated but intentionally ignored)

9f6abf4

Peiran commited on Oct 22

Pairing improvements: filter already-evaluated pairs from /data, round-robin schedule across test_ids, alternate A/B order per pair; ensure submit maps scores to correct model columns and auto-advance

88f2a10

Peiran commited on Oct 22

Improve Hub upload robustness: return detailed error messages to UI and print exceptions to logs

b25a877

Peiran commited on Oct 22

Persist evals to /data CSV and upload per-submission JSONL to dataset repo (peiranli0930/VisEval); add UI feedback

43656b3

Peiran commited on Oct 22

Persist evaluations: write CSV locally and optionally push to Space repo when HF_TOKEN/SPACE_ID available

591d755

Peiran commited on Oct 22

UI update: mask model info, new layout (original on top, A/B bottom), per-image 4 scores, and CSV schema update

ed54e20

Peiran commited on Oct 22

Add Scene Composition & Object Insertion evaluation UI

7d7268b

Peiran commited on Oct 22

Spaces:

peiranli0930
/

VisArena

Running

Commit History

Fix Gradio bindings: remove gr.Request from inputs; rely on implicit request injection. Docs already updated earlier.

12a36fb

Running

Login required + per-session batch (20), min raters filter (20), periodic reload, spaced repeats, and metrics logging (duration, flat scores); add hidden states and CSV fields; enforce HF-auth annotator id

6109248

Annotator ID from HF user: remove manual input, extract from gr.Request (X-Forwarded-User/username); wire through handlers; keep per-user dedup + global balancing

1688aaf

Per-user scheduling + global balancing: add Annotator ID, per-user dedup, count-based prioritization, CSV adds annotator_id; update UI bindings accordingly

64125ec

Fix indentation inside Blocks: move event bindings and controls under gr.Blocks context; py_compile passes

bf7288d

Ensure no duplicate evaluations: idempotent submit, update pair_state to remove evaluated pair, and clear UI on completion; add graceful returns when no pairs

579bdeb

Fix: robust empty-pairs handling, CSV trimming, and Python 3.8+ type hints; improve slider bounds and graceful no-op when no pending pairs

6a51e6d

problem fix

6803948

UI: switch to English-only labels/messages; minor cleanup. No logic changes.

78b57a9

Validation: skip pairs with missing image paths; docs: add Updates section describing pairing/scheduling, storage, fairness, and config (AGENTS.md updated but intentionally ignored)

9f6abf4

Pairing improvements: filter already-evaluated pairs from /data, round-robin schedule across test_ids, alternate A/B order per pair; ensure submit maps scores to correct model columns and auto-advance

88f2a10

Improve Hub upload robustness: return detailed error messages to UI and print exceptions to logs

b25a877

Persist evals to /data CSV and upload per-submission JSONL to dataset repo (peiranli0930/VisEval); add UI feedback

43656b3

Persist evaluations: write CSV locally and optionally push to Space repo when HF_TOKEN/SPACE_ID available

591d755

UI update: mask model info, new layout (original on top, A/B bottom), per-image 4 scores, and CSV schema update

ed54e20

Add Scene Composition & Object Insertion evaluation UI

7d7268b

Commit History

Fix Gradio bindings: remove gr.Request from inputs; rely on implicit request injection. Docs already updated earlier. 12a36fb Running

Login required + per-session batch (20), min raters filter (20), periodic reload, spaced repeats, and metrics logging (duration, flat scores); add hidden states and CSV fields; enforce HF-auth annotator id 6109248

Annotator ID from HF user: remove manual input, extract from gr.Request (X-Forwarded-User/username); wire through handlers; keep per-user dedup + global balancing 1688aaf

Per-user scheduling + global balancing: add Annotator ID, per-user dedup, count-based prioritization, CSV adds annotator_id; update UI bindings accordingly 64125ec

Fix indentation inside Blocks: move event bindings and controls under gr.Blocks context; py_compile passes bf7288d

Ensure no duplicate evaluations: idempotent submit, update pair_state to remove evaluated pair, and clear UI on completion; add graceful returns when no pairs 579bdeb

Fix: robust empty-pairs handling, CSV trimming, and Python 3.8+ type hints; improve slider bounds and graceful no-op when no pending pairs 6a51e6d

problem fix 6803948

UI: switch to English-only labels/messages; minor cleanup. No logic changes. 78b57a9

Validation: skip pairs with missing image paths; docs: add Updates section describing pairing/scheduling, storage, fairness, and config (AGENTS.md updated but intentionally ignored) 9f6abf4

Pairing improvements: filter already-evaluated pairs from /data, round-robin schedule across test_ids, alternate A/B order per pair; ensure submit maps scores to correct model columns and auto-advance 88f2a10

Improve Hub upload robustness: return detailed error messages to UI and print exceptions to logs b25a877

Persist evals to /data CSV and upload per-submission JSONL to dataset repo (peiranli0930/VisEval); add UI feedback 43656b3

Persist evaluations: write CSV locally and optionally push to Space repo when HF_TOKEN/SPACE_ID available 591d755

UI update: mask model info, new layout (original on top, A/B bottom), per-image 4 scores, and CSV schema update ed54e20

Add Scene Composition & Object Insertion evaluation UI 7d7268b

Fix Gradio bindings: remove gr.Request from inputs; rely on implicit request injection. Docs already updated earlier.

12a36fb

Running

Login required + per-session batch (20), min raters filter (20), periodic reload, spaced repeats, and metrics logging (duration, flat scores); add hidden states and CSV fields; enforce HF-auth annotator id

6109248

Annotator ID from HF user: remove manual input, extract from gr.Request (X-Forwarded-User/username); wire through handlers; keep per-user dedup + global balancing

1688aaf

Per-user scheduling + global balancing: add Annotator ID, per-user dedup, count-based prioritization, CSV adds annotator_id; update UI bindings accordingly

64125ec

Fix indentation inside Blocks: move event bindings and controls under gr.Blocks context; py_compile passes

bf7288d

Ensure no duplicate evaluations: idempotent submit, update pair_state to remove evaluated pair, and clear UI on completion; add graceful returns when no pairs

579bdeb

Fix: robust empty-pairs handling, CSV trimming, and Python 3.8+ type hints; improve slider bounds and graceful no-op when no pending pairs

6a51e6d

problem fix

6803948

UI: switch to English-only labels/messages; minor cleanup. No logic changes.

78b57a9

Validation: skip pairs with missing image paths; docs: add Updates section describing pairing/scheduling, storage, fairness, and config (AGENTS.md updated but intentionally ignored)

9f6abf4

Pairing improvements: filter already-evaluated pairs from /data, round-robin schedule across test_ids, alternate A/B order per pair; ensure submit maps scores to correct model columns and auto-advance

88f2a10

Improve Hub upload robustness: return detailed error messages to UI and print exceptions to logs

b25a877

Persist evals to /data CSV and upload per-submission JSONL to dataset repo (peiranli0930/VisEval); add UI feedback

43656b3

Persist evaluations: write CSV locally and optionally push to Space repo when HF_TOKEN/SPACE_ID available

591d755

UI update: mask model info, new layout (original on top, A/B bottom), per-image 4 scores, and CSV schema update

ed54e20

Add Scene Composition & Object Insertion evaluation UI

7d7268b