Spaces:

JunsWan
/

HardcoreLogic

Running

App Files Files Community

JunsWan commited on Oct 13

Commit

375b998

verified ·

1 Parent(s): 1e754f8

Update README.md

Browse files

Files changed (1) hide show

README.md +23 -47

README.md CHANGED Viewed

@@ -1,8 +1,5 @@
 ---
-title: Zebra Logic Bench
-emoji: 🦓
-colorFrom: blue
-colorTo: yellow
 sdk: gradio
 sdk_version: 4.19.2
 app_file: app.py
@@ -13,49 +10,28 @@ api: false
 tags:
     - leaderboard
 datasets:
-    - allenai/ZebraLogicBench
-    - WildEval/ZebraLogic
 models:
-    - Qwen/Qwen2-72B-Instruct
-    - Qwen/Qwen1.5-72B-Chat
-    - Qwen/Qwen1.5-7B-Chat
-    - meta-llama/Meta-Llama-3-8B-Instruct
-    - meta-llama/Meta-Llama-3-70B-Instruct
-    - meta-llama/Llama-2-13b-chat-hf
-    - meta-llama/Llama-2-70b-chat-hf
-    - meta-llama/Llama-2-7b-chat-hf
-    - mistralai/Mistral-7B-Instruct-v0.1
-    - mistralai/Mistral-7B-Instruct-v0.2
-    - mistralai/Mixtral-8x7B-Instruct-v0.1
-    - microsoft/Phi-3-medium-128k-instruct
-    - microsoft/Phi-3-mini-128k-instruct
-    - NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO
-    - NousResearch/Hermes-2-Theta-Llama-3-8B
-    - 01-ai/Yi-1.5-34B-Chat
-    - 01-ai/Yi-1.5-9B-Chat
-    - 01-ai/Yi-1.5-6B-Chat
-    - google/gemma-7b-it
-    - google/gemma-2b-it
-    - allenai/tulu-2-dpo-70b
-    - HuggingFaceH4/zephyr-7b-beta
-    - Nexusflow/Starling-LM-7B-beta
-    - databricks/dbrx-instruct
-    - princeton-nlp/Llama-3-Instruct-8B-SimPO
-    - chujiezheng/Llama-3-Instruct-8B-SimPO-ExPO
-    - chujiezheng/Starling-LM-7B-beta-ExPO
-    - ZhangShenao/SELM-Zephyr-7B-iter-3
-    - deepseek-ai/DeepSeek-V2-Chat
-    - m-a-p/neo_7b_instruct_v0.1
-    - 01-ai/Yi-34B-chat
-    - lmsys/vicuna-13b-v1.5
-    - HuggingFaceH4/zephyr-7b-gemma-v0.1
-    - deepseek-ai/DeepSeek-Coder-V2
-    - THUDM/glm-4-9b-chat
-    - chujiezheng/neo_7b_instruct_v0.1-ExPO
-    - ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
-Paper: arxiv.org/abs/2406.04770
-Paper: arxiv.org/abs/2502.01100

 ---
+title: HardcoreLogic Bench
 sdk: gradio
 sdk_version: 4.19.2
 app_file: app.py
 tags:
     - leaderboard
 datasets:
+    - xhWu-fd/HardcoreLogic
 models:
+    - Qwen/Qwen3-8B
+    - Qwen/Qwen3-30B-A3B-Thinking-2507
+    - Qwen/Qwen3-32B
+    - Qwen/QQwen3-Next-80B-A3B-Thinking
+    - Qwen/Qwen3-235B-A22B-Thinking-2507
+    - MiniMaxAI/MiniMax-M1-40k
+    - deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
+    - deepseek-ai/DeepSeek-V3.1
+    - deepseek-ai/DeepSeek-R1-0528
+    - zai-org/GLM-4.5
+    - moonshotai/Kimi-K2-Instruct
+    - ByteDance-Seed/Seed-OSS-36B-Instruct
+    - openai/gpt-oss-120b
+    - gpt-5
+    - gpt-5-mini
+    - o4-mini
+    - grok-4
+    - gemini-2.5-pro
+    - grok-3-mini
+    - claude-sonnet-4-thinking
+    - gemini-2.5-flash
 ---