Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yi Cui's picture
24 19 1

Yi Cui

onekq
tesfalegn's profile picture Fishtiks's profile picture axlman's profile picture
ยท
https://onekq.ai
  • onekq_ai
  • onekq
  • yicui

AI & ML interests

Benchmark, Code Generation Model

Recent Activity

updated a Space about 11 hours ago
onekq-ai/WebApp1K-models-leaderboard
posted an update about 20 hours ago
If RAG (by that I meant vectors and embeddings) transitions from QA to agents, is scalability (from wikipedia to personal memory) still an issue? What will be the new challenges? Anyone care to share experience?
posted an update 2 days ago
No SOTA from gpt5 codex https://huggingface.co/spaces/onekq-ai/WebApp1K-models-leaderboard
View all activity

Organizations

MLX Community's profile picture ONEKQ AI's profile picture

authored a paper 6 months ago

Tests as Prompt: A Test-Driven-Development Benchmark for LLM Code Generation

Paper โ€ข 2505.09027 โ€ข Published May 13
authored 3 papers about 1 year ago

A Case Study of Web App Coding with OpenAI Reasoning Models

Paper โ€ข 2409.13773 โ€ข Published Sep 19, 2024 โ€ข 7

WebApp1K: A Practical Code-Generation Benchmark for Web App Development

Paper โ€ข 2408.00019 โ€ข Published Jul 30, 2024 โ€ข 2

Insights from Benchmarking Frontier Language Models on Web App Code Generation

Paper โ€ข 2409.05177 โ€ข Published Sep 8, 2024 โ€ข 8
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs