Hugging Face Party @ PyTorch Conference

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

mbrack authored a paper 25 days ago

UniFusion: Vision-Language Model as Unified Encoder in Image Generation

xiyang99 authored a paper about 1 month ago

CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning

xiyang99 authored a paper about 1 month ago

HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation

View all activity

csabakecskemeti

posted an update 22 days ago

Post

2526

Christmas came early this year

3 replies

·

mbrack

authored a paper 25 days ago

UniFusion: Vision-Language Model as Unified Encoder in Image Generation

Paper • 2510.12789 • Published 25 days ago • 16

osanseviero

authored a paper about 1 month ago

EmbeddingGemma: Powerful and Lightweight Text Representations

Paper • 2509.20354 • Published Sep 24 • 39

xiyang99

authored 13 papers about 1 month ago

CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning

Paper • 2401.14011 • Published Jan 25, 2024

HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation

Paper • 2406.07070 • Published Jun 11, 2024

MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding

Paper • 2406.04264 • Published Jun 6, 2024 • 2

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 95

CS-Dialogue: A 104-Hour Dataset of Spontaneous Mandarin-English Code-Switching Dialogues for Speech Recognition

Paper • 2502.18913 • Published Feb 26

SeniorTalk: A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors

Paper • 2503.16578 • Published Mar 20

Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs

Paper • 2505.11842 • Published May 17 • 1

EmotionTalk: An Interactive Chinese Multimodal Emotion Dataset With Rich Annotations

Paper • 2505.23018 • Published May 29

RoboBrain 2.0 Technical Report

Paper • 2507.02029 • Published Jul 2 • 33

Beyond Solving Math Quiz: Evaluating the Ability of Large Reasoning Models to Ask for Information

Paper • 2508.11252 • Published Aug 15 • 3

RealTalk-CN: A Realistic Chinese Speech-Text Dialogue Benchmark With Cross-Modal Interaction Analysis

Paper • 2508.10015 • Published Aug 6

Reconsidering Overthinking: Penalizing Internal and External Redundancy in CoT Reasoning

Paper • 2508.02178 • Published Aug 4

FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions

Paper • 2509.17177 • Published Sep 21 • 13

lysandre

posted an update about 2 months ago

Post

6443

We're kick-starting the process of Transformers v5, with @ArthurZ and @cyrilvallez !

v5 should be significant: we're using it as a milestone for performance optimizations, saner defaults, and a much cleaner code base worthy of 2025.

Fun fact: v4.0.0-rc-1 came out on Nov 19, 2020, nearly five years ago!

6 replies

·

clem

posted an update 3 months ago

Post

4311

Thread to gossip during the

openai GPT-5 livestream: https://www.youtube.com/watch?v=0Uu_VJeVVfo. Feel free to post your impressions below!

29 replies

·

JingzeShi

authored a paper 3 months ago

Trainable Dynamic Mask Sparse Attention

Paper • 2508.02124 • Published Aug 4 • 17

wubingheng

authored a paper 3 months ago

Trainable Dynamic Mask Sparse Attention

Paper • 2508.02124 • Published Aug 4 • 17