45 55 180

Di Zhang

di-zhang-fdu

https://scholar.google.com/citations?user=vxAO250AAAAJ&hl=en

AI & ML interests

AI4Chem, LLM, Green LLM

Recent Activity

posted an update 5 days ago

Let-BERT-SPEAK: Training-Free Block Diffusion Language Model with BERT Code: https://github.com/trotsky1997/Let-BERT-SPEAK/blob/main/generate.py Blog: https://trotsky1997.notion.site/Let-BERT-SPEAK-Training-Free-Block-Diffusion-Language-Model-with-BERT-2a2bbfcc4cdf802aa67dcba6a02a0c9f

authored a paper 7 days ago

Chem-R: Learning to Reason as a Chemist

updated a dataset 13 days ago

di-zhang-fdu/chemvlm-sft-datasets

View all activity

Organizations

upvoted a paper 3 months ago

CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

Paper • 2508.18124 • Published Aug 25 • 48

upvoted an article 3 months ago

Article

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

and 4 others •

Aug 11

• 75

upvoted 3 papers 3 months ago

upvoted a collection 3 months ago

VLM-R1

Collection

Multimodal Reasoning Dataset for Large Scale Training with DeepSeek-R1 thoughts style • 18 items • Updated Apr 14 • 2

upvoted 2 collections 4 months ago

RefGPT Datasets

Collection

A large-scale dialogue dataset with references. • 6 items • Updated May 17, 2024 • 4

RLVR

Collection

Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' • 3 items • Updated Mar 31 • 13

upvoted an article 4 months ago

Article

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

and 11 others •

Jun 27

• 29

upvoted 3 papers 5 months ago

Control-R: Towards controllable test-time scaling

Paper • 2506.00189 • Published May 30 • 6

AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs

Paper • 2506.05328 • Published Jun 5 • 20

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 139

upvoted 6 papers 6 months ago

MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback

Paper • 2505.17873 • Published May 23 • 30

Visual Planning: Let's Think Only with Images

Paper • 2505.11409 • Published May 16 • 56

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3, 2024 • 37

A Preliminary Study for GPT-4o on Image Restoration

Paper • 2505.05621 • Published May 8 • 11

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25 • 41

AlignRAG: An Adaptable Framework for Resolving Misalignments in Retrieval-Aware Reasoning of RAG

Paper • 2504.14858 • Published Apr 21 • 4

upvoted 2 papers 7 months ago

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Paper • 2504.15271 • Published Apr 21 • 67

S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models

Paper • 2504.10368 • Published Apr 14 • 21

Di Zhang

AI & ML interests

Recent Activity

Organizations

di-zhang-fdu's activity

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub