2 20

zhijie deng PRO

zhijie3

https://thudzj.github.io/

thudzj

AI & ML interests

None yet

Recent Activity

upvoted a paper 28 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

upvoted a paper 3 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

updated a Space 3 months ago

zhijie3/D2F-LLaDA-Instruct-8B

View all activity

Organizations

upvoted a paper 28 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published about 1 month ago • 118

upvoted a paper 3 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14 • 142

updated a Space 3 months ago

D2F LLaDA Instruct 8B

👁

Diffusion LLMs Can Do Faster-Than-AR Inference via Discret

upvoted a paper 3 months ago

Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing

Paper • 2508.09192 • Published Aug 8 • 30

published a Space 3 months ago

D2F LLaDA Instruct 8B

👁

Diffusion LLMs Can Do Faster-Than-AR Inference via Discret

upvoted a paper 4 months ago

Scaling Speculative Decoding with Lookahead Reasoning

Paper • 2506.19830 • Published Jun 24 • 12

upvoted 3 papers 6 months ago

LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks

Paper • 2506.00411 • Published May 31 • 31

Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions

Paper • 2505.19949 • Published May 26 • 16

Done Is Better than Perfect: Unlocking Efficient Reasoning by Structured Multi-Turn Decomposition

Paper • 2505.19788 • Published May 26 • 13

upvoted a paper 7 months ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published Apr 21 • 47

authored 3 papers 8 months ago

commented a paper 8 months ago

Improved Visual-Spatial Reasoning via R1-Zero-Like Training

Paper • 2504.00883 • Published Apr 1 • 66 •

upvoted a paper 8 months ago

Improved Visual-Spatial Reasoning via R1-Zero-Like Training

Paper • 2504.00883 • Published Apr 1 • 66

commented a paper 8 months ago

Improved Visual-Spatial Reasoning via R1-Zero-Like Training

Paper • 2504.00883 • Published Apr 1 • 66 •

upvoted 2 papers 9 months ago

SIFT: Grounding LLM Reasoning in Contexts via Stickers

Paper • 2502.14922 • Published Feb 19 • 32

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13 • 193

authored 2 papers 9 months ago

Learning Neural Eigenfunctions for Unsupervised Semantic Segmentation

Paper • 2304.02841 • Published Apr 6, 2023

Online Speculative Decoding

Paper • 2310.07177 • Published Oct 11, 2023 • 3

zhijie deng PRO

AI & ML interests

Recent Activity

Organizations

zhijie3's activity

D2F LLaDA Instruct 8B

D2F LLaDA Instruct 8B