ltl's picture

ltl

ltl

·

2793145003

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

upvoted a paper 8 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

upvoted a paper 8 days ago

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

Paper • 2511.12609 • Published 3 days ago • 90

upvoted 2 papers 8 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published 13 days ago • 191

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

Paper • 2511.04962 • Published 12 days ago • 50

upvoted 2 papers about 1 month ago

FlashWorld: High-quality 3D Scene Generation within Seconds

Paper • 2510.13678 • Published Oct 15 • 70

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30 • 531

upvoted 2 papers 2 months ago

Set Block Decoding is a Language Model Inference Accelerator

Paper • 2509.04185 • Published Sep 4 • 52

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 192

upvoted 2 papers 4 months ago

PixNerd: Pixel Neural Field Diffusion

Paper • 2507.23268 • Published Jul 31 • 51

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 132

upvoted 3 papers 5 months ago

Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding

Paper • 2506.16035 • Published Jun 19 • 88

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19 • 126

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model

Paper • 2506.13642 • Published Jun 16 • 26

upvoted 8 papers 6 months ago

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27 • 109

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published May 21 • 97

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

Paper • 2410.10594 • Published Oct 14, 2024 • 29

Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning

Paper • 2505.07263 • Published May 12 • 30

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

Paper • 2505.03733 • Published May 6 • 17

Learning from Peers in Reasoning Models

Paper • 2505.07787 • Published May 12 • 45

Learning Dynamics in Continual Pre-Training for Large Language Models

Paper • 2505.07796 • Published May 12 • 19

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 82