The second version of omnimodality large model Uni-MoE
AI & ML interests
None defined yet.
Recent Activity
Papers
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Organization Card
Text Machine Group (TMG) from Harbin Institute of Technology (Shenzhen). 🔥
Large Models based Multimodal Agent for Long Video Generation: https://github.com/HITsz-TMG/Anim-Director; https://github.com/HITsz-TMG/FilmAgent
-
Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Paper • 2408.09787 • Published • 10 -
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Paper • 2506.10540 • Published • 37 -
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces
Paper • 2501.12909 • Published • 71
The second version of omnimodality large model Uni-MoE
Large Models based Multimodal Agent for Long Video Generation: https://github.com/HITsz-TMG/Anim-Director; https://github.com/HITsz-TMG/FilmAgent
-
Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Paper • 2408.09787 • Published • 10 -
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Paper • 2506.10540 • Published • 37 -
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces
Paper • 2501.12909 • Published • 71
models
28
HIT-TMG/Uni-MoE-2.0-Image
33B
•
Updated
•
67
HIT-TMG/Uni-MoE-2.0-Omni
33B
•
Updated
•
211
HIT-TMG/Uni-MoE-2.0-Thinking
28B
•
Updated
•
7
HIT-TMG/Uni-MoE-2.0-Base
28B
•
Updated
•
13
•
1
HIT-TMG/Uni-MoE-TTS
Updated
HIT-TMG/UniMoE-Audio-Preview
7B
•
Updated
•
95
•
8
HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v2
Feature Extraction
•
0.5B
•
Updated
•
805
•
30
HIT-TMG/EviOmni-nq_train-1.5B
Question Answering
•
2B
•
Updated
•
17
•
5
HIT-TMG/EviOmni-nq_train-7B
Question Answering
•
8B
•
Updated
•
8
•
2
HIT-TMG/CIGEval-Qwen2.5-VL-7B-Instruct-sft
8B
•
Updated
•
6
datasets
6
HIT-TMG/KaLM-embedding-pretrain-data
Viewer
•
Updated
•
23.7M
•
880
•
12
HIT-TMG/CIGEval_sft_data
Viewer
•
Updated
•
6.63k
•
108
HIT-TMG/YiZhao
Viewer
•
Updated
•
47.5M
•
498
•
6
HIT-TMG/MultiSkill
Viewer
•
Updated
•
1k
•
28
HIT-TMG/TruthReader_RAG_train
Viewer
•
Updated
•
7.16k
•
37
•
6
HIT-TMG/Hansel
Viewer
•
Updated
•
7.81M
•
6.72k
•
8