Black-Box On-Policy Distillation of Large Language Models Paper • 2511.10643 • Published 11 days ago • 43
The Era of Agentic Organization: Learning to Organize with Language Models Paper • 2510.26658 • Published 25 days ago • 26
DocReward: A Document Reward Model for Structuring and Stylizing Paper • 2510.11391 • Published Oct 13 • 27
Language Is Not All You Need: Aligning Perception with Language Models Paper • 2302.14045 • Published Feb 27, 2023
UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation Paper • 2303.08518 • Published Mar 15, 2023
Adapting Large Language Models via Reading Comprehension Paper • 2309.09530 • Published Sep 18, 2023 • 81
Kosmos-G: Generating Images in Context with Multimodal Large Language Models Paper • 2310.02992 • Published Oct 4, 2023 • 4
LayoutLM: Pre-training of Text and Layout for Document Image Understanding Paper • 1912.13318 • Published Dec 31, 2019 • 4
Democratizing Reasoning Ability: Tailored Learning from Large Language Model Paper • 2310.13332 • Published Oct 20, 2023 • 16
MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained Transformers Paper • 2012.15828 • Published Dec 31, 2020 • 1
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment Paper • 2106.06381 • Published Jun 11, 2021
DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders Paper • 2106.13736 • Published Jun 25, 2021
Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training Paper • 2109.07306 • Published Sep 15, 2021