To Build Our Future, We Must Know Our Past: Contextualizing Paradigm Shifts in Natural Language Processing Paper • 2310.07715 • Published Oct 11, 2023
Scalable Data Ablation Approximations for Language Models through Modular Training and Merging Paper • 2410.15661 • Published Oct 21, 2024 • 1