FlashPack β a new, high-throughput file format and loading mechanism for PyTorch that makes model checkpoint I/O blazingly fast, even on systems without access to GPU Direct Storage (GDS).
With FlashPack, loading any model can be 3β6Γ faster than with the current state-of-the-art methods like accelerate or the standard load_state_dict() and to() flow β all wrapped in a lightweight, pure-Python package that works anywhere.