ZeroGPU AoTI
community
AI & ML interests
AoT compilation, ZeroGPU inference optimization
Recent Activity
View all activity
Enlists the resources to serialize and load compiled graph modules from the Hub to skip compilation time π₯
Creative applications and accelerated demos with QwenImageEdit
optimized demo for Flux kontext [dev], using FP8 quantization and AoT compilation
Enlists the resources to serialize and load compiled graph modules from the Hub to skip compilation time π₯
optimized demos for Wan 2.2 14B models, using FP8 quantization + AoT compilation & community LoRAs for fast & high quality inference on ZeroGPU π¨
Creative applications and accelerated demos with QwenImageEdit
Compare AoTI vs. base version
optimized demo for Flux kontext [dev], using FP8 quantization and AoT compilation