view article Article Running Large Transformer Models on Mobile and Edge Devices By tugrulkaya • 5 days ago • 10
view article Article Sensitivity Aware Mixed Precision Quantization V1 By badaoui and 1 other • Jun 13 • 25