HunyuanVideo-Foley / fp8info.txt
phazei's picture
Update conversion script and include more triple weights to fp8
c1f87d4
Inspecting tensors in: hunyuanvideo_foley_fp8_e5m2.safetensors
Tensor Name | Precision (dtype) | Actual Size (MB) | FP32 Equiv. (MB)
--------------------------------------------|-------------------|------------------|-------------------
audio_embedder.proj.bias | BF16 | 0.003 | 0.006
audio_embedder.proj.weight | BF16 | 0.375 | 0.750
cond_in.linear_1.bias | BF16 | 0.003 | 0.006
cond_in.linear_1.weight | BF16 | 2.250 | 4.500
cond_in.linear_2.bias | BF16 | 0.003 | 0.006
cond_in.linear_2.weight | BF16 | 4.500 | 9.000
empty_clip_feat | BF16 | 0.001 | 0.003
empty_sync_feat | BF16 | 0.001 | 0.003
final_layer.adaLN_modulation.1.bias | BF16 | 0.006 | 0.012
final_layer.adaLN_modulation.1.weight | BF16 | 9.000 | 18.000
final_layer.linear.bias | BF16 | 0.000 | 0.000
final_layer.linear.weight | BF16 | 0.375 | 0.750
single_blocks.0.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.0.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.0.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.0.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.0.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.0.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.0.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.0.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.0.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.0.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.0.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.1.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.1.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.1.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.1.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.1.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.1.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.1.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.1.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.1.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.1.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.1.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.10.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.10.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.10.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.10.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.10.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.10.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.10.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.10.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.10.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.10.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.10.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.11.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.11.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.11.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.11.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.11.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.11.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.11.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.11.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.11.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.11.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.11.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.12.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.12.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.12.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.12.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.12.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.12.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.12.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.12.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.12.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.12.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.12.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.13.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.13.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.13.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.13.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.13.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.13.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.13.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.13.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.13.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.13.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.13.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.14.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.14.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.14.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.14.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.14.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.14.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.14.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.14.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.14.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.14.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.14.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.15.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.15.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.15.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.15.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.15.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.15.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.15.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.15.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.15.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.15.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.15.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.16.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.16.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.16.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.16.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.16.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.16.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.16.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.16.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.16.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.16.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.16.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.17.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.17.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.17.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.17.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.17.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.17.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.17.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.17.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.17.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.17.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.17.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.18.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.18.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.18.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.18.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.18.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.18.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.18.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.18.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.18.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.18.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.18.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.19.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.19.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.19.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.19.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.19.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.19.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.19.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.19.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.19.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.19.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.19.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.2.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.2.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.2.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.2.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.2.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.2.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.2.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.2.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.2.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.2.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.2.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.20.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.20.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.20.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.20.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.20.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.20.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.20.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.20.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.20.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.20.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.20.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.21.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.21.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.21.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.21.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.21.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.21.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.21.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.21.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.21.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.21.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.21.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.22.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.22.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.22.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.22.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.22.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.22.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.22.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.22.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.22.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.22.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.22.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.23.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.23.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.23.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.23.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.23.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.23.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.23.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.23.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.23.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.23.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.23.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.24.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.24.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.24.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.24.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.24.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.24.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.24.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.24.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.24.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.24.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.24.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.25.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.25.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.25.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.25.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.25.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.25.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.25.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.25.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.25.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.25.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.25.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.26.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.26.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.26.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.26.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.26.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.26.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.26.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.26.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.26.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.26.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.26.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.27.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.27.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.27.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.27.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.27.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.27.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.27.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.27.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.27.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.27.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.27.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.28.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.28.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.28.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.28.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.28.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.28.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.28.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.28.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.28.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.28.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.28.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.29.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.29.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.29.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.29.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.29.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.29.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.29.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.29.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.29.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.29.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.29.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.3.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.3.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.3.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.3.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.3.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.3.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.3.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.3.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.3.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.3.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.3.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.30.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.30.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.30.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.30.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.30.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.30.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.30.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.30.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.30.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.30.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.30.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.31.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.31.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.31.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.31.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.31.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.31.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.31.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.31.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.31.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.31.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.31.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.32.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.32.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.32.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.32.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.32.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.32.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.32.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.32.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.32.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.32.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.32.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.33.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.33.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.33.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.33.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.33.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.33.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.33.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.33.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.33.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.33.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.33.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.34.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.34.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.34.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.34.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.34.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.34.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.34.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.34.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.34.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.34.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.34.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.35.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.35.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.35.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.35.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.35.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.35.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.35.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.35.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.35.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.35.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.35.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.4.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.4.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.4.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.4.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.4.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.4.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.4.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.4.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.4.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.4.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.4.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.5.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.5.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.5.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.5.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.5.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.5.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.5.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.5.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.5.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.5.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.5.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.6.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.6.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.6.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.6.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.6.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.6.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.6.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.6.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.6.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.6.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.6.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.7.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.7.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.7.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.7.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.7.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.7.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.7.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.7.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.7.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.7.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.7.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.8.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.8.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.8.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.8.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.8.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.8.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.8.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.8.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.8.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.8.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.8.q_norm.weight | BF16 | 0.000 | 0.000
single_blocks.9.k_norm.weight | BF16 | 0.000 | 0.000
single_blocks.9.linear1.bias | BF16 | 0.003 | 0.006
single_blocks.9.linear1.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.9.linear2.w1.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.9.linear2.w2.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.9.linear2.w3.weight | F8_E5M2 | 18.000 | 72.000
single_blocks.9.linear_qkv.bias | BF16 | 0.009 | 0.018
single_blocks.9.linear_qkv.weight | F8_E5M2 | 6.750 | 27.000
single_blocks.9.modulation.linear.bias | BF16 | 0.018 | 0.035
single_blocks.9.modulation.linear.weight | F8_E5M2 | 13.500 | 54.000
single_blocks.9.q_norm.weight | BF16 | 0.000 | 0.000
sync_in.0.bias | BF16 | 0.003 | 0.006
sync_in.0.weight | BF16 | 2.250 | 4.500
sync_in.2.w1.weight | BF16 | 12.000 | 24.000
sync_in.2.w2.weight | BF16 | 12.000 | 24.000
sync_in.2.w3.weight | BF16 | 12.000 | 24.000
sync_pos_emb | BF16 | 0.012 | 0.023
time_in.mlp.0.bias | BF16 | 0.003 | 0.006
time_in.mlp.0.weight | BF16 | 0.750 | 1.500
time_in.mlp.2.bias | BF16 | 0.003 | 0.006
time_in.mlp.2.weight | BF16 | 4.500 | 9.000
triple_blocks.0.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.0.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.0.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.0.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.0.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.0.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.0.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.0.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.0.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.0.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.0.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.0.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.0.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.0.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.0.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.0.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.0.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.0.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.0.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.0.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.0.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.0.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.0.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.0.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.0.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.0.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.0.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.0.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.0.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.0.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.0.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.0.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.0.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.0.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.0.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.0.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.0.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.1.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.1.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.1.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.1.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.1.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.1.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.1.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.1.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.1.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.1.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.1.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.1.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.1.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.1.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.1.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.1.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.1.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.1.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.1.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.1.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.1.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.1.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.1.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.1.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.1.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.1.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.1.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.1.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.1.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.1.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.1.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.1.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.1.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.1.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.1.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.1.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.1.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.10.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.10.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.10.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.10.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.10.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.10.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.10.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.10.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.10.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.10.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.10.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.10.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.10.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.10.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.10.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.10.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.10.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.10.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.10.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.10.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.10.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.10.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.10.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.10.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.10.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.10.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.10.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.10.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.10.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.10.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.10.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.10.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.10.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.10.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.10.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.10.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.10.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.11.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.11.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.11.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.11.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.11.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.11.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.11.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.11.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.11.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.11.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.11.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.11.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.11.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.11.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.11.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.11.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.11.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.11.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.11.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.11.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.11.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.11.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.11.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.11.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.11.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.11.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.11.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.11.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.11.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.11.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.11.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.11.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.11.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.11.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.11.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.11.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.11.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.12.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.12.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.12.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.12.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.12.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.12.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.12.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.12.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.12.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.12.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.12.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.12.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.12.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.12.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.12.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.12.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.12.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.12.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.12.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.12.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.12.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.12.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.12.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.12.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.12.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.12.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.12.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.12.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.12.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.12.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.12.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.12.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.12.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.12.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.12.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.12.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.12.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.13.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.13.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.13.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.13.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.13.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.13.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.13.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.13.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.13.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.13.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.13.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.13.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.13.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.13.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.13.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.13.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.13.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.13.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.13.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.13.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.13.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.13.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.13.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.13.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.13.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.13.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.13.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.13.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.13.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.13.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.13.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.13.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.13.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.13.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.13.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.13.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.13.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.14.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.14.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.14.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.14.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.14.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.14.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.14.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.14.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.14.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.14.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.14.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.14.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.14.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.14.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.14.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.14.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.14.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.14.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.14.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.14.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.14.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.14.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.14.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.14.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.14.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.14.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.14.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.14.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.14.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.14.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.14.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.14.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.14.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.14.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.14.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.14.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.14.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.15.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.15.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.15.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.15.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.15.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.15.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.15.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.15.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.15.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.15.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.15.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.15.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.15.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.15.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.15.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.15.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.15.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.15.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.15.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.15.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.15.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.15.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.15.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.15.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.15.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.15.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.15.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.15.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.15.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.15.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.15.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.15.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.15.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.15.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.15.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.15.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.15.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.16.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.16.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.16.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.16.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.16.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.16.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.16.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.16.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.16.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.16.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.16.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.16.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.16.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.16.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.16.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.16.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.16.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.16.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.16.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.16.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.16.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.16.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.16.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.16.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.16.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.16.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.16.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.16.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.16.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.16.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.16.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.16.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.16.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.16.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.16.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.16.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.16.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.17.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.17.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.17.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.17.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.17.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.17.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.17.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.17.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.17.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.17.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.17.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.17.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.17.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.17.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.17.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.17.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.17.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.17.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.17.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.17.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.17.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.17.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.17.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.17.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.17.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.17.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.17.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.17.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.17.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.17.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.17.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.17.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.17.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.17.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.17.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.17.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.17.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.2.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.2.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.2.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.2.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.2.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.2.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.2.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.2.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.2.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.2.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.2.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.2.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.2.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.2.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.2.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.2.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.2.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.2.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.2.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.2.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.2.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.2.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.2.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.2.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.2.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.2.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.2.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.2.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.2.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.2.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.2.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.2.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.2.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.2.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.2.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.2.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.2.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.3.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.3.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.3.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.3.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.3.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.3.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.3.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.3.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.3.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.3.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.3.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.3.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.3.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.3.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.3.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.3.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.3.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.3.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.3.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.3.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.3.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.3.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.3.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.3.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.3.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.3.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.3.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.3.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.3.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.3.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.3.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.3.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.3.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.3.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.3.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.3.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.3.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.4.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.4.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.4.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.4.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.4.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.4.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.4.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.4.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.4.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.4.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.4.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.4.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.4.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.4.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.4.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.4.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.4.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.4.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.4.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.4.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.4.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.4.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.4.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.4.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.4.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.4.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.4.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.4.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.4.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.4.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.4.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.4.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.4.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.4.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.4.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.4.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.4.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.5.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.5.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.5.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.5.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.5.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.5.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.5.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.5.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.5.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.5.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.5.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.5.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.5.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.5.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.5.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.5.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.5.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.5.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.5.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.5.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.5.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.5.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.5.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.5.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.5.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.5.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.5.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.5.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.5.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.5.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.5.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.5.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.5.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.5.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.5.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.5.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.5.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.6.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.6.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.6.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.6.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.6.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.6.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.6.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.6.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.6.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.6.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.6.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.6.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.6.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.6.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.6.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.6.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.6.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.6.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.6.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.6.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.6.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.6.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.6.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.6.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.6.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.6.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.6.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.6.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.6.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.6.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.6.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.6.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.6.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.6.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.6.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.6.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.6.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.7.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.7.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.7.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.7.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.7.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.7.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.7.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.7.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.7.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.7.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.7.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.7.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.7.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.7.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.7.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.7.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.7.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.7.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.7.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.7.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.7.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.7.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.7.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.7.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.7.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.7.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.7.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.7.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.7.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.7.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.7.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.7.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.7.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.7.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.7.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.7.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.7.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.8.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.8.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.8.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.8.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.8.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.8.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.8.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.8.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.8.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.8.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.8.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.8.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.8.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.8.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.8.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.8.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.8.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.8.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.8.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.8.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.8.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.8.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.8.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.8.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.8.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.8.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.8.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.8.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.8.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.8.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.8.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.8.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.8.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.8.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.8.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.8.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.8.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.9.audio_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.9.audio_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.9.audio_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.9.audio_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.9.audio_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.9.audio_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.9.audio_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.9.audio_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.9.audio_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.9.audio_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.9.audio_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.9.audio_self_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.9.audio_self_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.9.audio_self_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.9.audio_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.9.audio_self_proj.weight | F8_E5M2 | 2.250 | 9.000
triple_blocks.9.audio_self_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.9.text_cross_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.9.text_cross_kv.bias | BF16 | 0.006 | 0.012
triple_blocks.9.text_cross_kv.weight | F8_E5M2 | 4.500 | 18.000
triple_blocks.9.v_cond_attn_k_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.9.v_cond_attn_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.9.v_cond_attn_qkv.bias | BF16 | 0.009 | 0.018
triple_blocks.9.v_cond_attn_qkv.weight | F8_E5M2 | 6.750 | 27.000
triple_blocks.9.v_cond_cross_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.9.v_cond_cross_proj.weight | BF16 | 4.500 | 9.000
triple_blocks.9.v_cond_cross_q.bias | BF16 | 0.003 | 0.006
triple_blocks.9.v_cond_cross_q.weight | BF16 | 4.500 | 9.000
triple_blocks.9.v_cond_cross_q_norm.weight | BF16 | 0.000 | 0.000
triple_blocks.9.v_cond_mlp.fc1.bias | BF16 | 0.012 | 0.023
triple_blocks.9.v_cond_mlp.fc1.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.9.v_cond_mlp.fc2.bias | BF16 | 0.003 | 0.006
triple_blocks.9.v_cond_mlp.fc2.weight | F8_E5M2 | 9.000 | 36.000
triple_blocks.9.v_cond_mod.linear.bias | BF16 | 0.026 | 0.053
triple_blocks.9.v_cond_mod.linear.weight | F8_E5M2 | 20.250 | 81.000
triple_blocks.9.v_cond_self_proj.bias | BF16 | 0.003 | 0.006
triple_blocks.9.v_cond_self_proj.weight | F8_E5M2 | 2.250 | 9.000
visual_proj.w1.weight | BF16 | 2.250 | 4.500
visual_proj.w2.weight | BF16 | 4.500 | 9.000
visual_proj.w3.weight | BF16 | 2.250 | 4.500
--- Summary ---
Total tensors found: 1087
Precision distribution:
- BF16 : 673 tensor(s)
- F8_E5M2 : 414 tensor(s)
Total actual size of all tensors: 5094.356 MB
Total theoretical FP32 size of all tensors: 19584.712 MB
Overall size reduction compared to full FP32: 73.99%