Public / onnxruntime / 155e22d1ab7

Skip to sidebar navigation
Skip to content

Commits

Tracy Sharpe authored and GitHub committed 155e22d1ab713 Jun 2020

MLAS: fuse float output into quantized GEMM (#4215)

Add more variants of MlasGemm that do a u8x8 GEMM with the output type as float. This fuses the common sequence of MatMulInteger + Cast + Mul(OutputScale) + optional Add(BiasVector).