Commits


Tracy Sharpe authored and GitHub committed 155e22d1ab7
MLAS: fuse float output into quantized GEMM (#4215) Add more variants of MlasGemm that do a u8x8 GEMM with the output type as float. This fuses the common sequence of MatMulInteger + Cast + Mul(OutputScale) + optional Add(BiasVector).