Public / onnxruntime / 4880f1da46e

Commits

Patrice Vignola authored and GitHub committed 4880f1da46e30 Aug 2023

Fix attention fusion for UNet onnx model export when using LoRA weights (#17249)

### Description
Tested with stable diffusion unet models exported by both pytorch 2.1.0
(nightly) and pytorch 1.13.1, with and without LoRA weights.



### Motivation and Context
LoRA weights modifiy the unet model by adding matmul and scale
operations to every q/k/v/out tensors, which breaks the current MHA
pattern recognition.