Commits


Jiajia Qin authored and GitHub committed 9799c3fbd26
[webgpu] Enable FlashAttention for GQA (#23761) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->