Commits


Jiajia Qin authored and GitHub committed 785e2b1eaec
[js/webgpu] Optimize softmax by vector (#18153) ### Description This PR enables `softmax` outputs max supported components instead of scalar for each thread. Softmax with input[0]: [12,4096,4096] becomes 47.86 ms from 55.11 ms