Learn more about cloning repositories
You have read-only access
Fix attention perf regression (#8682) * undo change in attention cpu * fix perf regression * disable persistent softmax by default