Learn more about cloning repositories
You have read-only access
[MLAS AArch64] SQNBitGemm CompInt8 kernel (#18953) Implement ARM NEON SQNBitGemm kernel that first block quantizes A to int8 and then does int8 multiplication.