Learn more about cloning repositories
You have read-only access
Ability to fuse non-square (pruned) attention weights for BERT-like models (#6850)