Learn more about cloning repositories
You have read-only access
Update Attention operator for GPT2 (#3474) Add unidrectional mask for Attention operator. Update mask_index to mask broadcast from B->BxS->BxNxSxS to B->BxSxS->BxNxSxS.