You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Release a new version of flash attention with causal prefix cross-attention.
Since only the forward pass is implemented, this should be released on its own branch. We should maybe also block backprop from running or something...