Skip to content

[1] Release causal prefix flashattn #1

Closed
@timt51

Description

@timt51

Release a new version of flash attention with causal prefix cross-attention.

  • Since only the forward pass is implemented, this should be released on its own branch. We should maybe also block backprop from running or something...
  • Include tests

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions