-
Notifications
You must be signed in to change notification settings - Fork 275
Pull requests: apple/axlearn
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Speed up FA Backward pass in GPU via parallelizing sequence dimension
#818
by kelvin-zou
was merged Nov 12, 2024
Loading…
[Bug fix] Update gpu flash attention after syntax change and fixed unit tests for flash attention
#809
by kelvin-zou
was merged Nov 5, 2024
Loading…
Bump up circle CI container size for build-and-test.
#794
by kelvin-zou
was merged Oct 29, 2024
Loading…
[Bug fix] better GPU flash attention compatibility
#744
by kelvin-zou
was merged Oct 14, 2024
Loading…
Support customized mesh rules to support different HWs
#696
by kelvin-zou
was merged Sep 13, 2024
Loading…
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.