-
Notifications
You must be signed in to change notification settings - Fork 280
Pull requests: apple/axlearn
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Expose segment_ids to _compute_attention in FlashAttention
#714
by changlan
was merged Sep 28, 2024
Loading…
Add --megascale_abort_on_hangs flag for multi-slice TPU jobs
#716
by mugithi
was closed Oct 3, 2024
Loading…
Support customized mesh rules to support different HWs
#696
by kelvin-zou
was merged Sep 13, 2024
Loading…
No-op when garbage collecting non-existent checkpoint dir.
#703
by markblee
was merged Sep 21, 2024
Loading…
Add support for tensor including UTF-8 string in inference_output
#23
by mialsy
was merged Aug 3, 2023
Loading…
Minor cleanup and add batch_size to fake text source
#11
by SnehaNB
was merged Jul 26, 2023
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-12-18.