-
Notifications
You must be signed in to change notification settings - Fork 296
Pull requests: apple/axlearn
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
New DataPartitionType DATA
#567
by apoorvtintin
was closed Dec 11, 2024
Loading…
updated Dec 11, 2024
Add meshes and config for TRN2/1 for Fuji models
#885
by apoorvtintin
was closed Jan 10, 2025
Loading…
updated Jan 10, 2025
Neuron support in Axlearn
#566
by apoorvtintin
was closed Jan 12, 2025
Loading…
updated Jan 12, 2025
[DO-NOT-MERGE] PR encompassing all changes needed to support neuron on Axlearn
#886
by apoorvtintin
was closed Jan 13, 2025
Loading…
updated Jan 13, 2025
[DO-NOT-MERGE] PR encompassing all changes needed to support neuron on Axlearn
#919
opened Jan 13, 2025 by
apoorvtintin
Loading…
updated Jan 17, 2025
Flash Attention for Neuron
#883
by apoorvtintin
was closed Jan 21, 2025
Loading…
updated Jan 21, 2025
Input batch sharding strategy BATCH
#884
by apoorvtintin
was closed Feb 7, 2025
Loading…
updated Feb 7, 2025
TRN2 Meshes and Configurations
#916
by apoorvtintin
was merged Feb 13, 2025
Loading…
updated Feb 13, 2025
Flash Attention for Neuron
#939
by apoorvtintin
was merged Feb 13, 2025
Loading…
updated Feb 13, 2025
Improvements and fixes to gradient accumulation
#993
opened Feb 14, 2025 by
apoorvtintin
Loading…
updated Feb 28, 2025
ProTip!
Filter pull requests by the default branch with base:main.