Skip to content

Pull requests: apple/axlearn

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Gradient Accumulation in Axlearn
#465 by apoorvtintin was closed Jul 31, 2024 updated Jul 31, 2024
New DataPartitionType DATA
#567 by apoorvtintin was closed Dec 11, 2024 Loading… updated Dec 11, 2024
Add meshes and config for TRN2/1 for Fuji models
#885 by apoorvtintin was closed Jan 10, 2025 Loading… updated Jan 10, 2025
Neuron support in Axlearn
#566 by apoorvtintin was closed Jan 12, 2025 Loading… updated Jan 12, 2025
[DO-NOT-MERGE] PR encompassing all changes needed to support neuron on Axlearn
#886 by apoorvtintin was closed Jan 13, 2025 Loading… updated Jan 13, 2025
Special remat for Neuron
#898 by apoorvtintin was merged Jan 14, 2025 Loading… updated Jan 14, 2025
[DO-NOT-MERGE] PR encompassing all changes needed to support neuron on Axlearn
#919 opened Jan 13, 2025 by apoorvtintin Loading… updated Jan 17, 2025
Flash Attention for Neuron
#883 by apoorvtintin was closed Jan 21, 2025 Loading… updated Jan 21, 2025
Input batch sharding strategy BATCH
#884 by apoorvtintin was closed Feb 7, 2025 Loading… updated Feb 7, 2025
TRN2 Meshes and Configurations
#916 by apoorvtintin was merged Feb 13, 2025 Loading… updated Feb 13, 2025
Flash Attention for Neuron
#939 by apoorvtintin was merged Feb 13, 2025 Loading… updated Feb 13, 2025
Improvements and fixes to gradient accumulation
#993 opened Feb 14, 2025 by apoorvtintin Loading… updated Feb 28, 2025
ProTip! Filter pull requests by the default branch with base:main.