Skip to content

Pull requests: apple/axlearn

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Flash Attention for Neuron
#939 opened Jan 21, 2025 by apoorvtintin Loading…
TRN2 Meshes and Configurations
#916 opened Jan 10, 2025 by apoorvtintin Loading…
Special remat for Neuron
#898 by apoorvtintin was merged Jan 14, 2025 Loading…
Add meshes and config for TRN2/1 for Fuji models
#885 by apoorvtintin was closed Jan 10, 2025 Loading…
Input batch sharding strategy BATCH
#884 opened Dec 11, 2024 by apoorvtintin Loading…
Flash Attention for Neuron
#883 by apoorvtintin was closed Jan 21, 2025 Loading…
New DataPartitionType DATA
#567 by apoorvtintin was closed Dec 11, 2024 Loading…
Neuron support in Axlearn
#566 by apoorvtintin was closed Jan 12, 2025 Loading…
Gradient Accumulation in Axlearn
#465 by apoorvtintin was closed Jul 31, 2024 Loading…
ProTip! Add no:assignee to see everything that’s not assigned.