Skip to content

Pull requests: apple/axlearn

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Remove version pin of typing-extensions
#904 by wangkuiyi was closed Jan 7, 2025 Loading…
Adds @apple/axlearn-admins to CODEOWNERS.
#896 by ruomingp was closed Dec 17, 2024 Loading…
add merge_group to action workflows
#870 by madrob was merged Dec 6, 2024 Loading…
Enable GCP Workload Monitoring
#868 by Perseus14 was closed Dec 4, 2024 Loading…
[Bug fix] Jax rollback for TPU
#855 by kelvin-zou was closed Nov 23, 2024 Loading…
[DO NOT CHECK IN, TEST ONLY]Test CI OOM issue
#792 by kelvin-zou was closed Oct 29, 2024 Loading…
Weight only offload
#789 by hanzhi713 was closed Nov 1, 2024 Draft
Support more types of groupnorm
#772 by berlino was closed Oct 28, 2024 Loading…
Add neuron attention with tests
#769 by lipovsek-aws was closed Oct 22, 2024 Loading…
support ssd/mamba2
#747 by berlino was closed Oct 11, 2024 Loading…
Upgrade jax to 0.4.33 for A3 Mega
#730 by samos123 was closed Oct 29, 2024 Draft
Adds initial lm training inputs with grain.
#727 by markblee was merged Oct 3, 2024 Loading…
Adds ASR WER calculator.
#726 by markblee was merged Oct 3, 2024 Loading…
add bert 768
#709 by YXSIO was closed Sep 23, 2024 Loading…
Fix weighted scalar division by zero.
#676 by markblee was merged Aug 27, 2024 Loading…
Style changes with py39+ as target.
#672 by miaojingang was closed Aug 23, 2024 Draft
Simplify flatten_items.
#666 by markblee was merged Aug 27, 2024 Loading…
Adds support for private worker pools.
#665 by markblee was merged Aug 27, 2024 Loading…
ProTip! Follow long discussions with comments:>50.