Skip to content

Pull requests: apple/axlearn

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Enable remat checkpoints to host instead of TPU memory
#643 by samos123 was merged Aug 14, 2024 Loading…
Print step time for each step
#361 opened Mar 9, 2024 by samos123 Loading…
increase timeout from 300s to 900s
#362 by samos123 was closed Apr 8, 2024 Loading…
GKE GPU A3 with TCPX support
#517 by samos123 was merged Jun 29, 2024 Loading…
Add github action support
#542 by samos123 was closed Jan 9, 2025 Loading…
allow using on-demand instead of spot only
#622 opened Aug 3, 2024 by samos123 Loading…
Enable support for Kueue for GKETPUJob
#623 by samos123 was merged Aug 13, 2024 Loading…
Set remat_spec.policy None for fuji v2 70B
#638 by samos123 was closed Aug 8, 2024 Loading…
WIP add fuji 65B and GCP GPU support
#357 by samos123 was closed Aug 2, 2024 Draft
set hostNetwork to True for TPUGKEJob
#641 opened Aug 8, 2024 by samos123 Loading…
Set TF_FORCE_GPU_ALLOW_GROWTH=true by default
#712 opened Sep 24, 2024 by samos123 Loading…
Upgrade jax to 0.4.33 for A3 Mega
#730 by samos123 was closed Oct 29, 2024 Draft
improve GCS perf: Change resource limit to request
#851 by samos123 was merged Jan 17, 2025 Loading…
Docker: Upgrade Jax to 0.4.37
#880 opened Dec 10, 2024 by samos123 Draft
Add default compiler options for v6e
#887 by samos123 was merged Dec 12, 2024 Loading…
use "true" and "false" instead of 0 and 1
#890 opened Dec 12, 2024 by samos123 Loading…
fix broken apt install google-perftools
#917 by samos123 was merged Jan 14, 2025 Loading…
ProTip! Filter pull requests by the default branch with base:main.