Skip to content

Pull requests: apple/axlearn

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

upgrade transformers version
#121 by gyin94 was closed Oct 14, 2023 Loading…
Update README.md
#142 by FarukhS52 was closed Oct 25, 2023 Loading…
Check for 'WAITING_FOR_RESOURCES' state.
#144 by markblee was merged Oct 25, 2023 Loading…
Raises a ValueError if the TPU version is unknown.
#153 by ruomingp was merged Oct 29, 2023 Loading…
More robust handling of unknown states.
#150 by markblee was merged Oct 29, 2023 Loading…
Adds GroupedQueryAttention.
#154 by markblee was merged Oct 30, 2023 Loading…
adding watchdog
#159 by kartikperisetla was closed Nov 3, 2023 Draft
Moves conformer layers to axlearn/audio.
#165 by markblee was closed Nov 5, 2023 Loading…
WIP add fuji 65B and GCP GPU support
#357 by samos123 was closed Aug 2, 2024 Draft
Cleanup output_dim handling in subsampler.
#173 by markblee was merged Nov 11, 2023 Loading…
Add ASR encoder layers.
#174 by markblee was merged Nov 13, 2023 Loading…
Adds the axlearn gcp bastion history command.
#176 by ruomingp was merged Nov 13, 2023 Loading…
Adds a bastion preemption command.
#182 by markblee was merged Nov 16, 2023 Loading…
Update trainer config test_utils.
#17 by markblee was merged Jul 29, 2023 Loading…
Evaler policies.
#15 by markblee was merged Jul 28, 2023 Loading…
torch: attention softmax as float32 for bf16
#18 by gyin94 was closed Aug 3, 2023 Loading…
Adds the CLI.
#7 by markblee was merged Jul 21, 2023 Loading…
Update README.md
#9 by TanayShukla was closed Aug 5, 2023 Loading…
ProTip! Updated in the last three days: updated:>2025-01-15.