Skip to content

Pull requests: apple/axlearn

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Repeat KV heads in Flash Attention
#938 by changlan was merged Jan 21, 2025 Loading…
Explicitly pass module outputs to metrics.
#953 by markblee was merged Jan 28, 2025 Loading…
Add v6e special meshes
#952 by hanzhi713 was merged Jan 27, 2025 Loading…
Workaround module outputs being dropped.
#951 by markblee was merged Jan 26, 2025 Loading…
Update LoraFusedQKVLinear
#949 by qdavid1 was merged Jan 27, 2025 Loading…
update jax to 0.4.37
#948 by matthew-e-hopkins was merged Jan 27, 2025 Loading…
Add link to github issue regarding kubernetes-32.0.0
#947 by Ethanlm was merged Jan 24, 2025 Loading…
Forward input keys to decoder.
#944 by markblee was merged Jan 23, 2025 Loading…
Legacy flash remat fix
#943 by hanzhi713 was merged Jan 23, 2025 Loading…
Some fixes for flash remat
#942 by hanzhi713 was merged Jan 22, 2025 Loading…
Add GKE A3 Ultra support
#940 by samos123 was closed Apr 4, 2025 Loading…
Flash Attention for Neuron
#939 by apoorvtintin was merged Feb 13, 2025 Loading…
AOT compilation for v6e
#937 by changlan was merged Jan 21, 2025 Loading…
Adds mesh rule for a3-megagpu-8g.
#936 by markblee was merged Jan 23, 2025 Loading…
Avoid a top-level import of tokenizers.
#935 by markblee was merged Jan 19, 2025 Loading…
Makes causal lm metrics configurable.
#934 by markblee was merged Jan 21, 2025 Loading…
Supports flexible input partition specs.
#933 by markblee was merged Jan 19, 2025 Loading…
Enable GCP Workload Monitoring
#932 by Perseus14 was closed Mar 18, 2025 Loading…
Add ReadOptions args to _make_autoregressive_inputs
#931 by RsEnts was merged Jan 17, 2025 Loading…
Fix aot compilation with grain inputs.
#929 by markblee was merged Jan 16, 2025 Loading…
Add prefill hidden states as module outputs.
#928 by markblee was merged Jan 16, 2025 Loading…
Cache AoT compilation result
#927 by hanzhi713 was merged Jan 16, 2025 Loading…
ProTip! Filter pull requests by the default branch with base:main.