Skip to content

Pull requests: apple/axlearn

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Enabled running Pallas Flash Attention on CPU.
#922 opened Jan 14, 2025 by ds-hwang Loading…
TRN2 Meshes and Configurations
#916 opened Jan 10, 2025 by apoorvtintin Loading…
Enable cudnn attention dropout
#913 opened Jan 8, 2025 by hanzhi713 Loading…
use "true" and "false" instead of 0 and 1
#890 opened Dec 12, 2024 by samos123 Loading…
Input batch sharding strategy BATCH
#884 opened Dec 11, 2024 by apoorvtintin Loading…
Flash Attention for Neuron
#883 opened Dec 11, 2024 by apoorvtintin Loading…
Docker: Upgrade Jax to 0.4.37
#880 opened Dec 10, 2024 by samos123 Draft
improve GCS perf: Change resource limit to request
#851 opened Nov 19, 2024 by samos123 Loading…
Add llama 3 tokenizer
#850 opened Nov 19, 2024 by sychen52 Loading…
Add Mamab2 and its Jamba variant
#839 opened Nov 14, 2024 by berlino Loading…
Add Goodput & Badput recording and monitoring support.
#783 opened Oct 25, 2024 by dipannita08 Loading…
5 tasks done
Use regex for parsing step_dir
#739 opened Oct 7, 2024 by nlusskin Loading…
Set TF_FORCE_GPU_ALLOW_GROWTH=true by default
#712 opened Sep 24, 2024 by samos123 Loading…
Fix submission of Dataflow jobs
#711 opened Sep 24, 2024 by damccorm Loading…
ProTip! Adding no:label will show everything without a label.