Skip to content

Pull requests: AI-Hypercomputer/maxtext

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Use an async checkpointer to initialize JDI in tests
#1170 opened Jan 16, 2025 by gobbleturk Loading…
4 tasks done
Add gemma2 configs for trillium
#1169 opened Jan 16, 2025 by wenxindongwork Loading…
4 tasks done
jit entire prefill and insert for packed sequences
#1168 opened Jan 15, 2025 by sixiang-google Loading…
4 tasks done
add GCP workload observability feature pull ready
#1167 opened Jan 14, 2025 by jcyang43 Loading…
4 tasks done
MMLU Benchmarks
#1163 opened Jan 13, 2025 by gagika Loading…
4 tasks done
Allow nsys profile saved to local directory
#1160 opened Jan 10, 2025 by hengtaoguo Draft
3 of 4 tasks
Add environment variable to change Jetstream version
#1159 opened Jan 10, 2025 by vivianrwu Loading…
4 tasks done
[draft] equivalent mixtral mlperf data pipeline
#1157 opened Jan 10, 2025 by ZhiyuLi-goog Loading…
5 tasks done
Incorporate Orbax emergency replicator checkpoint manager pull ready
#1153 opened Jan 8, 2025 by xuefgu Loading…
4 tasks done
Anisha seq parallel
#1147 opened Jan 6, 2025 by A9isha Draft
4 tasks
Fix typo
#1141 opened Jan 4, 2025 by kislaykishore Loading…
Raymondzou context parallelism pull ready
#1135 opened Jan 2, 2025 by A9isha Draft
4 tasks
Add Context Parallelism support to cudnn Flash Attention
#1133 opened Jan 1, 2025 by kocchop Loading…
4 tasks done
Get all tests to pass locally with no special configuration
#1108 opened Dec 19, 2024 by SamuelMarks Loading…
4 tasks done
Anisha ckpt2hf1
#1106 opened Dec 19, 2024 by A9isha Draft
4 tasks
Add a Ray trainer for MaxText
#1098 opened Dec 13, 2024 by richardsliu Loading…
Add llama-405b configuration for v5p
#1095 opened Dec 10, 2024 by suexu1025 Loading…
4 tasks done
Add mixtral 8x7b config for gpu
#1090 opened Dec 9, 2024 by michelle-yooh Loading…
4 tasks done
Add Pallas GPU decode attention in Maxtext inference
#1066 opened Nov 26, 2024 by tohaowu Loading…
4 tasks done
[DO NOT MERGE] verify fix
#1045 opened Nov 16, 2024 by RissyRan Draft
Add Llama2-70b sparsecore collective model to trillium configs
#1042 opened Nov 15, 2024 by Obliviour Loading…
4 tasks done
Enable pathways workloads for v6e benchmarks
#1040 opened Nov 15, 2024 by sadikneipp Loading…
ProTip! Exclude everything labeled bug with -label:bug.