Add on-device runner and v5e benchmark model configs #1199

raymondzouu · 2025-01-24T21:12:01Z

Description

Adding on-device runner to benchmark_runner.py. This will run the model benchmark on the current device rather than launching it as an xpk workload. The benchmark script must be ran on all workers. This will allow our XLML perf tests to run our latest benchmark model configs.

This change introduces a new command-line flag that specifies which runner to use (xpk, on-device). Usage is as follows:

xpk runner:

python3 benchmarks/benchmark_runner.py xpk --project=<project> --zone=<zone> --device_type=v5litepod-256 --num_slices=1  --cluster_name=<cluster> --base_output_directory=gs://<your-bucket> --model_name="default_16b_v5e_256" --base_docker_image maxtext_base_image

on-device runner:

python3 benchmarks/benchmark_runner.py on-device --base_output_directory=gs://<your-bucket> --model_name="default_16b_v5e_256" --run_name=test-run

Tests

http://shortn/_grjxE2vPHD

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed.

benchmarks/benchmark_runner.py

Obliviour

Really great improvements here! Adding v5e configs and a maxtext direct way of running these commands is awesome.

Can you add details on this way of directly running maxtext_xpk_runner.py in the readme file in the bnechmarks directory?

benchmarks/benchmark_runner.py

raymondzouu requested review from gobbleturk, khatwanimohit, bvandermoon, vipannalla and RissyRan as code owners January 24, 2025 21:12

raymondzouu requested a review from Obliviour January 24, 2025 21:13

raymondzouu assigned Obliviour Jan 24, 2025

Obliviour reviewed Jan 27, 2025

View reviewed changes

benchmarks/benchmark_runner.py Outdated Show resolved Hide resolved

Obliviour reviewed Jan 27, 2025

View reviewed changes

benchmarks/benchmark_runner.py Outdated Show resolved Hide resolved

raymondzouu force-pushed the raymondzou-on-device-runner-updated branch 2 times, most recently from 9192249 to 8032ce2 Compare January 27, 2025 22:08

raymondzouu requested a review from Obliviour January 27, 2025 22:08

Obliviour approved these changes Jan 27, 2025

View reviewed changes

raymondzouu assigned gobbleturk and unassigned Obliviour Jan 27, 2025

gobbleturk approved these changes Jan 28, 2025

View reviewed changes

github-actions bot added the pull ready label Jan 28, 2025

Add on-device runner and pythonic v5e model configs

f64c51a

raymondzouu force-pushed the raymondzou-on-device-runner-updated branch from 8032ce2 to f64c51a Compare January 28, 2025 20:56

copybara-service bot merged commit ce4cd52 into main Jan 28, 2025
15 checks passed

copybara-service bot deleted the raymondzou-on-device-runner-updated branch January 28, 2025 21:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add on-device runner and v5e benchmark model configs #1199

Add on-device runner and v5e benchmark model configs #1199

raymondzouu commented Jan 24, 2025 •

edited

Loading

Obliviour left a comment

Add on-device runner and v5e benchmark model configs #1199

Add on-device runner and v5e benchmark model configs #1199

Conversation

raymondzouu commented Jan 24, 2025 • edited Loading

Description

Tests

Checklist

Obliviour left a comment

Choose a reason for hiding this comment

raymondzouu commented Jan 24, 2025 •

edited

Loading