-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[v4.0] feat: add new tasks for image classification v2 #722
Conversation
MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅ |
version name scheme: should we call it Classification V2 (after model name) or V3.1 (align with the app version)? Let's discuss this in main mobile working group. |
we can move the two mobilenet edgetpu models to the end of the list later |
Let's call it V2 (not V2.0). |
Screenshot updated in the first comment. |
The accuracy test of @freedomtan Do you have an idea what can be the reason for this? |
Yes, I ran into the same problem on Pixel 7 before. It seems you have to reduce the batch size to a much smaller one. I guess it’s a bug or limitation of EdgeTPU. |
Thank you. Reducing the batch size worked. |
let's rebase this one and create an 4.0 branch. |
Kudos, SonarCloud Quality Gate passed! |
Let's rebase this to current master branch and make the model path local (local://) and without checksum, so that it's easier to test new models (we may get more than one model to test). |
a03cd6c
to
dd6663b
Compare
Done
Each vendor should add a new benchmark_setting {
benchmark_id: "image_classification_v2"
framework: "TFLite"
delegate_choice: {
delegate_name: "CPU"
accelerator_name: "cpu"
accelerator_desc: "CPU"
model_path: "https://github.com/mlcommons/mobile_models/raw/anh/add-MNEdgeTPU-V2-L/v3_1/tflite/mobilenet-edgetpu-v2-L_fp32.tflite"
model_checksum: "4b7ce0a82f8eaa542404d2ab2ee4e575"
}
}
benchmark_setting {
benchmark_id: "image_classification_offline_v2"
framework: "TFLite"
delegate_choice: {
delegate_name: "CPU"
accelerator_name: "cpu"
accelerator_desc: "CPU"
batch_size: 2
model_path: "https://github.com/mlcommons/mobile_models/raw/anh/add-MNEdgeTPU-V2-L/v3_1/tflite/mobilenet-edgetpu-v2-L_fp32.tflite"
model_checksum: "4b7ce0a82f8eaa542404d2ab2ee4e575"
}
} Or should I remove the checksum and set the path to local for |
we need diff --git a/flutter/assets/tasks.pbtxt b/flutter/assets/tasks.pbtxt
index 5a113ca..d7cf0be 100644
--- a/flutter/assets/tasks.pbtxt
+++ b/flutter/assets/tasks.pbtxt
@@ -221,6 +221,7 @@ task {
name: "Image Classification v2"
min_query_count: 1024
min_duration: 60
+ max_duration: 600
max_throughput: 1000
max_accuracy: 1.0
scenario: "SingleStream"
@@ -257,6 +258,7 @@ task {
name: "Image Classification v2 (Offline)"
min_query_count: 24576
min_duration: 0
+ max_duration: 600
max_throughput: 2000
max_accuracy: 1.0
scenario: "Offline" |
Done. |
@mohitmundhragithub and @AhmedTElthakeb for some new models, we need different preprocessing, I have a hack based on this one, 1d9e1e9 |
this is for which model? |
@freedomtan to send PRs for different input sizes for new models (with different preprocessing). 224x224, 256x256, 384x384 |
* Update S3 provider in android-build-test.yml * Update env var name --------- Co-authored-by: Anh <[email protected]>
* hacks for model small * small -> medium * medium -> large * for quantized models * make float model work again fixed a stupid bug * fix pixel and neuron settings * change preprocessing and label offset * Adding changes for calibration dataset preprocessing * hacks * clean up * fix pixel backend settings * fix pixel settings & icons for classificaton v2 * fix lint issue * Revert "Adding changes for calibration dataset preprocessing" This reverts commit c259333. New model doesn't need these * revert imagenet width and height we'll use #853 to handle different width and height * fix offline for Pixel backend * Undo update for Windows build * update MobilenetV4 settings for TFLite, Neuron, and Pixel * update windows setting for Mobilenet V4 * Use MobileNetV4 model in Apple devices * Fix tflite_settings_windows.pbtxt * Update expected_accuracy * Update expected_accuracy and expected_throughput for Apple devices * Use fp32 model for Windows * Update expected_throughput for Windows * Update expected_throughput for Android * Update S3 provider in android-build-test.yml (#855) * Update S3 provider in android-build-test.yml * Update env var name --------- Co-authored-by: Anh <[email protected]> * Reduce batch_size for image_classification_offline_v2 in Pixel 6 --------- Co-authored-by: Mohit Mundhra <[email protected]> Co-authored-by: Anh <[email protected]> Co-authored-by: Nathan Wasson <[email protected]>
* Remove InfiniteProgressCircle * Use GitHub vars for --num-flaky-test-attempts flag * Format Dart code * Add DottedProgressCircle * Add setting for progress animation style * Revert "Add setting for progress animation style" This reverts commit c14cc63. * Revert "Add DottedProgressCircle" This reverts commit ef1c427. * Use ProgressCircle with no animation * Format Dart code
* Use GitHub vars for --num-flaky-test-attempts flag * Use Pixel 6 for test-android-apk-tflite * Add expected_throughput for _kTFLiteBackend -> _kPixel6 * Update expected_throughput for _kTFLiteBackend -> _kPixel6 * Replace obsolete Windows image * Update expected_throughput for _kDN2103 * Use Pixel 6 - API level 31 for higher device capacity
Replace obsolete Windows image
…/new-edge-tpu-model
* Use dev version of loadgen * Set enforce_max_duration to false * Add min_duration_ms and max_duration_ms to dev Makefile * update enforce_max_duration logic * use updated loadgen * Show different colors for performance result based on multiple conditions. * Fix Dart linter issue * Use GitHub vars for --num-flaky-test-attempts flag * Update expected_throughput for _kS22Ultra * Use latest commit from inference/mobile_update branch --------- Co-authored-by: Koan-Sin Tan <[email protected]>
Should we merge this to |
I am fine with either way. Let's discuss it in the meeting. |
|
Quality Gate passedIssues Measures |
Closes #719
This is how the app looks with 2 new tasks: