include inference time in performance report only when not-mocked #1572

dafnapension · 2025-02-03T20:51:27Z

because inference is mocked, and evaluation is not indicative if runs over mocked predictions

…cause inference is mocked, and evaluation is not indicative if runs over mocked predictions Signed-off-by: dafnapension <[email protected]>

… mocked Signed-off-by: dafnapension <[email protected]>

* minor bug fixes * adding anls * Added LMMS_eval template, exact_match_mm metric, RGB image augmentor. * Added qa.with_context_multiple choice lmms_eval template * Added relaxed_correctness metric, edited info_vqa card, edit lmms_eval template * minor script changes * minor run changes * Added default template * Added lmms-lab version of chartqa with default template and relaxed correctness metric and with_type task * Added lmms-lab card of info_vqa. added default template to info_vqa and docvqa * Wevsrc new with_domain task, new metric. WIP * websrc working * discarding not important changes from main * workaround for llama_vision with WML * WML vs local UNITXT reproductions * Added llama vision benchmark * Changes for commit * fix unitxt typo * ruff * Update inference engine model and adjust expected targets in tests Signed-off-by: elronbandel <[email protected]> * Fix WML Inference Engine tests for images Signed-off-by: elronbandel <[email protected]> * Update tests Signed-off-by: elronbandel <[email protected]> * Enhance CSV loader with low_memory option and update inference engine tests for set equality Signed-off-by: elronbandel <[email protected]> * Add error handling in Loader class to raise UnitxtError on load_iterables failure Signed-off-by: elronbandel <[email protected]> --------- Signed-off-by: elronbandel <[email protected]>

Signed-off-by: dafnapension <[email protected]>

dafnapension force-pushed the polish_performance branch from c988f5e to fe4dd35 Compare February 4, 2025 14:50

dafnapension changed the title ~~remove inference time and evaluation time from performance report~~ include inference time in performance report only when not-mocked Feb 4, 2025

dafnapension force-pushed the polish_performance branch 2 times, most recently from 1734509 to 94bc4ec Compare February 10, 2025 08:50

dafnapension and others added 4 commits February 10, 2025 10:57

remove inference time and evaluation time from performance report, be…

382ef82

…cause inference is mocked, and evaluation is not indicative if runs over mocked predictions Signed-off-by: dafnapension <[email protected]>

returned evaluate to performance, and inference only in case that not…

7bd64d6

… mocked Signed-off-by: dafnapension <[email protected]>

match current mains version of loaders

7ca3758

Signed-off-by: dafnapension <[email protected]>

dafnapension force-pushed the polish_performance branch from 94bc4ec to 7ca3758 Compare February 10, 2025 08:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

include inference time in performance report only when not-mocked #1572

include inference time in performance report only when not-mocked #1572

dafnapension commented Feb 3, 2025

include inference time in performance report only when not-mocked #1572

Are you sure you want to change the base?

include inference time in performance report only when not-mocked #1572

Conversation

dafnapension commented Feb 3, 2025