Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[WIP][Experimental] Kleidi add i8mm op level tests
Still debugging i8mm gemm tests with multiple output tiles. [----------] 59 tests from test_linear_8bit_act_xbit_weight [ RUN ] test_linear_8bit_act_xbit_weight.Standard [ OK ] test_linear_8bit_act_xbit_weight.Standard (8 ms) [ RUN ] test_linear_8bit_act_xbit_weight.HasWeightZeros [ OK ] test_linear_8bit_act_xbit_weight.HasWeightZeros (2 ms) [ RUN ] test_linear_8bit_act_xbit_weight.HasBias [ OK ] test_linear_8bit_act_xbit_weight.HasBias (2 ms) [ RUN ] test_linear_8bit_act_xbit_weight.HasClamp [ OK ] test_linear_8bit_act_xbit_weight.HasClamp (2 ms) [ RUN ] test_linear_8bit_act_xbit_weight.SmallDimension [ OK ] test_linear_8bit_act_xbit_weight.SmallDimension (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KNotDivisibleByGroupSize [ OK ] test_linear_8bit_act_xbit_weight.KNotDivisibleByGroupSize (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.GroupSizeNotDivisibleBy16 [ OK ] test_linear_8bit_act_xbit_weight.GroupSizeNotDivisibleBy16 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMV_dotprod_1x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMV_dotprod_1x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMV_dotprod_1x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMV_dotprod_1x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMV_dotprod_1x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMV_dotprod_1x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMV_dotprod_1x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMV_dotprod_1x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiWithBiasGEMV_dotprod_1x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiWithBiasGEMV_dotprod_1x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMV_dotprod_1x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMV_dotprod_1x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMM_dotprod_1x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMM_dotprod_1x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMM_dotprod_1x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMM_dotprod_1x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMM_dotprod_1x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMM_dotprod_1x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMM_dotprod_1x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMM_dotprod_1x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiHasBiasGEMM_dotprod_1x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiHasBiasGEMM_dotprod_1x4x32 (2 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiLargeGEMM_dotprod_1x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiLargeGEMM_dotprod_1x4x32 (205 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMM_dotprod_1x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMM_dotprod_1x4x32 (1 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMV_dotprod_1x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMV_dotprod_1x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMV_dotprod_1x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMV_dotprod_1x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMV_dotprod_1x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMV_dotprod_1x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMV_dotprod_1x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMV_dotprod_1x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiWithBiasGEMV_dotprod_1x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiWithBiasGEMV_dotprod_1x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMV_dotprod_1x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMV_dotprod_1x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMM_dotprod_1x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMM_dotprod_1x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMM_dotprod_1x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMM_dotprod_1x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMM_dotprod_1x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMM_dotprod_1x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMM_dotprod_1x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMM_dotprod_1x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiHasBiasGEMM_dotprod_1x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiHasBiasGEMM_dotprod_1x8x32 (1 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiLargeGEMM_dotprod_1x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiLargeGEMM_dotprod_1x8x32 (197 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMM_dotprod_1x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMM_dotprod_1x8x32 (1 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMV_i8mm_4x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMV_i8mm_4x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMV_i8mm_4x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMV_i8mm_4x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMV_i8mm_4x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMV_i8mm_4x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMV_i8mm_4x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMV_i8mm_4x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiWithBiasGEMV_i8mm_4x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiWithBiasGEMV_i8mm_4x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMV_i8mm_4x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMV_i8mm_4x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMM_i8mm_4x8x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMM_i8mm_4x8x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMM_i8mm_4x8x32 [ FAILED ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMM_i8mm_4x8x32 (16 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMM_i8mm_4x8x32 [ FAILED ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMM_i8mm_4x8x32 (55 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMM_i8mm_4x8x32 [ FAILED ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMM_i8mm_4x8x32 (13 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiHasBiasGEMM_i8mm_4x8x32 [ FAILED ] test_linear_8bit_act_xbit_weight.KleidiHasBiasGEMM_i8mm_4x8x32 (16 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiLargeGEMM_i8mm_4x8x32 [ FAILED ] test_linear_8bit_act_xbit_weight.KleidiLargeGEMM_i8mm_4x8x32 (271 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMM_i8mm_4x8x32 [ FAILED ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMM_i8mm_4x8x32 (45 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMV_i8mm_8x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMV_i8mm_8x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMV_i8mm_8x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMV_i8mm_8x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMV_i8mm_8x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMV_i8mm_8x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMV_i8mm_8x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMV_i8mm_8x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiWithBiasGEMV_i8mm_8x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiWithBiasGEMV_i8mm_8x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMV_i8mm_8x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMV_i8mm_8x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMM_i8mm_8x4x32 [ OK ] test_linear_8bit_act_xbit_weight.KleidiTinyGEMM_i8mm_8x4x32 (0 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMM_i8mm_8x4x32 [ FAILED ] test_linear_8bit_act_xbit_weight.KleidiSmallGEMM_i8mm_8x4x32 (9 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMM_i8mm_8x4x32 [ FAILED ] test_linear_8bit_act_xbit_weight.KleidiStandardGEMM_i8mm_8x4x32 (55 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMM_i8mm_8x4x32 [ FAILED ] test_linear_8bit_act_xbit_weight.KleidiHasClampGEMM_i8mm_8x4x32 (7 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiHasBiasGEMM_i8mm_8x4x32 [ FAILED ] test_linear_8bit_act_xbit_weight.KleidiHasBiasGEMM_i8mm_8x4x32 (16 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiLargeGEMM_i8mm_8x4x32 [ FAILED ] test_linear_8bit_act_xbit_weight.KleidiLargeGEMM_i8mm_8x4x32 (270 ms) [ RUN ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMM_i8mm_8x4x32 [ FAILED ] test_linear_8bit_act_xbit_weight.KleidiLargerGroupGEMM_i8mm_8x4x32 (42 ms)
- Loading branch information