In-Place Dense Matrix Transposition #2199

jessicapriebe · 2025-01-30T16:21:50Z

Added a new kernel for In-Place Dense Matrix Transposition, based on Algorithm 467 by Brenner (DOI: 10.1145/355611.362542).

Performance:
Compared to the existing kernel, the added method provides significant performance benefits in a single-threaded context:

Faster in 82.7% of cases.
Average speedup of 25.8%.

Note:
Performance measurements were restricted to cases where the existing kernel yields correct results. Similar or even better performance could be observed across all cases.

Future Work:
The divisors operate on different indices of the array, allowing for parallelization and offering additional performance improvements in multi-threaded scenarios.

@mboehm7

Baunsgaard

Very cool, thanks for the PR, and finding bugs in my transpose in place.

I have only a few comments.

src/main/java/org/apache/sysds/runtime/matrix/data/LibMatrixReorg.java

.../java/org/apache/sysds/test/component/matrix/libMatrixReorg/TransposeInPlaceBrennerTest.java

codecov · 2025-01-30T20:18:55Z

Codecov Report

Attention: Patch coverage is 94.87179% with 6 lines in your changes missing coverage. Please review.

Project coverage is 72.30%. Comparing base (f8522a7) to head (c398bfa).
Report is 8 commits behind head on main.

Files with missing lines	Patch %	Lines
...ache/sysds/runtime/matrix/data/LibMatrixReorg.java	94.87%	3 Missing and 3 partials ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##               main    #2199      +/-   ##
============================================
+ Coverage     71.88%   72.30%   +0.42%     
- Complexity    44701    45023     +322     
============================================
  Files          1449     1452       +3     
  Lines        169182   169417     +235     
  Branches      32980    33059      +79     
============================================
+ Hits         121617   122498     +881     
+ Misses        38237    37602     -635     
+ Partials       9328     9317      -11

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

mboehm7 · 2025-02-01T13:29:28Z

LGTM - thanks for the new kernel @jessicapriebe. I'll merge it in.

DIA WiSe 24/25 project Closes apache#2199.

jessica and others added 3 commits January 24, 2025 21:44

added in-place transpose for dense matrices based on Brenners algorithm

3af19f8

added in-place transpose for dense matrices based on Brenners algorithm

661fbb7

Merge remote-tracking branch 'origin/brenner' into brenner

c398bfa

Baunsgaard reviewed Jan 30, 2025

View reviewed changes

minor fixes + added another test

83d773f

mboehm7 closed this in 0b5fae9 Feb 1, 2025

saminbassiri pushed a commit to saminbassiri/systemds that referenced this pull request Feb 3, 2025

[SYSTEMDS-3547] Additional transpose in-place kernel (Brenner)

14587b8

DIA WiSe 24/25 project Closes apache#2199.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In-Place Dense Matrix Transposition #2199

In-Place Dense Matrix Transposition #2199

jessicapriebe commented Jan 30, 2025 •

edited

Loading

Baunsgaard left a comment

codecov bot commented Jan 30, 2025

mboehm7 commented Feb 1, 2025

In-Place Dense Matrix Transposition #2199

In-Place Dense Matrix Transposition #2199

Conversation

jessicapriebe commented Jan 30, 2025 • edited Loading

Baunsgaard left a comment

Choose a reason for hiding this comment

codecov bot commented Jan 30, 2025

Codecov Report

mboehm7 commented Feb 1, 2025

jessicapriebe commented Jan 30, 2025 •

edited

Loading