mdtest size > first == last hang #506

jschwartz-cray · 2025-01-23T23:50:37Z

The -f/-l options are used to restrict the number of tasks mdtest runs to smaller subsets of the total size specified via the job MPI parameters.

In this mode a subset of the ranks will not participate in the test, and those ranks have to be managed properly so they join up with the ranks that did at the end.

The recently refactored logic fixed one issue but created another in the corner case of size > first == last. In this scenario only one rank participates in the test, but all ranks are duping MPI_COMM_WORLD and the barrier behavior is not correct for this scenario resulting in a hang.

The relevant code is here:

        if(i < last){
          MPI_Group testgroup;
          range.last = i - 1;
          MPI_Group_range_incl(worldgroup, 1, (void *)&range, &testgroup);
          MPI_Comm_create(world_com, testgroup, &testComm);
          MPI_Group_free(&testgroup);
          if(testComm == MPI_COMM_NULL){
            continue;
          }
        }else{
          MPI_Comm_dup(world_com, & testComm);
        }

One solution to this involves making the logic common and ensuring that any ranks which aren't participating are handled in the same manner as they are in

ior/src/ior.c

Line 117 in 9f97b10

if (params->testComm == MPI_COMM_NULL) {

. I will be submitting a PR which implements this fix and some minor error handling improvements as a separate commit.

The text was updated successfully, but these errors were encountered:

The -f/-l options are used to restrict the number of tasks mdtest runs to smaller subsets of the total size specified via the job MPI parameters. In this mode a subset of the ranks will not participate in the test, and those ranks have to be managed properly so they join up with the ranks that did at the end. The recently refactored logic fixed one issue but created another in the corner case of size > first == last. In this scenario only one rank participates in the test, but all ranks were duping MPI_COMM_WORLD and the barrier behavior was not correct for this scenario resulting in a hang. This solves the problem by making the logic common (a new group and communicator will always be created for the test whether it is for all ranks or a subset) and ensuring that any ranks which aren't participating are handled in the same manner as in ior.c:117.

jschwartz-cray · 2025-01-24T00:04:08Z

#507

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mdtest size > first == last hang #506

mdtest size > first == last hang #506

jschwartz-cray commented Jan 23, 2025

jschwartz-cray commented Jan 24, 2025

mdtest size > first == last hang #506

mdtest size > first == last hang #506

Comments

jschwartz-cray commented Jan 23, 2025

jschwartz-cray commented Jan 24, 2025