Clang UB sanitizer CI test: increase coverage #5597

lucafedeli88 · 2025-01-23T11:08:11Z

In CI test based on clang UB sanitizers, most of the time (~ 1h30) is spent in compiling the code, while just a few minutes are spent actually running some simulations. This means that we can increase the coverage of the test by adding some more simulations to the tests with a negligible increase of the total runtime.
This PR does just that: now most of the cases in Examples/Physics_applications are tested with the UB sanitizer.

Note that some cases cannot run in double precision (see below). For this reason, the PR also splits the UB sanitizer test into single precision and double precision (in double precision only the cases that cannot run in single precision are tested).

Updates:

Issue found while running inputs_test_3d_beam_beam_collision --> We need to run this case in double precision

The tool has found an issue while running mpirun -n 2 ./build/bin/warpx.3d Examples/Physics_applications/beam_beam_collision/inputs_test_3d_beam_beam_collision :

STEP 1 starts ...
/home/runner/work/WarpX/WarpX/build/_deps/fetchedpicsar-src/multi_physics/QED/include/picsar_qed/containers/picsar_tables.hpp:310:17: runtime error: -nan is outside the range of representable values of type 'int'
/home/runner/work/WarpX/WarpX/build/_deps/fetchedpicsar-src/multi_physics/QED/include/picsar_qed/containers/picsar_tables.hpp:310:17: runtime error: -nan is outside the range of representable values of type 'int'
SUMMARY: UndefinedBehaviorSanitizer: undefined-behavior /home/runner/work/WarpX/WarpX/build/_deps/fetchedpicsar-src/multi_physics/QED/include/picsar_qed/containers/picsar_tables.hpp:310:17 in 
SUMMARY: UndefinedBehaviorSanitizer: undefined-behavior /home/runner/work/WarpX/WarpX/build/_deps/fetchedpicsar-src/multi_physics/QED/include/picsar_qed/containers/picsar_tables.hpp:310:17 in

I've temporarily commented out this case while I investigate the cause. ~~For the moment, I am not able to reproduce the issue on my local machine~~. The issue is using single precision for this specific test case! Specifically, momenta end up being NaN and the sanitizer detects the attempt to convert a floating point NaN into an integer.

Issue found while running inputs_test_2d_background_mcc --> We need to run this case in double precision

MLMG does not converge in single precision for this simulation case. We need to run it in double precision.

Issue found while running free_electron_laser --> We need to run this case in double precision

I have observed this issue:

 STEP 444 starts ...
0::1::Assertion `m_current_z_lab[i_buffer] >= m_buffer_domain_lab[i_buffer].lo(m_moving_window_dir) and m_current_z_lab[i_buffer] <= m_buffer_domain_lab[i_buffer].hi(m_moving_window_dir)' failed, file "/home/runner/work/WarpX/WarpX/Source/Diagnostics/BTDiagnostics.cpp", line 870, Msg: 
 ### ERROR   : z-slice in lab-frame (0.299976) is outside the buffer domain
#            physical extent (0.299976 to 0.299988).
 !!!

which seems to be related to using single precision instead of double precision. Therefore, we need to run this case in double precision.

Issue found while running inputs_test_2d_laser_ion_acc --> bugfix in WarpX

inputs_test_2d_laser_ion_acc case has the following issue in single precision:

--- INFO    : Writing openPMD file diags/openPMDbw000000
terminate called after throwing an instance of 'std::runtime_error'
  what():  Datatypes of chunk data (FLOAT) and record component (DOUBLE) do not match.
SIGABRT
See Backtrace.0.0 file for details

This comes from the fact that the datatype of this dataset in ParticleHistogram2D.cpp is hard-coded as double:

    auto dataset = io::Dataset(
            io::determineDatatype<double>(),
            {static_cast<unsigned long>(m_bin_num_ord), static_cast<unsigned long>(m_bin_num_abs)});

this PR modifies these lines as follows:

    auto dataset = io::Dataset(
            io::determineDatatype<amrex::Real>(),
            {static_cast<unsigned long>(m_bin_num_ord), static_cast<unsigned long>(m_bin_num_abs)});

…an double

ax3l · 2025-01-24T22:51:42Z

Source/Diagnostics/ReducedDiags/ParticleHistogram2D.cpp

@@ -298,7 +298,7 @@ void ParticleHistogram2D::WriteToFile (int step) const
    data.setPosition<amrex::Real>({0.5, 0.5});

    auto dataset = io::Dataset(
-            io::determineDatatype<double>(),
+            io::determineDatatype<amrex::Real>(),


Wouldn't the properties be in ParticleReal?

Suggested change

io::determineDatatype<amrex::Real>(),

io::determineDatatype<amrex::ParticleReal>(),

add new tests

19333ab

lucafedeli88 added the component: tests Tests and CI label Jan 23, 2025

fix bug

47041c6

lucafedeli88 mentioned this pull request Jan 23, 2025

[WIP] Increase coverage of clang sanitizer tests #5280

Closed

lucafedeli88 changed the title ~~Clang UB sanitizer CI test: increase coverage~~ [WIP] Clang UB sanitizer CI test: increase coverage Jan 23, 2025

lucafedeli88 added 7 commits January 23, 2025 13:50

temporary workaround to test more cases

1f564e7

split clang UB sanitizer into single- and double- precision tests

ee64375

add echo to ease debugging

97cedfd

Move test case to double precision

82bb0cf

add even more test cases

62be62c

mv free_electron_laser to DP tests

d234bc2

dataset in ParticleHistogram2D must be of type amrex::Real, rather th…

bedd665

…an double

lucafedeli88 changed the title ~~[WIP] Clang UB sanitizer CI test: increase coverage~~ Clang UB sanitizer CI test: increase coverage Jan 24, 2025

lucafedeli88 requested review from EZoni and ax3l January 24, 2025 09:32

ax3l reviewed Jan 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clang UB sanitizer CI test: increase coverage #5597

Clang UB sanitizer CI test: increase coverage #5597

lucafedeli88 commented Jan 23, 2025 •

edited

Loading

ax3l Jan 24, 2025

	io::determineDatatype<amrex::Real>(),
	io::determineDatatype<amrex::ParticleReal>(),

Clang UB sanitizer CI test: increase coverage #5597

Are you sure you want to change the base?

Clang UB sanitizer CI test: increase coverage #5597

Conversation

lucafedeli88 commented Jan 23, 2025 • edited Loading

Updates:

Issue found while running inputs_test_3d_beam_beam_collision --> We need to run this case in double precision

Issue found while running inputs_test_2d_background_mcc --> We need to run this case in double precision

Issue found while running free_electron_laser --> We need to run this case in double precision

Issue found while running inputs_test_2d_laser_ion_acc --> bugfix in WarpX

ax3l Jan 24, 2025

Choose a reason for hiding this comment

lucafedeli88 commented Jan 23, 2025 •

edited

Loading