'NaN*' and '+Inf' #42

gcorrall · 2024-02-08T19:43:21Z

I've been using powerjoular to log power consumption on a few machines; this is run via systemd, and the data is stored in a csv file (and then processed by collectd)..

Recently I noticed one machine (Ubuntu 20.04, Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz) was regularly logging "NaN***" and "+Inf**" in the CSV file for the CPU utilisation, which in turn was causing the collectd plugin to log parsing errors ('Ignoring trailing garbage').

After some experimentation I think the problem originates in this line in powerjoular.adb:

CPU_Utilization := (Float (CPU_CCI_After.cbusy) - Float (CPU_CCI_Before.cbusy)) / (Float (CPU_CCI_After.ctotal) - Float (CPU_CCI_Before.ctotal));

If the before and after numbers are close enough together the Float conversion results in either CPU_CCI_After.cbusy and CPU_CCI_Before.cbusy being the same, or CPU_CCI_After.ctotal and CPU_CCI_Before.ctotal being the same; this would cause there to be a zero in the division and give a "NaN***" or "+Inf**".

The actual Long_Integers aren't the same; the Float conversion approximates them to the same value. E.g:

Put(Float(Long_Integer(15534323370)), Exp => 0, Fore => 0);
Put(Float(Long_Integer(15534324162)), Exp => 0, Fore => 0);

Both give the same answer (15534323712.00000), and caused the CPU_Utilization calculation to give me an +Inf**.

I think this case needs to be checked for, so that the division is avoided if a zero is involved. I assume this is also an issue with other similar calculations in the code.

Alternatively maybe greater precision could be used in the type conversions - Long_Float instead of Float. I crudely fixed this by changing all the Floats to Long_Floats (by running "sed -i 's/Float/Long_Float/g' src/*"). It did stop the problem.

The text was updated successfully, but these errors were encountered:

adelnoureddine · 2024-02-08T20:26:57Z

Thanks @gcorrall for pointing out this issue.
Switching from Float to Long_Float would indeed solve the problem. Though we need to check all calculations and platforms (PID, RPi).

If you wish, you can submit a PR with the first float conversions, and I'll go next week from there to verify the remaining calculations and do some testing on RPi too.

gcorrall · 2024-02-09T10:45:22Z

No problem - I've now submitted a PR.

adelnoureddine self-assigned this Feb 8, 2024

adelnoureddine added the bug Something isn't working label Feb 8, 2024

adelnoureddine linked a pull request Feb 9, 2024 that will close this issue

Change Floats to Long_Floats #43

Merged

adelnoureddine closed this as completed in #43 Feb 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

'NaN*' and '+Inf' #42

'NaN*' and '+Inf' #42

gcorrall commented Feb 8, 2024

adelnoureddine commented Feb 8, 2024

gcorrall commented Feb 9, 2024

'NaN***' and '+Inf**' #42

'NaN***' and '+Inf**' #42

Comments

gcorrall commented Feb 8, 2024

adelnoureddine commented Feb 8, 2024

gcorrall commented Feb 9, 2024

'NaN*' and '+Inf' #42

'NaN*' and '+Inf' #42