PFind out of memory #22

JulianKunkel · 2017-10-26T08:15:10Z

Running with 200 nodes and 5 proc produces an error, with 100 nodes and 10 proc it does work.
Error was:

Traceback (most recent call last):
File "/home/dkrz/k202079/work/io-500/io-500-dev/bin/pfind", line 16, in
from lib.parallelwalk import ParallelWalk
File "", line 969, in _find_and_load
File "", line 954, in _find_and_load_unlocked
File "", line 896, in _find_spec
File "", line 1139, in find_spec
File "", line 1113, in _get_spec
File "", line 1225, in find_spec
File "", line 1264, in _fill_cache
OSError: [Errno 12] Cannot allocate memory: '/mnt/lustre01/work/k20200/k202079/io-500/io-500-dev/bin/lib'

johnbent · 2017-10-26T19:42:10Z

I suggest we reproduce this in the main pwalk github using one of their examples like pdu and then file the issue there.

gmarkomanolis · 2017-10-27T08:19:05Z

For me, it does not seem to crash, but when I increase the nodes, it takes forever and I can not finish. The find did not finish not even in 80 minutes. I was decreasing the number of files and it does not help that much.

gmarkomanolis · 2017-10-27T10:58:06Z

Seems I found the issue on my system:

io500_fixed.sh around line 211 about pfind it is:

myrun "$command" $result_file

this command was never finishing and I was thinking that it is too slow. I tried an interactive job and only then I get MPI error but not through sbatch.

Then I observed that I execute the srun from the root folder (io-500-dev) and not the bin folder, then entering bin folder solved the issue.

So If I do that:

cd bin
myrun "$command" $result_file
matches=$( grep MATCHED $result_file )
cd ..

The pfind works (I have not tried on more than 2 nodes). However, something was changed, because some days ago it was working without any issue.

johnbent · 2017-10-27T12:47:57Z

When it is runs, there should be an output line like [Exec] or something. What is it saying? I thought we were passing the full path to pfind so the ‘cd’ shouldn’t make a difference. Unless the problem is that pfind can’t find its pwalk library? So, do you have a result for us? :)

…

On Oct 27, 2017, at 4:58 AM, George Markomanolis ***@***.***> wrote: Seems I found the issue on my system: io500_fixed.sh around line 211 about pfind it is: myrun "$command" $result_file this command was never finishing and I was thinking that it is too slow. I tried an interactive job and only then I get MPI error but not through sbatch. Then I observed that I execute the srun from the root folder (io-500-dev) and not the bin folder, then entering bin folder solved the issue. So If I do that: cd bin myrun "$command" $result_file matches=$( grep MATCHED $result_file ) cd .. The pfind works (I have not tried on more than 2 nodes). However, something was changed, because some days ago it was working without any issue. — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

johnbent · 2017-10-27T12:57:08Z

Maybe we need Io500.sh to set PYTHONPATH to where pwalk is?

…

On Oct 27, 2017, at 4:58 AM, George Markomanolis ***@***.***> wrote: Seems I found the issue on my system: io500_fixed.sh around line 211 about pfind it is: myrun "$command" $result_file this command was never finishing and I was thinking that it is too slow. I tried an interactive job and only then I get MPI error but not through sbatch. Then I observed that I execute the srun from the root folder (io-500-dev) and not the bin folder, then entering bin folder solved the issue. So If I do that: cd bin myrun "$command" $result_file matches=$( grep MATCHED $result_file ) cd .. The pfind works (I have not tried on more than 2 nodes). However, something was changed, because some days ago it was working without any issue. — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

gmarkomanolis · 2017-10-27T13:12:07Z

It is the whole path of pfind all the time, but some days ago, it was working ok and yesterday it didn't. The difference is from where I execute the pfind command. Yes, it could be just an environment variable, but why it was working before? This is the confusing part that probably we can not figure out.

I was trying to get the results yesterday but pfind was not working, so I will check now.

gmarkomanolis · 2017-10-27T14:12:38Z

I am tuning and I have the following issue:

I have created 8 million files which are not that much but my mdtest_easy is just 30 seconds and mdtest_hard_write 700 seconds, so I have to adjust. In any case, I will not have much fewer files. The pfind is looking for all the files and it takes too much time, I am in this test more than half an hour. My point is that if my system is slow with pfind but fast to create files, ok pfind should give me a bad result but to finish in normal time. We should have an approach not to search all the files. Initially, I thought to have one MPI process per node also as I am running 4 processes per node overall.

johnbent · 2017-10-27T14:22:16Z

Great question. I don't know what to do here.

…

On Fri, Oct 27, 2017 at 8:12 AM, George Markomanolis < ***@***.***> wrote: I am tuning and I have the following issue: I have created 8 million files which are not that much but my mdtest_easy is just 30 seconds and mdtest_hard_write 700 seconds, so I have to adjust. In any case, I will not have much fewer files. The pfind is looking for all the files and it takes too much time, I am in this test more than half an hour. My point is that if my system is slow with pfind but fast to create files, ok pfind should give me a bad result but to finish in normal time. We should have an approach not to search all the files. Initially, I thought to have one MPI process per node also as I am running 4 processes per node overall. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#22 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AB89PvT7JJc2IHLo-P0P9L-Zxau0MG88ks5sweTXgaJpZM4QHK2e> .

JulianKunkel · 2017-10-30T14:59:08Z

Yeah, that reminds to the testing stuff we did in the beginning... I think this is now fixed with stonewalling from John in the scripts? Also some kind of stonewalling inside pfind of the C-app available now. So that is probably fixed here? 2017-10-27 16:22 GMT+02:00 John Bent <[email protected]>:

…

Great question. I don't know what to do here. On Fri, Oct 27, 2017 at 8:12 AM, George Markomanolis < ***@***.***> wrote: > I am tuning and I have the following issue: > > I have created 8 million files which are not that much but my mdtest_easy > is just 30 seconds and mdtest_hard_write 700 seconds, so I have to adjust. > In any case, I will not have much fewer files. The pfind is looking for all > the files and it takes too much time, I am in this test more than half an > hour. My point is that if my system is slow with pfind but fast to create > files, ok pfind should give me a bad result but to finish in normal time. > We should have an approach not to search all the files. Initially, I > thought to have one MPI process per node also as I am running 4 processes > per node overall. > > — > You are receiving this because you commented. > Reply to this email directly, view it on GitHub > <#22 (comment)>, > or mute the thread > <https://github.com/notifications/unsubscribe- auth/AB89PvT7JJc2IHLo-P0P9L-Zxau0MG88ks5sweTXgaJpZM4QHK2e> > . > — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#22 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AE1uyiqOtj4YIR8TsI0EpEwypJ2NY_Vnks5swecZgaJpZM4QHK2e> .

-- http://wr.informatik.uni-hamburg.de/people/julian_kunkel

adilger · 2018-06-20T22:27:15Z

This was fixed in commit 462ace9 to enable stonewall by default and should probably be closed?

JulianKunkel · 2018-06-21T08:39:01Z

I'm not convinced that PFIND out of memory is resolved (even with stonewalling). I have to check the following theoretic setting: An extremely big directory but extremely slow stat() operations. One thread runs readdir() and creates jobs for stat(), they start to queue up. I'm not quite sure if libcircle handles this case and stalls the creation of new jobs(). That we do not see the problem is since even with 1k per filename, 1 GB memory is 1 M files which is not the amount of files we created. It is on my list to check when replacing libcircle. 2018-06-20 23:27 GMT+01:00 adilger <[email protected]>:

…

This was fixed in commit 462ace9 <462ace9> to enable stonewall by default and should probably be closed? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#22 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AE1uyrJ377GXLJV7WodnxDs3YQ-MltU5ks5t-sxEgaJpZM4QHK2e> .

-- Dr. Julian Kunkel Lecturer, Department of Computer Science +44 (0) 118 378 8218 http://www.cs.reading.ac.uk/ https://hps.vi4io.org/

adilger · 2018-06-21T09:45:00Z

On Jun 21, 2018, at 2:39 AM, Julian Kunkel ***@***.***> wrote: I'm not convinced that PFIND out of memory is resolved (even with stonewalling). I have to check the following theoretic setting: An extremely big directory but extremely slow stat() operations. One thread runs readdir() and creates jobs for stat(), they start to queue up. I'm not quite sure if libcircle handles this case and stalls the creation of new jobs(). That we do not see the problem is since even with 1k per filename, 1 GB memory is 1 M files which is not the amount of files we created. It is on my list to check when replacing libcircle.

We have a producer/consumer model for LFSCK traversal and repair of Lustre filesystems. The producer keeps track of how many items are in the queue, and if the queue gets too large it stops scanning until the consumer has reduced the backlog by some amount. Cheers, Andreas

2018-06-20 23:27 GMT+01:00 adilger ***@***.***>: > This was fixed in commit 462ace9 > <462ace9> > to enable stonewall by default and should probably be closed? > > — > You are receiving this because you authored the thread. > Reply to this email directly, view it on GitHub > <#22 (comment)>, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/AE1uyrJ377GXLJV7WodnxDs3YQ-MltU5ks5t-sxEgaJpZM4QHK2e> > . > -- Dr. Julian Kunkel Lecturer, Department of Computer Science +44 (0) 118 378 8218 http://www.cs.reading.ac.uk/ https://hps.vi4io.org/ — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

Cheers, Andreas

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PFind out of memory #22

PFind out of memory #22

JulianKunkel commented Oct 26, 2017

johnbent commented Oct 26, 2017

gmarkomanolis commented Oct 27, 2017

gmarkomanolis commented Oct 27, 2017

johnbent commented Oct 27, 2017 via email

johnbent commented Oct 27, 2017 via email

gmarkomanolis commented Oct 27, 2017

gmarkomanolis commented Oct 27, 2017

johnbent commented Oct 27, 2017 via email

JulianKunkel commented Oct 30, 2017 via email

adilger commented Jun 20, 2018

JulianKunkel commented Jun 21, 2018 via email

adilger commented Jun 21, 2018 via email

PFind out of memory #22

PFind out of memory #22

Comments

JulianKunkel commented Oct 26, 2017

johnbent commented Oct 26, 2017

gmarkomanolis commented Oct 27, 2017

gmarkomanolis commented Oct 27, 2017

johnbent commented Oct 27, 2017 via email

johnbent commented Oct 27, 2017 via email

gmarkomanolis commented Oct 27, 2017

gmarkomanolis commented Oct 27, 2017

johnbent commented Oct 27, 2017 via email

JulianKunkel commented Oct 30, 2017 via email

adilger commented Jun 20, 2018

JulianKunkel commented Jun 21, 2018 via email

adilger commented Jun 21, 2018 via email