no error reported by Hadoop in case of immediate failure #4

klbostee · 2010-02-21T17:17:28Z

As originally reported by Elias Pampalk:

The following scripts demonstrate a failure to fail when executed on a hadoop cluster (fails fine if executed locally):

import dumbo

def mapper(k, v):
    yield 1, 1

if __name__ == "__main__":
    dumbo.run(mapper, dumbo.sumsreducer, combiner=dumbo.sumsreducer)

The test uses dumbo.sumsreducer where dumbo.sumreduce should be used. A TypeError should be thrown by dumbo.sumsreducer (in the combiner). Instead the Hadoop reports show no error and zero output from the mapper.

The text was updated successfully, but these errors were encountered:

cap · 2010-06-21T18:38:16Z

I encounter this non-failure whenever the script fails before calling dumbo.run. This happens most frequently when a top-level import statement fails because of unfulfilled dependencies.

klbostee · 2010-06-25T17:00:22Z

@cap: Think that's actually a slightly different problem. When a dumbo script fails very quickly (i.e. basically immediately, instead of after having run for a while) it often happens that Hadoop Streaming's stderr catching mechanism hasn't been set up properly yet to catch the error. In this case you indeed won't see the error, but unfortunately there's not much that can be done about this in Dumbo itself.

klbostee · 2010-07-23T12:20:08Z

On second thought this actually is the same problem. It does fail fine, but it happens so quick that the Hadoop Streaming logging doesn't catch it (presumably because it hasn't been initialized properly yet). Can't immediately think of a clean Dumbo-side fix for this problem, but I'll leave the ticket open for future reference...

dangra · 2011-06-24T04:12:08Z

I was bitten by this bug, a possible workaround is to delay the propagation of the exception by a sensible time so hadoop has a chance to setup streaming logging before terminating python interpreter. At least it will happen less often and will make dumbo more reliable at a low price.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

no error reported by Hadoop in case of immediate failure #4

no error reported by Hadoop in case of immediate failure #4

klbostee commented Feb 21, 2010

cap commented Jun 21, 2010

klbostee commented Jun 25, 2010

klbostee commented Jul 23, 2010

dangra commented Jun 24, 2011

no error reported by Hadoop in case of immediate failure #4

no error reported by Hadoop in case of immediate failure #4

Comments

klbostee commented Feb 21, 2010

cap commented Jun 21, 2010

klbostee commented Jun 25, 2010

klbostee commented Jul 23, 2010

dangra commented Jun 24, 2011