Debug failed shuffles in hadoop map reduces

Question

Debug failed shuffles in hadoop map reduces

159 views Asked by Jal At 21 September 2018 at 18:03

I am seeing as the size of the input file increase failed shuffles increases and job complete time increases non linearly.

eg.

75GB took 1h
86GB took 5h

I also see average shuffle time increase 10 fold

eg.

75GB 4min
85GB 41min

Can someone point me a direction to debug this?

Original Q&A

There are 1 answers

**FabioFLX** · Answer 1 · 2018-10-01T09:38:57+00:00

FabioFLX On 01 October 2018 at 09:38

Whenever you are sure your algorithms are correct, automatic hard-disk volumes partioning or fragmentation problems may occur somewhere after that 75Gb threshold, as of you are probably using the same filesystem for caching the results.

TechQA.

Debug failed shuffles in hadoop map reduces

There are 1 answers

Related Questions in HADOOP

Related Questions in MAPREDUCE

Related Questions in QUBOLE

Popular Questions

Popular Tags

Trending Questions