I use Hadoop total order partitioner and random sampler as input sampler.
But when I increase my slave nodes and reduce tasks to 8, I get following error:
Caused by: java.io.IOException: Split points are out of order
I don't know the reason for this error.
How can I set the number of three parameters on inputsampler.randomsampler
function?
Are you sure you are generating enough keys? From the javadoc: TotalOrderPartitioner
The input file must be sorted with the same comparator and contain