0

I'm using the new Hadoop API and as there is no way to explicitly specify the number of mappers (unlike the old API), I need to change the size of the data chunks so that I can control the number of mappers. How to change the default size of the data split in the code?

HHH
  • 5,485
  • 17
  • 84
  • 154

1 Answers1

0

Number of Mappers is determined by (File) Splits
The split size is determined by the InputFormat being used.
mapred.max.split.size parameter will let you define the split size.

Jasper
  • 7,942
  • 27
  • 90
  • 131