0

It's a well known optimization to replace GroupByKey with ReduceByKey, since the latter reduces shuffling. I was wondering if there are reverse cases in which code with GroupByKey is faster than with ReduceByKey.

Joel
  • 1,544
  • 7
  • 10
  • 19
alexgbelov
  • 2,734
  • 4
  • 22
  • 40

0 Answers0