tomer-ben-david
1/5/2018 - 11:03 AM

spark reduce by top words

spark reduce by top words

cleanedMobyDick.filter(!_.isEmpty)
  .map(_.toLowerCase())
  .map((_, 1))
  .reduceByKey(_ + _)
  .takeOrdered(10)(Ordering[Int].reverse.on(_._2))
  .foreach(println)