Guillaume Eynard-Bontemps, CNES (Centre National d’Etudes Spatiales - French Space Agency)
2020-11-15
From Spark Research homepage:
Our goal was to design a programming model that supports a much wider class of applications than MapReduce, while maintaining its automatic fault tolerance. In particular, MapReduce is inefficient for multi-pass applications (…).
What’s the main difference between Spark and Hadoop Map Reduce?
Answer link Key: hg
Transformations |
---|
map* |
filter |
groupByKey |
Actions |
---|
reduce |
collect |
count |
first |
take |
saveAs… |
What’s the main API of Spark?
Answer link Key: op
Easiest: