Guillaume Eynard-Bontemps, CNES (Centre National d’Etudes Spatiales - French Space Agency)
2020-11-15
From Spark Research homepage:
Our goal was to design a programming model that supports a much wider class of applications than MapReduce, while maintaining its automatic fault tolerance. In particular, MapReduce is inefficient for multi-pass applications (…).




What’s the main difference between Spark and Hadoop Map Reduce?
Answer link Key: hg


| Transformations |
|---|
| map* |
| filter |
| groupByKey |
| Actions |
|---|
| reduce |
| collect |
| count |
| first |
| take |
| saveAs… |






What’s the main API of Spark?
Answer link Key: op
Easiest: