Guillaume Eynard-Bontemps, CNES (Centre National d’Etudes Spatiales - French Space Agency)
2020-11-15
1 ZB
1,000,000 PB
1,000,000,000,000 GB
1,000,000,000,000,000,000,000 B
Volume, variety, multiple sources, internal, external…
Store, Compute, Analyse: Calculators, Cloud, Hadoop, Spark, Dask
Visualize, Use: Applications, Web interfaces
Big data is a field that treats ways to analyze, systematically extract information from, or otherwise deal with data sets that are too large or complex to be dealt with by traditional data-processing application software.
Big data is where parallel computing tools are needed to handle data.
Not a technology.
What is the estimated size of the global data sphere?
Answer link Key: yi
Cite some V’s of Big Data (multiple choices):
Answer link Key: rf
Which technology is the most representative of the Big Data world?
Answer link Key: dy
Huge amount of small objects:
Think of:
Use cases:
Extract new knowledge and value from the data:
Cross analysis of internal and external data, correlations:
Data production or scientific exploration:
What is the typical volumes of scientific Datasets (multiple choices)?
Answer link Key: ri
https://blog.dataiku.com/when-and-when-not-to-use-deep-learning
Is Big Data and Machine Learning the same?
Answer link Key: fj