4. ahmed.rebai@esprit.tnLotfi.ncib@esprit.tn
The Data History
Since the dawn of time… up until 2005
Humans had created 130 EXABYTES of Data
2005-130 EXABYTES
2010-1200 EXABYTES
2015-7900 EXABYTES
2020-40900 EXABYTES
Byte
Kilobyte(KB) 1.000=103
Megabyte(MB) 1.000.000=106
Gigabyte(GB) 1.000.000.000=109
Terabyte(TB) 1.000.000.000.000=1012
Petabyte(PB) 1.000.000.000.000.000=1015
Exabyte(XB) 1.000.000.000.000.000.000=1018
3
5. ahmed.rebai@esprit.tnLotfi.ncib@esprit.tn
The Data History
A 1 BYTE of space
if we zoom out 1000 times we
will get a page of letter (1 kB)
about 500 characters
Now let zoom another 1000
times and we will get a book -
about 500 pages to take 1MB
Now lets zoom another times and
we will get 1GB(1 GB is sufficient
to fit all human genomes once
coded (Usually it takes 725MB)
If we zoom another 1000 times we will
get into TB(enough to fit some one’s life
recorded for 8 years(everything they do-
every minute or second
If we zoom another 1000 times we will get
into PB(Amazon rain forest is 1.4 Billion acres
about 500 tree per acre / 700 billion trees). If
you shup all these trees down and turn them
in to papers and fill the papers with letters
both side- close to 1PB.
If we zoom another 1000 times we
will get into XB(1000 TB)
4
7. ahmed.rebai@esprit.tnLotfi.ncib@esprit.tn
Why Data Science?
Salary trends have followed the impact of data science. With a national
average salary of $118.000(which increase to $126.000 in Silicon Valley), data
science has become a lucrative career path where you can solve hard
problems and drive social impact.
Data scientist is the sexiest career of
the 21st century
Statistical Analysis and Data Mining wher the
hottest skills that got recruiter’s attention in
2014/2015/2016/2017/2018
The US alone faces a shortage of more than
150.000 data analyst and an additional 1.5
million data savy managers
6
9. ahmed.rebai@esprit.tnLotfi.ncib@esprit.tn
What is Data Science?
“The ability to take data — to be able to understand it, to process it, to extract value
from it, to visualize it, to communicate it — that’s going to be a hugely important
skill in the next decades.”
- Hal Varian, chief economist at Google and UC Berkeley professor of information
sciences, business, and economics
DATA SCIENCE is the area of study which involves extracting insights from vast
amounts of data by the use of various scientific methods, algorithms, and processes.
Data Science is the science wich uses computer science, statistics and machine
learning, visualization and human-computer interactions to collect, clean integrate,
analyze, visualize, interact with data to create data products,
8