SlideShare a Scribd company logo
1 of 23
Download to read offline
Big Data



           Eufris 2012
Why should I care?
McKinsey:
•$250 billions annual savings in EU alone by enhancing public sector
•$600 billions annual consumer surplus from using personal location data globally

•Annual growth of data is remarcable
•Data is the most valuable thing most companies have
•Data is massively underutilized




                                                                                    Eufris 2012
Forecast
There will be a shortage of talent necessary for
organizations to take advantage of big data. By 2018, the
United States alone could face a shortage of 140,000 to
190,000 people with deep analytical skills as well as 1.5
million managers and analysts with the know-how to use
the analysis of big data to make effective decisions.




                                                      Eufris 2012
What is Big Data?
"Big data technologies describe a new generation of technologies and architectures, designed to
economically extract value from very large volumes of a wide variety of data, by enabling high-velocity
capture, discovery, and/or analysis"
IDC



"Big Data is a technlogy that helps extract value from the digital universe.”
IDC



"Techniques and technologies that make handling data at extreme scale economical."
Forrester




                                                                                                    Eufris 2012
ABC of Big Data
     Analy&cs
       •making	
  sense	
  of	
  your	
  data,	
  in	
  real-­‐5me,	
  in	
  easy	
  way


     Bandwidth
       •inges5ng,	
  prosessing	
  and	
  delivering	
  large	
  amounts	
  of	
  data


     Content
       •storing,	
  managing	
  and	
  retaining	
  large	
  amounts	
  of	
  data



www.netapp.com                                                                             Eufris 2012
3 V’s of Big Data
Variety
 • Big	
  Data	
  extends	
  beyond	
  structured	
  data,	
  including	
  unstructured	
  data	
  of	
  all	
  varie5es:	
  
   text,	
  audio,	
  video,	
  click	
  streams,	
  log	
  files	
  and	
  more


Velocity
 • o@en	
  5me	
  sensi5ve,	
  Big	
  Data	
  must	
  be	
  used	
  as	
  it	
  is	
  streaming	
  in	
  to	
  the	
  enterprise	
  in	
  order	
  
   to	
  maximize	
  its	
  value	
  to	
  the	
  business


Volume
 • Big	
  Data	
  comes	
  in	
  one	
  size:	
  large.	
  Enterprises	
  are	
  awash	
  with	
  data,	
  easily	
  amassing	
  
   terabytes	
  and	
  even	
  petabytes	
  of	
  informa5on




                                                                                                                                           Eufris 2012
Few core concepts



                    Eufris 2012
Hadoop
•The	
  Apache	
  Hadoop	
  so.ware	
  library	
  is	
  a	
  framework	
  that	
  
 allows	
  for	
  the	
  distributed	
  processing	
  of	
  large	
  data	
  sets	
  across	
  
 clusters	
  of	
  computers	
  using	
  a	
  simple	
  programming	
  model.

•Three	
  subprojects
  •Hadoop	
  Common
  •Hadoop	
  Distributed	
  Filesystem	
  (HDFS)
  •Hadoop	
  MapReduce




                                                                                          Eufris 2012
MapReduce
•Introduced	
  by	
  Google	
  in	
  2004

                                            2
                                            2

                 Map                        2   Reduce   3
                                                         4
                                            1
                                                         5
                                            2
                                            3
                                                             Eufris 2012
MapReduce on App Engine
 • Mapreduce	
  is	
  an	
  experimental,	
  innovaNve,	
  and	
  rapidly	
  changing	
  new	
  
   feature	
  for	
  App	
  Engine




                                                                                                   Eufris 2012
NoSQL
•DefiniNon	
  1

 “Next Generation Databases mostly addressing some of the points: being
 non-relational, distributed, open-source and horizontally scalable. The
 original intention has been modern web-scale databases. The movement
 began early 2009 and is growing rapidly. Often more characteristics apply as:
 schema-free, easy replication support, simple API, eventually consistent, a
 huge data amount, and more.”
 nosql-database.org




                                                                           Eufris 2012
NoSQL
•DefiniNon	
  2

 “In computing, NoSQL (sometimes expanded to "not only SQL") is a broad
 class of database management systems that differ from the classic model of
 the relational database management system (RDBMS) in some significant
 ways. These data stores may not require fixed table schemas, usually avoid
 join operations, and typically scale horizontally.”
 Wikipedia




                                                                         Eufris 2012
From ACID to BASE
ACID:
Atomicity,	
  Consistency,	
  Isola&on,	
  Durability




BASE:
Basically	
  available,	
  So?	
  state,	
  Eventually	
  consistent




                                                                       Eufris 2012
Big Data and cloud



                     Eufris 2012
Big Data on AWS




                  Eufris 2012
MapReduce on AWS
• Not	
  yet	
  Hadoop	
  1.0.0




                                  Eufris 2012
MapReduce on AWS
                   EC2
                   S3
                   + DynamoDB




                          Eufris 2012
Google BigQuery
 Features
• Speed - Analyze billions of rows(!) in seconds
• Scale - Terabytes of data, trillions of records
• Simplicity - SQL-like query language, hosted on
  Google infrastructure
• Sharing - Powerful group- and user-based permissions
  using Google accounts
• Security - Secure SSL access
• Multiple access methods - Can be used by REST
  API, a command-line tool, a browser-based graphical
  interface, and Google Apps Script

                                                   Eufris 2012
BigQuery example




                   Eufris 2012
Big Data outside of cloud



                            Eufris 2012
Oracle Big Data Appliance

About 500 000 $


18 Oracle Sun Servers
 • 864 GB main memory;
 • 216 CPU cores;
 • 648 TB of raw disk storage;
 • 40 Gb/s InfiniBand connectivity between nodes and engineered systems;
 • 10 Gb/s Ethernet connectivity.




                                                                    Eufris 2012
Autonomy IDOL 10



"For far too long, organizations have confined structured data to
relational databases and unstructured data to simplistic keyword
matching technologies..."

“IDOL 10 brings these worlds together, allowing organizations to
automatically process, understand, and act on 100 percent of
their data, in real-time. The results will be dramatic, as
businesses can develop entirely new applications that explore
the richness and color of Human Information that live in
unstructured, semi-structured, and structured forms.”

Price?

                                                                    Eufris 2012
Thank you!



             Eufris 2012

More Related Content

What's hot

Big data analytics, survey r.nabati
Big data analytics, survey r.nabatiBig data analytics, survey r.nabati
Big data analytics, survey r.nabatinabati
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataHaluan Irsad
 
Big Data - A brief introduction
Big Data - A brief introductionBig Data - A brief introduction
Big Data - A brief introductionFrans van Noort
 
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012Gigaom
 
Cloud computing and big data analytics
Cloud computing and big data analyticsCloud computing and big data analytics
Cloud computing and big data analyticshanish93
 
Hadoop: An Industry Perspective
Hadoop: An Industry PerspectiveHadoop: An Industry Perspective
Hadoop: An Industry PerspectiveCloudera, Inc.
 
Big data introduction, Hadoop in details
Big data introduction, Hadoop in detailsBig data introduction, Hadoop in details
Big data introduction, Hadoop in detailsMahmoud Yassin
 
Getting Started with Big Data in the Cloud
Getting Started with Big Data in the CloudGetting Started with Big Data in the Cloud
Getting Started with Big Data in the CloudRightScale
 
Big Data: An Overview
Big Data: An OverviewBig Data: An Overview
Big Data: An OverviewC. Scyphers
 
Big Data Taiwan 2014 Track2-2: Informatica Big Data Solution
Big Data Taiwan 2014 Track2-2: Informatica Big Data SolutionBig Data Taiwan 2014 Track2-2: Informatica Big Data Solution
Big Data Taiwan 2014 Track2-2: Informatica Big Data SolutionEtu Solution
 
Big Data & Hadoop Introduction
Big Data & Hadoop IntroductionBig Data & Hadoop Introduction
Big Data & Hadoop IntroductionJayant Mukherjee
 
Big Data - An Overview
Big Data -  An OverviewBig Data -  An Overview
Big Data - An OverviewArvind Kalyan
 
Core concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data AnalyticsCore concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data AnalyticsKaniska Mandal
 
Big Data Final Presentation
Big Data Final PresentationBig Data Final Presentation
Big Data Final Presentation17aroumougamh
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataKaran Desai
 
The rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingThe rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingMinhazul Arefin
 
Big Data/Hadoop Infrastructure Considerations
Big Data/Hadoop Infrastructure ConsiderationsBig Data/Hadoop Infrastructure Considerations
Big Data/Hadoop Infrastructure ConsiderationsRichard McDougall
 
Hadoop and BigData - July 2016
Hadoop and BigData - July 2016Hadoop and BigData - July 2016
Hadoop and BigData - July 2016Ranjith Sekar
 

What's hot (20)

Big data analytics, survey r.nabati
Big data analytics, survey r.nabatiBig data analytics, survey r.nabati
Big data analytics, survey r.nabati
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big Data - A brief introduction
Big Data - A brief introductionBig Data - A brief introduction
Big Data - A brief introduction
 
Big Data Hadoop Training by Easylearning Guru
Big Data Hadoop Training by Easylearning GuruBig Data Hadoop Training by Easylearning Guru
Big Data Hadoop Training by Easylearning Guru
 
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
 
Cloud computing and big data analytics
Cloud computing and big data analyticsCloud computing and big data analytics
Cloud computing and big data analytics
 
Hadoop: An Industry Perspective
Hadoop: An Industry PerspectiveHadoop: An Industry Perspective
Hadoop: An Industry Perspective
 
Big data introduction, Hadoop in details
Big data introduction, Hadoop in detailsBig data introduction, Hadoop in details
Big data introduction, Hadoop in details
 
Getting Started with Big Data in the Cloud
Getting Started with Big Data in the CloudGetting Started with Big Data in the Cloud
Getting Started with Big Data in the Cloud
 
Big Data: An Overview
Big Data: An OverviewBig Data: An Overview
Big Data: An Overview
 
Big Data Taiwan 2014 Track2-2: Informatica Big Data Solution
Big Data Taiwan 2014 Track2-2: Informatica Big Data SolutionBig Data Taiwan 2014 Track2-2: Informatica Big Data Solution
Big Data Taiwan 2014 Track2-2: Informatica Big Data Solution
 
Big Data & Hadoop Introduction
Big Data & Hadoop IntroductionBig Data & Hadoop Introduction
Big Data & Hadoop Introduction
 
Big Data - An Overview
Big Data -  An OverviewBig Data -  An Overview
Big Data - An Overview
 
Core concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data AnalyticsCore concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data Analytics
 
Big Data Final Presentation
Big Data Final PresentationBig Data Final Presentation
Big Data Final Presentation
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
The rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingThe rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computing
 
Big Data/Hadoop Infrastructure Considerations
Big Data/Hadoop Infrastructure ConsiderationsBig Data/Hadoop Infrastructure Considerations
Big Data/Hadoop Infrastructure Considerations
 
Big Data: an introduction
Big Data: an introductionBig Data: an introduction
Big Data: an introduction
 
Hadoop and BigData - July 2016
Hadoop and BigData - July 2016Hadoop and BigData - July 2016
Hadoop and BigData - July 2016
 

Viewers also liked

Smart Machines Sep 2014
Smart Machines Sep 2014Smart Machines Sep 2014
Smart Machines Sep 2014Immo Salo
 
Mitä on big data, Aamiaistilaisuus 13.03.2012
Mitä on big data, Aamiaistilaisuus 13.03.2012Mitä on big data, Aamiaistilaisuus 13.03.2012
Mitä on big data, Aamiaistilaisuus 13.03.2012Immo Salo
 
Big Data -esitys, Arcada ammattikorkeakoulu
Big Data -esitys, Arcada ammattikorkeakouluBig Data -esitys, Arcada ammattikorkeakoulu
Big Data -esitys, Arcada ammattikorkeakouluImmo Salo
 
Tiedolla johtamisen tulevaisuus ja avoin data, Mikko Babitzin
Tiedolla johtamisen tulevaisuus ja avoin data, Mikko BabitzinTiedolla johtamisen tulevaisuus ja avoin data, Mikko Babitzin
Tiedolla johtamisen tulevaisuus ja avoin data, Mikko BabitzinTilastokeskus
 
Pilvipalvelut, Tampere 25.10.2013, Eufris Oy, Immo Salo
Pilvipalvelut, Tampere 25.10.2013, Eufris Oy, Immo SaloPilvipalvelut, Tampere 25.10.2013, Eufris Oy, Immo Salo
Pilvipalvelut, Tampere 25.10.2013, Eufris Oy, Immo SaloImmo Salo
 
Smart machines -presentation, November 2014
Smart machines -presentation, November 2014Smart machines -presentation, November 2014
Smart machines -presentation, November 2014Immo Salo
 

Viewers also liked (6)

Smart Machines Sep 2014
Smart Machines Sep 2014Smart Machines Sep 2014
Smart Machines Sep 2014
 
Mitä on big data, Aamiaistilaisuus 13.03.2012
Mitä on big data, Aamiaistilaisuus 13.03.2012Mitä on big data, Aamiaistilaisuus 13.03.2012
Mitä on big data, Aamiaistilaisuus 13.03.2012
 
Big Data -esitys, Arcada ammattikorkeakoulu
Big Data -esitys, Arcada ammattikorkeakouluBig Data -esitys, Arcada ammattikorkeakoulu
Big Data -esitys, Arcada ammattikorkeakoulu
 
Tiedolla johtamisen tulevaisuus ja avoin data, Mikko Babitzin
Tiedolla johtamisen tulevaisuus ja avoin data, Mikko BabitzinTiedolla johtamisen tulevaisuus ja avoin data, Mikko Babitzin
Tiedolla johtamisen tulevaisuus ja avoin data, Mikko Babitzin
 
Pilvipalvelut, Tampere 25.10.2013, Eufris Oy, Immo Salo
Pilvipalvelut, Tampere 25.10.2013, Eufris Oy, Immo SaloPilvipalvelut, Tampere 25.10.2013, Eufris Oy, Immo Salo
Pilvipalvelut, Tampere 25.10.2013, Eufris Oy, Immo Salo
 
Smart machines -presentation, November 2014
Smart machines -presentation, November 2014Smart machines -presentation, November 2014
Smart machines -presentation, November 2014
 

Similar to Big Data

From open data to API-driven business
From open data to API-driven businessFrom open data to API-driven business
From open data to API-driven businessOpenDataSoft
 
Big data presentation (2014)
Big data presentation (2014)Big data presentation (2014)
Big data presentation (2014)Xavier Constant
 
Simple, Modular and Extensible Big Data Platform Concept
Simple, Modular and Extensible Big Data Platform ConceptSimple, Modular and Extensible Big Data Platform Concept
Simple, Modular and Extensible Big Data Platform ConceptSatish Mohan
 
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...BigDataEverywhere
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data AnalyticsAmazon Web Services
 
1.demystifying big data & hadoop
1.demystifying big data & hadoop1.demystifying big data & hadoop
1.demystifying big data & hadoopdatabloginfo
 
Gis - open source potentials
Gis  - open source potentialsGis  - open source potentials
Gis - open source potentialsTim Willoughby
 
Big data hadoop-no sql and graph db-final
Big data hadoop-no sql and graph db-finalBig data hadoop-no sql and graph db-final
Big data hadoop-no sql and graph db-finalramazan fırın
 
Big Data Introduction
Big Data IntroductionBig Data Introduction
Big Data Introductionyalla4u
 
New Enterprise Cloud Database Options for 2019
New Enterprise Cloud Database Options for 2019New Enterprise Cloud Database Options for 2019
New Enterprise Cloud Database Options for 2019EDB
 
Accelerate Migration to the Cloud using Data Virtualization (APAC)
Accelerate Migration to the Cloud using Data Virtualization (APAC)Accelerate Migration to the Cloud using Data Virtualization (APAC)
Accelerate Migration to the Cloud using Data Virtualization (APAC)Denodo
 
Apache hadoop for windows server and windwos azure
Apache hadoop for windows server and windwos azureApache hadoop for windows server and windwos azure
Apache hadoop for windows server and windwos azureBrad Sarsfield
 
Hadoop as data refinery
Hadoop as data refineryHadoop as data refinery
Hadoop as data refinerySteve Loughran
 
Hadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve LoughranHadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve LoughranJAX London
 
The elephantintheroom bigdataanalyticsinthecloud
The elephantintheroom bigdataanalyticsinthecloudThe elephantintheroom bigdataanalyticsinthecloud
The elephantintheroom bigdataanalyticsinthecloudKhazret Sapenov
 
0812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part2
0812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part20812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part2
0812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part2Raul Chong
 
2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit MumbaiAnand Haridass
 

Similar to Big Data (20)

From open data to API-driven business
From open data to API-driven businessFrom open data to API-driven business
From open data to API-driven business
 
Ibm db2update2019 icp4 data
Ibm db2update2019   icp4 dataIbm db2update2019   icp4 data
Ibm db2update2019 icp4 data
 
Big data presentation (2014)
Big data presentation (2014)Big data presentation (2014)
Big data presentation (2014)
 
Simple, Modular and Extensible Big Data Platform Concept
Simple, Modular and Extensible Big Data Platform ConceptSimple, Modular and Extensible Big Data Platform Concept
Simple, Modular and Extensible Big Data Platform Concept
 
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data Analytics
 
1.demystifying big data & hadoop
1.demystifying big data & hadoop1.demystifying big data & hadoop
1.demystifying big data & hadoop
 
Gis - open source potentials
Gis  - open source potentialsGis  - open source potentials
Gis - open source potentials
 
Big data hadoop-no sql and graph db-final
Big data hadoop-no sql and graph db-finalBig data hadoop-no sql and graph db-final
Big data hadoop-no sql and graph db-final
 
Big Data Introduction
Big Data IntroductionBig Data Introduction
Big Data Introduction
 
New Enterprise Cloud Database Options for 2019
New Enterprise Cloud Database Options for 2019New Enterprise Cloud Database Options for 2019
New Enterprise Cloud Database Options for 2019
 
Accelerate Migration to the Cloud using Data Virtualization (APAC)
Accelerate Migration to the Cloud using Data Virtualization (APAC)Accelerate Migration to the Cloud using Data Virtualization (APAC)
Accelerate Migration to the Cloud using Data Virtualization (APAC)
 
Apache hadoop for windows server and windwos azure
Apache hadoop for windows server and windwos azureApache hadoop for windows server and windwos azure
Apache hadoop for windows server and windwos azure
 
Hadoop as data refinery
Hadoop as data refineryHadoop as data refinery
Hadoop as data refinery
 
Hadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve LoughranHadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve Loughran
 
The elephantintheroom bigdataanalyticsinthecloud
The elephantintheroom bigdataanalyticsinthecloudThe elephantintheroom bigdataanalyticsinthecloud
The elephantintheroom bigdataanalyticsinthecloud
 
Hadoop in the Cloud
Hadoop in the CloudHadoop in the Cloud
Hadoop in the Cloud
 
0812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part2
0812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part20812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part2
0812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part2
 
IBM - Introduction to Cloudant
IBM - Introduction to CloudantIBM - Introduction to Cloudant
IBM - Introduction to Cloudant
 
2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai
 

More from Immo Salo

Webinar: Quantum Revolution Is Here (2022)
Webinar: Quantum Revolution Is Here (2022)Webinar: Quantum Revolution Is Here (2022)
Webinar: Quantum Revolution Is Here (2022)Immo Salo
 
Webinaari: Kvanttivallankumous 03.02.2021
Webinaari: Kvanttivallankumous 03.02.2021Webinaari: Kvanttivallankumous 03.02.2021
Webinaari: Kvanttivallankumous 03.02.2021Immo Salo
 
Smart machines -esitys Tampereella 02/2016
Smart machines -esitys Tampereella 02/2016Smart machines -esitys Tampereella 02/2016
Smart machines -esitys Tampereella 02/2016Immo Salo
 
Smart Machines -presentation May 2015
Smart Machines -presentation May 2015Smart Machines -presentation May 2015
Smart Machines -presentation May 2015Immo Salo
 
Smart machines -presentation, April 2015
Smart machines  -presentation, April 2015Smart machines  -presentation, April 2015
Smart machines -presentation, April 2015Immo Salo
 
Try out Hadoop
Try out HadoopTry out Hadoop
Try out HadoopImmo Salo
 
Smart machines -presentation, Feb 2015
Smart machines -presentation, Feb 2015Smart machines -presentation, Feb 2015
Smart machines -presentation, Feb 2015Immo Salo
 
Smart machines -presentation, January 2015
Smart machines -presentation, January 2015Smart machines -presentation, January 2015
Smart machines -presentation, January 2015Immo Salo
 
Smart Machines -presentation, Dec 2014
Smart Machines -presentation, Dec 2014Smart Machines -presentation, Dec 2014
Smart Machines -presentation, Dec 2014Immo Salo
 
Smart machines presentation, Oct 2014
Smart machines presentation, Oct 2014Smart machines presentation, Oct 2014
Smart machines presentation, Oct 2014Immo Salo
 
Smart machines, Strategic Technology Trend of 2015
Smart machines, Strategic Technology Trend of 2015Smart machines, Strategic Technology Trend of 2015
Smart machines, Strategic Technology Trend of 2015Immo Salo
 
Smart Machines Oct 2014
Smart Machines Oct 2014Smart Machines Oct 2014
Smart Machines Oct 2014Immo Salo
 
Smart machines - The most disruptive change in the history of IT?
Smart machines - The most disruptive change in the history of IT?Smart machines - The most disruptive change in the history of IT?
Smart machines - The most disruptive change in the history of IT?Immo Salo
 
Smart machines - The Next Hype
Smart machines - The Next HypeSmart machines - The Next Hype
Smart machines - The Next HypeImmo Salo
 
Smart machines - The Hype of 2015
Smart machines - The Hype of 2015Smart machines - The Hype of 2015
Smart machines - The Hype of 2015Immo Salo
 
Smart machines - THe Future Is Here
Smart machines - THe Future Is HereSmart machines - THe Future Is Here
Smart machines - THe Future Is HereImmo Salo
 
Cloud computing - palvelut verkossa, Espoo 27.11.2013, Eufris Oy, Immo Salo
Cloud computing - palvelut verkossa, Espoo 27.11.2013, Eufris Oy, Immo SaloCloud computing - palvelut verkossa, Espoo 27.11.2013, Eufris Oy, Immo Salo
Cloud computing - palvelut verkossa, Espoo 27.11.2013, Eufris Oy, Immo SaloImmo Salo
 
Pilvipalvelut tietoisku, Helsinki 21.11.2013, Eufris Oy, Immo Salo
Pilvipalvelut tietoisku, Helsinki 21.11.2013, Eufris Oy, Immo SaloPilvipalvelut tietoisku, Helsinki 21.11.2013, Eufris Oy, Immo Salo
Pilvipalvelut tietoisku, Helsinki 21.11.2013, Eufris Oy, Immo SaloImmo Salo
 

More from Immo Salo (20)

Webinar: Quantum Revolution Is Here (2022)
Webinar: Quantum Revolution Is Here (2022)Webinar: Quantum Revolution Is Here (2022)
Webinar: Quantum Revolution Is Here (2022)
 
Webinaari: Kvanttivallankumous 03.02.2021
Webinaari: Kvanttivallankumous 03.02.2021Webinaari: Kvanttivallankumous 03.02.2021
Webinaari: Kvanttivallankumous 03.02.2021
 
Smart machines -esitys Tampereella 02/2016
Smart machines -esitys Tampereella 02/2016Smart machines -esitys Tampereella 02/2016
Smart machines -esitys Tampereella 02/2016
 
Smart Machines -presentation May 2015
Smart Machines -presentation May 2015Smart Machines -presentation May 2015
Smart Machines -presentation May 2015
 
Hadoop
HadoopHadoop
Hadoop
 
Smart machines -presentation, April 2015
Smart machines  -presentation, April 2015Smart machines  -presentation, April 2015
Smart machines -presentation, April 2015
 
Try out Hadoop
Try out HadoopTry out Hadoop
Try out Hadoop
 
Smart machines -presentation, Feb 2015
Smart machines -presentation, Feb 2015Smart machines -presentation, Feb 2015
Smart machines -presentation, Feb 2015
 
Smart machines -presentation, January 2015
Smart machines -presentation, January 2015Smart machines -presentation, January 2015
Smart machines -presentation, January 2015
 
Smart Machines -presentation, Dec 2014
Smart Machines -presentation, Dec 2014Smart Machines -presentation, Dec 2014
Smart Machines -presentation, Dec 2014
 
Haiku Deck
Haiku DeckHaiku Deck
Haiku Deck
 
Smart machines presentation, Oct 2014
Smart machines presentation, Oct 2014Smart machines presentation, Oct 2014
Smart machines presentation, Oct 2014
 
Smart machines, Strategic Technology Trend of 2015
Smart machines, Strategic Technology Trend of 2015Smart machines, Strategic Technology Trend of 2015
Smart machines, Strategic Technology Trend of 2015
 
Smart Machines Oct 2014
Smart Machines Oct 2014Smart Machines Oct 2014
Smart Machines Oct 2014
 
Smart machines - The most disruptive change in the history of IT?
Smart machines - The most disruptive change in the history of IT?Smart machines - The most disruptive change in the history of IT?
Smart machines - The most disruptive change in the history of IT?
 
Smart machines - The Next Hype
Smart machines - The Next HypeSmart machines - The Next Hype
Smart machines - The Next Hype
 
Smart machines - The Hype of 2015
Smart machines - The Hype of 2015Smart machines - The Hype of 2015
Smart machines - The Hype of 2015
 
Smart machines - THe Future Is Here
Smart machines - THe Future Is HereSmart machines - THe Future Is Here
Smart machines - THe Future Is Here
 
Cloud computing - palvelut verkossa, Espoo 27.11.2013, Eufris Oy, Immo Salo
Cloud computing - palvelut verkossa, Espoo 27.11.2013, Eufris Oy, Immo SaloCloud computing - palvelut verkossa, Espoo 27.11.2013, Eufris Oy, Immo Salo
Cloud computing - palvelut verkossa, Espoo 27.11.2013, Eufris Oy, Immo Salo
 
Pilvipalvelut tietoisku, Helsinki 21.11.2013, Eufris Oy, Immo Salo
Pilvipalvelut tietoisku, Helsinki 21.11.2013, Eufris Oy, Immo SaloPilvipalvelut tietoisku, Helsinki 21.11.2013, Eufris Oy, Immo Salo
Pilvipalvelut tietoisku, Helsinki 21.11.2013, Eufris Oy, Immo Salo
 

Recently uploaded

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 

Recently uploaded (20)

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 

Big Data

  • 1. Big Data Eufris 2012
  • 2. Why should I care? McKinsey: •$250 billions annual savings in EU alone by enhancing public sector •$600 billions annual consumer surplus from using personal location data globally •Annual growth of data is remarcable •Data is the most valuable thing most companies have •Data is massively underutilized Eufris 2012
  • 3. Forecast There will be a shortage of talent necessary for organizations to take advantage of big data. By 2018, the United States alone could face a shortage of 140,000 to 190,000 people with deep analytical skills as well as 1.5 million managers and analysts with the know-how to use the analysis of big data to make effective decisions. Eufris 2012
  • 4. What is Big Data? "Big data technologies describe a new generation of technologies and architectures, designed to economically extract value from very large volumes of a wide variety of data, by enabling high-velocity capture, discovery, and/or analysis" IDC "Big Data is a technlogy that helps extract value from the digital universe.” IDC "Techniques and technologies that make handling data at extreme scale economical." Forrester Eufris 2012
  • 5. ABC of Big Data Analy&cs •making  sense  of  your  data,  in  real-­‐5me,  in  easy  way Bandwidth •inges5ng,  prosessing  and  delivering  large  amounts  of  data Content •storing,  managing  and  retaining  large  amounts  of  data www.netapp.com Eufris 2012
  • 6. 3 V’s of Big Data Variety • Big  Data  extends  beyond  structured  data,  including  unstructured  data  of  all  varie5es:   text,  audio,  video,  click  streams,  log  files  and  more Velocity • o@en  5me  sensi5ve,  Big  Data  must  be  used  as  it  is  streaming  in  to  the  enterprise  in  order   to  maximize  its  value  to  the  business Volume • Big  Data  comes  in  one  size:  large.  Enterprises  are  awash  with  data,  easily  amassing   terabytes  and  even  petabytes  of  informa5on Eufris 2012
  • 7. Few core concepts Eufris 2012
  • 8. Hadoop •The  Apache  Hadoop  so.ware  library  is  a  framework  that   allows  for  the  distributed  processing  of  large  data  sets  across   clusters  of  computers  using  a  simple  programming  model. •Three  subprojects •Hadoop  Common •Hadoop  Distributed  Filesystem  (HDFS) •Hadoop  MapReduce Eufris 2012
  • 9. MapReduce •Introduced  by  Google  in  2004 2 2 Map 2 Reduce 3 4 1 5 2 3 Eufris 2012
  • 10. MapReduce on App Engine • Mapreduce  is  an  experimental,  innovaNve,  and  rapidly  changing  new   feature  for  App  Engine Eufris 2012
  • 11. NoSQL •DefiniNon  1 “Next Generation Databases mostly addressing some of the points: being non-relational, distributed, open-source and horizontally scalable. The original intention has been modern web-scale databases. The movement began early 2009 and is growing rapidly. Often more characteristics apply as: schema-free, easy replication support, simple API, eventually consistent, a huge data amount, and more.” nosql-database.org Eufris 2012
  • 12. NoSQL •DefiniNon  2 “In computing, NoSQL (sometimes expanded to "not only SQL") is a broad class of database management systems that differ from the classic model of the relational database management system (RDBMS) in some significant ways. These data stores may not require fixed table schemas, usually avoid join operations, and typically scale horizontally.” Wikipedia Eufris 2012
  • 13. From ACID to BASE ACID: Atomicity,  Consistency,  Isola&on,  Durability BASE: Basically  available,  So?  state,  Eventually  consistent Eufris 2012
  • 14. Big Data and cloud Eufris 2012
  • 15. Big Data on AWS Eufris 2012
  • 16. MapReduce on AWS • Not  yet  Hadoop  1.0.0 Eufris 2012
  • 17. MapReduce on AWS EC2 S3 + DynamoDB Eufris 2012
  • 18. Google BigQuery Features • Speed - Analyze billions of rows(!) in seconds • Scale - Terabytes of data, trillions of records • Simplicity - SQL-like query language, hosted on Google infrastructure • Sharing - Powerful group- and user-based permissions using Google accounts • Security - Secure SSL access • Multiple access methods - Can be used by REST API, a command-line tool, a browser-based graphical interface, and Google Apps Script Eufris 2012
  • 19. BigQuery example Eufris 2012
  • 20. Big Data outside of cloud Eufris 2012
  • 21. Oracle Big Data Appliance About 500 000 $ 18 Oracle Sun Servers • 864 GB main memory; • 216 CPU cores; • 648 TB of raw disk storage; • 40 Gb/s InfiniBand connectivity between nodes and engineered systems; • 10 Gb/s Ethernet connectivity. Eufris 2012
  • 22. Autonomy IDOL 10 "For far too long, organizations have confined structured data to relational databases and unstructured data to simplistic keyword matching technologies..." “IDOL 10 brings these worlds together, allowing organizations to automatically process, understand, and act on 100 percent of their data, in real-time. The results will be dramatic, as businesses can develop entirely new applications that explore the richness and color of Human Information that live in unstructured, semi-structured, and structured forms.” Price? Eufris 2012
  • 23. Thank you! Eufris 2012