SlideShare a Scribd company logo
„Big Data Science in the Cloud“
Markus Schmidberger
Big Data Analyst & Cloud Engineer
@cloudHPC
markus@mongosoup.de
Big Data gets Political
●

New coalition agreement in Germany:
–

“Wir wollen die Informations- und KommunikationsStrategie (IKT-Strategie) für die digitale Wirtschaft
weiterentwickeln. ...

–

... Wir werden die Forschungs- und Innovationsförderung
für „Big Data“ auf die Entwicklung von Methoden und
Werkzeugen zur Datenanalyse ausrichten ... “
“We change the rules!”
Curios, playful, agile, experienced, goal-oriented, love to
detail, thinking differently ...

Continuos Software delivery

Big data &
polyglot persistence
3. December 2013 - 3

Lean & agile
Customer and Partners

3. December 2013 - 4
Big Data

3. December 2013 - 5
Big Data Science

●

Data science seeks to use all available and
relevant data to effectively tell a story that can
be easily understood by non-practitioners.
3. December 2013 - 6
Cloud Computing
●

Wikipedia: “... describes a variety of
computing concepts that involve a large
number of computers connected through
a real-time communication network such
as the Internet. ...”

3. December 2013 - 7
1) Put Apps & Data to best Place

3. December 2013 - 8
AWS Zones at the right Place

3. December 2013 - 9
Example: R and RStudio Server
●

R: open-source
statistical Software
–

●

www.r-project.org

RStudio IDE
–
–

www.rstudio.org
IDE + web / server
version
3. December 2013 - 10
2) Choose Cloud Resources carefully
●

●

●

Instance type
EBS optimized
EBS provisioned
IOPS

●

Load Balancer

●

Availability Zones
http://media.amazonwebservices.com/AWS_NoSQL_MongoDB.pdf
3. December 2013 - 11
MongoSoup is the first German-based MongoDB cloud
hosting solution!
Supported by a team of experts from MongoDB Inc.
first German partner comSysto. You can have a running
MongoDB database in virtually no time.

●

MongoDB hosting on Amazon EC2 (eu-west-1) and in Munich

●

24x7 monitoring and support

●

Dedicated instances and shared hosting available

●

Replica Sets and Sharding available

●

SSL-enabled MongoDB

3. December 2013 - 12
Performance <-> Costs
●

scale up & out

●

scale down ?

●

monitor your resources
from the beginning

3. December 2013 - 13
3) Use full Cloud Technology Stack

3. December 2013 - 14
Example: AWS EMR with mapR
●

Speed

●

Compression
–

●

reduces disk and
network I/O and
increases
performance

Snapshots
–

data protection
3. December 2013 - 15
4) Data Protection
●

●

talk to the experts
(e.g. Bitkom)
use available
mechanisms &
services
–
–

●

EMR in VPC
Mongosoup.de

be aware of the topic

3. December 2013 - 16
More Big Data Events
●

“Map-Reducing
Everywhere”
–

●

https://hadoopsummit.uservoice.co
m

Forum Big Data und
Verantwortung u.a. mit
Frank Schirrmacher
–

3. December 2013 - 17

Di, 03.12. 19:00; Große Aula LMU
„Big Data Science in the Cloud“
- Yes We Can @cloudHPC
markus@mongosoup.de
http://comsysto.com/events

3. December 2013 - 18

More Related Content

What's hot

Sentiment Analysis with KNIME Analytics Platform
Sentiment Analysis with KNIME Analytics PlatformSentiment Analysis with KNIME Analytics Platform
Sentiment Analysis with KNIME Analytics Platform
KNIMESlides
 
NetApp Flash Storage Facts
NetApp Flash Storage FactsNetApp Flash Storage Facts
NetApp Flash Storage Facts
NetApp Insight
 
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVA
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVABDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVA
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVA
BigData_Europe
 
Sentiment Analysis with Deep Learning, Machine Learning or Lexicon based
Sentiment Analysis with Deep Learning, Machine Learning or Lexicon basedSentiment Analysis with Deep Learning, Machine Learning or Lexicon based
Sentiment Analysis with Deep Learning, Machine Learning or Lexicon based
KNIMESlides
 
Peter Marsh: The new forces facing the project environment
Peter Marsh: The new forces facing the project environmentPeter Marsh: The new forces facing the project environment
Peter Marsh: The new forces facing the project environment
Association for Project Management
 
KNIME Software Overview
KNIME Software OverviewKNIME Software Overview
KNIME Software Overview
KNIMESlides
 
What can the cloud do for you?
What can the cloud do for you?What can the cloud do for you?
What can the cloud do for you?
Mind the Byte
 
Fraud detection using Deep learning and TensorFow on DSX
Fraud detection using Deep learning and TensorFow on DSXFraud detection using Deep learning and TensorFow on DSX
Fraud detection using Deep learning and TensorFow on DSX
Tuhin Mahmud
 
Big Data Analytics @ Munich Re - VIII. International Istanbul Insurance Confe...
Big Data Analytics @ Munich Re - VIII. International Istanbul Insurance Confe...Big Data Analytics @ Munich Re - VIII. International Istanbul Insurance Confe...
Big Data Analytics @ Munich Re - VIII. International Istanbul Insurance Confe...
SigortaTatbikatcilariDernegi
 
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud SystemsDevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
Hong-Linh Truong
 
SC7 Webinar 4 04/05/2017 NCSR Demokritos Presentation "Event Detection"
SC7 Webinar 4 04/05/2017 NCSR Demokritos Presentation "Event Detection"SC7 Webinar 4 04/05/2017 NCSR Demokritos Presentation "Event Detection"
SC7 Webinar 4 04/05/2017 NCSR Demokritos Presentation "Event Detection"
BigData_Europe
 
Big Data Maturity and its Evolution
Big Data Maturity and its EvolutionBig Data Maturity and its Evolution
Big Data Maturity and its Evolution
Sriram Murali K J
 
This week in Neo4j - 3rd February 2018
This week in Neo4j - 3rd February 2018This week in Neo4j - 3rd February 2018
This week in Neo4j - 3rd February 2018
Mark Needham
 
IC-SDV 2019: Deep SEARCH 9
IC-SDV 2019: Deep SEARCH 9 IC-SDV 2019: Deep SEARCH 9
IC-SDV 2019: Deep SEARCH 9
Dr. Haxel Consult
 
AI-SDV 2020: Seea SEARCH 9
AI-SDV 2020: Seea SEARCH 9AI-SDV 2020: Seea SEARCH 9
AI-SDV 2020: Seea SEARCH 9
Dr. Haxel Consult
 
Microsoft in the Cloud
Microsoft in the CloudMicrosoft in the Cloud
Microsoft in the Cloud
hitsurume
 
Easy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance ProfessionalEasy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance Professional
Martin Kaltenböck
 
Microservices Applications: Challenges and Best Practices When Deploying SQL-...
Microservices Applications: Challenges and Best Practices When Deploying SQL-...Microservices Applications: Challenges and Best Practices When Deploying SQL-...
Microservices Applications: Challenges and Best Practices When Deploying SQL-...
NuoDB
 
Big Data LDN 2017: Your flight is boarding now!
Big Data LDN 2017: Your flight is boarding now!Big Data LDN 2017: Your flight is boarding now!
Big Data LDN 2017: Your flight is boarding now!
Matt Stubbs
 
Helix Nebula Initiative
Helix Nebula InitiativeHelix Nebula Initiative
Helix Nebula Initiative
Helix Nebula The Science Cloud
 

What's hot (20)

Sentiment Analysis with KNIME Analytics Platform
Sentiment Analysis with KNIME Analytics PlatformSentiment Analysis with KNIME Analytics Platform
Sentiment Analysis with KNIME Analytics Platform
 
NetApp Flash Storage Facts
NetApp Flash Storage FactsNetApp Flash Storage Facts
NetApp Flash Storage Facts
 
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVA
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVABDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVA
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVA
 
Sentiment Analysis with Deep Learning, Machine Learning or Lexicon based
Sentiment Analysis with Deep Learning, Machine Learning or Lexicon basedSentiment Analysis with Deep Learning, Machine Learning or Lexicon based
Sentiment Analysis with Deep Learning, Machine Learning or Lexicon based
 
Peter Marsh: The new forces facing the project environment
Peter Marsh: The new forces facing the project environmentPeter Marsh: The new forces facing the project environment
Peter Marsh: The new forces facing the project environment
 
KNIME Software Overview
KNIME Software OverviewKNIME Software Overview
KNIME Software Overview
 
What can the cloud do for you?
What can the cloud do for you?What can the cloud do for you?
What can the cloud do for you?
 
Fraud detection using Deep learning and TensorFow on DSX
Fraud detection using Deep learning and TensorFow on DSXFraud detection using Deep learning and TensorFow on DSX
Fraud detection using Deep learning and TensorFow on DSX
 
Big Data Analytics @ Munich Re - VIII. International Istanbul Insurance Confe...
Big Data Analytics @ Munich Re - VIII. International Istanbul Insurance Confe...Big Data Analytics @ Munich Re - VIII. International Istanbul Insurance Confe...
Big Data Analytics @ Munich Re - VIII. International Istanbul Insurance Confe...
 
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud SystemsDevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
 
SC7 Webinar 4 04/05/2017 NCSR Demokritos Presentation "Event Detection"
SC7 Webinar 4 04/05/2017 NCSR Demokritos Presentation "Event Detection"SC7 Webinar 4 04/05/2017 NCSR Demokritos Presentation "Event Detection"
SC7 Webinar 4 04/05/2017 NCSR Demokritos Presentation "Event Detection"
 
Big Data Maturity and its Evolution
Big Data Maturity and its EvolutionBig Data Maturity and its Evolution
Big Data Maturity and its Evolution
 
This week in Neo4j - 3rd February 2018
This week in Neo4j - 3rd February 2018This week in Neo4j - 3rd February 2018
This week in Neo4j - 3rd February 2018
 
IC-SDV 2019: Deep SEARCH 9
IC-SDV 2019: Deep SEARCH 9 IC-SDV 2019: Deep SEARCH 9
IC-SDV 2019: Deep SEARCH 9
 
AI-SDV 2020: Seea SEARCH 9
AI-SDV 2020: Seea SEARCH 9AI-SDV 2020: Seea SEARCH 9
AI-SDV 2020: Seea SEARCH 9
 
Microsoft in the Cloud
Microsoft in the CloudMicrosoft in the Cloud
Microsoft in the Cloud
 
Easy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance ProfessionalEasy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance Professional
 
Microservices Applications: Challenges and Best Practices When Deploying SQL-...
Microservices Applications: Challenges and Best Practices When Deploying SQL-...Microservices Applications: Challenges and Best Practices When Deploying SQL-...
Microservices Applications: Challenges and Best Practices When Deploying SQL-...
 
Big Data LDN 2017: Your flight is boarding now!
Big Data LDN 2017: Your flight is boarding now!Big Data LDN 2017: Your flight is boarding now!
Big Data LDN 2017: Your flight is boarding now!
 
Helix Nebula Initiative
Helix Nebula InitiativeHelix Nebula Initiative
Helix Nebula Initiative
 

Viewers also liked

Carbon footprinting feb 2008
Carbon footprinting feb 2008Carbon footprinting feb 2008
Carbon footprinting feb 2008Jane Bevis
 
On-Pack Recycling Label scheme
On-Pack Recycling Label schemeOn-Pack Recycling Label scheme
On-Pack Recycling Label schemeJane Bevis
 
From debate to action SHIFT07 Oct 07
From debate to action SHIFT07 Oct 07From debate to action SHIFT07 Oct 07
From debate to action SHIFT07 Oct 07Jane Bevis
 
(New) final exam for acc 561 all correct answers 100%
(New) final exam for acc 561 all correct answers 100%(New) final exam for acc 561 all correct answers 100%
(New) final exam for acc 561 all correct answers 100%quikly11
 
Design a Course the Adler Online Way
Design a Course the Adler Online WayDesign a Course the Adler Online Way
Design a Course the Adler Online Way
colleenfleming
 
(New) final exam for law 531 all correct answers 100%
(New) final exam for law 531 all correct answers 100%(New) final exam for law 531 all correct answers 100%
(New) final exam for law 531 all correct answers 100%quikly11
 
Carbon Reduction Commitment Scheme as at April 2009
Carbon Reduction Commitment Scheme as at April 2009Carbon Reduction Commitment Scheme as at April 2009
Carbon Reduction Commitment Scheme as at April 2009
Jane Bevis
 
Retail Forum April 2009
Retail Forum April 2009Retail Forum April 2009
Retail Forum April 2009
Jane Bevis
 
(New) final exam for acc 460 all correct answers 100%
(New) final exam for acc 460 all correct answers 100%(New) final exam for acc 460 all correct answers 100%
(New) final exam for acc 460 all correct answers 100%quikly11
 
(New) final exam for mgt 527 all correct answers 100%
(New) final exam for mgt 527 all correct answers 100%(New) final exam for mgt 527 all correct answers 100%
(New) final exam for mgt 527 all correct answers 100%quikly11
 
Recycling MRW feb 2008
Recycling MRW feb 2008Recycling MRW feb 2008
Recycling MRW feb 2008Jane Bevis
 
(New) final exam for bcom 275 all correct answers 100%
(New) final exam for bcom 275 all correct answers 100%(New) final exam for bcom 275 all correct answers 100%
(New) final exam for bcom 275 all correct answers 100%quikly11
 
Uk retail prospects Jan 2012
Uk retail prospects Jan 2012Uk retail prospects Jan 2012
Uk retail prospects Jan 2012Jane Bevis
 
Please don’t leave me cast
Please don’t leave me castPlease don’t leave me cast
Please don’t leave me castMarielly13
 
Coastal flood risk ABI conference nov06
Coastal flood risk ABI conference nov06Coastal flood risk ABI conference nov06
Coastal flood risk ABI conference nov06Jane Bevis
 
(New) final exam for acc 422 all correct answers 100%
(New) final exam for acc 422 all correct answers 100%(New) final exam for acc 422 all correct answers 100%
(New) final exam for acc 422 all correct answers 100%quikly11
 
About EasyLifeApp
About EasyLifeApp About EasyLifeApp
About EasyLifeApp
Easylifeapp
 
(New) final exam for qnt 351 all correct answers 100%
(New) final exam for qnt 351 all correct answers 100%(New) final exam for qnt 351 all correct answers 100%
(New) final exam for qnt 351 all correct answers 100%quikly11
 

Viewers also liked (19)

Carbon footprinting feb 2008
Carbon footprinting feb 2008Carbon footprinting feb 2008
Carbon footprinting feb 2008
 
On-Pack Recycling Label scheme
On-Pack Recycling Label schemeOn-Pack Recycling Label scheme
On-Pack Recycling Label scheme
 
From debate to action SHIFT07 Oct 07
From debate to action SHIFT07 Oct 07From debate to action SHIFT07 Oct 07
From debate to action SHIFT07 Oct 07
 
(New) final exam for acc 561 all correct answers 100%
(New) final exam for acc 561 all correct answers 100%(New) final exam for acc 561 all correct answers 100%
(New) final exam for acc 561 all correct answers 100%
 
Design a Course the Adler Online Way
Design a Course the Adler Online WayDesign a Course the Adler Online Way
Design a Course the Adler Online Way
 
(New) final exam for law 531 all correct answers 100%
(New) final exam for law 531 all correct answers 100%(New) final exam for law 531 all correct answers 100%
(New) final exam for law 531 all correct answers 100%
 
Carbon Reduction Commitment Scheme as at April 2009
Carbon Reduction Commitment Scheme as at April 2009Carbon Reduction Commitment Scheme as at April 2009
Carbon Reduction Commitment Scheme as at April 2009
 
Retail Forum April 2009
Retail Forum April 2009Retail Forum April 2009
Retail Forum April 2009
 
(New) final exam for acc 460 all correct answers 100%
(New) final exam for acc 460 all correct answers 100%(New) final exam for acc 460 all correct answers 100%
(New) final exam for acc 460 all correct answers 100%
 
(New) final exam for mgt 527 all correct answers 100%
(New) final exam for mgt 527 all correct answers 100%(New) final exam for mgt 527 all correct answers 100%
(New) final exam for mgt 527 all correct answers 100%
 
Recycling MRW feb 2008
Recycling MRW feb 2008Recycling MRW feb 2008
Recycling MRW feb 2008
 
(New) final exam for bcom 275 all correct answers 100%
(New) final exam for bcom 275 all correct answers 100%(New) final exam for bcom 275 all correct answers 100%
(New) final exam for bcom 275 all correct answers 100%
 
Uk retail prospects Jan 2012
Uk retail prospects Jan 2012Uk retail prospects Jan 2012
Uk retail prospects Jan 2012
 
State cah 2013
State cah   2013State cah   2013
State cah 2013
 
Please don’t leave me cast
Please don’t leave me castPlease don’t leave me cast
Please don’t leave me cast
 
Coastal flood risk ABI conference nov06
Coastal flood risk ABI conference nov06Coastal flood risk ABI conference nov06
Coastal flood risk ABI conference nov06
 
(New) final exam for acc 422 all correct answers 100%
(New) final exam for acc 422 all correct answers 100%(New) final exam for acc 422 all correct answers 100%
(New) final exam for acc 422 all correct answers 100%
 
About EasyLifeApp
About EasyLifeApp About EasyLifeApp
About EasyLifeApp
 
(New) final exam for qnt 351 all correct answers 100%
(New) final exam for qnt 351 all correct answers 100%(New) final exam for qnt 351 all correct answers 100%
(New) final exam for qnt 351 all correct answers 100%
 

Similar to Big Data Science in the Cloud from Big Data World Conference 2013

Cloud-based Energy Efficient Software
Cloud-based Energy Efficient SoftwareCloud-based Energy Efficient Software
Cloud-based Energy Efficient Software
Fotis Stamatelopoulos
 
"Engineering implications of the cloud when applied to the Media" - Mesclado'...
"Engineering implications of the cloud when applied to the Media" - Mesclado'..."Engineering implications of the cloud when applied to the Media" - Mesclado'...
"Engineering implications of the cloud when applied to the Media" - Mesclado'...
Mesclado
 
SecureCloud Project
SecureCloud ProjectSecureCloud Project
SecureCloud Project
EUBrasilCloudFORUM .
 
2017 Cloud Computing Primer
2017 Cloud Computing Primer2017 Cloud Computing Primer
2017 Cloud Computing Primer
Rajesh Math
 
The rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingThe rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computing
Minhazul Arefin
 
Cloud Computing - An Introduction
Cloud Computing - An IntroductionCloud Computing - An Introduction
Cloud Computing - An Introduction
Ravindra Dastikop
 
Business Track: Building a Private Cloud to Empower the Business at Goldman ...
Business Track: Building a Private Cloud  to Empower the Business at Goldman ...Business Track: Building a Private Cloud  to Empower the Business at Goldman ...
Business Track: Building a Private Cloud to Empower the Business at Goldman ...MongoDB
 
BigData Hadoop
BigData Hadoop BigData Hadoop
BigData Hadoop
Kumari Surabhi
 
Storage and The Cloud 1. What is driving IT / Businesses to Cloud 2. Traditio...
Storage and The Cloud 1. What is driving IT / Businesses to Cloud 2. Traditio...Storage and The Cloud 1. What is driving IT / Businesses to Cloud 2. Traditio...
Storage and The Cloud 1. What is driving IT / Businesses to Cloud 2. Traditio...
tkharrat
 
2018 19 Cloudcomputing
2018 19 Cloudcomputing2018 19 Cloudcomputing
2018 19 Cloudcomputing
Rajesh Math
 
data_engineering_basics.pdf
data_engineering_basics.pdfdata_engineering_basics.pdf
data_engineering_basics.pdf
Ketan Patil
 
HPC as a Service
HPC as a ServiceHPC as a Service
HPC as a Service
EUBrasilCloudFORUM .
 
C cloud organizational_impacts_big_data_on-prem_vs_off-premise_john_sing
C cloud organizational_impacts_big_data_on-prem_vs_off-premise_john_singC cloud organizational_impacts_big_data_on-prem_vs_off-premise_john_sing
C cloud organizational_impacts_big_data_on-prem_vs_off-premise_john_sing
John Sing
 
Final_CloudEventFrankfurt2017 (1).pdf
Final_CloudEventFrankfurt2017 (1).pdfFinal_CloudEventFrankfurt2017 (1).pdf
Final_CloudEventFrankfurt2017 (1).pdfMongoDB
 
Simply Business' Data Platform
Simply Business' Data PlatformSimply Business' Data Platform
Simply Business' Data Platform
Dani Solà Lagares
 
From Zero to Cloud and Back
From Zero to Cloud and BackFrom Zero to Cloud and Back
From Zero to Cloud and Back
BATbern
 
Object Storage: How Can it Work for You
Object Storage: How Can it Work for YouObject Storage: How Can it Work for You
Object Storage: How Can it Work for You
Cloudian
 
[DOST] OpenStack & the Enterprise Hybrid Cloud - Tech, People, Processes
[DOST] OpenStack & the Enterprise Hybrid Cloud - Tech, People, Processes[DOST] OpenStack & the Enterprise Hybrid Cloud - Tech, People, Processes
[DOST] OpenStack & the Enterprise Hybrid Cloud - Tech, People, Processes
Gerd Prüßmann
 
[DataCon.TW 2017] Data Lake: centralize in on-prem vs. decentralize on cloud
[DataCon.TW 2017] Data Lake: centralize in on-prem vs. decentralize on cloud[DataCon.TW 2017] Data Lake: centralize in on-prem vs. decentralize on cloud
[DataCon.TW 2017] Data Lake: centralize in on-prem vs. decentralize on cloud
Jeff Hung
 

Similar to Big Data Science in the Cloud from Big Data World Conference 2013 (20)

Cloud-based Energy Efficient Software
Cloud-based Energy Efficient SoftwareCloud-based Energy Efficient Software
Cloud-based Energy Efficient Software
 
"Engineering implications of the cloud when applied to the Media" - Mesclado'...
"Engineering implications of the cloud when applied to the Media" - Mesclado'..."Engineering implications of the cloud when applied to the Media" - Mesclado'...
"Engineering implications of the cloud when applied to the Media" - Mesclado'...
 
SecureCloud Project
SecureCloud ProjectSecureCloud Project
SecureCloud Project
 
2017 Cloud Computing Primer
2017 Cloud Computing Primer2017 Cloud Computing Primer
2017 Cloud Computing Primer
 
The rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingThe rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computing
 
Cloud Computing - An Introduction
Cloud Computing - An IntroductionCloud Computing - An Introduction
Cloud Computing - An Introduction
 
Mongo bbmw
Mongo bbmwMongo bbmw
Mongo bbmw
 
Business Track: Building a Private Cloud to Empower the Business at Goldman ...
Business Track: Building a Private Cloud  to Empower the Business at Goldman ...Business Track: Building a Private Cloud  to Empower the Business at Goldman ...
Business Track: Building a Private Cloud to Empower the Business at Goldman ...
 
BigData Hadoop
BigData Hadoop BigData Hadoop
BigData Hadoop
 
Storage and The Cloud 1. What is driving IT / Businesses to Cloud 2. Traditio...
Storage and The Cloud 1. What is driving IT / Businesses to Cloud 2. Traditio...Storage and The Cloud 1. What is driving IT / Businesses to Cloud 2. Traditio...
Storage and The Cloud 1. What is driving IT / Businesses to Cloud 2. Traditio...
 
2018 19 Cloudcomputing
2018 19 Cloudcomputing2018 19 Cloudcomputing
2018 19 Cloudcomputing
 
data_engineering_basics.pdf
data_engineering_basics.pdfdata_engineering_basics.pdf
data_engineering_basics.pdf
 
HPC as a Service
HPC as a ServiceHPC as a Service
HPC as a Service
 
C cloud organizational_impacts_big_data_on-prem_vs_off-premise_john_sing
C cloud organizational_impacts_big_data_on-prem_vs_off-premise_john_singC cloud organizational_impacts_big_data_on-prem_vs_off-premise_john_sing
C cloud organizational_impacts_big_data_on-prem_vs_off-premise_john_sing
 
Final_CloudEventFrankfurt2017 (1).pdf
Final_CloudEventFrankfurt2017 (1).pdfFinal_CloudEventFrankfurt2017 (1).pdf
Final_CloudEventFrankfurt2017 (1).pdf
 
Simply Business' Data Platform
Simply Business' Data PlatformSimply Business' Data Platform
Simply Business' Data Platform
 
From Zero to Cloud and Back
From Zero to Cloud and BackFrom Zero to Cloud and Back
From Zero to Cloud and Back
 
Object Storage: How Can it Work for You
Object Storage: How Can it Work for YouObject Storage: How Can it Work for You
Object Storage: How Can it Work for You
 
[DOST] OpenStack & the Enterprise Hybrid Cloud - Tech, People, Processes
[DOST] OpenStack & the Enterprise Hybrid Cloud - Tech, People, Processes[DOST] OpenStack & the Enterprise Hybrid Cloud - Tech, People, Processes
[DOST] OpenStack & the Enterprise Hybrid Cloud - Tech, People, Processes
 
[DataCon.TW 2017] Data Lake: centralize in on-prem vs. decentralize on cloud
[DataCon.TW 2017] Data Lake: centralize in on-prem vs. decentralize on cloud[DataCon.TW 2017] Data Lake: centralize in on-prem vs. decentralize on cloud
[DataCon.TW 2017] Data Lake: centralize in on-prem vs. decentralize on cloud
 

Recently uploaded

De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.
ViralQR
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 

Recently uploaded (20)

De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.Welocme to ViralQR, your best QR code generator.
Welocme to ViralQR, your best QR code generator.
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 

Big Data Science in the Cloud from Big Data World Conference 2013

  • 1. „Big Data Science in the Cloud“ Markus Schmidberger Big Data Analyst & Cloud Engineer @cloudHPC markus@mongosoup.de
  • 2. Big Data gets Political ● New coalition agreement in Germany: – “Wir wollen die Informations- und KommunikationsStrategie (IKT-Strategie) für die digitale Wirtschaft weiterentwickeln. ... – ... Wir werden die Forschungs- und Innovationsförderung für „Big Data“ auf die Entwicklung von Methoden und Werkzeugen zur Datenanalyse ausrichten ... “
  • 3. “We change the rules!” Curios, playful, agile, experienced, goal-oriented, love to detail, thinking differently ... Continuos Software delivery Big data & polyglot persistence 3. December 2013 - 3 Lean & agile
  • 4. Customer and Partners 3. December 2013 - 4
  • 6. Big Data Science ● Data science seeks to use all available and relevant data to effectively tell a story that can be easily understood by non-practitioners. 3. December 2013 - 6
  • 7. Cloud Computing ● Wikipedia: “... describes a variety of computing concepts that involve a large number of computers connected through a real-time communication network such as the Internet. ...” 3. December 2013 - 7
  • 8. 1) Put Apps & Data to best Place 3. December 2013 - 8
  • 9. AWS Zones at the right Place 3. December 2013 - 9
  • 10. Example: R and RStudio Server ● R: open-source statistical Software – ● www.r-project.org RStudio IDE – – www.rstudio.org IDE + web / server version 3. December 2013 - 10
  • 11. 2) Choose Cloud Resources carefully ● ● ● Instance type EBS optimized EBS provisioned IOPS ● Load Balancer ● Availability Zones http://media.amazonwebservices.com/AWS_NoSQL_MongoDB.pdf 3. December 2013 - 11
  • 12. MongoSoup is the first German-based MongoDB cloud hosting solution! Supported by a team of experts from MongoDB Inc. first German partner comSysto. You can have a running MongoDB database in virtually no time. ● MongoDB hosting on Amazon EC2 (eu-west-1) and in Munich ● 24x7 monitoring and support ● Dedicated instances and shared hosting available ● Replica Sets and Sharding available ● SSL-enabled MongoDB 3. December 2013 - 12
  • 13. Performance <-> Costs ● scale up & out ● scale down ? ● monitor your resources from the beginning 3. December 2013 - 13
  • 14. 3) Use full Cloud Technology Stack 3. December 2013 - 14
  • 15. Example: AWS EMR with mapR ● Speed ● Compression – ● reduces disk and network I/O and increases performance Snapshots – data protection 3. December 2013 - 15
  • 16. 4) Data Protection ● ● talk to the experts (e.g. Bitkom) use available mechanisms & services – – ● EMR in VPC Mongosoup.de be aware of the topic 3. December 2013 - 16
  • 17. More Big Data Events ● “Map-Reducing Everywhere” – ● https://hadoopsummit.uservoice.co m Forum Big Data und Verantwortung u.a. mit Frank Schirrmacher – 3. December 2013 - 17 Di, 03.12. 19:00; Große Aula LMU
  • 18. „Big Data Science in the Cloud“ - Yes We Can @cloudHPC markus@mongosoup.de http://comsysto.com/events 3. December 2013 - 18