SlideShare a Scribd company logo
1 of 28
Download to read offline
An Experiment on Bare-Metal
BigData Provisioning
Ata Turk, Ravi S. Gudimetla, Emine Ugur Kaynar, Jason Hennessey,
Sahil Tikale, Peter Desnoyers, Orran Krieger
1
BigData Analytics on the Cloud
• BigData deployments are
moving to the cloud
• On-demand usage (Cost), Elasticity,
Agility, Simplicity, …
• Virtualized IaaS solutions: Amazon
EMR, Azure HDInsight, …
• Virtualization drawbacks
• Overhead, unpredictability, security
concerns, device functionality, …
• Bare-metal cloud solutions: IBM,
Rackspace, and Internap, …
2
Bare-Metal BigData
Cloud Solutions
• Bare-Metal cloud provisioning
• Automated provisioning: Ironic,
MaaS, …
• Image copy to local disk => long
waits => loss of agility & elasticity
• OS streaming*, Lazy copy &
de-virtualization**
• What about network booting?
• incur an ongoing unacceptable
overhead during runtime
** Y. Omote, T. Shinagawa, and K. Kato, “Improving Agility and Elasticity in Bare-metal Clouds,” in ASPLOS’15, pp. 145–159, 2015.
3
* David Clerc, “OS Streaming Deployment”, in IPCCC’10, pp. 169–179, 2010.
• Large parts of the HPC community has been
doing it for the last 20 years.
• Virtualized IaaS is doing it all the time.
• Why not bare-metal cloud?
4
Network-Mounted BigData
System
• Clients access kernel and init
ramdisk via PXE
• Mount OS & BigData apps
from a remote iSCSI volume
• Use local disk for ephemeral
storage (HDFS, /swap, /tmp,
…)
5
Bare-Metal BigData
Provisioning Prototype
6
Bare-Metal BigData
Provisioning Prototype
Hardware Isolation Layer:
A service to allocate bare-
metal nodes out of a shared
pool and isolate network
7
Bare-Metal BigData
Provisioning Prototype
CEPH: Central image
store hosting user images
with BigData applications
8
Bare-Metal BigData
Provisioning Prototype
Provisioning VM:
Gateway between
isolated servers
and image store
9
Bare-Metal BigData
Provisioning Prototype
10
Bare-Metal BigData
Provisioning Prototype
11
Bare-Metal BigData
Provisioning Prototype
12
Bare-Metal BigData
Provisioning Prototype
13
Provisioning Time
Local Disk iSCSI
0
200
400
600
800
1000
1200
1400
ElapsedTime(Secs)
Bigdata Configuration
Bigdata Installation
OS Reboot
Firmware Initialization
Post Setup Software Installation
Package Installation
OS Boot(inc. Kernel+Initrd Download)
DHCP request
Firmware Initialization
Haas Power Cycle
Ceph Cloning
Haas Initilization
14
Firmware initialization
OS Boot (inc. kernel+initrd download)
OS Installation
OS Reboot
BigData App Installation
Post Config
DHCP Request
Local disk
installation
iSCSI boot
Provisioning Time
Local Disk iSCSI
0
200
400
600
800
1000
1200
1400
ElapsedTime(Secs)
Bigdata Configuration
Bigdata Installation
OS Reboot
Firmware Initialization
Post Setup Software Installation
Package Installation
OS Boot(inc. Kernel+Initrd Download)
DHCP request
Firmware Initialization
Haas Power Cycle
Ceph Cloning
Haas Initilization
15
Firmware initialization
OS Boot (inc. kernel+initrd download)
OS Installation
OS Reboot
BigData App Installation
Post Config
DHCP Request
Local disk
installation
iSCSI boot
Provisioning Time Scaling
2 Node 4 Node 8 Node
0
50
100
150
200
250
300
ElapsedTime(Secs)
Bigdata Post Script
Booting
Ceph Cloning
Haas Initilization
16
Read Traffic over Boot Drive
Initial
Provisioning
Data
Generation1
Sort1
Data
Generation2
Sort2
Data
Generation3
Sort3
Data
Generation4
Sort4
Data
Generation5
Sort5
0
100
200
300
CumulativeiSCSIreadspernode(MB)
iSCSI Reads: Runs with 256GB Data
iSCSI Reads: Runs with 128GB Data
17
Read Traffic over Boot Drive
Initial
Provisioning
Data
Generation1
Sort1
Data
Generation2
Sort2
Data
Generation3
Sort3
Data
Generation4
Sort4
Data
Generation5
Sort5
0
100
200
300
CumulativeiSCSIreadspernode(MB)
iSCSI Reads: Runs with 256GB Data
iSCSI Reads: Runs with 128GB Data
~170MB / 8GB Boot Image => 2%
18
Read Traffic over Boot Drive
Initial
Provisioning
Data
Generation1
Sort1
Data
Generation2
Sort2
Data
Generation3
Sort3
Data
Generation4
Sort4
Data
Generation5
Sort5
0
100
200
300
CumulativeiSCSIreadspernode(MB)
iSCSI Reads: Runs with 256GB Data
iSCSI Reads: Runs with 128GB Data
3KB/s
read
after
initial
boot
19
Write Traffic over Boot Drive
Initial
Provisioning
Data
Generation1
Sort1
Data
Generation2
Sort2
Data
Generation3
Sort3
Data
Generation4
Sort4
Data
Generation5
Sort5
0
100
200
300
400
500
600
700
CumulativeiSCSIwritespernode(MB)
iSCSI Writes - Runs with 256GB Data
iSCSI Writes - Runs with 128GB Data
20
Write Traffic over Boot Drive
Initial
Provisioning
Data
Generation1
Sort1
Data
Generation2
Sort2
Data
Generation3
Sort3
Data
Generation4
Sort4
Data
Generation5
Sort5
0
100
200
300
400
500
600
700
CumulativeiSCSIwritespernode(MB)
iSCSI Writes - Runs with 256GB Data
iSCSI Writes - Runs with 128GB Data 14KB/s
write
21
Runtime Performance of
Network-Mounted Boot Drive171
319
616
1187
2314
171
318
617
1176
2281
64
115
300
542
1073
69
120
238
555
1361
60
75
76
118
199
52
63
86
125
201
Data Size (GB)
0
400
800
1200
1600
2000
2400
2800
ElapsedTime(secs)
WordCount - Local Disk
WordCount - iSCSI Mounted
Sort - Local Disk
Sort - iSCSI Mounted
Grep - Local Disk
Grep - iSCSI Mounted
8GB 16GB 32GB 64GB 128GB
22
Runtime Performance of
Network-Mounted Boot Drive171
319
616
1187
2314
171
318
617
1176
2281
64
115
300
542
1073
69
120
238
555
1361
60
75
76
118
199
52
63
86
125
201
Data Size (GB)
0
400
800
1200
1600
2000
2400
2800
ElapsedTime(secs)
WordCount - Local Disk
WordCount - iSCSI Mounted
Sort - Local Disk
Sort - iSCSI Mounted
Grep - Local Disk
Grep - iSCSI Mounted
8GB 16GB 32GB 64GB 128GB
23
Runtime Performance of
Network-Mounted Boot Drive171
319
616
1187
2314
171
318
617
1176
2281
64
115
300
542
1073
69
120
238
555
1361
60
75
76
118
199
52
63
86
125
201
Data Size (GB)
0
400
800
1200
1600
2000
2400
2800
ElapsedTime(secs)
WordCount - Local Disk
WordCount - iSCSI Mounted
Sort - Local Disk
Sort - iSCSI Mounted
Grep - Local Disk
Grep - iSCSI Mounted
8GB 16GB 32GB 64GB 128GB
24
Take-aways
• Network booting the OS for bare-metal BigData
• uses only a fraction of boot disk during start-up
• improves provisioning time with no runtime degradation
• provisioning time < 5 mins, boot disk reads: ~3KB/s, writes: ~14KB/s
• Enormous effort on bare-metal provisioning on local disks
may be unnecessary, especially for BigData deployments
• We are building a new Bare Metal Imaging Service using
remote network boot mechanisms
• enable capabilities available on virtualized platforms (e.g.
snapshotting, cloning, …) to bare metal cloud solutions
25
Questions
26
Provisioning Time
Local Disk iSCSI
0
200
400
600
800
1000
1200
1400
ElapsedTime(Secs)
Bigdata Configuration
Bigdata Installation
OS Reboot
Firmware Initialization
Post Setup Software Installation
Package Installation
OS Boot(inc. Kernel+Initrd Download)
DHCP request
Firmware Initialization
Haas Power Cycle
Ceph Cloning
Haas Initilization
27
Local disk
installation
iSCSI boot Emulab* Ironic*
* A. Chandrasekar and G. Gibson, “A comparative study of baremetal provisioning frameworks,” Parallel Data Laboratory,
Carnegie Mellon University, Tech. Rep. CMU-PDL-14-109, 2014.
Provisioning Time
Local Disk iSCSI
0
200
400
600
800
1000
1200
1400
ElapsedTime(Secs)
Bigdata Configuration
Bigdata Installation
OS Reboot
Firmware Initialization
Post Setup Software Installation
Package Installation
OS Boot(inc. Kernel+Initrd Download)
DHCP request
Firmware Initialization
Haas Power Cycle
Ceph Cloning
Haas Initilization
28
Local disk
installation
iSCSI boot Emulab* Ironic*
* A. Chandrasekar and G. Gibson, “A comparative study of baremetal provisioning frameworks,” Parallel Data Laboratory,
Carnegie Mellon University, Tech. Rep. CMU-PDL-14-109, 2014.
copy (rather than install)
an image to local disk

More Related Content

What's hot

Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio Ceph Community
 
Build an affordable Cloud Stroage
Build an affordable Cloud StroageBuild an affordable Cloud Stroage
Build an affordable Cloud StroageAlex Lau
 
PG-Strom v2.0 Technical Brief (17-Apr-2018)
PG-Strom v2.0 Technical Brief (17-Apr-2018)PG-Strom v2.0 Technical Brief (17-Apr-2018)
PG-Strom v2.0 Technical Brief (17-Apr-2018)Kohei KaiGai
 
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red_Hat_Storage
 
FDW-based Sharding Update and Future
FDW-based Sharding Update and FutureFDW-based Sharding Update and Future
FDW-based Sharding Update and FutureMasahiko Sawada
 
Introduction to TrioNAS LX U300
Introduction to TrioNAS LX U300Introduction to TrioNAS LX U300
Introduction to TrioNAS LX U300qsantechnology
 
Data relay introduction to big data clusters
Data relay introduction to big data clustersData relay introduction to big data clusters
Data relay introduction to big data clustersChris Adkin
 
ClickHouse on Kubernetes! By Robert Hodges, Altinity CEO
ClickHouse on Kubernetes! By Robert Hodges, Altinity CEOClickHouse on Kubernetes! By Robert Hodges, Altinity CEO
ClickHouse on Kubernetes! By Robert Hodges, Altinity CEOAltinity Ltd
 
Data weekender deploying prod grade sql 2019 big data clusters
Data weekender deploying prod grade sql 2019 big data clustersData weekender deploying prod grade sql 2019 big data clusters
Data weekender deploying prod grade sql 2019 big data clustersChris Adkin
 
Ceph on Intel: Intel Storage Components, Benchmarks, and Contributions
Ceph on Intel: Intel Storage Components, Benchmarks, and ContributionsCeph on Intel: Intel Storage Components, Benchmarks, and Contributions
Ceph on Intel: Intel Storage Components, Benchmarks, and ContributionsColleen Corrice
 
openSUSE storage workshop 2016
openSUSE storage workshop 2016openSUSE storage workshop 2016
openSUSE storage workshop 2016Alex Lau
 
Transparent Data Encryption in PostgreSQL and Integration with Key Management...
Transparent Data Encryption in PostgreSQL and Integration with Key Management...Transparent Data Encryption in PostgreSQL and Integration with Key Management...
Transparent Data Encryption in PostgreSQL and Integration with Key Management...Masahiko Sawada
 
Aerospike DB and Storm for real-time analytics
Aerospike DB and Storm for real-time analyticsAerospike DB and Storm for real-time analytics
Aerospike DB and Storm for real-time analyticsAerospike
 
Bloat and Fragmentation in PostgreSQL
Bloat and Fragmentation in PostgreSQLBloat and Fragmentation in PostgreSQL
Bloat and Fragmentation in PostgreSQLMasahiko Sawada
 
DevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
DevOps for ETL processing at scale with MongoDB, Solr, AWS and ChefDevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
DevOps for ETL processing at scale with MongoDB, Solr, AWS and ChefGaurav "GP" Pal
 

What's hot (18)

Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
 
Build an affordable Cloud Stroage
Build an affordable Cloud StroageBuild an affordable Cloud Stroage
Build an affordable Cloud Stroage
 
PG-Strom v2.0 Technical Brief (17-Apr-2018)
PG-Strom v2.0 Technical Brief (17-Apr-2018)PG-Strom v2.0 Technical Brief (17-Apr-2018)
PG-Strom v2.0 Technical Brief (17-Apr-2018)
 
JAWS
JAWSJAWS
JAWS
 
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
 
FDW-based Sharding Update and Future
FDW-based Sharding Update and FutureFDW-based Sharding Update and Future
FDW-based Sharding Update and Future
 
Introduction to TrioNAS LX U300
Introduction to TrioNAS LX U300Introduction to TrioNAS LX U300
Introduction to TrioNAS LX U300
 
Data relay introduction to big data clusters
Data relay introduction to big data clustersData relay introduction to big data clusters
Data relay introduction to big data clusters
 
ClickHouse on Kubernetes! By Robert Hodges, Altinity CEO
ClickHouse on Kubernetes! By Robert Hodges, Altinity CEOClickHouse on Kubernetes! By Robert Hodges, Altinity CEO
ClickHouse on Kubernetes! By Robert Hodges, Altinity CEO
 
optimizing_ceph_flash
optimizing_ceph_flashoptimizing_ceph_flash
optimizing_ceph_flash
 
Data weekender deploying prod grade sql 2019 big data clusters
Data weekender deploying prod grade sql 2019 big data clustersData weekender deploying prod grade sql 2019 big data clusters
Data weekender deploying prod grade sql 2019 big data clusters
 
Ceph on Intel: Intel Storage Components, Benchmarks, and Contributions
Ceph on Intel: Intel Storage Components, Benchmarks, and ContributionsCeph on Intel: Intel Storage Components, Benchmarks, and Contributions
Ceph on Intel: Intel Storage Components, Benchmarks, and Contributions
 
openSUSE storage workshop 2016
openSUSE storage workshop 2016openSUSE storage workshop 2016
openSUSE storage workshop 2016
 
Transparent Data Encryption in PostgreSQL and Integration with Key Management...
Transparent Data Encryption in PostgreSQL and Integration with Key Management...Transparent Data Encryption in PostgreSQL and Integration with Key Management...
Transparent Data Encryption in PostgreSQL and Integration with Key Management...
 
Aerospike DB and Storm for real-time analytics
Aerospike DB and Storm for real-time analyticsAerospike DB and Storm for real-time analytics
Aerospike DB and Storm for real-time analytics
 
Bloat and Fragmentation in PostgreSQL
Bloat and Fragmentation in PostgreSQLBloat and Fragmentation in PostgreSQL
Bloat and Fragmentation in PostgreSQL
 
Workshop actualización SVG CESGA 2012
Workshop actualización SVG CESGA 2012 Workshop actualización SVG CESGA 2012
Workshop actualización SVG CESGA 2012
 
DevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
DevOps for ETL processing at scale with MongoDB, Solr, AWS and ChefDevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
DevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
 

Similar to Hot Cloud'16: An Experiment on Bare-Metal BigData Provisioning

An introduction and evaluations of a wide area distributed storage system
An introduction and evaluations of  a wide area distributed storage systemAn introduction and evaluations of  a wide area distributed storage system
An introduction and evaluations of a wide area distributed storage systemHiroki Kashiwazaki
 
Nimble Storage Series A presentation 2007
Nimble Storage Series A presentation 2007Nimble Storage Series A presentation 2007
Nimble Storage Series A presentation 2007Wing Venture Capital
 
Empowering Congress with Data-Driven Analytics (BDT304) | AWS re:Invent 2013
Empowering Congress with Data-Driven Analytics (BDT304) | AWS re:Invent 2013Empowering Congress with Data-Driven Analytics (BDT304) | AWS re:Invent 2013
Empowering Congress with Data-Driven Analytics (BDT304) | AWS re:Invent 2013Amazon Web Services
 
Introducción a Microsoft Azure 2023
Introducción a Microsoft Azure 2023Introducción a Microsoft Azure 2023
Introducción a Microsoft Azure 2023Mariano Kovo
 
Vizuri Exadata East Coast Users Conference
Vizuri Exadata East Coast Users ConferenceVizuri Exadata East Coast Users Conference
Vizuri Exadata East Coast Users ConferenceIsaac Christoffersen
 
High Performance Computing for LiDAR Data Production
High Performance Computing for LiDAR Data ProductionHigh Performance Computing for LiDAR Data Production
High Performance Computing for LiDAR Data ProductionMattBethel1
 
Azure VM 101 - HomeGen by CloudGen Verona - Marco Obinu
Azure VM 101 - HomeGen by CloudGen Verona - Marco ObinuAzure VM 101 - HomeGen by CloudGen Verona - Marco Obinu
Azure VM 101 - HomeGen by CloudGen Verona - Marco ObinuMarco Obinu
 
Accelerate Game Development and Enhance Game Experience with Intel® Optane™ T...
Accelerate Game Development and Enhance Game Experience with Intel® Optane™ T...Accelerate Game Development and Enhance Game Experience with Intel® Optane™ T...
Accelerate Game Development and Enhance Game Experience with Intel® Optane™ T...Intel® Software
 
Sun Oracle Exadata V2 For OLTP And DWH
Sun Oracle Exadata V2 For OLTP And DWHSun Oracle Exadata V2 For OLTP And DWH
Sun Oracle Exadata V2 For OLTP And DWHMark Rabne
 
High Performance, Scalable MongoDB in a Bare Metal Cloud
High Performance, Scalable MongoDB in a Bare Metal CloudHigh Performance, Scalable MongoDB in a Bare Metal Cloud
High Performance, Scalable MongoDB in a Bare Metal CloudMongoDB
 
Optimizing FUSE for Cloud Storage
Optimizing FUSE for Cloud StorageOptimizing FUSE for Cloud Storage
Optimizing FUSE for Cloud StorageOpenVZ
 
High Performance, Scalable MongoDB in a Bare Metal Cloud
High Performance, Scalable MongoDB in a Bare Metal CloudHigh Performance, Scalable MongoDB in a Bare Metal Cloud
High Performance, Scalable MongoDB in a Bare Metal CloudMongoDB
 
14 scaleabilty wics
14 scaleabilty wics14 scaleabilty wics
14 scaleabilty wicsashish61_scs
 
Storage Spaces Direct - the new Microsoft SDS star - Carsten Rachfahl
Storage Spaces Direct - the new Microsoft SDS star - Carsten RachfahlStorage Spaces Direct - the new Microsoft SDS star - Carsten Rachfahl
Storage Spaces Direct - the new Microsoft SDS star - Carsten RachfahlITCamp
 
20130522 preparando el soporte de aplicaciones empresariales con windows azure
20130522 preparando el soporte de aplicaciones empresariales con windows azure20130522 preparando el soporte de aplicaciones empresariales con windows azure
20130522 preparando el soporte de aplicaciones empresariales con windows azureRicardo González
 
SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...
SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...
SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...Chester Chen
 
Exploiting Your File System to Build Robust & Efficient Workflows
Exploiting Your File System to Build Robust & Efficient WorkflowsExploiting Your File System to Build Robust & Efficient Workflows
Exploiting Your File System to Build Robust & Efficient Workflowsjasonajohnson
 

Similar to Hot Cloud'16: An Experiment on Bare-Metal BigData Provisioning (20)

An introduction and evaluations of a wide area distributed storage system
An introduction and evaluations of  a wide area distributed storage systemAn introduction and evaluations of  a wide area distributed storage system
An introduction and evaluations of a wide area distributed storage system
 
Nimble Storage Series A presentation 2007
Nimble Storage Series A presentation 2007Nimble Storage Series A presentation 2007
Nimble Storage Series A presentation 2007
 
CCNA Icnd110 s06l02
CCNA Icnd110 s06l02CCNA Icnd110 s06l02
CCNA Icnd110 s06l02
 
Empowering Congress with Data-Driven Analytics (BDT304) | AWS re:Invent 2013
Empowering Congress with Data-Driven Analytics (BDT304) | AWS re:Invent 2013Empowering Congress with Data-Driven Analytics (BDT304) | AWS re:Invent 2013
Empowering Congress with Data-Driven Analytics (BDT304) | AWS re:Invent 2013
 
Introducción a Microsoft Azure 2023
Introducción a Microsoft Azure 2023Introducción a Microsoft Azure 2023
Introducción a Microsoft Azure 2023
 
Vizuri Exadata East Coast Users Conference
Vizuri Exadata East Coast Users ConferenceVizuri Exadata East Coast Users Conference
Vizuri Exadata East Coast Users Conference
 
Galaxy Big Data with MariaDB
Galaxy Big Data with MariaDBGalaxy Big Data with MariaDB
Galaxy Big Data with MariaDB
 
High Performance Computing for LiDAR Data Production
High Performance Computing for LiDAR Data ProductionHigh Performance Computing for LiDAR Data Production
High Performance Computing for LiDAR Data Production
 
Azure VM 101 - HomeGen by CloudGen Verona - Marco Obinu
Azure VM 101 - HomeGen by CloudGen Verona - Marco ObinuAzure VM 101 - HomeGen by CloudGen Verona - Marco Obinu
Azure VM 101 - HomeGen by CloudGen Verona - Marco Obinu
 
Accelerate Game Development and Enhance Game Experience with Intel® Optane™ T...
Accelerate Game Development and Enhance Game Experience with Intel® Optane™ T...Accelerate Game Development and Enhance Game Experience with Intel® Optane™ T...
Accelerate Game Development and Enhance Game Experience with Intel® Optane™ T...
 
Sun Oracle Exadata V2 For OLTP And DWH
Sun Oracle Exadata V2 For OLTP And DWHSun Oracle Exadata V2 For OLTP And DWH
Sun Oracle Exadata V2 For OLTP And DWH
 
High Performance, Scalable MongoDB in a Bare Metal Cloud
High Performance, Scalable MongoDB in a Bare Metal CloudHigh Performance, Scalable MongoDB in a Bare Metal Cloud
High Performance, Scalable MongoDB in a Bare Metal Cloud
 
CLFS 2010
CLFS 2010CLFS 2010
CLFS 2010
 
Optimizing FUSE for Cloud Storage
Optimizing FUSE for Cloud StorageOptimizing FUSE for Cloud Storage
Optimizing FUSE for Cloud Storage
 
High Performance, Scalable MongoDB in a Bare Metal Cloud
High Performance, Scalable MongoDB in a Bare Metal CloudHigh Performance, Scalable MongoDB in a Bare Metal Cloud
High Performance, Scalable MongoDB in a Bare Metal Cloud
 
14 scaleabilty wics
14 scaleabilty wics14 scaleabilty wics
14 scaleabilty wics
 
Storage Spaces Direct - the new Microsoft SDS star - Carsten Rachfahl
Storage Spaces Direct - the new Microsoft SDS star - Carsten RachfahlStorage Spaces Direct - the new Microsoft SDS star - Carsten Rachfahl
Storage Spaces Direct - the new Microsoft SDS star - Carsten Rachfahl
 
20130522 preparando el soporte de aplicaciones empresariales con windows azure
20130522 preparando el soporte de aplicaciones empresariales con windows azure20130522 preparando el soporte de aplicaciones empresariales con windows azure
20130522 preparando el soporte de aplicaciones empresariales con windows azure
 
SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...
SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...
SF Big Analytics & SF Machine Learning Meetup: Machine Learning at the Limit ...
 
Exploiting Your File System to Build Robust & Efficient Workflows
Exploiting Your File System to Build Robust & Efficient WorkflowsExploiting Your File System to Build Robust & Efficient Workflows
Exploiting Your File System to Build Robust & Efficient Workflows
 

Recently uploaded

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 

Recently uploaded (20)

E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 

Hot Cloud'16: An Experiment on Bare-Metal BigData Provisioning

  • 1. An Experiment on Bare-Metal BigData Provisioning Ata Turk, Ravi S. Gudimetla, Emine Ugur Kaynar, Jason Hennessey, Sahil Tikale, Peter Desnoyers, Orran Krieger 1
  • 2. BigData Analytics on the Cloud • BigData deployments are moving to the cloud • On-demand usage (Cost), Elasticity, Agility, Simplicity, … • Virtualized IaaS solutions: Amazon EMR, Azure HDInsight, … • Virtualization drawbacks • Overhead, unpredictability, security concerns, device functionality, … • Bare-metal cloud solutions: IBM, Rackspace, and Internap, … 2
  • 3. Bare-Metal BigData Cloud Solutions • Bare-Metal cloud provisioning • Automated provisioning: Ironic, MaaS, … • Image copy to local disk => long waits => loss of agility & elasticity • OS streaming*, Lazy copy & de-virtualization** • What about network booting? • incur an ongoing unacceptable overhead during runtime ** Y. Omote, T. Shinagawa, and K. Kato, “Improving Agility and Elasticity in Bare-metal Clouds,” in ASPLOS’15, pp. 145–159, 2015. 3 * David Clerc, “OS Streaming Deployment”, in IPCCC’10, pp. 169–179, 2010.
  • 4. • Large parts of the HPC community has been doing it for the last 20 years. • Virtualized IaaS is doing it all the time. • Why not bare-metal cloud? 4
  • 5. Network-Mounted BigData System • Clients access kernel and init ramdisk via PXE • Mount OS & BigData apps from a remote iSCSI volume • Use local disk for ephemeral storage (HDFS, /swap, /tmp, …) 5
  • 7. Bare-Metal BigData Provisioning Prototype Hardware Isolation Layer: A service to allocate bare- metal nodes out of a shared pool and isolate network 7
  • 8. Bare-Metal BigData Provisioning Prototype CEPH: Central image store hosting user images with BigData applications 8
  • 9. Bare-Metal BigData Provisioning Prototype Provisioning VM: Gateway between isolated servers and image store 9
  • 14. Provisioning Time Local Disk iSCSI 0 200 400 600 800 1000 1200 1400 ElapsedTime(Secs) Bigdata Configuration Bigdata Installation OS Reboot Firmware Initialization Post Setup Software Installation Package Installation OS Boot(inc. Kernel+Initrd Download) DHCP request Firmware Initialization Haas Power Cycle Ceph Cloning Haas Initilization 14 Firmware initialization OS Boot (inc. kernel+initrd download) OS Installation OS Reboot BigData App Installation Post Config DHCP Request Local disk installation iSCSI boot
  • 15. Provisioning Time Local Disk iSCSI 0 200 400 600 800 1000 1200 1400 ElapsedTime(Secs) Bigdata Configuration Bigdata Installation OS Reboot Firmware Initialization Post Setup Software Installation Package Installation OS Boot(inc. Kernel+Initrd Download) DHCP request Firmware Initialization Haas Power Cycle Ceph Cloning Haas Initilization 15 Firmware initialization OS Boot (inc. kernel+initrd download) OS Installation OS Reboot BigData App Installation Post Config DHCP Request Local disk installation iSCSI boot
  • 16. Provisioning Time Scaling 2 Node 4 Node 8 Node 0 50 100 150 200 250 300 ElapsedTime(Secs) Bigdata Post Script Booting Ceph Cloning Haas Initilization 16
  • 17. Read Traffic over Boot Drive Initial Provisioning Data Generation1 Sort1 Data Generation2 Sort2 Data Generation3 Sort3 Data Generation4 Sort4 Data Generation5 Sort5 0 100 200 300 CumulativeiSCSIreadspernode(MB) iSCSI Reads: Runs with 256GB Data iSCSI Reads: Runs with 128GB Data 17
  • 18. Read Traffic over Boot Drive Initial Provisioning Data Generation1 Sort1 Data Generation2 Sort2 Data Generation3 Sort3 Data Generation4 Sort4 Data Generation5 Sort5 0 100 200 300 CumulativeiSCSIreadspernode(MB) iSCSI Reads: Runs with 256GB Data iSCSI Reads: Runs with 128GB Data ~170MB / 8GB Boot Image => 2% 18
  • 19. Read Traffic over Boot Drive Initial Provisioning Data Generation1 Sort1 Data Generation2 Sort2 Data Generation3 Sort3 Data Generation4 Sort4 Data Generation5 Sort5 0 100 200 300 CumulativeiSCSIreadspernode(MB) iSCSI Reads: Runs with 256GB Data iSCSI Reads: Runs with 128GB Data 3KB/s read after initial boot 19
  • 20. Write Traffic over Boot Drive Initial Provisioning Data Generation1 Sort1 Data Generation2 Sort2 Data Generation3 Sort3 Data Generation4 Sort4 Data Generation5 Sort5 0 100 200 300 400 500 600 700 CumulativeiSCSIwritespernode(MB) iSCSI Writes - Runs with 256GB Data iSCSI Writes - Runs with 128GB Data 20
  • 21. Write Traffic over Boot Drive Initial Provisioning Data Generation1 Sort1 Data Generation2 Sort2 Data Generation3 Sort3 Data Generation4 Sort4 Data Generation5 Sort5 0 100 200 300 400 500 600 700 CumulativeiSCSIwritespernode(MB) iSCSI Writes - Runs with 256GB Data iSCSI Writes - Runs with 128GB Data 14KB/s write 21
  • 22. Runtime Performance of Network-Mounted Boot Drive171 319 616 1187 2314 171 318 617 1176 2281 64 115 300 542 1073 69 120 238 555 1361 60 75 76 118 199 52 63 86 125 201 Data Size (GB) 0 400 800 1200 1600 2000 2400 2800 ElapsedTime(secs) WordCount - Local Disk WordCount - iSCSI Mounted Sort - Local Disk Sort - iSCSI Mounted Grep - Local Disk Grep - iSCSI Mounted 8GB 16GB 32GB 64GB 128GB 22
  • 23. Runtime Performance of Network-Mounted Boot Drive171 319 616 1187 2314 171 318 617 1176 2281 64 115 300 542 1073 69 120 238 555 1361 60 75 76 118 199 52 63 86 125 201 Data Size (GB) 0 400 800 1200 1600 2000 2400 2800 ElapsedTime(secs) WordCount - Local Disk WordCount - iSCSI Mounted Sort - Local Disk Sort - iSCSI Mounted Grep - Local Disk Grep - iSCSI Mounted 8GB 16GB 32GB 64GB 128GB 23
  • 24. Runtime Performance of Network-Mounted Boot Drive171 319 616 1187 2314 171 318 617 1176 2281 64 115 300 542 1073 69 120 238 555 1361 60 75 76 118 199 52 63 86 125 201 Data Size (GB) 0 400 800 1200 1600 2000 2400 2800 ElapsedTime(secs) WordCount - Local Disk WordCount - iSCSI Mounted Sort - Local Disk Sort - iSCSI Mounted Grep - Local Disk Grep - iSCSI Mounted 8GB 16GB 32GB 64GB 128GB 24
  • 25. Take-aways • Network booting the OS for bare-metal BigData • uses only a fraction of boot disk during start-up • improves provisioning time with no runtime degradation • provisioning time < 5 mins, boot disk reads: ~3KB/s, writes: ~14KB/s • Enormous effort on bare-metal provisioning on local disks may be unnecessary, especially for BigData deployments • We are building a new Bare Metal Imaging Service using remote network boot mechanisms • enable capabilities available on virtualized platforms (e.g. snapshotting, cloning, …) to bare metal cloud solutions 25
  • 27. Provisioning Time Local Disk iSCSI 0 200 400 600 800 1000 1200 1400 ElapsedTime(Secs) Bigdata Configuration Bigdata Installation OS Reboot Firmware Initialization Post Setup Software Installation Package Installation OS Boot(inc. Kernel+Initrd Download) DHCP request Firmware Initialization Haas Power Cycle Ceph Cloning Haas Initilization 27 Local disk installation iSCSI boot Emulab* Ironic* * A. Chandrasekar and G. Gibson, “A comparative study of baremetal provisioning frameworks,” Parallel Data Laboratory, Carnegie Mellon University, Tech. Rep. CMU-PDL-14-109, 2014.
  • 28. Provisioning Time Local Disk iSCSI 0 200 400 600 800 1000 1200 1400 ElapsedTime(Secs) Bigdata Configuration Bigdata Installation OS Reboot Firmware Initialization Post Setup Software Installation Package Installation OS Boot(inc. Kernel+Initrd Download) DHCP request Firmware Initialization Haas Power Cycle Ceph Cloning Haas Initilization 28 Local disk installation iSCSI boot Emulab* Ironic* * A. Chandrasekar and G. Gibson, “A comparative study of baremetal provisioning frameworks,” Parallel Data Laboratory, Carnegie Mellon University, Tech. Rep. CMU-PDL-14-109, 2014. copy (rather than install) an image to local disk