SlideShare a Scribd company logo
1 of 51
An Overview of Cloud Computing:
My Other Computer is a Data Center
Robert Grossman
Open Data Group &
University of Illinois at Chicago
IEEE New Technologies Conference
August 6, 2009
Part 1
What is a Cloud?
2
What is a Cloud?
3
Software as a Service
What Else is a Cloud?
4
Platform as a Service
Is Anything Else a Cloud?
5
Infrastructure as a Service
Are There Other Types of Clouds?
6
Large Data Cloud Services
ad targeting
One Definition
 Clouds provide on-demand resources or
services over a network, often the Internet,
with the scale and reliability of a data center.
 No standard definition.
 Cloud architectures are not new.
 What is new:
– Scale
– Ease of use
– Pricing model.
7
8
Scale is new.
Elastic, Usage Based Pricing Is New
9
1 computer in a rack
for 120 hours
120 computers in three
racks for 1 hour
costs the same as
 Elastic, usage based pricing turns capex into opex.
 Clouds can be used to manage surges in computing needs.
Simplicity Offered By the Cloud is New
10
+ .. and you have a computer
ready to work.
A new programmer can develop a
program to process a container full of
data with less than day of training
using MapReduce.
Part 2
Varieties of Clouds
11
Varieties of Clouds
 Architectural Model
– On-demand computing instances
vs large data cloud services
 Payment Model
– Elastic, usage based pricing,
lease/own, …
 Management Model
– Private vs Public; Single vs
Multiple Tenant; …
 Programming Model
– Queue Service, MPI,
MapReduce, Distributed UDF
12
Computing instances
vs large data cloud
services
Private internal vs
public external
Elastic, usage-
based pricing or not
All combinations
occur.
Architectural Models:
How Do You Fill a Data Center?
Cloud Storage Services
Cloud Compute Services
(MapReduce & Generalizations)
Cloud Data Services
(BigTable, etc.)
Quasi-relational
Data Services
App App App App App
App App
App App
large data cloud
services
App App App
…
on-demand
computing instances
Payment Models
 Buying racks, containers and data centers
 Leasing racks containers and data centers
 Utility based computing (pay as you go)
– Moves cap ex to op ex
– Handle surge requirements (use 1000 servers for 1
hour vs 1 server for 1000 hours)
14
Management Models
 Public, private and hybrid models
 Single tenant vs multiple tenant (shared vs
non-shared hardware)
 Owned vs leased
 Manage yourself vs outsource management
 All combinations are possible
15
Programming Models
 Amazon’s Simple
Queue Service
 MPI, sockets, FIFO
16
 MapReduce
 Distributed UDF
on-demand
computing
instances
large data
cloud services
 DryadLINQ
 Azure services
Part 3. Cloud Computing Industry
 “Cloud computing has become the center of
investment and innovation.”
Nicholas Carr, 2009 IDC Directions
17
Cloud computing is
approaching the top of
the Gartner hype cycle.
IaaS, PaaS and SaaS Point of View
SaaS
PaaS
IaaS
Infrastructure as a Service
PRODUCT: Compute power, storage
and networking infrastructure over the
internet, provided as a virtual machine
image
USERS: Developers
Platform as a Service
PRODUCT: storage, compute and
other services to simplify application
development, especially of web
applications.
USERS: Application Developers
Software as a Service
PRODUCT: Finished
application available on
demand to end user
USERS: Software consumer
Building Data Centers
 Sun’s Modular
Data Center (MD)
 Formerly Project
Blackbox
 Containers used by
Google, Microsoft
& others
 Data center
consists of 10-60+
containers.
19
Data Center Operating Systems
 Data center services include: VM management
services, business continuity services, security
services, power management services, etc.
20
workstatio
n
VM 1 VM 5
…
VM 1 VM 50,000
…
Data Center Operating System
Berkeley View of Cloud Computing
21
Providers of Cloud Services
Consumers of Cloud Services
Providers of Software as a Service
Consumers of Software as a Service
 Berkeley Report on cloud computing divides industry
into these layers & concentrates on public clouds.
Data Centers
Transition Taking Place
 A hand full of players are building multiple data
centers a year and improving with each one.
 This includes Google, Microsoft, Yahoo, …
 A data center today costs $200 M – $400+ M
 Berkeley RAD Report points out analogy with
semiconductor industry as companies stopped
building their own Fabs and starting leasing
Fabs from others as Fabs approached $1B
22
Mindmeister Map of Cloud Computing
 Dupont’s Mindmeister Map divides the industry:
– IaaS, PaaS, Management, Community
 http://www.mindmeister.com/maps/show_public/15936058
23
Part 4
Virtualization
24
Virtualization
 Virtualization separates logical infrastructure
from the underlying physical resources to
decrease time to make changes, improve
flexibility, improve utilization and reduce costs
 Example - server virtualization. Use one
physical server to support multiple logical
virtual machines (VMs), which are sometimes
called logical partitions.
 Technology pioneered by IBM in 1960s to
better utilize mainframes
25
Idea Dates Back to the 1960s
26
IBM Mainframe
IBM VM/370
CMS
App
Native (Full) Virtualization
Examples: Vmware ESX
MVS
App
CMS
App
Two Types of Virtualization
 Using the hypervisor, each guest OS sees its own
independent copy of the CPU, memory, IO, etc.
27
Physical Hardware
Hyperviser
Unmodified
Guest OS 1
Unmodified
Guest OS 2
Native (Full) Virtualization
Examples: Vmware ESX
Apps
Physical Hardware
Hyperviser
Modified
Guest OS 1
Modified
Guest OS 2
Para Virtualization
Examples: Xen
Apps
Four Key Properties
1. Partitioning: run multiple VMs on one
physical server; one VM doesn’t know about
the others
2. Isolation: security isolation is at the hardware
level.
3. Encapsulation: entire state of the machine
can be copied to files and moved around
4. Hardware abstraction: provision and migrate
VM to another server
28
Managing Virtual Machines
 Provision VM
 Schedule VM
 Monitor VM
 Self-service portal for VM
29
Large Data Clouds
30
Part 5
The Google Data Stack
 The Google File System (2003)
 MapReduce: Simplified Data Processing… (2004)
 BigTable: A Distributed Storage System… (2006)
31
Map-Reduce Example
 Input is file with one document per record
 User specifies map function
– key = document URL
– Value = terms that document contains
(“doc cdickens”,
“it was the best of times”)
“it”, 1
“was”, 1
“the”, 1
“best”, 1
map
Example (cont’d)
 MapReduce library gathers together all pairs
with the same key value (shuffle/sort phase)
 The user-defined reduce function combines all
the values associated with the same key
key = “it”
values = 1, 1
key = “was”
values = 1, 1
key = “best”
values = 1
key = “worst”
values = 1
“it”, 2
“was”, 2
“best”, 1
“worst”, 1reduce
Generalization: Apply User Defined
Functions (UDF) to Files in Storage Cloud
34
map/shuffle reduce
UDFUDF
Google’s Layered Cloud Services
Storage Services
Table Services
Compute Services
35
Google’s Stack
Applications
Google File System (GFS)
Google’s MapReduce
Google’s BigTable
Hadoop’s Layered Cloud Services
Storage Services
Table Services
Compute Services
36
Hadoop’s Stack
Applications
Hadoop Distributed File
System (HDFS)
Hadoop’s MapReduce
Sector’s Layered Cloud Services
Storage Services
Table Services
Compute Services
37
Sector’s Stack
Applications
Sector’s Distributed File
System (SDFS)
Sphere’s UDF
Routing &
Transport Services
UDP-based Data Transport
Protocol (UDT)
Hadoop & Sector
Hadoop Sector
Storage Cloud Block-based file
system
File-based
Programming
Model
MapReduce UDF &
MapReduce
Protocol TCP UDP-based
protocol (UDT)
Replication At time of writing Periodically
Security Not yet HIPAA capable
Language Java C++
38
MalStone Benchmark
 Benchmark developed by Open Cloud
Consortium for clouds supporting data
intensive computing.
 Code to generate synthetic data required is
available from code.google.com/p/malgen
 Stylized analytic computation that is easy to
implement in MapReduce and its
generalizations.
39
MalStone B
time
40
dk-2 dk-1 dk
sites entities
MalStone B Benchmark
41
MalStone B
Hadoop v0.18.3 799 min
Hadoop Streaming v0.18.3 142 min
Sector v1.19 44 min
# Nodes 20 nodes
# Records 10 Billion
Size of Dataset 1 TB
Trading Functionality for Scalability
Databases Data Clouds
Scalability 100’s TB 100’s PB
Functionalit
y
Full SQL-based queries,
including joins
Optimized access to sorted
tables (tables with single keys)
Optimized Databases are optimized for
safe writes
Clouds optimized for efficient
reads
Consistency
model
ACID (Atomicity, Consistency,
Isolation & Durability) –
database always consist
Eventual consistency – updates
eventually propagate through
system
Parallelism Difficult because of ACID
model; shared nothing is
possible (Graywolf)
Basic design incorporates
parallelism over commodity
components
Scale Racks Data center
42
Not Everyone Agrees
 David J. DeWitt and Michael Stonebraker,
MapReduce: A Major Step Backwards,
Database Column, Jane 17, 2008
43
Part 6. Standards Efforts
44
Change of gauge at Ussuriisk (near
Vladivostok) at the Chinese –Russian border
Train gauge
in China is
1435 mm
Train gauge
in Russia is
1520 mm
How can a
cloud
application
move from
one cloud
storage
service to
another?
Standards Efforts for Clouds
 Cloud Computing Interoperability Forum (CCIF)
 Open Cloud Consortium (OCC)
 Open Grid Forum (OGF)
 Distributed Management Task Force (DMTF)
 Storage Network Industrial Association (SNIA)
 Plus several others…
45
www.opencloudconsortium.org
1. Supports the development of standards.
2. Supports reference implementations for
cloud computing, preferably open source.
3. Manages a testbed for cloud computing
called the Open Cloud Testbed.
4. Supports the development of benchmarks.
5. Sponsors workshops and other events related
to cloud computing.
46
Activities Currently Focused Around
Five Use Cases
1. Moving an existing cloud application from Cloud
1 to Cloud 2 without changing the application.
2. Providing surge capacity for an application on
Cloud 1 using any of the Clouds 2, 3, … (without
changing the application).
Cloud 1 Cloud 2
1. Migrate / port
2. Surge / burst
Large Data Cloud Use Cases
3. Moving a large data cloud application from
one large data cloud storage service to
another.
4. Moving a large data cloud application from
one large data cloud compute service to
another.
Large Data Cloud Storage Services
Large Data Cloud Compute Services
App 1 App 2
Inter-Cloud Use Case
5. Inter-cloud communication between two
HIPAA compliant clouds.
Cloud 1 Cloud 2
OCC Welcomes New Members
 Companies and organizations are welcome to
join the Open Cloud Consortium (OCC)
www.opencloudconsortium.org/membership.html
 Join one of our working groups
– Large Data Clouds Working Group
– Standard Cloud Performance Measurement
(SCPM) Working Group
– Information Sharing & Security Working Group
For More Information
 Contact information:
Robert Grossman
rlg@opendatagroup.com
blog.rgrossman.com
 Web sites
– www.opendatagroup.com
– www.ncdm.uic.edu
– www.opencloudconsortium.org
51

More Related Content

What's hot

Azure SQL Database Managed Instance - technical overview
Azure SQL Database Managed Instance - technical overviewAzure SQL Database Managed Instance - technical overview
Azure SQL Database Managed Instance - technical overviewGeorge Walters
 
The Evolution of SQL Server as a Service - SQL Azure Managed Instance
The Evolution of SQL Server as a Service - SQL Azure Managed InstanceThe Evolution of SQL Server as a Service - SQL Azure Managed Instance
The Evolution of SQL Server as a Service - SQL Azure Managed InstanceJavier Villegas
 
Azure SQL DB Managed Instances Built to easily modernize application data layer
Azure SQL DB Managed Instances Built to easily modernize application data layerAzure SQL DB Managed Instances Built to easily modernize application data layer
Azure SQL DB Managed Instances Built to easily modernize application data layerMicrosoft Tech Community
 
Implement SQL Server on an Azure VM
Implement SQL Server on an Azure VMImplement SQL Server on an Azure VM
Implement SQL Server on an Azure VMJames Serra
 
Designing a modern data warehouse in azure
Designing a modern data warehouse in azure   Designing a modern data warehouse in azure
Designing a modern data warehouse in azure Antonios Chatzipavlis
 
Azure SQL Managed Instance - SqlBits 2019
Azure SQL Managed Instance - SqlBits 2019Azure SQL Managed Instance - SqlBits 2019
Azure SQL Managed Instance - SqlBits 2019Jovan Popovic
 
Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure ...
Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure ...Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure ...
Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure ...Edureka!
 
Data Platform Overview
Data Platform OverviewData Platform Overview
Data Platform OverviewHamid J. Fard
 
McGraw-Hill Optimizes Analytics Workloads with Databricks
 McGraw-Hill Optimizes Analytics Workloads with Databricks McGraw-Hill Optimizes Analytics Workloads with Databricks
McGraw-Hill Optimizes Analytics Workloads with DatabricksAmazon Web Services
 
Microsoft Azure Offerings and New Services
Microsoft Azure Offerings and New Services Microsoft Azure Offerings and New Services
Microsoft Azure Offerings and New Services Mohamed Tawfik
 
Azure Databases for PostgreSQL, MySQL and MariaDB
Azure Databases for PostgreSQL, MySQL and MariaDBAzure Databases for PostgreSQL, MySQL and MariaDB
Azure Databases for PostgreSQL, MySQL and MariaDBrockplace
 
Azure - Data Platform
Azure - Data PlatformAzure - Data Platform
Azure - Data Platformgiventocode
 
Oracle to Azure PostgreSQL database migration webinar
Oracle to Azure PostgreSQL database migration webinarOracle to Azure PostgreSQL database migration webinar
Oracle to Azure PostgreSQL database migration webinarMinnie Seungmin Cho
 
J1 T1 4 - Azure Data Factory vs SSIS - Regis Baccaro
J1 T1 4 - Azure Data Factory vs SSIS - Regis BaccaroJ1 T1 4 - Azure Data Factory vs SSIS - Regis Baccaro
J1 T1 4 - Azure Data Factory vs SSIS - Regis BaccaroMS Cloud Summit
 
Data Migration to Azure SQL and Azure SQL Managed Instance - June 19 2020
Data Migration to Azure SQL and Azure SQL Managed Instance - June 19 2020Data Migration to Azure SQL and Azure SQL Managed Instance - June 19 2020
Data Migration to Azure SQL and Azure SQL Managed Instance - June 19 2020Timothy McAliley
 
Cortana Analytics Suite
Cortana Analytics SuiteCortana Analytics Suite
Cortana Analytics SuiteJames Serra
 

What's hot (20)

Azure data platform overview
Azure data platform overviewAzure data platform overview
Azure data platform overview
 
Azure SQL Database Managed Instance - technical overview
Azure SQL Database Managed Instance - technical overviewAzure SQL Database Managed Instance - technical overview
Azure SQL Database Managed Instance - technical overview
 
The Evolution of SQL Server as a Service - SQL Azure Managed Instance
The Evolution of SQL Server as a Service - SQL Azure Managed InstanceThe Evolution of SQL Server as a Service - SQL Azure Managed Instance
The Evolution of SQL Server as a Service - SQL Azure Managed Instance
 
Azure SQL DB Managed Instances Built to easily modernize application data layer
Azure SQL DB Managed Instances Built to easily modernize application data layerAzure SQL DB Managed Instances Built to easily modernize application data layer
Azure SQL DB Managed Instances Built to easily modernize application data layer
 
Implement SQL Server on an Azure VM
Implement SQL Server on an Azure VMImplement SQL Server on an Azure VM
Implement SQL Server on an Azure VM
 
Designing a modern data warehouse in azure
Designing a modern data warehouse in azure   Designing a modern data warehouse in azure
Designing a modern data warehouse in azure
 
Azure SQL Managed Instance - SqlBits 2019
Azure SQL Managed Instance - SqlBits 2019Azure SQL Managed Instance - SqlBits 2019
Azure SQL Managed Instance - SqlBits 2019
 
Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure ...
Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure ...Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure ...
Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure ...
 
Data Platform Overview
Data Platform OverviewData Platform Overview
Data Platform Overview
 
McGraw-Hill Optimizes Analytics Workloads with Databricks
 McGraw-Hill Optimizes Analytics Workloads with Databricks McGraw-Hill Optimizes Analytics Workloads with Databricks
McGraw-Hill Optimizes Analytics Workloads with Databricks
 
Microsoft Azure Offerings and New Services
Microsoft Azure Offerings and New Services Microsoft Azure Offerings and New Services
Microsoft Azure Offerings and New Services
 
Azure Databases for PostgreSQL, MySQL and MariaDB
Azure Databases for PostgreSQL, MySQL and MariaDBAzure Databases for PostgreSQL, MySQL and MariaDB
Azure Databases for PostgreSQL, MySQL and MariaDB
 
Azure - Data Platform
Azure - Data PlatformAzure - Data Platform
Azure - Data Platform
 
Synapse for mere mortals
Synapse for mere mortalsSynapse for mere mortals
Synapse for mere mortals
 
Oracle to Azure PostgreSQL database migration webinar
Oracle to Azure PostgreSQL database migration webinarOracle to Azure PostgreSQL database migration webinar
Oracle to Azure PostgreSQL database migration webinar
 
J1 T1 4 - Azure Data Factory vs SSIS - Regis Baccaro
J1 T1 4 - Azure Data Factory vs SSIS - Regis BaccaroJ1 T1 4 - Azure Data Factory vs SSIS - Regis Baccaro
J1 T1 4 - Azure Data Factory vs SSIS - Regis Baccaro
 
Data Management
Data ManagementData Management
Data Management
 
Data Migration to Azure SQL and Azure SQL Managed Instance - June 19 2020
Data Migration to Azure SQL and Azure SQL Managed Instance - June 19 2020Data Migration to Azure SQL and Azure SQL Managed Instance - June 19 2020
Data Migration to Azure SQL and Azure SQL Managed Instance - June 19 2020
 
Taming the shrew Power BI
Taming the shrew Power BITaming the shrew Power BI
Taming the shrew Power BI
 
Cortana Analytics Suite
Cortana Analytics SuiteCortana Analytics Suite
Cortana Analytics Suite
 

Similar to An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)

My Other Computer is a Data Center (2010 v21)
My Other Computer is a Data Center (2010 v21)My Other Computer is a Data Center (2010 v21)
My Other Computer is a Data Center (2010 v21)Robert Grossman
 
An Introduction to Cloud Computing (2009)
An Introduction to Cloud Computing (2009)An Introduction to Cloud Computing (2009)
An Introduction to Cloud Computing (2009)Robert Grossman
 
Cloud computing and grid computing 360 degree compared
Cloud computing and grid computing 360 degree comparedCloud computing and grid computing 360 degree compared
Cloud computing and grid computing 360 degree comparedMd. Hasibur Rashid
 
Cloud computing - dien toan dam may
Cloud computing - dien toan dam mayCloud computing - dien toan dam may
Cloud computing - dien toan dam mayNguyen Duong
 
My Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataMy Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataRobert Grossman
 
The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5
The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5
The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5Robert Grossman
 
CENTRE FOR DATA CENTER WITH DIAGRAMS.ppt
CENTRE FOR DATA CENTER WITH DIAGRAMS.pptCENTRE FOR DATA CENTER WITH DIAGRAMS.ppt
CENTRE FOR DATA CENTER WITH DIAGRAMS.pptdhanasekarscse
 
02_Cloud-Intro.pdf cloud introduction introduction
02_Cloud-Intro.pdf cloud introduction introduction02_Cloud-Intro.pdf cloud introduction introduction
02_Cloud-Intro.pdf cloud introduction introductionAslamHossain30
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
 
Introduction to Cloud Computing
Introduction to Cloud ComputingIntroduction to Cloud Computing
Introduction to Cloud ComputingAnimesh Chaturvedi
 
CHAPTER 2 cloud computing technology in cs
CHAPTER 2 cloud computing technology in csCHAPTER 2 cloud computing technology in cs
CHAPTER 2 cloud computing technology in csTSha7
 
Cloud Computing: Overview and Examples
Cloud Computing: Overview and ExamplesCloud Computing: Overview and Examples
Cloud Computing: Overview and ExamplesEueung Mulyana
 

Similar to An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19) (20)

My Other Computer is a Data Center (2010 v21)
My Other Computer is a Data Center (2010 v21)My Other Computer is a Data Center (2010 v21)
My Other Computer is a Data Center (2010 v21)
 
An Introduction to Cloud Computing (2009)
An Introduction to Cloud Computing (2009)An Introduction to Cloud Computing (2009)
An Introduction to Cloud Computing (2009)
 
Computer project
Computer projectComputer project
Computer project
 
Cloud computing and grid computing 360 degree compared
Cloud computing and grid computing 360 degree comparedCloud computing and grid computing 360 degree compared
Cloud computing and grid computing 360 degree compared
 
Cloud computing - dien toan dam may
Cloud computing - dien toan dam mayCloud computing - dien toan dam may
Cloud computing - dien toan dam may
 
My Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataMy Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big Data
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Database as a Service - Tutorial @ICDE 2010
Database as a Service - Tutorial @ICDE 2010Database as a Service - Tutorial @ICDE 2010
Database as a Service - Tutorial @ICDE 2010
 
Cloud computing: highlights
Cloud computing: highlightsCloud computing: highlights
Cloud computing: highlights
 
Cloud & Data Center Networking
Cloud & Data Center NetworkingCloud & Data Center Networking
Cloud & Data Center Networking
 
The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5
The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5
The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5
 
CENTRE FOR DATA CENTER WITH DIAGRAMS.ppt
CENTRE FOR DATA CENTER WITH DIAGRAMS.pptCENTRE FOR DATA CENTER WITH DIAGRAMS.ppt
CENTRE FOR DATA CENTER WITH DIAGRAMS.ppt
 
02_Cloud-Intro.pdf cloud introduction introduction
02_Cloud-Intro.pdf cloud introduction introduction02_Cloud-Intro.pdf cloud introduction introduction
02_Cloud-Intro.pdf cloud introduction introduction
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
Introduction to Cloud Computing
Introduction to Cloud ComputingIntroduction to Cloud Computing
Introduction to Cloud Computing
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
CHAPTER 2 cloud computing technology in cs
CHAPTER 2 cloud computing technology in csCHAPTER 2 cloud computing technology in cs
CHAPTER 2 cloud computing technology in cs
 
Above The Clouds
Above The CloudsAbove The Clouds
Above The Clouds
 
CDP_2(1).pptx
CDP_2(1).pptxCDP_2(1).pptx
CDP_2(1).pptx
 
Cloud Computing: Overview and Examples
Cloud Computing: Overview and ExamplesCloud Computing: Overview and Examples
Cloud Computing: Overview and Examples
 

More from Robert Grossman

Some Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your CompanySome Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your CompanyRobert Grossman
 
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsSome Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsRobert Grossman
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataRobert Grossman
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedCrossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedRobert Grossman
 
A Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchA Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchRobert Grossman
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?Robert Grossman
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...Robert Grossman
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...Robert Grossman
 
AnalyticOps - Chicago PAW 2016
AnalyticOps - Chicago PAW 2016AnalyticOps - Chicago PAW 2016
AnalyticOps - Chicago PAW 2016Robert Grossman
 
Keynote on 2015 Yale Day of Data
Keynote on 2015 Yale Day of Data Keynote on 2015 Yale Day of Data
Keynote on 2015 Yale Day of Data Robert Grossman
 
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...Robert Grossman
 
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...Robert Grossman
 
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)Robert Grossman
 
Architectures for Data Commons (XLDB 15 Lightning Talk)
Architectures for Data Commons (XLDB 15 Lightning Talk)Architectures for Data Commons (XLDB 15 Lightning Talk)
Architectures for Data Commons (XLDB 15 Lightning Talk)Robert Grossman
 
Practical Methods for Identifying Anomalies That Matter in Large Datasets
Practical Methods for Identifying Anomalies That Matter in Large DatasetsPractical Methods for Identifying Anomalies That Matter in Large Datasets
Practical Methods for Identifying Anomalies That Matter in Large DatasetsRobert Grossman
 
What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? Robert Grossman
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Robert Grossman
 
Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)Robert Grossman
 
What Are Science Clouds?
What Are Science Clouds?What Are Science Clouds?
What Are Science Clouds?Robert Grossman
 
Adversarial Analytics - 2013 Strata & Hadoop World Talk
Adversarial Analytics - 2013 Strata & Hadoop World TalkAdversarial Analytics - 2013 Strata & Hadoop World Talk
Adversarial Analytics - 2013 Strata & Hadoop World TalkRobert Grossman
 

More from Robert Grossman (20)

Some Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your CompanySome Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your Company
 
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsSome Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data Platforms
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate Data
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedCrossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
 
A Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchA Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical Research
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
AnalyticOps - Chicago PAW 2016
AnalyticOps - Chicago PAW 2016AnalyticOps - Chicago PAW 2016
AnalyticOps - Chicago PAW 2016
 
Keynote on 2015 Yale Day of Data
Keynote on 2015 Yale Day of Data Keynote on 2015 Yale Day of Data
Keynote on 2015 Yale Day of Data
 
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
 
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
 
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
 
Architectures for Data Commons (XLDB 15 Lightning Talk)
Architectures for Data Commons (XLDB 15 Lightning Talk)Architectures for Data Commons (XLDB 15 Lightning Talk)
Architectures for Data Commons (XLDB 15 Lightning Talk)
 
Practical Methods for Identifying Anomalies That Matter in Large Datasets
Practical Methods for Identifying Anomalies That Matter in Large DatasetsPractical Methods for Identifying Anomalies That Matter in Large Datasets
Practical Methods for Identifying Anomalies That Matter in Large Datasets
 
What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care?
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
 
Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)
 
What Are Science Clouds?
What Are Science Clouds?What Are Science Clouds?
What Are Science Clouds?
 
Adversarial Analytics - 2013 Strata & Hadoop World Talk
Adversarial Analytics - 2013 Strata & Hadoop World TalkAdversarial Analytics - 2013 Strata & Hadoop World Talk
Adversarial Analytics - 2013 Strata & Hadoop World Talk
 

Recently uploaded

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 

Recently uploaded (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)

  • 1. An Overview of Cloud Computing: My Other Computer is a Data Center Robert Grossman Open Data Group & University of Illinois at Chicago IEEE New Technologies Conference August 6, 2009
  • 2. Part 1 What is a Cloud? 2
  • 3. What is a Cloud? 3 Software as a Service
  • 4. What Else is a Cloud? 4 Platform as a Service
  • 5. Is Anything Else a Cloud? 5 Infrastructure as a Service
  • 6. Are There Other Types of Clouds? 6 Large Data Cloud Services ad targeting
  • 7. One Definition  Clouds provide on-demand resources or services over a network, often the Internet, with the scale and reliability of a data center.  No standard definition.  Cloud architectures are not new.  What is new: – Scale – Ease of use – Pricing model. 7
  • 9. Elastic, Usage Based Pricing Is New 9 1 computer in a rack for 120 hours 120 computers in three racks for 1 hour costs the same as  Elastic, usage based pricing turns capex into opex.  Clouds can be used to manage surges in computing needs.
  • 10. Simplicity Offered By the Cloud is New 10 + .. and you have a computer ready to work. A new programmer can develop a program to process a container full of data with less than day of training using MapReduce.
  • 11. Part 2 Varieties of Clouds 11
  • 12. Varieties of Clouds  Architectural Model – On-demand computing instances vs large data cloud services  Payment Model – Elastic, usage based pricing, lease/own, …  Management Model – Private vs Public; Single vs Multiple Tenant; …  Programming Model – Queue Service, MPI, MapReduce, Distributed UDF 12 Computing instances vs large data cloud services Private internal vs public external Elastic, usage- based pricing or not All combinations occur.
  • 13. Architectural Models: How Do You Fill a Data Center? Cloud Storage Services Cloud Compute Services (MapReduce & Generalizations) Cloud Data Services (BigTable, etc.) Quasi-relational Data Services App App App App App App App App App large data cloud services App App App … on-demand computing instances
  • 14. Payment Models  Buying racks, containers and data centers  Leasing racks containers and data centers  Utility based computing (pay as you go) – Moves cap ex to op ex – Handle surge requirements (use 1000 servers for 1 hour vs 1 server for 1000 hours) 14
  • 15. Management Models  Public, private and hybrid models  Single tenant vs multiple tenant (shared vs non-shared hardware)  Owned vs leased  Manage yourself vs outsource management  All combinations are possible 15
  • 16. Programming Models  Amazon’s Simple Queue Service  MPI, sockets, FIFO 16  MapReduce  Distributed UDF on-demand computing instances large data cloud services  DryadLINQ  Azure services
  • 17. Part 3. Cloud Computing Industry  “Cloud computing has become the center of investment and innovation.” Nicholas Carr, 2009 IDC Directions 17 Cloud computing is approaching the top of the Gartner hype cycle.
  • 18. IaaS, PaaS and SaaS Point of View SaaS PaaS IaaS Infrastructure as a Service PRODUCT: Compute power, storage and networking infrastructure over the internet, provided as a virtual machine image USERS: Developers Platform as a Service PRODUCT: storage, compute and other services to simplify application development, especially of web applications. USERS: Application Developers Software as a Service PRODUCT: Finished application available on demand to end user USERS: Software consumer
  • 19. Building Data Centers  Sun’s Modular Data Center (MD)  Formerly Project Blackbox  Containers used by Google, Microsoft & others  Data center consists of 10-60+ containers. 19
  • 20. Data Center Operating Systems  Data center services include: VM management services, business continuity services, security services, power management services, etc. 20 workstatio n VM 1 VM 5 … VM 1 VM 50,000 … Data Center Operating System
  • 21. Berkeley View of Cloud Computing 21 Providers of Cloud Services Consumers of Cloud Services Providers of Software as a Service Consumers of Software as a Service  Berkeley Report on cloud computing divides industry into these layers & concentrates on public clouds. Data Centers
  • 22. Transition Taking Place  A hand full of players are building multiple data centers a year and improving with each one.  This includes Google, Microsoft, Yahoo, …  A data center today costs $200 M – $400+ M  Berkeley RAD Report points out analogy with semiconductor industry as companies stopped building their own Fabs and starting leasing Fabs from others as Fabs approached $1B 22
  • 23. Mindmeister Map of Cloud Computing  Dupont’s Mindmeister Map divides the industry: – IaaS, PaaS, Management, Community  http://www.mindmeister.com/maps/show_public/15936058 23
  • 25. Virtualization  Virtualization separates logical infrastructure from the underlying physical resources to decrease time to make changes, improve flexibility, improve utilization and reduce costs  Example - server virtualization. Use one physical server to support multiple logical virtual machines (VMs), which are sometimes called logical partitions.  Technology pioneered by IBM in 1960s to better utilize mainframes 25
  • 26. Idea Dates Back to the 1960s 26 IBM Mainframe IBM VM/370 CMS App Native (Full) Virtualization Examples: Vmware ESX MVS App CMS App
  • 27. Two Types of Virtualization  Using the hypervisor, each guest OS sees its own independent copy of the CPU, memory, IO, etc. 27 Physical Hardware Hyperviser Unmodified Guest OS 1 Unmodified Guest OS 2 Native (Full) Virtualization Examples: Vmware ESX Apps Physical Hardware Hyperviser Modified Guest OS 1 Modified Guest OS 2 Para Virtualization Examples: Xen Apps
  • 28. Four Key Properties 1. Partitioning: run multiple VMs on one physical server; one VM doesn’t know about the others 2. Isolation: security isolation is at the hardware level. 3. Encapsulation: entire state of the machine can be copied to files and moved around 4. Hardware abstraction: provision and migrate VM to another server 28
  • 29. Managing Virtual Machines  Provision VM  Schedule VM  Monitor VM  Self-service portal for VM 29
  • 31. The Google Data Stack  The Google File System (2003)  MapReduce: Simplified Data Processing… (2004)  BigTable: A Distributed Storage System… (2006) 31
  • 32. Map-Reduce Example  Input is file with one document per record  User specifies map function – key = document URL – Value = terms that document contains (“doc cdickens”, “it was the best of times”) “it”, 1 “was”, 1 “the”, 1 “best”, 1 map
  • 33. Example (cont’d)  MapReduce library gathers together all pairs with the same key value (shuffle/sort phase)  The user-defined reduce function combines all the values associated with the same key key = “it” values = 1, 1 key = “was” values = 1, 1 key = “best” values = 1 key = “worst” values = 1 “it”, 2 “was”, 2 “best”, 1 “worst”, 1reduce
  • 34. Generalization: Apply User Defined Functions (UDF) to Files in Storage Cloud 34 map/shuffle reduce UDFUDF
  • 35. Google’s Layered Cloud Services Storage Services Table Services Compute Services 35 Google’s Stack Applications Google File System (GFS) Google’s MapReduce Google’s BigTable
  • 36. Hadoop’s Layered Cloud Services Storage Services Table Services Compute Services 36 Hadoop’s Stack Applications Hadoop Distributed File System (HDFS) Hadoop’s MapReduce
  • 37. Sector’s Layered Cloud Services Storage Services Table Services Compute Services 37 Sector’s Stack Applications Sector’s Distributed File System (SDFS) Sphere’s UDF Routing & Transport Services UDP-based Data Transport Protocol (UDT)
  • 38. Hadoop & Sector Hadoop Sector Storage Cloud Block-based file system File-based Programming Model MapReduce UDF & MapReduce Protocol TCP UDP-based protocol (UDT) Replication At time of writing Periodically Security Not yet HIPAA capable Language Java C++ 38
  • 39. MalStone Benchmark  Benchmark developed by Open Cloud Consortium for clouds supporting data intensive computing.  Code to generate synthetic data required is available from code.google.com/p/malgen  Stylized analytic computation that is easy to implement in MapReduce and its generalizations. 39
  • 40. MalStone B time 40 dk-2 dk-1 dk sites entities
  • 41. MalStone B Benchmark 41 MalStone B Hadoop v0.18.3 799 min Hadoop Streaming v0.18.3 142 min Sector v1.19 44 min # Nodes 20 nodes # Records 10 Billion Size of Dataset 1 TB
  • 42. Trading Functionality for Scalability Databases Data Clouds Scalability 100’s TB 100’s PB Functionalit y Full SQL-based queries, including joins Optimized access to sorted tables (tables with single keys) Optimized Databases are optimized for safe writes Clouds optimized for efficient reads Consistency model ACID (Atomicity, Consistency, Isolation & Durability) – database always consist Eventual consistency – updates eventually propagate through system Parallelism Difficult because of ACID model; shared nothing is possible (Graywolf) Basic design incorporates parallelism over commodity components Scale Racks Data center 42
  • 43. Not Everyone Agrees  David J. DeWitt and Michael Stonebraker, MapReduce: A Major Step Backwards, Database Column, Jane 17, 2008 43
  • 44. Part 6. Standards Efforts 44 Change of gauge at Ussuriisk (near Vladivostok) at the Chinese –Russian border Train gauge in China is 1435 mm Train gauge in Russia is 1520 mm How can a cloud application move from one cloud storage service to another?
  • 45. Standards Efforts for Clouds  Cloud Computing Interoperability Forum (CCIF)  Open Cloud Consortium (OCC)  Open Grid Forum (OGF)  Distributed Management Task Force (DMTF)  Storage Network Industrial Association (SNIA)  Plus several others… 45
  • 46. www.opencloudconsortium.org 1. Supports the development of standards. 2. Supports reference implementations for cloud computing, preferably open source. 3. Manages a testbed for cloud computing called the Open Cloud Testbed. 4. Supports the development of benchmarks. 5. Sponsors workshops and other events related to cloud computing. 46
  • 47. Activities Currently Focused Around Five Use Cases 1. Moving an existing cloud application from Cloud 1 to Cloud 2 without changing the application. 2. Providing surge capacity for an application on Cloud 1 using any of the Clouds 2, 3, … (without changing the application). Cloud 1 Cloud 2 1. Migrate / port 2. Surge / burst
  • 48. Large Data Cloud Use Cases 3. Moving a large data cloud application from one large data cloud storage service to another. 4. Moving a large data cloud application from one large data cloud compute service to another. Large Data Cloud Storage Services Large Data Cloud Compute Services App 1 App 2
  • 49. Inter-Cloud Use Case 5. Inter-cloud communication between two HIPAA compliant clouds. Cloud 1 Cloud 2
  • 50. OCC Welcomes New Members  Companies and organizations are welcome to join the Open Cloud Consortium (OCC) www.opencloudconsortium.org/membership.html  Join one of our working groups – Large Data Clouds Working Group – Standard Cloud Performance Measurement (SCPM) Working Group – Information Sharing & Security Working Group
  • 51. For More Information  Contact information: Robert Grossman rlg@opendatagroup.com blog.rgrossman.com  Web sites – www.opendatagroup.com – www.ncdm.uic.edu – www.opencloudconsortium.org 51