4. Agenda
• Cloud Computing and Google Cloud
Platform
• Google Compute Engine and the Cloud
Platform
• Beyond GCE (Google Compute Engine)
5.
6. Cloud Service Levels
IaaS: Infrastructure as a Service
PaaS: Platform as a Service
SaaS: Software as a Service SaaS
PaaS
IaaS
7. Infrastructure as a Service
• Simplest form: leasing a physical or virtual
server box: RackSpace, SoftLayer
• Includes
• Hardware: servers, network, routers, load
balancers,…
• Software: operating systems, databases (storage),
application servers
8. Infrastructure as a Service
Amazon AWS (Amazon Web Services) (+ S3
(Simple Storage Service) + EC2 (Elastic Cloud
Compute))
Microsoft Azure: VM Role
Google: Google Compute Engine (GCE)*
9. Platform as a Service
• The provider takes care some higher level
functions in the service stack
• Instead of getting servers, you get an
application framework
• Less control over the lower level service
elements, but the abstraction should result in
less hassle and more focus on the goal
10. Platform as a Service
• Google: Google App Engine (GAE)*
• Azure Web Role, Worker Role, Reporting
Services, etc.
11. Software as a Service
• Software deployed on the internet
• Designed for end-users
• Delivered through the web
• The back-end automatically scales, fault-tolerant
persistence
12. Software as a Service
• Usually API (Application Programming
Interface) is available for usage or feature
extension
• Example
• Gmail, Google Docs, Google Spreadsheet
• Office 365
13. IaaS / PaaS / SaaS
SaaS
PaaS
IaaS
Level of Control
Level of Abstraction
22. Google Compute Engine
• IaaS level*
• Minute-by-minute billing (10 minutes minimum)
• Variety of virtual hardware selections (CPU
config and mem size)
• Standard or custom VM images
• Can be accessed through command line and
RESTful API
23. GCE Demo
• Exploring Google Cloud developer console,
creating a VM
• Starting steps for hosting a website
(installing Apache, etc.)
37. Hadoop
• Hadoop is an open-source software
framework that supports data-intensive
distributed applications
• A Hadooop cluster is composed of a single
master node and multiple worker nodes
38. Hadoop
Has two main services:
1. Storing large amounts of data: HDFS, Hadoop
Distributed File System
2. Processing large amounts of data:
implementation of the MapReduce
programming model
39. HDFS
Metadata
Store
Name node
Node 1 Node 2
Block A Block B Block A Block B
Node 3
Block A Block B
Data node Data node Data node
40. Job / task management
Jobtracker
Name node
Heart beat signals and
communication
Tasktracker Tasktracker
Map 1 Reduce 1 Map 2 Reduce 2
Tasktracker
Map 3 Reduce 3
Data node Data node Data node