The 5 principles of google's cloud
Upcoming SlideShare
Loading in...5
×
 

The 5 principles of google's cloud

on

  • 1,903 views

I did this inspired by the Google IO 2011. The animations and videos don't show :-)

I did this inspired by the Google IO 2011. The animations and videos don't show :-)

Statistics

Views

Total Views
1,903
Views on SlideShare
1,860
Embed Views
43

Actions

Likes
5
Downloads
78
Comments
0

4 Embeds 43

http://www.skirtap.com 24
http://cloudbetween.com 17
http://www.linkedin.com 1
http://cloudbetween.wordpress.com 1

Accessibility

Upload Details

Uploaded via as Adobe PDF

Usage Rights

CC Attribution License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    The 5 principles of google's cloud The 5 principles of google's cloud Presentation Transcript

    • E N TE RP RI S EA R C H I TE C T U R E THE 5 PRINCIPLES OF OF GOOGLE’S ”CLOUD” Patrik Svensson, 2011, ptrksvnssn@gmail.comtorsdag den 12 maj 2011
    • E N TE RP RI S E THE VISION OF GOOGLEA R C H I TE C T U R Etorsdag den 12 maj 2011
    • E N TE RP RI S EAE R C C T H U I R T E THE 5 PRINCIPLES • Everything is a service (or an application in Android) • Relentless technical focus (thinking at nanoscale) • Data centers are the foundation • Code is king, Data is king kong • Identify and keep track on your userstorsdag den 12 maj 2011
    • E N TE RP RI S EA R C H I TE C T U R Etorsdag den 12 maj 2011
    • #1 EVERYTHING IS AE N TE RP RI S E SERVICE (OR ANAE R C C T H U I R T E APPLICATION)torsdag den 12 maj 2011
    • E N TE RP RI S E #2 RELENTLESSAE R C C T H U I R T E TECHNICAL FOCUS • Jedis build their own lightsabres • Parallelize, Distribute, Cache, Compress, Redundantize everything • Latency is VERY evil Source: http://www.flickr.com/photos/60994749@N07/5557591956/torsdag den 12 maj 2011
    • EXAMPLE: ”NUMBERSE N TE RP RI S E EVERYONE SHOULDAE R C C T H U I R T E KNOW” 1,000,000 ns = 1 ms 1,000,000,000 ns = 1 s Source: Jeff Dean, ”Designs, Lessons and Advise from Building Large Distributed Systems”torsdag den 12 maj 2011
    • E N TE RP RI S E #3 DATA CENTERS AREAE R C C T H U I R T E THE FOUNDATIONtorsdag den 12 maj 2011
    • E N TE RP RI S EAE R C C T H U I R T E ECONOMIES OF SCALE • ~40 data centers in 2009, 1000,000 machines Source: http://techcrunch.com/2008/04/11/where-are-all-the-google-data-centers/torsdag den 12 maj 2011
    • E N TE RP RI S EA R C H I TE C T U R Etorsdag den 12 maj 2011
    • E N TE RP RI S E #4 CODE IS KING, DATA IS KING KONGA R C H I TE C T U R E Enterprise Architecture Technical Architecture i.e. which technologies do we use DATA CENTERS DATA CODE CONTROL USERS "We need: "We need to build applications "We need: and services, application-, "We need scheduling "We need to identify our One Distributed File Cooling, Power, integration- & data platforms, synchronization, lock users to be able to Systems, Distributed Perimeter Networks, parallell computing platforms & services, i.e. various interact, differentiate and One Shared memory, Containers, Racks, use an open source OS, upon forms of control customize the user & common data Switches & Hardware at our data center/data platform" mechanisms for data and experience" formats to get scale low cost that scale" code" and low cost" Implementation Architecture i.e. how do we implement the technologies Android, Chrome App Engine, Gmail, Search, Index GFS master GFS, Python, Java, C++ Google Work Queue, OpenID, OAuth, Google Google Container- BigTable, Chubby,Netscalar, Google Accounts available for most Protocol Buffers, Json based Data Centers Protocol Buffers HTTP Server, (Spanner) services Sawzall, Dremel, Percolator MapReduce Linuxtorsdag den 12 maj 2011
    • E N TE RP RI S E "Googles mission is toAE R C C T H U I R T E ABOUT DATA organize the worlds information and make it available to all" +20 Petabyte/day 200 150 100 ~10 Terabyte/day 50 ~2,5 Terabyte 0 Structured, Numerical Unstructured, Textual Communication, Traffictorsdag den 12 maj 2011
    • E N TE RP RI S EAE R C C T H U I R T E DATA CENTER ”ENTRY” • The same entry to each Data Center • ~50 caching (using Squid) • Built their own HTTP servers/farms Source: Ed Austin, ”The Anatomy of the Google Architecture”torsdag den 12 maj 2011
    • E N TE RP RI S EAE R C C T H U I R T E INSIDE THE CONTAINERS • Customized commodity servers, is customized racks in containers (+1000 servers), organized into clusters • All containers ”cloned” and look the same Source: Jeff Dean, ”Designs, Lessons and Advise from Building Large Distributed Systems”torsdag den 12 maj 2011
    • THE SAME HW, OS ANDE N TE RP RI S E FILESYSTEMAE R C C T H U I R T E EVERYWHERE Source: Jeff Dean, ”Designs, Lessons and Advise from Building Large Distributed Systems”torsdag den 12 maj 2011
    • E N TE RP RI S EAE R C C T H U I R T E BIGDATA AS DATABASE Source: Jeff Dean, ”Designs, Lessons and Advise from Building Large Distributed Systems”torsdag den 12 maj 2011
    • E N TE RP RI S E BIGDATA IS COLUMN-AE R C C T H U I R T E BASED Source: Jeff Dean, ”Designs, Lessons and Advise from Building Large Distributed Systems”torsdag den 12 maj 2011
    • E N TE RP RI S EAE R C C T H U I R T E BIGDATA NEEDS GFS • Use GFS to store data and logs Source: Jeff Dean, ”Designs, Lessons and Advise from Building Large Distributed Systems”torsdag den 12 maj 2011
    • MAPREDUCE -E N TE RP RI S E A PARALLELLAE R C C T H U I R T E COMPUTING PLATFORM Source: Jeff Dean, ”Designs, Lessons and Advise from Building Large Distributed Systems”torsdag den 12 maj 2011
    • E N TE RP RI S E ABOUT CODING ATAE R C C T H U I R T E GOOGLE • Linux as operating system everywhere - is open source, highly customized for this (Android is also a higly customized version of Linux) • Serialization/Integration - Protocol buffers (RPC) runs at nano speed, internally used for ”everything”, Json and RESTful used for external API’s • Application-oriented Programming languages - mainly Python, Java and C++ • Data-oriented programming languages - Percolator, Sawzall, Dremel for various data processing task (so specialised tools for data!) • The Business Applications - Gmail, Search, App Engine etc - built upon data center infrasctructure, data platform and abovetorsdag den 12 maj 2011
    • E N TE RP RI S E #5 IDENTIFY AND KEEPAE R C C T H U I R T E TRACK OF YOUR USERS • You need a google account to start Android properly • OpenSocial is a collaborate effort to compete against Facebook • OpenID is an identity standard and OAuth is a standard for authorizing services • Google is identifying and tracking every step you take within their domainstorsdag den 12 maj 2011