Can scale to larger instances, can backup, can restore up to 5 minutes, all tools work, all patching is managed
Hadoop on AWS - http://wiki.apache.org/hadoop/AmazonEC2
S3 = .12 / GB / month -> $ 150 100 GB / yrEBS = .10 / GB / month -> $ 100 100 GB / yrEC2 = .12 / hr (Small, on-demand, Windows) -> $ 1051 run all year (up to 3.85 / hr, down to .01 / hr for spot instances) can be PLUS other services, i.e. CloudWatch…RDS = .14 / hr (small, on demand, SQL 2008 STD) -> $ 1226 run all year ( up to 3.85 / hr, down to .05 / hr to heavy utilitization PLUS up/down data chargedDynamo = .01 / 10 writes & .01 / 50 reads PLUS up/down chargesElastic Beanstalk / Windows = starter package $ 42 / month -> $ 504 / yr
AWS for the SQL Server Pro
AWS for the SQL Server Pro @LynnLangit September 2012 – for SQL Saturday
What and Why AWS?• Amazon’s web services – Amazon’s cloud• Set of services• Market leader – In market longest – Usually cheapest – Most often used in production
Application and/or Data Hosting• Hosted compute• Hosted data – Relational – Non-relational (NoSQL) – Other, massively parallel (i.e. Hadoop)• Other services
Note about EC2 storage• S3 by default – 10 GB max – 3 copies• EBS optional – Can store AMIs (persistent) – Can ‘stop’ EC2 instances and ‘re-start’ – saves $$$ – Costs more – Can expand – One copy only (faster)
Compare Cloud DB – RDBMS or NoSQL AWS Google Microsoft OthersCloud RDBMS SQL Server mySQL SQL Azure Hosted RDBMS on RackspacenoSQL buckets S3, EBS, Glacier Cloud Storage / Azure Blobs Many… Google DriveNoSQL DynamoDB H/R Datastore Azure Tables Heroku bundlesdatabases on GAEStreaming Custom EC2 Prospective StreamInsight & Various –Machine Search & Mahout with Splunk, etc…Learning Prediction API HadoopDocument or MongoDB on Freebase (g) MongoDB on HostedGraph EC2 Windows Azure MongoDBHadoop Elastic Big Query Hadoop (HDFS) Cloudera Whirr MapReduce on (HBase-like) on Azure S3 & EC2Data sets & Karmasphere Translation API Azure Database.comother Full-text search DataMarket
Compare other Cloud ServicesVendor AWS Google AzureFree tier For new (6 months) For developers For new (30 days)Compute Elastic Beanstalk Google App Engine Web or Worker RolesPiece Work Mechanical Turk n/a n/aRouting Direct Connect none VPN servicesVirtual Private Virtual Private Cloud none Azure ApplianceCloudAuto-scaling for w/RightScale for EC2 w/RightScale for w/RightScale forVMs GCE Azure VMs
Conclusions• EC2 for testing, training and production (Iaas)• S3 for archiving R/W• Glacier for archiving W only• RDS for HA SQL Server• DynamoDB for fast NoSQL – on SSDs• Elastic Map Reduce for easy Hadoop• Elastic Beanstalk for application scalability (PaaS)
• recipes) www.TeachingKidsProgramming.org • Free Courseware ( • Do a Recipe Teach a Kid (Ages 10 ++) • Java or Microsoft SmallBasic
Toward Data Craftsmanship… Follow me @LynnLangit RSS my blog www.LynnLangit.com Hire me • To help build your BI/Big Data solution • To teach your team next gen BI • To learn more about using NoSQL solutions
A particular slide catching your eye?
Clipping is a handy way to collect important slides you want to go back to later.