TOC• Overview: Cloud Eco System• Amazon Web Services for scientiﬁc computing • EC2 Deployment • S3, EBS and local Storage • Queueing / Messaging
Cloud LayersSoftware as a Service - SaaS running applicationsPlatform as a Service - PaaS software environments and servicesInfrastructure as a Service - IaaS virtual machines and network
Elastic Compute Cloud (EC2)• Virtual Data Center• Instance Types• Regions• Availability Zones• Security Groups (Firewalling) Can go down just like anyone else• Amazon Machine Images (AMI)
EC2 Setup• Management: • AWS web console • CLI: ec2-tools • Python: boto (check dev/src/tba)• Use the setup script• Get SSH ﬁngerprint from instance’s boot console• Use some form of parallel SSH for additional setup (Twisted conch works great, fabric for serial deployment)
The DNS Problem• Instances get new IP on boot• Solutions: • use Virtual Private Cloud (Amazon VPC) • auto-update hosts ﬁle • set up DNS server
Storage• Instance fast, cleared on boot, no charges• Elastic Block Storage (EBS) fast, networked, instant snapshot-able, N attachments per instance• Simple Storage Service (S3) HTTP access, CDN (CloudFront)
Messaging• Simple Queueing Service (Amazon SQS)• AMQP / RabbitMQ (c.f. knk’s recent rocketcircus)• Rolling your own (e.g. XML RPC) is ﬁne, too
TLDRAWS for the occasional number crunch: • 1/2 day to get basic setup running (longer for robust setups) • proﬁt
A particular slide catching your eye?
Clipping is a handy way to collect important slides you want to go back to later.