SlideShare a Scribd company logo
1 of 50
Download to read offline
© 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Chris Johnson, Solutions Architect
Ronen Artzi, R&D Architect and Cloud Evangelist
Michael S. Heimlich, Ph.D., Solution Delivery Manager
December 1, 2016
20k in 20 days – Agile Genomic Analysis
Decoupled Architectures
ENT320
What to Expect from the Session
• Learn about AWS Well-Architected
• Review decoupled architectures in AWS
• Consider how event-based architectures improve this
model
• Learn how AstraZeneca used Agile methods to analyze
20,000 Exomes in 20 days
What is the Well-Architected Program?
Reliability
Ensuring that a given system is
architected to meet operational
thresholds during a specific
period of time.
Performance
Ensuring a system delivers maximal
performance for a set of resources
(instances, storage, database and
space/time).
Cost Optimization
Cost Optimization helps achieve the lowest
price for a workload or set of workloads
while taking into account fluctuating needs.
Security
Complimenting the AWS Security
Best Practices whitepaper, our
Security pillar reviews definitions
and compliance best practices.
5 Architectural Pillars
Operational Excellence
Focusing around operational practices and
procedures used to manage production
workloads.
Reliability Pillar Dimensions
Pillar Area of Focus
• Ensuring a given system is architected
to meet operational thresholds, during
a specific period of time
• Meet increased workload demands
• Recover from failures with minimal
disruption or no disruption
Well-Architecting
For “Reliability”
Service Limits
Multi-
AZ/Region
Scalability
Health
Checking &
Monitoring
Networking
Self-Healing
Automation/DR
/HA/Backup
Security Pillar Dimensions
Pillar Area of Focus
• Adhering and complimenting the AWS
Security Best Practices whitepaper
• Review of definitions and compliance
best practices and methodologies
• Review of enforcement and
governance best practices and
methodologies
Well-Architecting
For “Security”
Identity/Key
Management
Encryption
Security
Monitoring &
Logging
Dedicated
Instances
Compliance
Governance
Operational Excellence Pillar Dimensions
Pillar Area of Focus
• Achieve highly automated and
resilient deployment pipelines for code
changes
• Orchestrate operational requirements
once the workloads are deployed into
production
Well-Architecting
For “Operational Excellence”
Amazon
CloudWatch
Change
Auditing
CI/CD
Pipeline
Configuration
Management
Service
Catalog
AWS SDKs
Cost Optimization Pillar Dimensions
Pillar Area of Focus
• Achieve the lowest price for a
system/workload or set of
systems/workloads
• Optimize cost while taking into
account fluctuating needs
Well-Architecting
For “Cost Optimization”
Spot/RI
Environment/
Volume
Tuning
Service
Selection
Account
Management
Consolidated
Billing
Decommission
Resources
Performance Pillar Dimensions
Pillar Area of Focus
• Ensuring a system/workload delivers
maximum performance for a set of
AWS resources utilized (instances,
storage, database and locality)
• Provide optimal performance
efficiency best practices and
guidelines
Well-Architecting
For “Performance”
Right AWS
Services
Resource
Utilization
Storage
Architecture
Caching
Latency
Requirements
Planning &
Benchmarking
Let’s consider Performance Optimization. . .
In the beginning there was Amazon SQS:
• First service offered by AWS
• Cornerstone for service-oriented architectures
• Enables decoupling of application components
• Allows for asynchronous processing
The Classic Decoupled AWS Architecture
Classic Decoupled Architecture
“Pull”
• Uses a queue (like SQS) for passing information between
systems
• Messages are pulled off a queue
• Requires a process that periodically polls the queue for
messages
• Widely used pattern in service-oriented architecture
The Classic Decoupled AWS Architecture
Advantages
• Decoupled
• Clients don’t know about workers and vice versa
• Scalable
• Easy to add workers and queues
• Asynchronous
• Long running jobs are processed in the background
• Highly Available
• Workers and queues run across Availability Zones in an AWS Region
Classic Decoupled AWS Architecture
Aspects to think about …
• AMI Baking
• Use AMIs so that you’re using a preconfigured image for your worker
tier
• Auto Scale Groups
• Use Auto Scaling Groups minimize idle Amazon EC2 instances, and
consider the right metric for triggers
• Undifferentiated Heavy Lifting
• Message Validity and Timeout configurations to manage your queues
and workers effectively
Event-Driven Architecture
What is it?
• A way of responding to certain actions that occur in an AWS
service
• Provides hooks for an application to execute code in response to
an event
• Eliminates the “undifferentiated heavy lifting” of managing and
scaling compute resources and queues to execute event
handling code
Event-Driven Architecture – S3 Notifications
Event-Driven Architecture - S3 Notifications
S3 will send notifications when certain events happen in a bucket.
Notifications can be published to different targets:
• Amazon SNS
• Useful when the occurrence of an event must be broadcast to a large
number of clients
• Amazon SQS
• Useful when a worker process needs to respond to the S3 event
asynchronously
• AWS Lambda
• Automatically executes code when an S3 event occurs
Event-Driven Architecture – S3 Notifications
SNS Topic
SQS
Email
HTTP/S
SMS
Mobile Push
Event-Driven Architecture & Lambda
“Push”
• S3 Notification pushes an event to Lambda
• Lambda service executes code in response to an event
Event-Driven Architecture
Advantages
• Reduces operational complexity
• No need to manage fleets of EC2 instances that process messages off
a queue
• Cost-Effective
• Pay only for the number of executions of event handling code
• No need to fine-tune auto scaling rules to limit idle CPU cycles
Event-Driven Architecture with Lambda
Aspects to think about …
• Virtual wiring!
• Wire up Lambda using SNS, keep Inputs/Outputs simple and within 5
minute durations
• Scratch space
• Each Lambda function receives 500 MB of nonpersistent disk space in
its own /tmp directory. If you need more, consider other persistent
stores
• Remove Kernel/OS dependencies
• Embrace the simplification of no longer worrying about the underlying
OS
Pricing Comparison
Decoupled Event Driven
Compute Pay for each hour of EC2
compute time.
Pay for each execution of
a Lambda function.
Scaling Pay for more EC2
instances or for larger
instances.
No cost to scale up –
handled by the Lambda
service.
Storage Pay for Amazon EBS or
S3 usage.
Pay for S3 usage.
Network Normal data transfer
charges.
Normal data transfer
charges.
20k in 20 Days - Agile Genomic Analysis
We are a global, science-led
biopharmaceutical business
pushing the boundaries of science
to deliver life-changing medicines.
$24.7bn
2015 Revenue*
100+
Countries
AZ’s Genomic Strategy - Bring Better Medicines to Patients, Faster
Genomic Sequencing – Digitizing the Human
Process and align
to a reference
Fragment the DNA
to create a library
Amplify and read the
fragments with a sequencer
Whole Exome vs. Whole Genome
Whole Exome Advantages Whole Genome Advantages
Protein coding regions, 2% of genome Contains everything
~ 85% of disease causing mutations More reliable sequence coverage
Lower sequencing cost at a high depth Better coverage uniformity
Reduces storage and analysis costs Minimal amplification needed
Increased number of samples Universal for all species
Genomic Mutations – Disease Drivers
X
Deletion Duplication Insertion
+ + +
20 days …
“Give me six hours to chop down a tree and I
will spend the first four sharpening the axe.”
― Abraham Lincoln
“Champions do not become champions when they win
the event, but in the hours, weeks, months and years
they spend preparing for it. The victorious performance
itself is merely the demonstration of their championship
character.”
― Alan Armstrong
The Journey toward 20 Days …
Program
Management
Global Privacy
Office
Technologies
IT
Quality
IT Security
Business
Science Units
Legal
Urgency
.. The Pragmatic way
Do the Right things
… Adjusting to new concepts
Do things right
…The Cloud mindset
Stacked Security and Compliance ModelStackedResponsibility
APIs
Service Endpoint
Storage
Compute
Networking Management
Services
Regions
Availability Zones
Edge Locations
Amazon
Web
Services
Client Side Encryption Server Side Encryption Network Communication Protection
Operating System Network and Instance Firewall Platform Logging/Monitoring
Identify and Access Management
AZ R&D
Cloud
Project Researcher Collaborator Application End User / App
Data Owner
Genomics has challenged our existing
approaches to data privacy:
Genomes are Sensitive Personal Data
31
De-identification and patient consent
Standards & processes
• Your Genome, your fingerprint
• But … not only yours …
Applied Stacked Security and Compliance Model
dbGap / Genomic Data
Provider
Protecting against risk
associated with
releasing individual’s
genomes.
AstraZeneca
(QA/Privacy/Security)
Protecting AZ patient
Information security
and privacy for risk of
exposure
HIPPA
Protecting against risks
in releasing Personal
Information(PHI)
APIs
Service Endpoint
Storage
Compute
Networking
Management
Services
Regions
Availability Zones
Edge Locations
Client/Client Side Encryption Network Communication Protection
Operating System Network and Instance Firewall Platform Logging/Monitoring
Identify and Access Management
Genomic Project
StackedResponsibility
Project
Team
AZ
Science
Cloud
Platform
AWS
Redefining the Landscape for Genetic Drivers in Cancer
Motivation for Re-Analysis of TCGA Exome Data
33
Improvements in tools, references, and resources allow us to better define the causes of cancer
• hg38 Reference Genome
• Updated from hg19 – more accurate mutation detection
• VarDict Variant Caller
• 20% better sensitivity finding mutations compared to the best current algorithms
• Developed by AstraZeneca, now open source
• Better Computational Resources
• Quickly re-analyze the data at scale
• Bring computational resources to the data, enabling us to succeed
A Project is Born
34
Project: Re-analyze 20k TCGA Exomes in 20 days
• Use the new hg38 reference genome
• Utilize VarDict to improve the variant calls
• Complete in ~20 days (Between Thanksgiving and Christmas)
Challenges:
20K Exomes = 270TB of Raw Genomic files
• Current Storage
• 500 TB, mostly filled
• Not enough space!
• HPC utilization
• Used for ongoing projects
• Stop everything else for ~ 1 year!
• Internet Connection
• 1Gbps Fastline
• ~25 days just to download!
• Experience to Date
• On-premises: < 3,000 exomes (3 years)
• Process ~ 7x as many in < 1 month!
The Informatics Workflow – Paradigm Shift
The Old Way
• All at once
• Coupled / Synchronous
Analyze Post-Process
The New Way
• Process when available
• Decoupled / Asynchronous
Download
Project
Project
Download
Analyze
Post-Process
PARALLEL INGESTION
AT SCALE
ASYNCHRONOUS
PARALLEL ANALYSIS
AT SCALE
Alignment
Slice Index and Stats
Variant Call
UPLOAD TO
LOCAL HPC FOR
DOWNSTREAM
ANALYSIS
Simplify: Turn It to a Loosely Coupled At Scale Solution
36
PARALLEL INGESTION ASYNCHRONOUS PARALLEL ANALYSIS UPLOAD RESULTS
Simplify: Turn It to a Loosely Coupled At Scale Solution
37
Science
Requested
Target
Samples
CGHub Local Science HPCDevOps Bucket
INGESTION S3 Bucket
Ingestion
Request
Worker
RESULTS S3
Bucket
Automation
Worker VLAD
Automation Step
Workers VLAD
Loader Worker
INGESTION
Request Queue
ANALYSIS
Request Queue
PIPELINE AUTOMATION
Results Queue
PIPELINE
Results Queue
Queue
Process
Storage
PIPELINE S3 Bucket
BAM, BAI
VCF
Analysis
Request Worker
PARALLEL INGESTION
AT SCALE
ASYNCHRONOUS
PARALLEL ANALYSIS
AT SCALE
Alignment
Slice Index and Stats
Variant Call
UPLOAD TO
LOCAL HPC FOR
DOWNSTREAM
ANALYSIS
PARALLEL INGESTION ASYNCHRONOUS PARALLEL ANALYSIS UPLOAD RESULTS
ASYNCHRONOUS
PARALLEL ANALYSIS
AT SCALE
Alignment
Slice Index and Stats
Variant Call
UPLOAD TO
LOCAL HPC FOR
DOWNSTREAM
ANALYSIS
Intelligent Scale
Ingestion Process
Analysis:
- 20K Exomes -> ~270TB (BAMs, BAIs)
- Distribution : most files <10GB, 10s 50GB, couple >200GB
- Theoretical best rate to download 270TB: ~25 days
Parallelize/Scale opportunity:
- CGHUB’s GeneTorrent Client uses multiple threads as it
brings a single file. We can run several Clients in parallel.
- S3 allows significant parallel throughput (use TCGA uuid as
randomizer for object name)
- AWS Network Bandwidth is superb
- Spot Instances will provide ingestion at scale with reduced
cost ( $0.2 / $0.08)
- Server Less via Lambda is perfect to trigger post load
Analysis ( generate Analysis Request Message )
Results: We got 1.5-2 TB/Hour with ~300
Workers/Instances !!
IngestionWorkers
N Workers per
Instance
Analysis Request
Queue
ReadyToAnalyze
Queue
80GB/160GB/
320GB
Launch
ConfigGroups
The Data Ingestion Story
39
~52TB/6 Days
~85TB/
2 Days
~115TB
~63TB/
1.5 Days
Cautiously test the water Bullish!!! Play to Win
Simplify: Turn It to a Loosely Coupled At Scale Solution
40
Science
Requested
Target
Samples
CGHub Local Science HPCDevOps Bucket
INGESTION S3 Bucket
Ingestion
Request
Worker
RESULTS S3
Bucket
Automation
Worker VLAD
Automation Step
Workers VLAD
Loader Worker
INGESTION
Request Queue
ANALYSIS
Request Queue
PIPELINE AUTOMATION
Results Queue
PIPELINE
Results Queue
Queue
Process
Storage
PIPELINE S3 Bucket
BAM, BAI
VCF
Analysis
Request Worker
PARALLEL INGESTION
AT SCALE
ASYNCHRONOUS
PARALLEL ANALYSIS
AT SCALE
Alignment
Slice Index and Stats
Variant Call
UPLOAD TO
LOCAL HPC FOR
DOWNSTREAM
ANALYSIS
PARALLEL INGESTION ASYNCHRONOUS PARALLEL ANALYSIS UPLOAD RESULTS
PARALLEL
INGESTION
AT SCALE
UPLOAD TO LOCAL
HPC FOR
DOWNSTREAM
ANALYSIS
Pipeline Analysis
41
Design Principles :
- Use all available resources (Local/Cloud)
- Optimize Env to the task
- Align Environment to target goals (time, cost)
Parallelize/Scale opportunity:
- S3 allows significant parallel throughput (use)
- AWS Network Bandwidth is superb
- Single Orchestration Control Plane (Queue Base)
• Use all Available Resources:
• Two Local Pipeline Analysis Engines
• Three Cloud-Based Pipeline Engines
- Optimized Task-Tuned Clusters:
- AWS R3.8xlarge based RAVE Platform
- AWS C3.8xlarge based RAVE Platform
Results: Orchestrated Hybrid On-Demand
Auto Scaled Pipeline.
Univa Based
Grid Engine
On Premise
GPFS
NextGen BcBio
S3
Bina Rave
Template/Automation
Post Alignment Slicing
Post Realignment :
- Slice BAM and extract Gene Of Interest List
- Generate Stats
- Index BAM File
Parallelize/Scale opportunity:
- Operation can be done on every realigned BAM file
while it’s being used for further pipeline activities.
- Slice, Stats, Index can be uploaded to local HPC
regardless to Variant call results as long are
distributed to the right canonical named located
(based on tcga uuid)
- S3 bucket can hold “infinite” number of objects
without “folders” structure
- Spot Instances are perfect for cost reduction
Results:
- When Running at 700 Workers (100 nodes) we did
~6000 Slice ,Stats , Index in less than 3 hours
Post Align
N Workers per
Instances
Pipeline Analysis
Queue
ReadyToUpload
Queue
80GB/160GB/
320GB
Launch
ConfigGroup
S3
Simplify: Turn It to a Loosely Coupled At Scale Solution
43
Science
Requested
Target
Samples
CGHub Local Science HPCDevOps Bucket
INGESTION S3 Bucket
Ingestion
Request
Worker
RESULTS S3
Bucket
Automation
Worker VLAD
Automation Step
Workers VLAD
Loader Worker
INGESTION
Request Queue
ANALYSIS
Request Queue
PIPELINE AUTOMATION
Results Queue
PIPELINE
Results Queue
Queue
Process
Storage
PIPELINE S3 Bucket
BAM, BAI
VCF
Analysis
Request Worker
PARALLEL INGESTION
AT SCALE
ASYNCHRONOUS
PARALLEL ANALYSIS
AT SCALE
Alignment
Slice Index and Stats
Variant Call
UPLOAD TO
LOCAL HPC FOR
DOWNSTREAM
ANALYSIS
PARALLEL INGESTION ASYNCHRONOUS PARALLEL ANALYSIS UPLOAD RESULTS
INGESTION
AT SCALE
ASYNCHRONOUS
PARALLEL ANALYSIS
AT SCALE
Alignment
Slice Index and Stats
Variant Call
Cleanup and Bring in What Matters
Loader
Developed
+270TB of raw data  ~9TB Meaningful Analysis
Time Shrinking Machine – Getting Results Faster
Months
01 03 05 07 09 11 13
Start
New Way
1/21/2016
Old Way
2/8/2017
Genomic File Ingestion
1/1/2016 - 1/25/2016
Data Analysis, Exclusive Use
1/26/2016 - 1/25/2017
Post-Processing
1/26/2017 - 2/1/2017
File Cleanup
2/2/2017 - 2/8/2017
Genomic File Ingestion (270 TB @ 5 Gbps) 5 Days
1/1/2016 - 1/5/2016
Data Analysis (7680 CPU Cores)
1/1/2016 - 1/21/2016
Post-Processing
1/1/2016 - 1/21/2016
File Cleanup
1/1/2016 - 1/21/2016
Resource Shrinking Machine – Use/Pay When Needed
Months
01 03 05 07 09 11 13
Start
1/21/2016 2/8/2017
1/1/2016 - 1/25/2016
1/26/2016 - 1/25/2017
1/26/2017 - 2/1/2017
2/2/2017 - 2/8/2017
1/1/2016 - 1/21/2016
Cloud On Premises
$$$ OpEx – No Physical Infrastructure
$$$ CapEx – No Flexible Infrastructure
270 TB
Data Reduction
9 TB
So…How Did We Do?
46
19,690
TCGA EXOMES
RE-ALIGNED TO
HG38
>270 TB
RAW DATA
UPLOADED AT
1.5-2 TB/HR WITH
300 LOADERS
7680 CPU
FOR ALIGNMENT
AND VARIANT
CALLING
>2000/HR
FOR SLICE
STATS AND INDEX
W/ 700 WORKERS
$4
PER EXOME INCL.
COMPUTE,
STORAGE,
NETWORK
Lessons Learned from the 20k in 20 Days Project
• Plan ahead as much as you can 
• Work first on the foundation and enabling factors
• Security and Privacy foundation
• DevOps and Automation
• Understand the details of:
• Process
• Computation and Data Scale
• Leverage the AWS XaaS
• Measure Everything and associate it to Business Outcome
Acknowledgements
AstraZeneca
• Vlad Saveliev
• Ronen Artzi
• Michael Heimlich
• Tristan Lubinski
• Jonathan Dry @DrySci
• Danielle Greenawalt @BostonBioinfX
• Justin Johnson @BioInfo
• Zhongwu Lai @ZhongwuL
• Stefanie Rintoul
• Vitaly Rozenman
• Carl Barrett
• Brian Dougherty
• Bryan Takasaki
AstraZeneca Security/Privacy
• Richard Paul
• Gayle Pearce
• Lee Ann Heckathorn
• Stephen Weil
• Sheri Arnell
• Tommy Farrell
• Victoria Southern
Bina/Roche
• Andi Broka
• Engineering Team
• Product Management Team
• Science Team
Thank you!
Remember to complete
your evaluations!

More Related Content

What's hot

Build a Serverless Web Application in One Day - DevDay Austin 2017
Build a Serverless Web Application in One Day - DevDay Austin 2017Build a Serverless Web Application in One Day - DevDay Austin 2017
Build a Serverless Web Application in One Day - DevDay Austin 2017
Amazon Web Services
 

What's hot (20)

HSBC and AWS Day - Big Data and HPC on AWS
HSBC and AWS Day - Big Data and HPC on AWSHSBC and AWS Day - Big Data and HPC on AWS
HSBC and AWS Day - Big Data and HPC on AWS
 
Wild Rydes - Serverless DevOps to the Rescue
Wild Rydes - Serverless DevOps to the RescueWild Rydes - Serverless DevOps to the Rescue
Wild Rydes - Serverless DevOps to the Rescue
 
AWS re:Invent 2016: Building a Platform for Collaborative Scientific Research...
AWS re:Invent 2016: Building a Platform for Collaborative Scientific Research...AWS re:Invent 2016: Building a Platform for Collaborative Scientific Research...
AWS re:Invent 2016: Building a Platform for Collaborative Scientific Research...
 
SMC302 Building Serverless Web Applications
SMC302 Building Serverless Web ApplicationsSMC302 Building Serverless Web Applications
SMC302 Building Serverless Web Applications
 
AWS re:Invent 2016: How Thermo Fisher Is Reducing Mass Spectrometry Experimen...
AWS re:Invent 2016: How Thermo Fisher Is Reducing Mass Spectrometry Experimen...AWS re:Invent 2016: How Thermo Fisher Is Reducing Mass Spectrometry Experimen...
AWS re:Invent 2016: How Thermo Fisher Is Reducing Mass Spectrometry Experimen...
 
SRV412 Deep Dive on CICD and Docker
SRV412 Deep Dive on CICD and DockerSRV412 Deep Dive on CICD and Docker
SRV412 Deep Dive on CICD and Docker
 
AWS re:Invent 2016: Real-Time Data Exploration and Analytics with Amazon Elas...
AWS re:Invent 2016: Real-Time Data Exploration and Analytics with Amazon Elas...AWS re:Invent 2016: Real-Time Data Exploration and Analytics with Amazon Elas...
AWS re:Invent 2016: Real-Time Data Exploration and Analytics with Amazon Elas...
 
S/4HANA on AWS-SAPPHIRE NOW 2016
S/4HANA on AWS-SAPPHIRE NOW 2016S/4HANA on AWS-SAPPHIRE NOW 2016
S/4HANA on AWS-SAPPHIRE NOW 2016
 
Build a Serverless Web Application in One Day - DevDay Austin 2017
Build a Serverless Web Application in One Day - DevDay Austin 2017Build a Serverless Web Application in One Day - DevDay Austin 2017
Build a Serverless Web Application in One Day - DevDay Austin 2017
 
DevOps on AWS
DevOps on AWSDevOps on AWS
DevOps on AWS
 
WKS401 Deploy a Deep Learning Framework on Amazon ECS and EC2 Spot Instances
WKS401 Deploy a Deep Learning Framework on Amazon ECS and EC2 Spot InstancesWKS401 Deploy a Deep Learning Framework on Amazon ECS and EC2 Spot Instances
WKS401 Deploy a Deep Learning Framework on Amazon ECS and EC2 Spot Instances
 
The Pace of Innovation - Pop-up Loft Tel Aviv
The Pace of Innovation - Pop-up Loft Tel AvivThe Pace of Innovation - Pop-up Loft Tel Aviv
The Pace of Innovation - Pop-up Loft Tel Aviv
 
BDA402 Deep Dive: Log Analytics with Amazon Elasticsearch Service
BDA402 Deep Dive: Log Analytics with Amazon Elasticsearch ServiceBDA402 Deep Dive: Log Analytics with Amazon Elasticsearch Service
BDA402 Deep Dive: Log Analytics with Amazon Elasticsearch Service
 
Running Relational Databases on AWS
Running Relational Databases on AWS  Running Relational Databases on AWS
Running Relational Databases on AWS
 
Real-time Data Processing Using AWS Lambda
Real-time Data Processing Using AWS LambdaReal-time Data Processing Using AWS Lambda
Real-time Data Processing Using AWS Lambda
 
AWS re:Invent 2016: Running Batch Jobs on Amazon ECS (CON310)
AWS re:Invent 2016: Running Batch Jobs on Amazon ECS (CON310)AWS re:Invent 2016: Running Batch Jobs on Amazon ECS (CON310)
AWS re:Invent 2016: Running Batch Jobs on Amazon ECS (CON310)
 
(CMP201) All You Need To Know About Auto Scaling
(CMP201) All You Need To Know About Auto Scaling(CMP201) All You Need To Know About Auto Scaling
(CMP201) All You Need To Know About Auto Scaling
 
Scaling the Platform for Your Startup - Startup Talks June 2015
Scaling the Platform for Your Startup - Startup Talks June 2015Scaling the Platform for Your Startup - Startup Talks June 2015
Scaling the Platform for Your Startup - Startup Talks June 2015
 
Cost Optimization at Scale
Cost Optimization at ScaleCost Optimization at Scale
Cost Optimization at Scale
 
SMC304 Serverless Orchestration with AWS Step Functions
SMC304 Serverless Orchestration with AWS Step FunctionsSMC304 Serverless Orchestration with AWS Step Functions
SMC304 Serverless Orchestration with AWS Step Functions
 

Viewers also liked

Viewers also liked (20)

AWS re:Invent 2016: Driving AWS Cost Efficiency at Your Company (ENT202)
AWS re:Invent 2016: Driving AWS Cost Efficiency at Your Company (ENT202)AWS re:Invent 2016: Driving AWS Cost Efficiency at Your Company (ENT202)
AWS re:Invent 2016: Driving AWS Cost Efficiency at Your Company (ENT202)
 
AWS re:Invent 2016| DAT318 | Migrating from RDBMS to NoSQL: How Sony Moved fr...
AWS re:Invent 2016| DAT318 | Migrating from RDBMS to NoSQL: How Sony Moved fr...AWS re:Invent 2016| DAT318 | Migrating from RDBMS to NoSQL: How Sony Moved fr...
AWS re:Invent 2016| DAT318 | Migrating from RDBMS to NoSQL: How Sony Moved fr...
 
AWS re:Invent 2016: Deep-Dive: Native, Hybrid and Web patterns with Serverles...
AWS re:Invent 2016: Deep-Dive: Native, Hybrid and Web patterns with Serverles...AWS re:Invent 2016: Deep-Dive: Native, Hybrid and Web patterns with Serverles...
AWS re:Invent 2016: Deep-Dive: Native, Hybrid and Web patterns with Serverles...
 
Database Migration: Simple, Cross-Engine and Cross-Platform Migrations with ...
 Database Migration: Simple, Cross-Engine and Cross-Platform Migrations with ... Database Migration: Simple, Cross-Engine and Cross-Platform Migrations with ...
Database Migration: Simple, Cross-Engine and Cross-Platform Migrations with ...
 
Getting Started with Amazon Enterprise Applications
Getting Started with Amazon Enterprise ApplicationsGetting Started with Amazon Enterprise Applications
Getting Started with Amazon Enterprise Applications
 
Successful Cloud Adoption for the Enterprise. Not If. When.
Successful Cloud Adoption for the Enterprise. Not If. When.Successful Cloud Adoption for the Enterprise. Not If. When.
Successful Cloud Adoption for the Enterprise. Not If. When.
 
Application Migrations
Application MigrationsApplication Migrations
Application Migrations
 
AWS Summit Gold Sponsor Presentation - Soltius
AWS Summit Gold Sponsor Presentation - SoltiusAWS Summit Gold Sponsor Presentation - Soltius
AWS Summit Gold Sponsor Presentation - Soltius
 
Best Practices for Protecting Cloud Workloads - November 2016 Webinar Series
Best Practices for Protecting Cloud Workloads - November 2016 Webinar SeriesBest Practices for Protecting Cloud Workloads - November 2016 Webinar Series
Best Practices for Protecting Cloud Workloads - November 2016 Webinar Series
 
Cost optimization at scale toronto v3
Cost optimization at scale toronto v3Cost optimization at scale toronto v3
Cost optimization at scale toronto v3
 
AWS re:Invent 2016: Learn how IFTTT uses ElastiCache for Redis to predict eve...
AWS re:Invent 2016: Learn how IFTTT uses ElastiCache for Redis to predict eve...AWS re:Invent 2016: Learn how IFTTT uses ElastiCache for Redis to predict eve...
AWS re:Invent 2016: Learn how IFTTT uses ElastiCache for Redis to predict eve...
 
Cloud Adoption
Cloud AdoptionCloud Adoption
Cloud Adoption
 
Deep Dive on Amazon Relational Database Service
Deep Dive on Amazon Relational Database ServiceDeep Dive on Amazon Relational Database Service
Deep Dive on Amazon Relational Database Service
 
Configuration Management with AWS OpsWorks  by Amir Golan, Senior Product Man...
Configuration Management with AWS OpsWorks  by Amir Golan, Senior Product Man...Configuration Management with AWS OpsWorks  by Amir Golan, Senior Product Man...
Configuration Management with AWS OpsWorks  by Amir Golan, Senior Product Man...
 
Ignite eCommerce growth with AWS
Ignite eCommerce growth with AWSIgnite eCommerce growth with AWS
Ignite eCommerce growth with AWS
 
AWS re:Invent 2016: Choosing the Right Partner for Your AWS Journey (ENT310)
AWS re:Invent 2016: Choosing the Right Partner for Your AWS Journey (ENT310)AWS re:Invent 2016: Choosing the Right Partner for Your AWS Journey (ENT310)
AWS re:Invent 2016: Choosing the Right Partner for Your AWS Journey (ENT310)
 
Building Your First Big Data Application on AWS
Building Your First Big Data Application on AWSBuilding Your First Big Data Application on AWS
Building Your First Big Data Application on AWS
 
AWS re:Invent 2016: Chalk Talk: Applying Security-by-Design to Drive Complian...
AWS re:Invent 2016: Chalk Talk: Applying Security-by-Design to Drive Complian...AWS re:Invent 2016: Chalk Talk: Applying Security-by-Design to Drive Complian...
AWS re:Invent 2016: Chalk Talk: Applying Security-by-Design to Drive Complian...
 
Releasing Software Quickly and Reliably With AWS CodePipeline by Mark Mansour...
Releasing Software Quickly and Reliably With AWS CodePipeline by Mark Mansour...Releasing Software Quickly and Reliably With AWS CodePipeline by Mark Mansour...
Releasing Software Quickly and Reliably With AWS CodePipeline by Mark Mansour...
 
AWS re:Invent 2016: Case Study: How Spokeo Improved Web Application Response ...
AWS re:Invent 2016: Case Study: How Spokeo Improved Web Application Response ...AWS re:Invent 2016: Case Study: How Spokeo Improved Web Application Response ...
AWS re:Invent 2016: Case Study: How Spokeo Improved Web Application Response ...
 

Similar to AWS re:Invent 2016: 20k in 20 Days - Agile Genomic Analysis (ENT320)

AWS Cloud Computing for Startups Werner Vogels -part i
AWS Cloud Computing for Startups   Werner Vogels -part iAWS Cloud Computing for Startups   Werner Vogels -part i
AWS Cloud Computing for Startups Werner Vogels -part i
Amazon Web Services
 
Wicked rugby
Wicked rugbyWicked rugby
Wicked rugby
Dklumb4
 
AWS Cloud Kata | Manila - Getting to Profitability on AWS
AWS Cloud Kata | Manila - Getting to Profitability on AWSAWS Cloud Kata | Manila - Getting to Profitability on AWS
AWS Cloud Kata | Manila - Getting to Profitability on AWS
Amazon Web Services
 
Aws what is cloud computing deck 08 14 13
Aws what is cloud computing deck 08 14 13Aws what is cloud computing deck 08 14 13
Aws what is cloud computing deck 08 14 13
Amazon Web Services
 

Similar to AWS re:Invent 2016: 20k in 20 Days - Agile Genomic Analysis (ENT320) (20)

Compliance In The Cloud Using Security By Design
Compliance In The Cloud Using Security By DesignCompliance In The Cloud Using Security By Design
Compliance In The Cloud Using Security By Design
 
Aws re invent 2018 recap
Aws re invent 2018 recapAws re invent 2018 recap
Aws re invent 2018 recap
 
Cloud First: New Architecture for New Infrastructure
Cloud First: New Architecture for New InfrastructureCloud First: New Architecture for New Infrastructure
Cloud First: New Architecture for New Infrastructure
 
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
 
Effective and Efficient Computing for the Government
Effective and Efficient Computing for the GovernmentEffective and Efficient Computing for the Government
Effective and Efficient Computing for the Government
 
AWS Cloud Computing for Startups Werner Vogels -part i
AWS Cloud Computing for Startups   Werner Vogels -part iAWS Cloud Computing for Startups   Werner Vogels -part i
AWS Cloud Computing for Startups Werner Vogels -part i
 
AWS webinar what is cloud computing 13 09 11
AWS webinar what is cloud computing 13 09 11AWS webinar what is cloud computing 13 09 11
AWS webinar what is cloud computing 13 09 11
 
AWS Big Data Solution Days
AWS Big Data Solution DaysAWS Big Data Solution Days
AWS Big Data Solution Days
 
Achieving Your Department Objectives: Providing Better Citizen Services at Lo...
Achieving Your Department Objectives: Providing Better Citizen Services at Lo...Achieving Your Department Objectives: Providing Better Citizen Services at Lo...
Achieving Your Department Objectives: Providing Better Citizen Services at Lo...
 
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...
 
Wicked rugby
Wicked rugbyWicked rugby
Wicked rugby
 
AWS Storage and Edge Processing
AWS Storage and Edge ProcessingAWS Storage and Edge Processing
AWS Storage and Edge Processing
 
AWS Cloud Kata | Manila - Getting to Profitability on AWS
AWS Cloud Kata | Manila - Getting to Profitability on AWSAWS Cloud Kata | Manila - Getting to Profitability on AWS
AWS Cloud Kata | Manila - Getting to Profitability on AWS
 
Uses, considerations, and recommendations for AWS
Uses, considerations, and recommendations for AWSUses, considerations, and recommendations for AWS
Uses, considerations, and recommendations for AWS
 
Being Well Architected in the Cloud (Updated)
Being Well Architected in the Cloud (Updated)Being Well Architected in the Cloud (Updated)
Being Well Architected in the Cloud (Updated)
 
Being Well-Architected in the Cloud
Being Well-Architected in the CloudBeing Well-Architected in the Cloud
Being Well-Architected in the Cloud
 
Aws what is cloud computing deck 08 14 13
Aws what is cloud computing deck 08 14 13Aws what is cloud computing deck 08 14 13
Aws what is cloud computing deck 08 14 13
 
Building a Big Data & Analytics Platform using AWS
Building a Big Data & Analytics Platform using AWS Building a Big Data & Analytics Platform using AWS
Building a Big Data & Analytics Platform using AWS
 
What is Cloud Computing with AWS?
What is Cloud Computing with AWS?What is Cloud Computing with AWS?
What is Cloud Computing with AWS?
 
5 Years Of Building SaaS On AWS
5 Years Of Building SaaS On AWS5 Years Of Building SaaS On AWS
5 Years Of Building SaaS On AWS
 

More from Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

AWS re:Invent 2016: 20k in 20 Days - Agile Genomic Analysis (ENT320)

  • 1. © 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Chris Johnson, Solutions Architect Ronen Artzi, R&D Architect and Cloud Evangelist Michael S. Heimlich, Ph.D., Solution Delivery Manager December 1, 2016 20k in 20 days – Agile Genomic Analysis Decoupled Architectures ENT320
  • 2. What to Expect from the Session • Learn about AWS Well-Architected • Review decoupled architectures in AWS • Consider how event-based architectures improve this model • Learn how AstraZeneca used Agile methods to analyze 20,000 Exomes in 20 days
  • 3. What is the Well-Architected Program? Reliability Ensuring that a given system is architected to meet operational thresholds during a specific period of time. Performance Ensuring a system delivers maximal performance for a set of resources (instances, storage, database and space/time). Cost Optimization Cost Optimization helps achieve the lowest price for a workload or set of workloads while taking into account fluctuating needs. Security Complimenting the AWS Security Best Practices whitepaper, our Security pillar reviews definitions and compliance best practices. 5 Architectural Pillars Operational Excellence Focusing around operational practices and procedures used to manage production workloads.
  • 4. Reliability Pillar Dimensions Pillar Area of Focus • Ensuring a given system is architected to meet operational thresholds, during a specific period of time • Meet increased workload demands • Recover from failures with minimal disruption or no disruption Well-Architecting For “Reliability” Service Limits Multi- AZ/Region Scalability Health Checking & Monitoring Networking Self-Healing Automation/DR /HA/Backup
  • 5. Security Pillar Dimensions Pillar Area of Focus • Adhering and complimenting the AWS Security Best Practices whitepaper • Review of definitions and compliance best practices and methodologies • Review of enforcement and governance best practices and methodologies Well-Architecting For “Security” Identity/Key Management Encryption Security Monitoring & Logging Dedicated Instances Compliance Governance
  • 6. Operational Excellence Pillar Dimensions Pillar Area of Focus • Achieve highly automated and resilient deployment pipelines for code changes • Orchestrate operational requirements once the workloads are deployed into production Well-Architecting For “Operational Excellence” Amazon CloudWatch Change Auditing CI/CD Pipeline Configuration Management Service Catalog AWS SDKs
  • 7. Cost Optimization Pillar Dimensions Pillar Area of Focus • Achieve the lowest price for a system/workload or set of systems/workloads • Optimize cost while taking into account fluctuating needs Well-Architecting For “Cost Optimization” Spot/RI Environment/ Volume Tuning Service Selection Account Management Consolidated Billing Decommission Resources
  • 8. Performance Pillar Dimensions Pillar Area of Focus • Ensuring a system/workload delivers maximum performance for a set of AWS resources utilized (instances, storage, database and locality) • Provide optimal performance efficiency best practices and guidelines Well-Architecting For “Performance” Right AWS Services Resource Utilization Storage Architecture Caching Latency Requirements Planning & Benchmarking
  • 9. Let’s consider Performance Optimization. . . In the beginning there was Amazon SQS: • First service offered by AWS • Cornerstone for service-oriented architectures • Enables decoupling of application components • Allows for asynchronous processing
  • 10. The Classic Decoupled AWS Architecture
  • 11. Classic Decoupled Architecture “Pull” • Uses a queue (like SQS) for passing information between systems • Messages are pulled off a queue • Requires a process that periodically polls the queue for messages • Widely used pattern in service-oriented architecture
  • 12. The Classic Decoupled AWS Architecture Advantages • Decoupled • Clients don’t know about workers and vice versa • Scalable • Easy to add workers and queues • Asynchronous • Long running jobs are processed in the background • Highly Available • Workers and queues run across Availability Zones in an AWS Region
  • 13. Classic Decoupled AWS Architecture Aspects to think about … • AMI Baking • Use AMIs so that you’re using a preconfigured image for your worker tier • Auto Scale Groups • Use Auto Scaling Groups minimize idle Amazon EC2 instances, and consider the right metric for triggers • Undifferentiated Heavy Lifting • Message Validity and Timeout configurations to manage your queues and workers effectively
  • 14. Event-Driven Architecture What is it? • A way of responding to certain actions that occur in an AWS service • Provides hooks for an application to execute code in response to an event • Eliminates the “undifferentiated heavy lifting” of managing and scaling compute resources and queues to execute event handling code
  • 15. Event-Driven Architecture – S3 Notifications
  • 16. Event-Driven Architecture - S3 Notifications S3 will send notifications when certain events happen in a bucket. Notifications can be published to different targets: • Amazon SNS • Useful when the occurrence of an event must be broadcast to a large number of clients • Amazon SQS • Useful when a worker process needs to respond to the S3 event asynchronously • AWS Lambda • Automatically executes code when an S3 event occurs
  • 17. Event-Driven Architecture – S3 Notifications SNS Topic SQS Email HTTP/S SMS Mobile Push
  • 18. Event-Driven Architecture & Lambda “Push” • S3 Notification pushes an event to Lambda • Lambda service executes code in response to an event
  • 19. Event-Driven Architecture Advantages • Reduces operational complexity • No need to manage fleets of EC2 instances that process messages off a queue • Cost-Effective • Pay only for the number of executions of event handling code • No need to fine-tune auto scaling rules to limit idle CPU cycles
  • 20. Event-Driven Architecture with Lambda Aspects to think about … • Virtual wiring! • Wire up Lambda using SNS, keep Inputs/Outputs simple and within 5 minute durations • Scratch space • Each Lambda function receives 500 MB of nonpersistent disk space in its own /tmp directory. If you need more, consider other persistent stores • Remove Kernel/OS dependencies • Embrace the simplification of no longer worrying about the underlying OS
  • 21. Pricing Comparison Decoupled Event Driven Compute Pay for each hour of EC2 compute time. Pay for each execution of a Lambda function. Scaling Pay for more EC2 instances or for larger instances. No cost to scale up – handled by the Lambda service. Storage Pay for Amazon EBS or S3 usage. Pay for S3 usage. Network Normal data transfer charges. Normal data transfer charges.
  • 22. 20k in 20 Days - Agile Genomic Analysis
  • 23. We are a global, science-led biopharmaceutical business pushing the boundaries of science to deliver life-changing medicines. $24.7bn 2015 Revenue* 100+ Countries
  • 24. AZ’s Genomic Strategy - Bring Better Medicines to Patients, Faster
  • 25. Genomic Sequencing – Digitizing the Human Process and align to a reference Fragment the DNA to create a library Amplify and read the fragments with a sequencer
  • 26. Whole Exome vs. Whole Genome Whole Exome Advantages Whole Genome Advantages Protein coding regions, 2% of genome Contains everything ~ 85% of disease causing mutations More reliable sequence coverage Lower sequencing cost at a high depth Better coverage uniformity Reduces storage and analysis costs Minimal amplification needed Increased number of samples Universal for all species
  • 27. Genomic Mutations – Disease Drivers X Deletion Duplication Insertion + + +
  • 28. 20 days … “Give me six hours to chop down a tree and I will spend the first four sharpening the axe.” ― Abraham Lincoln “Champions do not become champions when they win the event, but in the hours, weeks, months and years they spend preparing for it. The victorious performance itself is merely the demonstration of their championship character.” ― Alan Armstrong
  • 29. The Journey toward 20 Days … Program Management Global Privacy Office Technologies IT Quality IT Security Business Science Units Legal Urgency .. The Pragmatic way Do the Right things … Adjusting to new concepts Do things right …The Cloud mindset
  • 30. Stacked Security and Compliance ModelStackedResponsibility APIs Service Endpoint Storage Compute Networking Management Services Regions Availability Zones Edge Locations Amazon Web Services Client Side Encryption Server Side Encryption Network Communication Protection Operating System Network and Instance Firewall Platform Logging/Monitoring Identify and Access Management AZ R&D Cloud Project Researcher Collaborator Application End User / App Data Owner
  • 31. Genomics has challenged our existing approaches to data privacy: Genomes are Sensitive Personal Data 31 De-identification and patient consent Standards & processes • Your Genome, your fingerprint • But … not only yours …
  • 32. Applied Stacked Security and Compliance Model dbGap / Genomic Data Provider Protecting against risk associated with releasing individual’s genomes. AstraZeneca (QA/Privacy/Security) Protecting AZ patient Information security and privacy for risk of exposure HIPPA Protecting against risks in releasing Personal Information(PHI) APIs Service Endpoint Storage Compute Networking Management Services Regions Availability Zones Edge Locations Client/Client Side Encryption Network Communication Protection Operating System Network and Instance Firewall Platform Logging/Monitoring Identify and Access Management Genomic Project StackedResponsibility Project Team AZ Science Cloud Platform AWS
  • 33. Redefining the Landscape for Genetic Drivers in Cancer Motivation for Re-Analysis of TCGA Exome Data 33 Improvements in tools, references, and resources allow us to better define the causes of cancer • hg38 Reference Genome • Updated from hg19 – more accurate mutation detection • VarDict Variant Caller • 20% better sensitivity finding mutations compared to the best current algorithms • Developed by AstraZeneca, now open source • Better Computational Resources • Quickly re-analyze the data at scale • Bring computational resources to the data, enabling us to succeed
  • 34. A Project is Born 34 Project: Re-analyze 20k TCGA Exomes in 20 days • Use the new hg38 reference genome • Utilize VarDict to improve the variant calls • Complete in ~20 days (Between Thanksgiving and Christmas) Challenges: 20K Exomes = 270TB of Raw Genomic files • Current Storage • 500 TB, mostly filled • Not enough space! • HPC utilization • Used for ongoing projects • Stop everything else for ~ 1 year! • Internet Connection • 1Gbps Fastline • ~25 days just to download! • Experience to Date • On-premises: < 3,000 exomes (3 years) • Process ~ 7x as many in < 1 month!
  • 35. The Informatics Workflow – Paradigm Shift The Old Way • All at once • Coupled / Synchronous Analyze Post-Process The New Way • Process when available • Decoupled / Asynchronous Download Project Project Download Analyze Post-Process
  • 36. PARALLEL INGESTION AT SCALE ASYNCHRONOUS PARALLEL ANALYSIS AT SCALE Alignment Slice Index and Stats Variant Call UPLOAD TO LOCAL HPC FOR DOWNSTREAM ANALYSIS Simplify: Turn It to a Loosely Coupled At Scale Solution 36 PARALLEL INGESTION ASYNCHRONOUS PARALLEL ANALYSIS UPLOAD RESULTS
  • 37. Simplify: Turn It to a Loosely Coupled At Scale Solution 37 Science Requested Target Samples CGHub Local Science HPCDevOps Bucket INGESTION S3 Bucket Ingestion Request Worker RESULTS S3 Bucket Automation Worker VLAD Automation Step Workers VLAD Loader Worker INGESTION Request Queue ANALYSIS Request Queue PIPELINE AUTOMATION Results Queue PIPELINE Results Queue Queue Process Storage PIPELINE S3 Bucket BAM, BAI VCF Analysis Request Worker PARALLEL INGESTION AT SCALE ASYNCHRONOUS PARALLEL ANALYSIS AT SCALE Alignment Slice Index and Stats Variant Call UPLOAD TO LOCAL HPC FOR DOWNSTREAM ANALYSIS PARALLEL INGESTION ASYNCHRONOUS PARALLEL ANALYSIS UPLOAD RESULTS ASYNCHRONOUS PARALLEL ANALYSIS AT SCALE Alignment Slice Index and Stats Variant Call UPLOAD TO LOCAL HPC FOR DOWNSTREAM ANALYSIS
  • 38. Intelligent Scale Ingestion Process Analysis: - 20K Exomes -> ~270TB (BAMs, BAIs) - Distribution : most files <10GB, 10s 50GB, couple >200GB - Theoretical best rate to download 270TB: ~25 days Parallelize/Scale opportunity: - CGHUB’s GeneTorrent Client uses multiple threads as it brings a single file. We can run several Clients in parallel. - S3 allows significant parallel throughput (use TCGA uuid as randomizer for object name) - AWS Network Bandwidth is superb - Spot Instances will provide ingestion at scale with reduced cost ( $0.2 / $0.08) - Server Less via Lambda is perfect to trigger post load Analysis ( generate Analysis Request Message ) Results: We got 1.5-2 TB/Hour with ~300 Workers/Instances !! IngestionWorkers N Workers per Instance Analysis Request Queue ReadyToAnalyze Queue 80GB/160GB/ 320GB Launch ConfigGroups
  • 39. The Data Ingestion Story 39 ~52TB/6 Days ~85TB/ 2 Days ~115TB ~63TB/ 1.5 Days Cautiously test the water Bullish!!! Play to Win
  • 40. Simplify: Turn It to a Loosely Coupled At Scale Solution 40 Science Requested Target Samples CGHub Local Science HPCDevOps Bucket INGESTION S3 Bucket Ingestion Request Worker RESULTS S3 Bucket Automation Worker VLAD Automation Step Workers VLAD Loader Worker INGESTION Request Queue ANALYSIS Request Queue PIPELINE AUTOMATION Results Queue PIPELINE Results Queue Queue Process Storage PIPELINE S3 Bucket BAM, BAI VCF Analysis Request Worker PARALLEL INGESTION AT SCALE ASYNCHRONOUS PARALLEL ANALYSIS AT SCALE Alignment Slice Index and Stats Variant Call UPLOAD TO LOCAL HPC FOR DOWNSTREAM ANALYSIS PARALLEL INGESTION ASYNCHRONOUS PARALLEL ANALYSIS UPLOAD RESULTS PARALLEL INGESTION AT SCALE UPLOAD TO LOCAL HPC FOR DOWNSTREAM ANALYSIS
  • 41. Pipeline Analysis 41 Design Principles : - Use all available resources (Local/Cloud) - Optimize Env to the task - Align Environment to target goals (time, cost) Parallelize/Scale opportunity: - S3 allows significant parallel throughput (use) - AWS Network Bandwidth is superb - Single Orchestration Control Plane (Queue Base) • Use all Available Resources: • Two Local Pipeline Analysis Engines • Three Cloud-Based Pipeline Engines - Optimized Task-Tuned Clusters: - AWS R3.8xlarge based RAVE Platform - AWS C3.8xlarge based RAVE Platform Results: Orchestrated Hybrid On-Demand Auto Scaled Pipeline. Univa Based Grid Engine On Premise GPFS NextGen BcBio S3 Bina Rave Template/Automation
  • 42. Post Alignment Slicing Post Realignment : - Slice BAM and extract Gene Of Interest List - Generate Stats - Index BAM File Parallelize/Scale opportunity: - Operation can be done on every realigned BAM file while it’s being used for further pipeline activities. - Slice, Stats, Index can be uploaded to local HPC regardless to Variant call results as long are distributed to the right canonical named located (based on tcga uuid) - S3 bucket can hold “infinite” number of objects without “folders” structure - Spot Instances are perfect for cost reduction Results: - When Running at 700 Workers (100 nodes) we did ~6000 Slice ,Stats , Index in less than 3 hours Post Align N Workers per Instances Pipeline Analysis Queue ReadyToUpload Queue 80GB/160GB/ 320GB Launch ConfigGroup S3
  • 43. Simplify: Turn It to a Loosely Coupled At Scale Solution 43 Science Requested Target Samples CGHub Local Science HPCDevOps Bucket INGESTION S3 Bucket Ingestion Request Worker RESULTS S3 Bucket Automation Worker VLAD Automation Step Workers VLAD Loader Worker INGESTION Request Queue ANALYSIS Request Queue PIPELINE AUTOMATION Results Queue PIPELINE Results Queue Queue Process Storage PIPELINE S3 Bucket BAM, BAI VCF Analysis Request Worker PARALLEL INGESTION AT SCALE ASYNCHRONOUS PARALLEL ANALYSIS AT SCALE Alignment Slice Index and Stats Variant Call UPLOAD TO LOCAL HPC FOR DOWNSTREAM ANALYSIS PARALLEL INGESTION ASYNCHRONOUS PARALLEL ANALYSIS UPLOAD RESULTS INGESTION AT SCALE ASYNCHRONOUS PARALLEL ANALYSIS AT SCALE Alignment Slice Index and Stats Variant Call Cleanup and Bring in What Matters Loader Developed +270TB of raw data  ~9TB Meaningful Analysis
  • 44. Time Shrinking Machine – Getting Results Faster Months 01 03 05 07 09 11 13 Start New Way 1/21/2016 Old Way 2/8/2017 Genomic File Ingestion 1/1/2016 - 1/25/2016 Data Analysis, Exclusive Use 1/26/2016 - 1/25/2017 Post-Processing 1/26/2017 - 2/1/2017 File Cleanup 2/2/2017 - 2/8/2017 Genomic File Ingestion (270 TB @ 5 Gbps) 5 Days 1/1/2016 - 1/5/2016 Data Analysis (7680 CPU Cores) 1/1/2016 - 1/21/2016 Post-Processing 1/1/2016 - 1/21/2016 File Cleanup 1/1/2016 - 1/21/2016
  • 45. Resource Shrinking Machine – Use/Pay When Needed Months 01 03 05 07 09 11 13 Start 1/21/2016 2/8/2017 1/1/2016 - 1/25/2016 1/26/2016 - 1/25/2017 1/26/2017 - 2/1/2017 2/2/2017 - 2/8/2017 1/1/2016 - 1/21/2016 Cloud On Premises $$$ OpEx – No Physical Infrastructure $$$ CapEx – No Flexible Infrastructure
  • 46. 270 TB Data Reduction 9 TB So…How Did We Do? 46 19,690 TCGA EXOMES RE-ALIGNED TO HG38 >270 TB RAW DATA UPLOADED AT 1.5-2 TB/HR WITH 300 LOADERS 7680 CPU FOR ALIGNMENT AND VARIANT CALLING >2000/HR FOR SLICE STATS AND INDEX W/ 700 WORKERS $4 PER EXOME INCL. COMPUTE, STORAGE, NETWORK
  • 47. Lessons Learned from the 20k in 20 Days Project • Plan ahead as much as you can  • Work first on the foundation and enabling factors • Security and Privacy foundation • DevOps and Automation • Understand the details of: • Process • Computation and Data Scale • Leverage the AWS XaaS • Measure Everything and associate it to Business Outcome
  • 48. Acknowledgements AstraZeneca • Vlad Saveliev • Ronen Artzi • Michael Heimlich • Tristan Lubinski • Jonathan Dry @DrySci • Danielle Greenawalt @BostonBioinfX • Justin Johnson @BioInfo • Zhongwu Lai @ZhongwuL • Stefanie Rintoul • Vitaly Rozenman • Carl Barrett • Brian Dougherty • Bryan Takasaki AstraZeneca Security/Privacy • Richard Paul • Gayle Pearce • Lee Ann Heckathorn • Stephen Weil • Sheri Arnell • Tommy Farrell • Victoria Southern Bina/Roche • Andi Broka • Engineering Team • Product Management Team • Science Team