SlideShare a Scribd company logo
1 of 24
Download to read offline
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Move Data to AWS Faster for Migrations, DR,
& Bidirectional Workflows
S T G 3 8 2
Olga Kogan
Sr. Product Manager
Amazon Web Services
David Green
Enterprise Solutions Architect
Amazon Web Services
Lance Smith
Associate Director
Celgene
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Celgene case study
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
What is AWS DataSync?
Online transfer service that simplifies, automates, and
accelerates moving data between on-premises storage and AWS
Fast data
transfer
Cost-
effective
Combines the speed and reliability of network acceleration
software with the cost-effectiveness of open source tools
Easy to use Secure and
reliable
Cloud
integrated
AWS
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
What’s the problem?
As more and more critical workloads move to the cloud …
… you need to
move increasingly
large datasets
along with them
AWS
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Common use patterns for AWS DataSync
Migration of
active
application data
to AWS
Transferring
data for time
sensitive in-
cloud analysis
Replication of
data to AWS for
business
continuity
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Shared
file system
NFS TLS
How AWS DataSync works
On-Premise
Amazon S3
bucket
AWS storage resources
AWS
DataSync
Agent deployed
on-premises for
fast access to
local storage
Region
Amazon EFS
file system
AWS DataSync
agent
Data transfer
over the WAN via
efficient purpose-
built protocol
Managed from
AWS Console or
AWS Command
Line Interface
(AWS CLI)
Service in AWS
writes or reads
data from AWS
storage services
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Celgene
Celgene is building a preeminent global
biopharmaceutical company focused on the
discovery, development, and commercialization of
innovative therapies for patients with cancer,
immune-inflammatory, and other unmet medical
needs
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Biotech lab environment challenges
• Multiple lab systems
• Different standards
• Distributed sites
• Various lab processes
• Diverse storage &
networking
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Moving data to the cloud
Use cases
• Rein in ”hidden” files
stored on local lab disk
• Data archive
• Data transfer to compute
• Centralize data for machine
learning & analytics
Challenges
• Impossible to change
applications in near-term
• Very hard to change people
and processes
• Data transfers & latency
• Offline data movement
possible, but lengthy
Previous options
• Amazon Simple Storage Service
(Amazon S3) upload clients, but
resistance to change by users
• Multiple backup tools
• Custom scripted utilizing cron
• One backup software cost $25K
for one instrument; we have 1K
instruments …
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Moving data to the cloud
Use cases
• Rein in ”hidden” files
stored on local lab disk
• Data archive
• Data transfer to compute
• Centralize data for machine
learning & analytics
Challenges
• Impossible to change
applications in near-term
• Very hard to change people
and processes
• Data transfers & latency
• Offline data movement
possible, but lengthy
Previous options
• Amazon S3 upload clients, but
resistance to change by users
• Multiple backup tools
• Custom scripted utilizing cron
• One backup software cost $25K
for one instrument; we have 1K
instruments …
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Moving data to the cloud
Use cases
• Rein in ”hidden” files
stored on local lab disk
• Data archive
• Data transfer to compute
• Centralize data for machine
learning & analytics
Challenges
• Impossible to change
applications in near-term
• Very hard to change people
and processes
• Data transfers & latency
• Offline data movement
possible, but lengthy
Previous options
• Amazon S3 upload clients, but
resistance to change by users
• Multiple backup tools
• Custom scripted utilizing cron
• One backup software cost $25K
for one instrument; we have 1K
instruments …
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Our hybrid architecture AWS Cloud
Region
VPC
On-premises architecture
NFS Server/
NAS
Lab PCResearch
Scientists
Data Acquisition
Instrumentation
Scientific
Experiment
CambridgeSeattleSan FranciscoSevilleSan Diego
1. Instruments and users write to PCs and then NAS
2. AWS DataSync pulls or reads data from NAS, and …
3. Sends it over the internet, or through our colo and AWS Direct
Connect, based on data type and size, and validates data integrity
4. Once the data lands in Amazon S3, archiving, analytics, machine
learning, and processing actions occur
Internet
Colocation
facilities
Or
AWS
DataSync
AWS
DataSync
Agent
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Terabytes
files
size per file
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWS EFS
Demo architecture AWS Cloud
us-west-2
Oregon
On-premises architecture
NFS Server /
NAS:/data
Los Angeles
1. AWS DataSync moves files from NAS to Amazon S3 us-west-1
2. AWS DataSync writes data to Amazon S3 and Amazon Elastic File
System (Amazon EFS)
3. S3 Cross Region Replication (CRR) moves data to another region
within another account
4. S3 Lifecycle Policy writes data to Amazon Glacier
5. File access provided by AWS Storage Gateway (file gateway)
ap-northeast-2
Seoul
NFS Server /
NAS:/home/users
AWS
DataSync
Agent
AWS
DataSync
Demo architecture―DX
On-premises architecture
Los Angeles
group DirectConnect-Public-VIF {
type external;
peer-as 7224;
multipath;
neighbor 1.2.3.1 {
description "pubic-vif-01 dxvif-fg3qmoaq";
local-address 1.2.3.2;
authentication-key ""; ## SECRET-DATA
export EXPORT-PREFIXES;
}
neighbor 2.3.4.1 {
description "pubic-vif-02 dxvif-fgafmipq";
local-address 2.3.4.2;
authentication-key ""; ## SECRET-DATA
export EXPORT-PREFIXES;
}
}
Thank you!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.

More Related Content

What's hot

What's hot (20)

SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...
SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...
SaaS Analytics and Metrics: Capturing and Surfacing the Data That's Fundament...
 
Manage Queries, and Audit Usage & Control Costs at Scale on Amazon Athena (AN...
Manage Queries, and Audit Usage & Control Costs at Scale on Amazon Athena (AN...Manage Queries, and Audit Usage & Control Costs at Scale on Amazon Athena (AN...
Manage Queries, and Audit Usage & Control Costs at Scale on Amazon Athena (AN...
 
The Future of Enterprise Applications is Serverless (ENT314-R1) - AWS re:Inve...
The Future of Enterprise Applications is Serverless (ENT314-R1) - AWS re:Inve...The Future of Enterprise Applications is Serverless (ENT314-R1) - AWS re:Inve...
The Future of Enterprise Applications is Serverless (ENT314-R1) - AWS re:Inve...
 
DevOps Concepts for Data Science (DEV347-R2) - AWS re:Invent 2018
DevOps Concepts for Data Science (DEV347-R2) - AWS re:Invent 2018DevOps Concepts for Data Science (DEV347-R2) - AWS re:Invent 2018
DevOps Concepts for Data Science (DEV347-R2) - AWS re:Invent 2018
 
Cost Optimization Tooling (ARC301) - AWS re:Invent 2018
Cost Optimization Tooling (ARC301) - AWS re:Invent 2018Cost Optimization Tooling (ARC301) - AWS re:Invent 2018
Cost Optimization Tooling (ARC301) - AWS re:Invent 2018
 
Post-Production Media Delivery at Scale with AWS (STG391) - AWS re:Invent 2018
Post-Production Media Delivery at Scale with AWS (STG391) - AWS re:Invent 2018Post-Production Media Delivery at Scale with AWS (STG391) - AWS re:Invent 2018
Post-Production Media Delivery at Scale with AWS (STG391) - AWS re:Invent 2018
 
Amazon EMR: Optimize Transient Clusters for Data Processing & ETL (ANT341) - ...
Amazon EMR: Optimize Transient Clusters for Data Processing & ETL (ANT341) - ...Amazon EMR: Optimize Transient Clusters for Data Processing & ETL (ANT341) - ...
Amazon EMR: Optimize Transient Clusters for Data Processing & ETL (ANT341) - ...
 
Reliability of the Cloud: How AWS Achieves High Availability (ARC317-R1) - AW...
Reliability of the Cloud: How AWS Achieves High Availability (ARC317-R1) - AW...Reliability of the Cloud: How AWS Achieves High Availability (ARC317-R1) - AW...
Reliability of the Cloud: How AWS Achieves High Availability (ARC317-R1) - AW...
 
Hands-On Building and Deploying .NET Applications on AWS (DEV331-R1) - AWS re...
Hands-On Building and Deploying .NET Applications on AWS (DEV331-R1) - AWS re...Hands-On Building and Deploying .NET Applications on AWS (DEV331-R1) - AWS re...
Hands-On Building and Deploying .NET Applications on AWS (DEV331-R1) - AWS re...
 
Lessons Learned from Building an AWS Service on AWS Lambda (SRV327-R1) - AWS ...
Lessons Learned from Building an AWS Service on AWS Lambda (SRV327-R1) - AWS ...Lessons Learned from Building an AWS Service on AWS Lambda (SRV327-R1) - AWS ...
Lessons Learned from Building an AWS Service on AWS Lambda (SRV327-R1) - AWS ...
 
Migrating Real-Time Sports Scores to the Cloud via Low-Latency Messaging (API...
Migrating Real-Time Sports Scores to the Cloud via Low-Latency Messaging (API...Migrating Real-Time Sports Scores to the Cloud via Low-Latency Messaging (API...
Migrating Real-Time Sports Scores to the Cloud via Low-Latency Messaging (API...
 
Best Practices for Running SQL Server on Amazon RDS (DAT323) - AWS re:Invent ...
Best Practices for Running SQL Server on Amazon RDS (DAT323) - AWS re:Invent ...Best Practices for Running SQL Server on Amazon RDS (DAT323) - AWS re:Invent ...
Best Practices for Running SQL Server on Amazon RDS (DAT323) - AWS re:Invent ...
 
Developing with .NET Core on AWS: What's New (DEV318-R1) - AWS re:Invent 2018
Developing with .NET Core on AWS: What's New (DEV318-R1) - AWS re:Invent 2018Developing with .NET Core on AWS: What's New (DEV318-R1) - AWS re:Invent 2018
Developing with .NET Core on AWS: What's New (DEV318-R1) - AWS re:Invent 2018
 
Implementing Multi-Region AWS IoT, ft. Analog Devices (IOT401) - AWS re:Inven...
Implementing Multi-Region AWS IoT, ft. Analog Devices (IOT401) - AWS re:Inven...Implementing Multi-Region AWS IoT, ft. Analog Devices (IOT401) - AWS re:Inven...
Implementing Multi-Region AWS IoT, ft. Analog Devices (IOT401) - AWS re:Inven...
 
Architecture Patterns of Serverless Microservices (ARC304-R1) - AWS re:Invent...
Architecture Patterns of Serverless Microservices (ARC304-R1) - AWS re:Invent...Architecture Patterns of Serverless Microservices (ARC304-R1) - AWS re:Invent...
Architecture Patterns of Serverless Microservices (ARC304-R1) - AWS re:Invent...
 
Migrating Data to the Cloud: Exploring Your Options from AWS (STG205-R1) - AW...
Migrating Data to the Cloud: Exploring Your Options from AWS (STG205-R1) - AW...Migrating Data to the Cloud: Exploring Your Options from AWS (STG205-R1) - AW...
Migrating Data to the Cloud: Exploring Your Options from AWS (STG205-R1) - AW...
 
Enabling Your Organization’s Amazon Redshift Adoption – Going from Zero to He...
Enabling Your Organization’s Amazon Redshift Adoption – Going from Zero to He...Enabling Your Organization’s Amazon Redshift Adoption – Going from Zero to He...
Enabling Your Organization’s Amazon Redshift Adoption – Going from Zero to He...
 
Computing at the Edge with AWS Greengrass and Amazon FreeRTOS, ft. General El...
Computing at the Edge with AWS Greengrass and Amazon FreeRTOS, ft. General El...Computing at the Edge with AWS Greengrass and Amazon FreeRTOS, ft. General El...
Computing at the Edge with AWS Greengrass and Amazon FreeRTOS, ft. General El...
 
Building Your Geospatial Data Lake (WPS324) - AWS re:Invent 2018
Building Your Geospatial Data Lake (WPS324) - AWS re:Invent 2018Building Your Geospatial Data Lake (WPS324) - AWS re:Invent 2018
Building Your Geospatial Data Lake (WPS324) - AWS re:Invent 2018
 
Which Database is Right for Your Serverless Application (ARC215) - AWS re:Inv...
Which Database is Right for Your Serverless Application (ARC215) - AWS re:Inv...Which Database is Right for Your Serverless Application (ARC215) - AWS re:Inv...
Which Database is Right for Your Serverless Application (ARC215) - AWS re:Inv...
 

Similar to Move Data to AWS Faster for Migrations, DR, & Bidirectional Workflows (STG382-R1) - AWS re:Invent 2018

Similar to Move Data to AWS Faster for Migrations, DR, & Bidirectional Workflows (STG382-R1) - AWS re:Invent 2018 (20)

Building Data Lakes and Analytics on AWS. IPExpo Manchester.
Building Data Lakes and Analytics on AWS. IPExpo Manchester.Building Data Lakes and Analytics on AWS. IPExpo Manchester.
Building Data Lakes and Analytics on AWS. IPExpo Manchester.
 
AWS Storage State of the Union
AWS Storage State of the UnionAWS Storage State of the Union
AWS Storage State of the Union
 
Big data journey to the cloud rohit pujari 5.30.18
Big data journey to the cloud   rohit pujari 5.30.18Big data journey to the cloud   rohit pujari 5.30.18
Big data journey to the cloud rohit pujari 5.30.18
 
Backup & Recovery - Optimize Your Backup and Restore Architectures in the Cloud
Backup & Recovery - Optimize Your Backup and Restore Architectures in the CloudBackup & Recovery - Optimize Your Backup and Restore Architectures in the Cloud
Backup & Recovery - Optimize Your Backup and Restore Architectures in the Cloud
 
AWS Storage State of the Union
AWS Storage State of the UnionAWS Storage State of the Union
AWS Storage State of the Union
 
Building a modern data platform in the cloud. AWS DevDay Nordics
Building a modern data platform in the cloud. AWS DevDay NordicsBuilding a modern data platform in the cloud. AWS DevDay Nordics
Building a modern data platform in the cloud. AWS DevDay Nordics
 
NetApp Cloud Data Services & AWS Empower Your Cloud Champions
NetApp Cloud Data Services & AWS Empower Your Cloud ChampionsNetApp Cloud Data Services & AWS Empower Your Cloud Champions
NetApp Cloud Data Services & AWS Empower Your Cloud Champions
 
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best PracticesBuild Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
 
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best PracticesBuild Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
 
Architecting a Serverless Data Lake on AWS
Architecting a Serverless Data Lake on AWSArchitecting a Serverless Data Lake on AWS
Architecting a Serverless Data Lake on AWS
 
Introducing AWS DataSync - Simplify, automate, and accelerate online data tra...
Introducing AWS DataSync - Simplify, automate, and accelerate online data tra...Introducing AWS DataSync - Simplify, automate, and accelerate online data tra...
Introducing AWS DataSync - Simplify, automate, and accelerate online data tra...
 
Building-a-Modern-Data-Platform-in-the-Cloud.pdf
Building-a-Modern-Data-Platform-in-the-Cloud.pdfBuilding-a-Modern-Data-Platform-in-the-Cloud.pdf
Building-a-Modern-Data-Platform-in-the-Cloud.pdf
 
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
 
Migrating Databases to the Cloud: Introduction to AWS DMS - SRV215 - Atlanta ...
Migrating Databases to the Cloud: Introduction to AWS DMS - SRV215 - Atlanta ...Migrating Databases to the Cloud: Introduction to AWS DMS - SRV215 - Atlanta ...
Migrating Databases to the Cloud: Introduction to AWS DMS - SRV215 - Atlanta ...
 
AWS Data Transfer Services: Deep Dive - SRV302 - Chicago AWS Summit
AWS Data Transfer Services: Deep Dive - SRV302 - Chicago AWS SummitAWS Data Transfer Services: Deep Dive - SRV302 - Chicago AWS Summit
AWS Data Transfer Services: Deep Dive - SRV302 - Chicago AWS Summit
 
AWS Data Transfer Services Deep Dive
AWS Data Transfer Services Deep Dive AWS Data Transfer Services Deep Dive
AWS Data Transfer Services Deep Dive
 
AWS Storage State of the Union & APN Storage Ecosystem
AWS Storage State of the Union & APN Storage EcosystemAWS Storage State of the Union & APN Storage Ecosystem
AWS Storage State of the Union & APN Storage Ecosystem
 
How a Biotech Firm Streamlined Data Protection on AWS
 How a Biotech Firm Streamlined Data Protection on AWS How a Biotech Firm Streamlined Data Protection on AWS
How a Biotech Firm Streamlined Data Protection on AWS
 
Migrating Databases to the Cloud: Introduction to AWS DMS - SRV215 - Toronto ...
Migrating Databases to the Cloud: Introduction to AWS DMS - SRV215 - Toronto ...Migrating Databases to the Cloud: Introduction to AWS DMS - SRV215 - Toronto ...
Migrating Databases to the Cloud: Introduction to AWS DMS - SRV215 - Toronto ...
 
Migrating your IT - Final
Migrating your IT - FinalMigrating your IT - Final
Migrating your IT - Final
 

More from Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Move Data to AWS Faster for Migrations, DR, & Bidirectional Workflows (STG382-R1) - AWS re:Invent 2018

  • 1.
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Move Data to AWS Faster for Migrations, DR, & Bidirectional Workflows S T G 3 8 2 Olga Kogan Sr. Product Manager Amazon Web Services David Green Enterprise Solutions Architect Amazon Web Services Lance Smith Associate Director Celgene
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Celgene case study
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. What is AWS DataSync? Online transfer service that simplifies, automates, and accelerates moving data between on-premises storage and AWS Fast data transfer Cost- effective Combines the speed and reliability of network acceleration software with the cost-effectiveness of open source tools Easy to use Secure and reliable Cloud integrated AWS
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. What’s the problem? As more and more critical workloads move to the cloud … … you need to move increasingly large datasets along with them AWS
  • 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Common use patterns for AWS DataSync Migration of active application data to AWS Transferring data for time sensitive in- cloud analysis Replication of data to AWS for business continuity
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Shared file system NFS TLS How AWS DataSync works On-Premise Amazon S3 bucket AWS storage resources AWS DataSync Agent deployed on-premises for fast access to local storage Region Amazon EFS file system AWS DataSync agent Data transfer over the WAN via efficient purpose- built protocol Managed from AWS Console or AWS Command Line Interface (AWS CLI) Service in AWS writes or reads data from AWS storage services
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Celgene Celgene is building a preeminent global biopharmaceutical company focused on the discovery, development, and commercialization of innovative therapies for patients with cancer, immune-inflammatory, and other unmet medical needs
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Biotech lab environment challenges • Multiple lab systems • Different standards • Distributed sites • Various lab processes • Diverse storage & networking
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Moving data to the cloud Use cases • Rein in ”hidden” files stored on local lab disk • Data archive • Data transfer to compute • Centralize data for machine learning & analytics Challenges • Impossible to change applications in near-term • Very hard to change people and processes • Data transfers & latency • Offline data movement possible, but lengthy Previous options • Amazon Simple Storage Service (Amazon S3) upload clients, but resistance to change by users • Multiple backup tools • Custom scripted utilizing cron • One backup software cost $25K for one instrument; we have 1K instruments …
  • 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Moving data to the cloud Use cases • Rein in ”hidden” files stored on local lab disk • Data archive • Data transfer to compute • Centralize data for machine learning & analytics Challenges • Impossible to change applications in near-term • Very hard to change people and processes • Data transfers & latency • Offline data movement possible, but lengthy Previous options • Amazon S3 upload clients, but resistance to change by users • Multiple backup tools • Custom scripted utilizing cron • One backup software cost $25K for one instrument; we have 1K instruments …
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Moving data to the cloud Use cases • Rein in ”hidden” files stored on local lab disk • Data archive • Data transfer to compute • Centralize data for machine learning & analytics Challenges • Impossible to change applications in near-term • Very hard to change people and processes • Data transfers & latency • Offline data movement possible, but lengthy Previous options • Amazon S3 upload clients, but resistance to change by users • Multiple backup tools • Custom scripted utilizing cron • One backup software cost $25K for one instrument; we have 1K instruments …
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Our hybrid architecture AWS Cloud Region VPC On-premises architecture NFS Server/ NAS Lab PCResearch Scientists Data Acquisition Instrumentation Scientific Experiment CambridgeSeattleSan FranciscoSevilleSan Diego 1. Instruments and users write to PCs and then NAS 2. AWS DataSync pulls or reads data from NAS, and … 3. Sends it over the internet, or through our colo and AWS Direct Connect, based on data type and size, and validates data integrity 4. Once the data lands in Amazon S3, archiving, analytics, machine learning, and processing actions occur Internet Colocation facilities Or AWS DataSync AWS DataSync Agent
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 18. files
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 21. AWS EFS Demo architecture AWS Cloud us-west-2 Oregon On-premises architecture NFS Server / NAS:/data Los Angeles 1. AWS DataSync moves files from NAS to Amazon S3 us-west-1 2. AWS DataSync writes data to Amazon S3 and Amazon Elastic File System (Amazon EFS) 3. S3 Cross Region Replication (CRR) moves data to another region within another account 4. S3 Lifecycle Policy writes data to Amazon Glacier 5. File access provided by AWS Storage Gateway (file gateway) ap-northeast-2 Seoul NFS Server / NAS:/home/users AWS DataSync Agent AWS DataSync
  • 22. Demo architecture―DX On-premises architecture Los Angeles group DirectConnect-Public-VIF { type external; peer-as 7224; multipath; neighbor 1.2.3.1 { description "pubic-vif-01 dxvif-fg3qmoaq"; local-address 1.2.3.2; authentication-key ""; ## SECRET-DATA export EXPORT-PREFIXES; } neighbor 2.3.4.1 { description "pubic-vif-02 dxvif-fgafmipq"; local-address 2.3.4.2; authentication-key ""; ## SECRET-DATA export EXPORT-PREFIXES; } }
  • 23. Thank you! © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.