SlideShare a Scribd company logo
1 of 19
3 ways to efficiently
migrate your big data
to AWS cloud
AWS Services useful
in the migration
process
Amazon EMR is a service that allows
cost-effective and fast processing of large
amounts of data. It uses the Hadoop and
Spark frameworks based on Amazon EC2 and Amazon
S3. It allows for efficient processing of large amounts
of data in processes such as indexing, data mining,
machine learning or financial analysis.
Amazon S3 (Simple Storage Service) is a
fully manager extraction, transformation and
loading (ETL) service that makes it easier for
clients to prepare and load data for analysis. It also allows
you to configure, coordinate and monitor complex data
flows.
AWS Glue is a fully managed
extraction, transformation and
loading (ETL) service that makes it
easier for clients to prepare and load data for
analysis. It also allows you to configure,
coordinate and monitor complex data flows.
Open source software
supporting big data
Open source software
Apache Hadoop is software for distributed
storage and processing of large data sets using
computer clusters.
Apache Spark is a software that is a
programming platform for distributed
computing.
▪ Hadoop is designed to efficiently support batch processing, while Spark is
designed to efficiently handle data in real-time.
▪ Hadoop is a high-latency computing structure that has no interactive mode,
while Spark gives low-latency computing and can process data interactively.
▪ Apache Spark is also a component of the Hadoop Ecosystem. Spark’s main
idea was to perform memory processing.
3 approaches to the
migration process
There are few approaches in cloud migration, but
these 3 allow you to make conscious decisions
about your architecture.
3 APPROACHES TO THE
MIGRATION PROCESS
It relies on redesigning the existing
infrastructure in such a way to make
full use of cloud computing. The
approach relies on the analysing the
existing architecture and the way it’s
being designed, which will allow to
provide benefits such as lower
memory and hardware costs, increase
operational flexibility to ensure
business benefits.
Re-architecting
It is an ideal solution when we need
more efficient infrastructure. By
transferring the workloads of the
existing environment, we can avoid
most of the changes that can occur
during re-architecting. A smaller
number of changes also reduces the
risk associated with unexpected work,
and thus your solution can come back
sooner or enter the market.
Lift and shift
It’s a combination of two previous
approaches. In this mode, the part
responsible for fast migration is
associated with lift and shift. Re-
architecting, in turn, supports the
possibilities of redesigning the needed
solutions. This approach allows a great
deal of flexibility, which allows you to
experiment with cloud solutions and
gain the necessary experience before
you permanently decide to move to
the cloud.
Hybrid
Prototyping in
the spirit of
best practices
Knowing the migration possibilities to the cloud,
let’s move on to prototyping. When learning new
solutions, there is always a learning stage. And as
you know, practice is its best form. Prototyping
should be crucial when implementing new
services and products. Here is the scenario the
same as before – the cheaper option is to check the application at the
prototyping stage. There is a similar story with instance types. The worst
assumption is that the application running in the on-premise
environment will work the same way in the cloud environment. There are
many factors that affect this. It’s worth running applications with loads
that can occur in the real world in a test environment.
Best Preactices
in prototyping
1. Make a list of all potential assumptions and uncertainties
while remembering what may have the greatest impact
on the environment.
2. First, select and implement the most risky aspects of
migration.
3. Set your goals in advance and don’t be afraid to ask. The
answers will help in project verification or answer the
question of how a given solution works.
4. Always prototype under similar conditions in which you
want to operate. You can start with a smaller
environment or set of features and then use the scale.
5. Iteration and Continuous Integration as the basis for creating
implementation tests. Using an automated environment and
scripts, you can run the test in several environments.
6. Ask the expert for verification to be able to check the test
configuration and environment. This will allow you to eliminate
errors and check if the results are not falsified.
7. Correctly running the tests will allow you to remove variables
that may be due to dependencies.
8. Document the test results and ask for verification to ensure
they are reliable.
9. Don’t take all assumptions for granted! In the big data
area, too many factors affect performance, functionality
and cost.
10.Prototyping aims to verify the assumptions of the project
with a fairly high degree of certainty. In general, more
effort put into the prototype, taking into account many
factors, will give greater confidence that the project will
operate in a production environment.
11. And above all, don’t be afraid to seek help – from AWS
Authorized Partners, AWS Support and in documentation
Any questions?
We can help you!
Feel free to contact us
kontakt@lcloud.pl
www.lcloud.pl
Thank you for your time!
All source materials in the presentation have been appropriately marked.

More Related Content

Similar to 3 ways to efficiently migrate your big data to AWS cloud | LCloud

Migrating to Cloud: Inhouse Hadoop to Databricks (3)
Migrating to Cloud: Inhouse Hadoop to Databricks (3)Migrating to Cloud: Inhouse Hadoop to Databricks (3)
Migrating to Cloud: Inhouse Hadoop to Databricks (3)Knoldus Inc.
 
White paper on testing in cloud
White paper on testing in cloudWhite paper on testing in cloud
White paper on testing in cloudimkulu
 
Cloud First Architecture
Cloud First ArchitectureCloud First Architecture
Cloud First ArchitectureCameron Vetter
 
Harnessing the Cloud for Performance Testing- Impetus White Paper
Harnessing the Cloud for Performance Testing- Impetus White PaperHarnessing the Cloud for Performance Testing- Impetus White Paper
Harnessing the Cloud for Performance Testing- Impetus White PaperImpetus Technologies
 
Migrating thousands of workloads to AWS at enterprise scale
Migrating thousands of workloads to AWS at enterprise scaleMigrating thousands of workloads to AWS at enterprise scale
Migrating thousands of workloads to AWS at enterprise scaleTom Laszewski
 
Migrating Thousands of Workloads to AWS at Enterprise Scale – Chris Wegmann, ...
Migrating Thousands of Workloads to AWS at Enterprise Scale – Chris Wegmann, ...Migrating Thousands of Workloads to AWS at Enterprise Scale – Chris Wegmann, ...
Migrating Thousands of Workloads to AWS at Enterprise Scale – Chris Wegmann, ...Amazon Web Services
 
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
2020 Cloud Data Lake Platforms Buyers Guide - White paper | QuboleVasu S
 
Accenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Accenture 2014 AWS re:Invent Enterprise Migration Breakout SessionAccenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Accenture 2014 AWS re:Invent Enterprise Migration Breakout SessionTom Laszewski
 
Qubole on AWS - White paper
Qubole on AWS - White paper Qubole on AWS - White paper
Qubole on AWS - White paper Vasu S
 
Microsoft SQL Server - SQL Server Migrations Presentation
Microsoft SQL Server - SQL Server Migrations PresentationMicrosoft SQL Server - SQL Server Migrations Presentation
Microsoft SQL Server - SQL Server Migrations PresentationMicrosoft Private Cloud
 
Achieve New Heights with Modern Analytics
Achieve New Heights with Modern AnalyticsAchieve New Heights with Modern Analytics
Achieve New Heights with Modern AnalyticsSense Corp
 
MS Cloud Design Patterns Infographic 2015
MS Cloud Design Patterns Infographic 2015MS Cloud Design Patterns Infographic 2015
MS Cloud Design Patterns Infographic 2015James Tramel
 
Ms cloud design patterns infographic 2015
Ms cloud design patterns infographic 2015Ms cloud design patterns infographic 2015
Ms cloud design patterns infographic 2015Kesavan Munuswamy
 
Take testing-to-cloud
Take testing-to-cloudTake testing-to-cloud
Take testing-to-cloudVipin Jain
 
From Relational Database Management to Big Data: Solutions for Data Migration...
From Relational Database Management to Big Data: Solutions for Data Migration...From Relational Database Management to Big Data: Solutions for Data Migration...
From Relational Database Management to Big Data: Solutions for Data Migration...Cognizant
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise AnalyticsDATAVERSITY
 
Migration to Oracle 12c Made Easy Using Replication Technology
Migration to Oracle 12c Made Easy Using Replication TechnologyMigration to Oracle 12c Made Easy Using Replication Technology
Migration to Oracle 12c Made Easy Using Replication TechnologyDonna Guazzaloca-Zehl
 
A Survey on Heuristic Based Techniques in Cloud Computing
A Survey on Heuristic Based Techniques in Cloud ComputingA Survey on Heuristic Based Techniques in Cloud Computing
A Survey on Heuristic Based Techniques in Cloud ComputingIRJET Journal
 
Best practice adoption (and lack there of)
Best practice adoption (and lack there of)Best practice adoption (and lack there of)
Best practice adoption (and lack there of)John Pape
 

Similar to 3 ways to efficiently migrate your big data to AWS cloud | LCloud (20)

Migrating to Cloud: Inhouse Hadoop to Databricks (3)
Migrating to Cloud: Inhouse Hadoop to Databricks (3)Migrating to Cloud: Inhouse Hadoop to Databricks (3)
Migrating to Cloud: Inhouse Hadoop to Databricks (3)
 
White paper on testing in cloud
White paper on testing in cloudWhite paper on testing in cloud
White paper on testing in cloud
 
Cloud First Architecture
Cloud First ArchitectureCloud First Architecture
Cloud First Architecture
 
Harnessing the Cloud for Performance Testing- Impetus White Paper
Harnessing the Cloud for Performance Testing- Impetus White PaperHarnessing the Cloud for Performance Testing- Impetus White Paper
Harnessing the Cloud for Performance Testing- Impetus White Paper
 
Migrating thousands of workloads to AWS at enterprise scale
Migrating thousands of workloads to AWS at enterprise scaleMigrating thousands of workloads to AWS at enterprise scale
Migrating thousands of workloads to AWS at enterprise scale
 
Migrating Thousands of Workloads to AWS at Enterprise Scale – Chris Wegmann, ...
Migrating Thousands of Workloads to AWS at Enterprise Scale – Chris Wegmann, ...Migrating Thousands of Workloads to AWS at Enterprise Scale – Chris Wegmann, ...
Migrating Thousands of Workloads to AWS at Enterprise Scale – Chris Wegmann, ...
 
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
 
Accenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Accenture 2014 AWS re:Invent Enterprise Migration Breakout SessionAccenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Accenture 2014 AWS re:Invent Enterprise Migration Breakout Session
 
AWS Services 7 Transformation Media
AWS Services 7 Transformation MediaAWS Services 7 Transformation Media
AWS Services 7 Transformation Media
 
Qubole on AWS - White paper
Qubole on AWS - White paper Qubole on AWS - White paper
Qubole on AWS - White paper
 
Microsoft SQL Server - SQL Server Migrations Presentation
Microsoft SQL Server - SQL Server Migrations PresentationMicrosoft SQL Server - SQL Server Migrations Presentation
Microsoft SQL Server - SQL Server Migrations Presentation
 
Achieve New Heights with Modern Analytics
Achieve New Heights with Modern AnalyticsAchieve New Heights with Modern Analytics
Achieve New Heights with Modern Analytics
 
MS Cloud Design Patterns Infographic 2015
MS Cloud Design Patterns Infographic 2015MS Cloud Design Patterns Infographic 2015
MS Cloud Design Patterns Infographic 2015
 
Ms cloud design patterns infographic 2015
Ms cloud design patterns infographic 2015Ms cloud design patterns infographic 2015
Ms cloud design patterns infographic 2015
 
Take testing-to-cloud
Take testing-to-cloudTake testing-to-cloud
Take testing-to-cloud
 
From Relational Database Management to Big Data: Solutions for Data Migration...
From Relational Database Management to Big Data: Solutions for Data Migration...From Relational Database Management to Big Data: Solutions for Data Migration...
From Relational Database Management to Big Data: Solutions for Data Migration...
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 
Migration to Oracle 12c Made Easy Using Replication Technology
Migration to Oracle 12c Made Easy Using Replication TechnologyMigration to Oracle 12c Made Easy Using Replication Technology
Migration to Oracle 12c Made Easy Using Replication Technology
 
A Survey on Heuristic Based Techniques in Cloud Computing
A Survey on Heuristic Based Techniques in Cloud ComputingA Survey on Heuristic Based Techniques in Cloud Computing
A Survey on Heuristic Based Techniques in Cloud Computing
 
Best practice adoption (and lack there of)
Best practice adoption (and lack there of)Best practice adoption (and lack there of)
Best practice adoption (and lack there of)
 

More from LCloud

Well architected tool - Serverless and Machine Learning Lens
Well architected tool - Serverless and Machine Learning LensWell architected tool - Serverless and Machine Learning Lens
Well architected tool - Serverless and Machine Learning LensLCloud
 
On a trail with Amazon Detective | LCloud
On a trail with Amazon Detective | LCloudOn a trail with Amazon Detective | LCloud
On a trail with Amazon Detective | LCloudLCloud
 
"Don’t Run with Scissors: Serverless Security Survival Guide" | Hillel Solow,...
"Don’t Run with Scissors: Serverless Security Survival Guide" | Hillel Solow,..."Don’t Run with Scissors: Serverless Security Survival Guide" | Hillel Solow,...
"Don’t Run with Scissors: Serverless Security Survival Guide" | Hillel Solow,...LCloud
 
"Building a Production-Grade Serverless Deployment" - Eoin Shanaghy, CTO, fo...
"Building a Production-Grade Serverless Deployment"  - Eoin Shanaghy, CTO, fo..."Building a Production-Grade Serverless Deployment"  - Eoin Shanaghy, CTO, fo...
"Building a Production-Grade Serverless Deployment" - Eoin Shanaghy, CTO, fo...LCloud
 
Amazon Aurora MySQL - tips & tricks in configuration | LCloud
Amazon Aurora MySQL - tips & tricks in configuration | LCloudAmazon Aurora MySQL - tips & tricks in configuration | LCloud
Amazon Aurora MySQL - tips & tricks in configuration | LCloudLCloud
 
AWS Landing Zone Essentials PL | LCloud
AWS Landing Zone Essentials PL | LCloudAWS Landing Zone Essentials PL | LCloud
AWS Landing Zone Essentials PL | LCloudLCloud
 
Security management using devops | LCloud
Security management using devops | LCloudSecurity management using devops | LCloud
Security management using devops | LCloudLCloud
 
Amazon Neptune - visually more options
Amazon Neptune - visually more optionsAmazon Neptune - visually more options
Amazon Neptune - visually more optionsLCloud
 
How to use AWS practices to provide the enterprise architecture in the cloud
How to use AWS practices to provide the enterprise architecture in the cloudHow to use AWS practices to provide the enterprise architecture in the cloud
How to use AWS practices to provide the enterprise architecture in the cloudLCloud
 
Overview of Amazon Web Services - kwiecień 2017
Overview of Amazon Web Services - kwiecień 2017Overview of Amazon Web Services - kwiecień 2017
Overview of Amazon Web Services - kwiecień 2017LCloud
 
Good practices to design and implement IT architecture based on AWS
Good practices to design and implement IT architecture based on AWSGood practices to design and implement IT architecture based on AWS
Good practices to design and implement IT architecture based on AWSLCloud
 
Lcloud na AWS re: Invent 2016 w Las Vegas
Lcloud na AWS re: Invent 2016 w Las Vegas Lcloud na AWS re: Invent 2016 w Las Vegas
Lcloud na AWS re: Invent 2016 w Las Vegas LCloud
 
What to know about Amazon Elastic Block Store (EBS)
What to know about Amazon Elastic Block Store (EBS)What to know about Amazon Elastic Block Store (EBS)
What to know about Amazon Elastic Block Store (EBS)LCloud
 

More from LCloud (13)

Well architected tool - Serverless and Machine Learning Lens
Well architected tool - Serverless and Machine Learning LensWell architected tool - Serverless and Machine Learning Lens
Well architected tool - Serverless and Machine Learning Lens
 
On a trail with Amazon Detective | LCloud
On a trail with Amazon Detective | LCloudOn a trail with Amazon Detective | LCloud
On a trail with Amazon Detective | LCloud
 
"Don’t Run with Scissors: Serverless Security Survival Guide" | Hillel Solow,...
"Don’t Run with Scissors: Serverless Security Survival Guide" | Hillel Solow,..."Don’t Run with Scissors: Serverless Security Survival Guide" | Hillel Solow,...
"Don’t Run with Scissors: Serverless Security Survival Guide" | Hillel Solow,...
 
"Building a Production-Grade Serverless Deployment" - Eoin Shanaghy, CTO, fo...
"Building a Production-Grade Serverless Deployment"  - Eoin Shanaghy, CTO, fo..."Building a Production-Grade Serverless Deployment"  - Eoin Shanaghy, CTO, fo...
"Building a Production-Grade Serverless Deployment" - Eoin Shanaghy, CTO, fo...
 
Amazon Aurora MySQL - tips & tricks in configuration | LCloud
Amazon Aurora MySQL - tips & tricks in configuration | LCloudAmazon Aurora MySQL - tips & tricks in configuration | LCloud
Amazon Aurora MySQL - tips & tricks in configuration | LCloud
 
AWS Landing Zone Essentials PL | LCloud
AWS Landing Zone Essentials PL | LCloudAWS Landing Zone Essentials PL | LCloud
AWS Landing Zone Essentials PL | LCloud
 
Security management using devops | LCloud
Security management using devops | LCloudSecurity management using devops | LCloud
Security management using devops | LCloud
 
Amazon Neptune - visually more options
Amazon Neptune - visually more optionsAmazon Neptune - visually more options
Amazon Neptune - visually more options
 
How to use AWS practices to provide the enterprise architecture in the cloud
How to use AWS practices to provide the enterprise architecture in the cloudHow to use AWS practices to provide the enterprise architecture in the cloud
How to use AWS practices to provide the enterprise architecture in the cloud
 
Overview of Amazon Web Services - kwiecień 2017
Overview of Amazon Web Services - kwiecień 2017Overview of Amazon Web Services - kwiecień 2017
Overview of Amazon Web Services - kwiecień 2017
 
Good practices to design and implement IT architecture based on AWS
Good practices to design and implement IT architecture based on AWSGood practices to design and implement IT architecture based on AWS
Good practices to design and implement IT architecture based on AWS
 
Lcloud na AWS re: Invent 2016 w Las Vegas
Lcloud na AWS re: Invent 2016 w Las Vegas Lcloud na AWS re: Invent 2016 w Las Vegas
Lcloud na AWS re: Invent 2016 w Las Vegas
 
What to know about Amazon Elastic Block Store (EBS)
What to know about Amazon Elastic Block Store (EBS)What to know about Amazon Elastic Block Store (EBS)
What to know about Amazon Elastic Block Store (EBS)
 

Recently uploaded

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 

Recently uploaded (20)

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 

3 ways to efficiently migrate your big data to AWS cloud | LCloud

  • 1. 3 ways to efficiently migrate your big data to AWS cloud
  • 2. AWS Services useful in the migration process
  • 3. Amazon EMR is a service that allows cost-effective and fast processing of large amounts of data. It uses the Hadoop and Spark frameworks based on Amazon EC2 and Amazon S3. It allows for efficient processing of large amounts of data in processes such as indexing, data mining, machine learning or financial analysis.
  • 4. Amazon S3 (Simple Storage Service) is a fully manager extraction, transformation and loading (ETL) service that makes it easier for clients to prepare and load data for analysis. It also allows you to configure, coordinate and monitor complex data flows.
  • 5. AWS Glue is a fully managed extraction, transformation and loading (ETL) service that makes it easier for clients to prepare and load data for analysis. It also allows you to configure, coordinate and monitor complex data flows.
  • 7. Open source software Apache Hadoop is software for distributed storage and processing of large data sets using computer clusters. Apache Spark is a software that is a programming platform for distributed computing. ▪ Hadoop is designed to efficiently support batch processing, while Spark is designed to efficiently handle data in real-time. ▪ Hadoop is a high-latency computing structure that has no interactive mode, while Spark gives low-latency computing and can process data interactively. ▪ Apache Spark is also a component of the Hadoop Ecosystem. Spark’s main idea was to perform memory processing.
  • 8. 3 approaches to the migration process
  • 9. There are few approaches in cloud migration, but these 3 allow you to make conscious decisions about your architecture. 3 APPROACHES TO THE MIGRATION PROCESS
  • 10. It relies on redesigning the existing infrastructure in such a way to make full use of cloud computing. The approach relies on the analysing the existing architecture and the way it’s being designed, which will allow to provide benefits such as lower memory and hardware costs, increase operational flexibility to ensure business benefits. Re-architecting
  • 11. It is an ideal solution when we need more efficient infrastructure. By transferring the workloads of the existing environment, we can avoid most of the changes that can occur during re-architecting. A smaller number of changes also reduces the risk associated with unexpected work, and thus your solution can come back sooner or enter the market. Lift and shift
  • 12. It’s a combination of two previous approaches. In this mode, the part responsible for fast migration is associated with lift and shift. Re- architecting, in turn, supports the possibilities of redesigning the needed solutions. This approach allows a great deal of flexibility, which allows you to experiment with cloud solutions and gain the necessary experience before you permanently decide to move to the cloud. Hybrid
  • 13. Prototyping in the spirit of best practices
  • 14. Knowing the migration possibilities to the cloud, let’s move on to prototyping. When learning new solutions, there is always a learning stage. And as you know, practice is its best form. Prototyping should be crucial when implementing new services and products. Here is the scenario the same as before – the cheaper option is to check the application at the prototyping stage. There is a similar story with instance types. The worst assumption is that the application running in the on-premise environment will work the same way in the cloud environment. There are many factors that affect this. It’s worth running applications with loads that can occur in the real world in a test environment.
  • 16. 1. Make a list of all potential assumptions and uncertainties while remembering what may have the greatest impact on the environment. 2. First, select and implement the most risky aspects of migration. 3. Set your goals in advance and don’t be afraid to ask. The answers will help in project verification or answer the question of how a given solution works. 4. Always prototype under similar conditions in which you want to operate. You can start with a smaller environment or set of features and then use the scale.
  • 17. 5. Iteration and Continuous Integration as the basis for creating implementation tests. Using an automated environment and scripts, you can run the test in several environments. 6. Ask the expert for verification to be able to check the test configuration and environment. This will allow you to eliminate errors and check if the results are not falsified. 7. Correctly running the tests will allow you to remove variables that may be due to dependencies. 8. Document the test results and ask for verification to ensure they are reliable.
  • 18. 9. Don’t take all assumptions for granted! In the big data area, too many factors affect performance, functionality and cost. 10.Prototyping aims to verify the assumptions of the project with a fairly high degree of certainty. In general, more effort put into the prototype, taking into account many factors, will give greater confidence that the project will operate in a production environment. 11. And above all, don’t be afraid to seek help – from AWS Authorized Partners, AWS Support and in documentation
  • 19. Any questions? We can help you! Feel free to contact us kontakt@lcloud.pl www.lcloud.pl Thank you for your time! All source materials in the presentation have been appropriately marked.