SlideShare a Scribd company logo
1 of 7
Download to read offline
AMAZON GLUE
Elif Nurber KARAKAS
AWS Glue is a serverless data integration service that
makes it easy to discover, prepare, and combine data
for analytics, machine learning, and application
development. AWS Glue provides all the capabilities
needed for data integration so that you can start
analyzing your data and putting it to use in minutes
instead of months.
What is AWS Glue?
USE CASES
Glue can integrate with Snowflake data
warehouse to help manage the data
integration process.
AWS data lake can integrate with Glue.
AWS Glue can integrate with Athena to
create schemas.
ETL code can be used for Glue on GitHub
as well.
Benefits of using Glue
Fault-tolerance: Failed jobs in Glue are retrievable, and
logs in Glue can be debugged.
Filtering: Filters for bad data.
Support: Supports several non-native Java Database
Connectivity (JDBC) data sources.
Maintenance and deployment: Simple maintenance
and deployment, because the service is completely
managed by AWS.
Drawbacks of
Using Glue
Limited compatibility: While AWS Glue does work with
a variety of commonly used data sources, it only works
with services running on AWS. Organizations may need
a third-party ETL service if sources are not AWS-based.
No incremental data sync: All data is staged on S3 first,
so Glue is not the best option for real-time ETL jobs.
Learning curve: Teams using Glue should have a strong
understanding of Apache spark.
Relational database queries: Glue has limited support
for queries of traditional relational databases, only SQL
queries.
SOME OF GLUE
CUSTOMERS
THANK YOU!
nurberelif@gmail.com

More Related Content

Similar to Amazon Glue

Extending your Hadoop Implementation to the Cloud
Extending your Hadoop Implementation to the CloudExtending your Hadoop Implementation to the Cloud
Extending your Hadoop Implementation to the Cloud
DataWorks Summit
 
Azure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish KalamatiAzure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish Kalamati
Girish Kalamati
 

Similar to Amazon Glue (20)

5 Reasons to Move Your BI to the Cloud
5 Reasons to Move Your BI to the Cloud5 Reasons to Move Your BI to the Cloud
5 Reasons to Move Your BI to the Cloud
 
Azure quick-start-for-net-developers
Azure quick-start-for-net-developersAzure quick-start-for-net-developers
Azure quick-start-for-net-developers
 
cloud computing.pptx
cloud computing.pptxcloud computing.pptx
cloud computing.pptx
 
Extending your Hadoop Implementation to the Cloud
Extending your Hadoop Implementation to the CloudExtending your Hadoop Implementation to the Cloud
Extending your Hadoop Implementation to the Cloud
 
Aswin
AswinAswin
Aswin
 
UNIT -IV.docx
UNIT -IV.docxUNIT -IV.docx
UNIT -IV.docx
 
Cloud Computing Serverless Architecture
Cloud Computing Serverless ArchitectureCloud Computing Serverless Architecture
Cloud Computing Serverless Architecture
 
Aws vs azure bakeoff
Aws vs azure bakeoffAws vs azure bakeoff
Aws vs azure bakeoff
 
Working with azure database services platform
Working with azure database services platformWorking with azure database services platform
Working with azure database services platform
 
Azure fundamental -Introduction
Azure fundamental -IntroductionAzure fundamental -Introduction
Azure fundamental -Introduction
 
Introdcution to Azure
Introdcution to AzureIntrodcution to Azure
Introdcution to Azure
 
When Should You Use AWS Lambda?
When Should You Use AWS Lambda?When Should You Use AWS Lambda?
When Should You Use AWS Lambda?
 
Google apps engine
Google apps engineGoogle apps engine
Google apps engine
 
Azure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish KalamatiAzure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish Kalamati
 
Google apps engine
Google apps engineGoogle apps engine
Google apps engine
 
Cloud service providers
Cloud service providersCloud service providers
Cloud service providers
 
Aws community day pune 2020 v3
Aws community day pune 2020 v3Aws community day pune 2020 v3
Aws community day pune 2020 v3
 
Inside Microsoft Azure
Inside Microsoft AzureInside Microsoft Azure
Inside Microsoft Azure
 
Exploring Microsoft Azure Infrastructures
Exploring Microsoft Azure InfrastructuresExploring Microsoft Azure Infrastructures
Exploring Microsoft Azure Infrastructures
 
Serverless Extract-transform-load (ETL) on AWS Webinar
Serverless Extract-transform-load (ETL) on AWS WebinarServerless Extract-transform-load (ETL) on AWS Webinar
Serverless Extract-transform-load (ETL) on AWS Webinar
 

More from Elif Nurber Karakaş

More from Elif Nurber Karakaş (10)

Amazon Lake Formation
Amazon Lake FormationAmazon Lake Formation
Amazon Lake Formation
 
Amazon Data Pipeline
Amazon Data PipelineAmazon Data Pipeline
Amazon Data Pipeline
 
Amazon Quicksight
Amazon QuicksightAmazon Quicksight
Amazon Quicksight
 
Amazon Redshift
Amazon RedshiftAmazon Redshift
Amazon Redshift
 
Amazon OpenSearch Service
Amazon OpenSearch ServiceAmazon OpenSearch Service
Amazon OpenSearch Service
 
Amazon Managed Streaming for Apache Kafka
Amazon Managed Streaming for Apache KafkaAmazon Managed Streaming for Apache Kafka
Amazon Managed Streaming for Apache Kafka
 
Amazon Kinesis Data Streams
Amazon Kinesis Data StreamsAmazon Kinesis Data Streams
Amazon Kinesis Data Streams
 
Amazon CloudSearch
Amazon CloudSearchAmazon CloudSearch
Amazon CloudSearch
 
Amazon Elastic MapReduce
Amazon Elastic MapReduceAmazon Elastic MapReduce
Amazon Elastic MapReduce
 
Amazon athena
Amazon athenaAmazon athena
Amazon athena
 

Recently uploaded

Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
FIDO Alliance
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
panagenda
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
Muhammad Subhan
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc
 

Recently uploaded (20)

Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
Generative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdfGenerative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdf
 
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
 
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptx
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage Intacct
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdfFrisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
 
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties ReimaginedEasier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 

Amazon Glue

  • 2. AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development. AWS Glue provides all the capabilities needed for data integration so that you can start analyzing your data and putting it to use in minutes instead of months. What is AWS Glue?
  • 3. USE CASES Glue can integrate with Snowflake data warehouse to help manage the data integration process. AWS data lake can integrate with Glue. AWS Glue can integrate with Athena to create schemas. ETL code can be used for Glue on GitHub as well.
  • 4. Benefits of using Glue Fault-tolerance: Failed jobs in Glue are retrievable, and logs in Glue can be debugged. Filtering: Filters for bad data. Support: Supports several non-native Java Database Connectivity (JDBC) data sources. Maintenance and deployment: Simple maintenance and deployment, because the service is completely managed by AWS.
  • 5. Drawbacks of Using Glue Limited compatibility: While AWS Glue does work with a variety of commonly used data sources, it only works with services running on AWS. Organizations may need a third-party ETL service if sources are not AWS-based. No incremental data sync: All data is staged on S3 first, so Glue is not the best option for real-time ETL jobs. Learning curve: Teams using Glue should have a strong understanding of Apache spark. Relational database queries: Glue has limited support for queries of traditional relational databases, only SQL queries.