SlideShare a Scribd company logo
1 of 31
Download to read offline
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
for Data Engineering
and Analytics
2 Ponto
Lorem ipsum dolor sit
amet, consec tetur
adipiscing elit. at efficitur
tortor. Donec congue odio
bibendum
1 Ponto
Lorem ipsum dolor sit
amet, consec tetur
adipiscing elit. at
efficitur tortor. Donec
congue odio bibendum
3 Ponto
Lorem ipsum dolor sit
amet, consec tetur
adipiscing elit. at efficitur
tortor. Donec congue odio
bibendum
4 Ponto
Lorem ipsum dolor sit
amet, consec tetur
adipiscing elit. at efficitur
tortor. Donec congue odio
bibendum
➔ Gerente de Produto e Plataforma na Dadosfera;
➔ AWS Community Builder;
➔ Bacharel em Sistemas de Informação pelo IFG;
➔ Pós graduado em Big Data e Machine Learning
pela Fasam;
➔ Palestrante em eventos como Campus Party,
TDC, Join Community, Google Dev Fest…
Cicero Moura
cicerojmm
/in/cicero-moura
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
APPS
DEVICES
PEOPLE
APP/LOGS
THIRD-PARTY DATA
IOT/DEVICES
Data sources
FOR
APPLICATIONS
Amazon
Aurora
Amazon Kinesis
& Amazon MSK
FOR ANALYTICS AND
MACHINE LEARNING
Data Lake
Amazon S3
Amazon Redshift
Data Warehouse
Amazon
Redshift
Amazon
EMR
BUSINESS
INTELLIGENCE
Amazon
QuickSight
MACHINE LEARNING
Amazon
SageMaker
ANALYTICS
Amazon
DynamoDB
AWS Glue
Act
Integrate
Store and query
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Catalog and govern | AWS Lake Formation, Amazon DataZone
Building an end-to-end data strategy
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Pick the right tool
DATABASES ANALYTICS ML / AI
Amazon Relational
Database Service (MySQL,
PostgreSQL, MariaDB,
Oracle, SQL Server)
Amazon Relational
Database Service Custom
Amazon Aurora
Amazon DynamoDB
Amazon DocumentDB
Amazon Redshift
Amazon Athena
Amazon OpenSearch
Service
Amazon EMR
AWS Lake Formation
Amazon MSK
Amazon Keyspaces
Amazon Neptune
Amazon Timestream
Amazon ElastiCache
Amazon MemoryDB
for Redis
AWS Database
Migration Service
Amazon Kinesis Data
Streams
Amazon Kinesis Data
Firehose
Amazon Kinesis Data
Analytics
Amazon Kinesis Video
Streams
Amazon QuickSight
AWS Glue
Amazon SageMaker
Amazon Comprehend
Amazon Polly
Amazon Rekognition
Amazon Textract
Amazon Transcribe
Amazon Translate
Amazon Fraud Detector
Amazon Personalize
Amazon Forecast
Amazon Augmented AI
Amazon DevOps Guru
Amazon CodeGuru
Amazon HealthLake
Amazon Kendra
Amazon Lex
Amazon Lookout for
Equipment
Amazon Monitron
AWS Panorama
New Tools and
Features
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Aurora zero-ETL integration
NEW
INTEGRATION WITH AMAZON REDSHIFT
Preview
• Drive holistic insights across applications
or partitions
• Analyze data from multiple Aurora
databases in the same Redshift cluster
• Leverage Redshift features such as
materialized views, data sharing and
federated access to data lakes
Amazon Redshift Amazon Aurora
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift integration for Apache Spark
NEW
BUILD INTERACTIVE SPARK APPLICATIONS
GA
Redshift
Connector for
Spark
Amazon Redshift
Amazon Glue
Amazon EMR
• Apache Spark applications accessing
Amazon Redshift data from AWS analytics
services such as Amazon EMR, AWS Glue,
and Amazon SageMaker
• Build Apache Spark applications that read
from and write to your Amazon Redshift
data warehouse, without compromising
performance or transactional consistency.
Amazon SageMaker
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift integration for Apache Spark
BUILD INTERACTIVE SPARK APPLICATIONS
NEW GA
With Amazon EMR
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift integration for Apache Spark
BUILD INTERACTIVE SPARK APPLICATIONS
NEW GA
With Amazon EMR
With AWS Glue
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift integration for Apache Spark
BUILD INTERACTIVE SPARK APPLICATIONS
NEW GA
With Amazon EMR
With AWS Glue
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift Multi-AZ for RA3 clusters
NEW
HIGHLY RESILIENT DATA WAREHOUSE
Amazon Redshift
managed storage
AZ 1 AZ 2
Preview
• Auto-failover with zero data loss and
no manual intervention
• Easy management through a single
endpoint
• Workload processing across AZs
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift Auto-Copy From Amazon S3
NEW
SIMPLIFIED & AUTOMATED FILE INGESTION FROM AMAZON S3 INTO REDSHIFT
Preview
• Simple, low code data ingestion
• Easily convert your existing COPY
statements into automatic ingestion
jobs
• Automatic ingestion of new data from
Amazon S3 based on user defined
configurations
Amazon S3
Redshift
Copy Job
Redshift
Table
Continuously
monitoring S3
folder
New file(s)
detected Ingestion
automatically starts
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWS Glue for Ray
SCALING YOUR DATA INTEGRATION WORKLOADS USING PYTHON
• AWS Glue for Ray is a new engine
option on AWS Glue.
• Data engineers can use AWS Glue for
Ray to process large datasets with
Python and popular Python libraries.
• AWS Glue for Ray facilitates the
distributed processing of your Python
code over multi-node clusters.
NEW Preview
AWS Glue
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWS Glue Data Quality
DELIVER HIGH QUALITY DATA ACROSS YOUR DATA LAKES AND DATA PIPELINES
• Automatic data quality rule
recommendations based on your data
• Keep data quality high with ongoing
data analysis
• Data quality for datasets in your data
lake and data pipelines
• Cost-effective to scale with
pay-as-you-go billing, with no lock-in
NEW Preview
AWS Glue
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Athena for Apache Spark
BUILD INTERACTIVE SPARK APPLICATIONS
• Interactive Spark applications start
under a second and run faster with our
optimized Spark runtime
• Harness Spark for complex, powerful
analytics using the expressive power
of Python along with its wide
ecosystem
• Build applications without managing
resources or configuring software with
Athena
NEW GA
Amazon Athena
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Athena for Apache Spark
BUILD INTERACTIVE SPARK APPLICATIONS
NEW GA
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Athena for Apache Spark
BUILD INTERACTIVE SPARK APPLICATIONS
NEW GA
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon RDS Optimized Writes for MySQL
UP TO 2X HIGHER WRITE THROUGHPUT AT NO ADDITIONAL COST
• Write only once when flushing data to
durable storage without the need for
the double write buffer.
• Great fit for write-intensive database
workloads, such as digital payments,
financial trading, and online gaming.
• Uses the AWS Nitro System, to reliably
and durably write to table storage in
one step.
NEW GA
Amazon RDS
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon RDS Optimized Reads for MySQL
UP TO 50% FASTER QUERIES
• Places temporary tables generated by
MySQL on the local NVMe-based SSD
block-level storage that’s physically
connected to the host server.
• Great for workloads that rely heavily
on temporary tables such as high-load
joins with complex common table
expressions (CTEs) etc.
NEW GA
Amazon RDS
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon RDS Blue/Green Deployments
SAFER, SIMPLER, AND FASTER UPDATES
• Keep your current production database
safe
• Promote the staging environment to
production in as fast as a minute
• The staging environment stays in sync
with the current production
environment using logical replication.
• No data loss
NEW GA
Amazon RDS
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon DocumentDB Elastic Clusters
WITH MONGODB COMPATIBILITY
• Elastically scale workloads in minutes
• Elastic Clusters automatically scales to
handle virtually any number of reads
and writes
• Fully managed solution to scale
document workloads of virtually any
size and scale
• Zero impact to application availability
or performance
NEW GA
Amazon DocumentDB
(with MongoDB compatibility)
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon OpenSearch Serverless (Preview)
RUN LARGE-SCALE SEARCH AND ANALYTICS WORKLOADS
• Only pay for the resources that you
consume.
• Automatically scales resources to
match your application's usage
• Cost-effective for infrequent,
intermittent, or unpredictable
workloads.
• Primary use-cases - Log analytics,
Full-text search
NEW
Amazon OpenSearch Service
Preview
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon ElastiCache and Amazon MemoryDB for Redis
• Redis 7 support in Amazon
ElastiCache for Redis - Redis
Functions, Sharded Pub/Sub and more
• Data tiering for Amazon MemoryDB for
Redis - Ideal for workloads that access
up to 20% of their overall dataset
• IAM Authentication in Amazon
ElastiCache for Redis
NEW GA
Amazon ElastiCache
for Redis
REDIS 7, IAM AND DATA TIERING
Amazon MemoryDB
for Redis
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWS DMS Schema Conversion
• Convert the schema, views, stored
procedures, and functions from a
source database into the schema for
the target database service.
• Prescriptive guidance on how to
resolve any incompatibilities between
the source and target database
engines.
NEW GA
AWS Database Migration
Service
(DMS)
MIGRATE DATABASES QUICKLY AND SECURELY TO AWS
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Expanded API capabilities for Amazon QuickSight
PROGRAMMATIC ACCESS TO THE UNDERLYING STRUCTURE OF QUICKSIGHT DASHBOARDS
• Access underlying data models of
Amazon QuickSight dashboards,
reports, analyses and templates via
the SDK.
• Translate legacy BI assets to
cloud-native dashboards quickly
• Integrate into DevOps processes such
as code reviews, audits, and audit
every change before deployment
NEW GA
Amazon QuickSight
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon QuickSight Paginated Reports
DAY-TO-DAY DETAILED OPERATIONAL DATA IN CUSTOM FORMATS
• Create, schedule, and share highly
formatted multipage reports
• Build all insights, independent of
preferred consumption model, on
single source of truth governed
datasets
• Single authoring experience for
dashboards and reports
• Pay for what you use.
NEW GA
Amazon QuickSight
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon QuickSight Paginated Reports
DAY-TO-DAY DETAILED OPERATIONAL DATA IN CUSTOM FORMATS
NEW GA
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon DataZone
UNLOCK DATA ACROSS ORGANIZATIONAL BOUNDARIES WITH BUILT-IN GOVERNANCE
Preview
NEW
• A new data management service
• Discover and share data at scale
across organizational
• Data catalog populated by ML is easy
to search using business terms
• Integrated with Redshift, Athena, and
QuickSight and provides APIs to third
party sources
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon DataZone
UNLOCK DATA ACROSS ORGANIZATIONAL BOUNDARIES WITH BUILT-IN GOVERNANCE
Preview
NEW
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Questions?
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.

More Related Content

Similar to Re:cap do AWS re:Invet 2022 for Data Engineer and Analytics

Databases - EBC on the road Brazil Edition [Portuguese]
Databases - EBC on the road Brazil Edition [Portuguese]Databases - EBC on the road Brazil Edition [Portuguese]
Databases - EBC on the road Brazil Edition [Portuguese]Amazon Web Services
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSAmazon Web Services
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSAmazon Web Services
 
Migrazione di Database e Data Warehouse su AWS
Migrazione di Database e Data Warehouse su AWSMigrazione di Database e Data Warehouse su AWS
Migrazione di Database e Data Warehouse su AWSAmazon Web Services
 
DAT317_Migrating Databases and Data Warehouses to the Cloud
DAT317_Migrating Databases and Data Warehouses to the CloudDAT317_Migrating Databases and Data Warehouses to the Cloud
DAT317_Migrating Databases and Data Warehouses to the CloudAmazon Web Services
 
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018Amazon Web Services
 
Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...
Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...
Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...Amazon Web Services
 
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best PracticesBuild Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best PracticesAmazon Web Services
 
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best PracticesBuild Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best PracticesAmazon Web Services
 
DAT309_Best Practices for Migrating from Oracle and SQL Server to Amazon RDS
DAT309_Best Practices for Migrating from Oracle and SQL Server to Amazon RDSDAT309_Best Practices for Migrating from Oracle and SQL Server to Amazon RDS
DAT309_Best Practices for Migrating from Oracle and SQL Server to Amazon RDSAmazon Web Services
 
ABD312_Deep Dive Migrating Big Data Workloads to AWS
ABD312_Deep Dive Migrating Big Data Workloads to AWSABD312_Deep Dive Migrating Big Data Workloads to AWS
ABD312_Deep Dive Migrating Big Data Workloads to AWSAmazon Web Services
 
GPSWKS401_Designing a Cloud Enterprise Data Warehouse
GPSWKS401_Designing a Cloud Enterprise Data WarehouseGPSWKS401_Designing a Cloud Enterprise Data Warehouse
GPSWKS401_Designing a Cloud Enterprise Data WarehouseAmazon Web Services
 
Implementazione di una soluzione Data Lake.pdf
Implementazione di una soluzione Data Lake.pdfImplementazione di una soluzione Data Lake.pdf
Implementazione di una soluzione Data Lake.pdfAmazon Web Services
 
AWS re:Invent 2016: Relational and NoSQL Databases on AWS: NBC, MarkLogic, an...
AWS re:Invent 2016: Relational and NoSQL Databases on AWS: NBC, MarkLogic, an...AWS re:Invent 2016: Relational and NoSQL Databases on AWS: NBC, MarkLogic, an...
AWS re:Invent 2016: Relational and NoSQL Databases on AWS: NBC, MarkLogic, an...Amazon Web Services
 
Replicate & Manage Data Using Managed Databases & Serverless Technologies (DA...
Replicate & Manage Data Using Managed Databases & Serverless Technologies (DA...Replicate & Manage Data Using Managed Databases & Serverless Technologies (DA...
Replicate & Manage Data Using Managed Databases & Serverless Technologies (DA...Amazon Web Services
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSAmazon Web Services
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon RedshiftAmazon Web Services
 

Similar to Re:cap do AWS re:Invet 2022 for Data Engineer and Analytics (20)

Databases - EBC on the road Brazil Edition [Portuguese]
Databases - EBC on the road Brazil Edition [Portuguese]Databases - EBC on the road Brazil Edition [Portuguese]
Databases - EBC on the road Brazil Edition [Portuguese]
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWS
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWS
 
Migrazione di Database e Data Warehouse su AWS
Migrazione di Database e Data Warehouse su AWSMigrazione di Database e Data Warehouse su AWS
Migrazione di Database e Data Warehouse su AWS
 
DAT317_Migrating Databases and Data Warehouses to the Cloud
DAT317_Migrating Databases and Data Warehouses to the CloudDAT317_Migrating Databases and Data Warehouses to the Cloud
DAT317_Migrating Databases and Data Warehouses to the Cloud
 
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
 
Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...
Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...
Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...
 
Amazon Aurora: Database Week SF
Amazon Aurora: Database Week SFAmazon Aurora: Database Week SF
Amazon Aurora: Database Week SF
 
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best PracticesBuild Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
 
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best PracticesBuild Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
 
DAT309_Best Practices for Migrating from Oracle and SQL Server to Amazon RDS
DAT309_Best Practices for Migrating from Oracle and SQL Server to Amazon RDSDAT309_Best Practices for Migrating from Oracle and SQL Server to Amazon RDS
DAT309_Best Practices for Migrating from Oracle and SQL Server to Amazon RDS
 
ABD312_Deep Dive Migrating Big Data Workloads to AWS
ABD312_Deep Dive Migrating Big Data Workloads to AWSABD312_Deep Dive Migrating Big Data Workloads to AWS
ABD312_Deep Dive Migrating Big Data Workloads to AWS
 
GPSWKS401_Designing a Cloud Enterprise Data Warehouse
GPSWKS401_Designing a Cloud Enterprise Data WarehouseGPSWKS401_Designing a Cloud Enterprise Data Warehouse
GPSWKS401_Designing a Cloud Enterprise Data Warehouse
 
Implementazione di una soluzione Data Lake.pdf
Implementazione di una soluzione Data Lake.pdfImplementazione di una soluzione Data Lake.pdf
Implementazione di una soluzione Data Lake.pdf
 
AWS re:Invent 2016: Relational and NoSQL Databases on AWS: NBC, MarkLogic, an...
AWS re:Invent 2016: Relational and NoSQL Databases on AWS: NBC, MarkLogic, an...AWS re:Invent 2016: Relational and NoSQL Databases on AWS: NBC, MarkLogic, an...
AWS re:Invent 2016: Relational and NoSQL Databases on AWS: NBC, MarkLogic, an...
 
Migrating database to cloud
Migrating database to cloudMigrating database to cloud
Migrating database to cloud
 
Replicate & Manage Data Using Managed Databases & Serverless Technologies (DA...
Replicate & Manage Data Using Managed Databases & Serverless Technologies (DA...Replicate & Manage Data Using Managed Databases & Serverless Technologies (DA...
Replicate & Manage Data Using Managed Databases & Serverless Technologies (DA...
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWS
 
Data_Analytics_and_AI_ML
Data_Analytics_and_AI_MLData_Analytics_and_AI_ML
Data_Analytics_and_AI_ML
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 

More from Cicero Joasyo Mateus de Moura

Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...
Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...
Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...Cicero Joasyo Mateus de Moura
 
Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...
Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...
Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...Cicero Joasyo Mateus de Moura
 
JoinCommunity 2022 - Modern Data Stack - As tecnologias e práticas mais mode...
JoinCommunity 2022 -  Modern Data Stack - As tecnologias e práticas mais mode...JoinCommunity 2022 -  Modern Data Stack - As tecnologias e práticas mais mode...
JoinCommunity 2022 - Modern Data Stack - As tecnologias e práticas mais mode...Cicero Joasyo Mateus de Moura
 
CPGOIAS2 2022: Big Data e Machine Learning: vida real, cases práticos e poss...
 CPGOIAS2 2022: Big Data e Machine Learning: vida real, cases práticos e poss... CPGOIAS2 2022: Big Data e Machine Learning: vida real, cases práticos e poss...
CPGOIAS2 2022: Big Data e Machine Learning: vida real, cases práticos e poss...Cicero Joasyo Mateus de Moura
 
Modern Data Stack - As tecnologias e práticas mais modernas atualmente em Big...
Modern Data Stack - As tecnologias e práticas mais modernas atualmente em Big...Modern Data Stack - As tecnologias e práticas mais modernas atualmente em Big...
Modern Data Stack - As tecnologias e práticas mais modernas atualmente em Big...Cicero Joasyo Mateus de Moura
 
Big Data e Machine Learning: vida real, cases práticos e possível futuro da á...
Big Data e Machine Learning: vida real, cases práticos e possível futuro da á...Big Data e Machine Learning: vida real, cases práticos e possível futuro da á...
Big Data e Machine Learning: vida real, cases práticos e possível futuro da á...Cicero Joasyo Mateus de Moura
 
TDC Connections 2022 - Segurança e Anonimização de Dados em Larga Escala com...
TDC Connections 2022 - Segurança e Anonimização de Dados  em Larga Escala com...TDC Connections 2022 - Segurança e Anonimização de Dados  em Larga Escala com...
TDC Connections 2022 - Segurança e Anonimização de Dados em Larga Escala com...Cicero Joasyo Mateus de Moura
 
CPBSB 2022 - Big Data e Machine Learning na Prática Construindo um Data Lake...
CPBSB 2022 - Big Data e  Machine Learning na Prática Construindo um Data Lake...CPBSB 2022 - Big Data e  Machine Learning na Prática Construindo um Data Lake...
CPBSB 2022 - Big Data e Machine Learning na Prática Construindo um Data Lake...Cicero Joasyo Mateus de Moura
 
Big Data Analytics - Data Engineer, Arquitetura, AWS e Mais
Big Data Analytics - Data Engineer, Arquitetura, AWS e MaisBig Data Analytics - Data Engineer, Arquitetura, AWS e Mais
Big Data Analytics - Data Engineer, Arquitetura, AWS e MaisCicero Joasyo Mateus de Moura
 
Stream de dados e Data Lake com Debezium, Delta Lake e EMR
Stream de dados e Data Lake com Debezium, Delta Lake e EMRStream de dados e Data Lake com Debezium, Delta Lake e EMR
Stream de dados e Data Lake com Debezium, Delta Lake e EMRCicero Joasyo Mateus de Moura
 
Big Data na prática: como construir um Data Lake para análise de dados
Big Data na prática: como construir um Data Lake para análise de dadosBig Data na prática: como construir um Data Lake para análise de dados
Big Data na prática: como construir um Data Lake para análise de dadosCicero Joasyo Mateus de Moura
 
Processamento e Análise de Dados em Tempo Real com Kafka, ElasticSearch e PyS...
Processamento e Análise de Dados em Tempo Real com Kafka, ElasticSearch e PyS...Processamento e Análise de Dados em Tempo Real com Kafka, ElasticSearch e PyS...
Processamento e Análise de Dados em Tempo Real com Kafka, ElasticSearch e PyS...Cicero Joasyo Mateus de Moura
 
Processamento e Análise de Dados em Tempo Real com Python, Kafka e ElasticSearch
Processamento e Análise de Dados em Tempo Real com Python, Kafka e ElasticSearchProcessamento e Análise de Dados em Tempo Real com Python, Kafka e ElasticSearch
Processamento e Análise de Dados em Tempo Real com Python, Kafka e ElasticSearchCicero Joasyo Mateus de Moura
 
Construindo sua Assistente Virtual Integrada a Dispositivos IoT com Python ...
Construindo sua  Assistente Virtual  Integrada a Dispositivos IoT com Python ...Construindo sua  Assistente Virtual  Integrada a Dispositivos IoT com Python ...
Construindo sua Assistente Virtual Integrada a Dispositivos IoT com Python ...Cicero Joasyo Mateus de Moura
 
IoT e TA: Soluções Com Potencial Para Melhorar a Vida das Pessoas
IoT e TA: Soluções Com Potencial Para Melhorar a Vida das PessoasIoT e TA: Soluções Com Potencial Para Melhorar a Vida das Pessoas
IoT e TA: Soluções Com Potencial Para Melhorar a Vida das PessoasCicero Joasyo Mateus de Moura
 

More from Cicero Joasyo Mateus de Moura (18)

Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...
Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...
Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...
 
Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...
Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...
Escalando a Qualidade de Dados com Great Expectations e Spark no Modern Data ...
 
JoinCommunity 2022 - Modern Data Stack - As tecnologias e práticas mais mode...
JoinCommunity 2022 -  Modern Data Stack - As tecnologias e práticas mais mode...JoinCommunity 2022 -  Modern Data Stack - As tecnologias e práticas mais mode...
JoinCommunity 2022 - Modern Data Stack - As tecnologias e práticas mais mode...
 
CPGOIAS2 2022: Big Data e Machine Learning: vida real, cases práticos e poss...
 CPGOIAS2 2022: Big Data e Machine Learning: vida real, cases práticos e poss... CPGOIAS2 2022: Big Data e Machine Learning: vida real, cases práticos e poss...
CPGOIAS2 2022: Big Data e Machine Learning: vida real, cases práticos e poss...
 
Modern Data Stack - As tecnologias e práticas mais modernas atualmente em Big...
Modern Data Stack - As tecnologias e práticas mais modernas atualmente em Big...Modern Data Stack - As tecnologias e práticas mais modernas atualmente em Big...
Modern Data Stack - As tecnologias e práticas mais modernas atualmente em Big...
 
Big Data e Machine Learning: vida real, cases práticos e possível futuro da á...
Big Data e Machine Learning: vida real, cases práticos e possível futuro da á...Big Data e Machine Learning: vida real, cases práticos e possível futuro da á...
Big Data e Machine Learning: vida real, cases práticos e possível futuro da á...
 
TDC Connections 2022 - Segurança e Anonimização de Dados em Larga Escala com...
TDC Connections 2022 - Segurança e Anonimização de Dados  em Larga Escala com...TDC Connections 2022 - Segurança e Anonimização de Dados  em Larga Escala com...
TDC Connections 2022 - Segurança e Anonimização de Dados em Larga Escala com...
 
CPBSB 2022 - Big Data e Machine Learning na Prática Construindo um Data Lake...
CPBSB 2022 - Big Data e  Machine Learning na Prática Construindo um Data Lake...CPBSB 2022 - Big Data e  Machine Learning na Prática Construindo um Data Lake...
CPBSB 2022 - Big Data e Machine Learning na Prática Construindo um Data Lake...
 
Big Data Analytics - Data Engineer, Arquitetura, AWS e Mais
Big Data Analytics - Data Engineer, Arquitetura, AWS e MaisBig Data Analytics - Data Engineer, Arquitetura, AWS e Mais
Big Data Analytics - Data Engineer, Arquitetura, AWS e Mais
 
Stream de dados e Data Lake com Debezium, Delta Lake e EMR
Stream de dados e Data Lake com Debezium, Delta Lake e EMRStream de dados e Data Lake com Debezium, Delta Lake e EMR
Stream de dados e Data Lake com Debezium, Delta Lake e EMR
 
Big Data na prática: como construir um Data Lake para análise de dados
Big Data na prática: como construir um Data Lake para análise de dadosBig Data na prática: como construir um Data Lake para análise de dados
Big Data na prática: como construir um Data Lake para análise de dados
 
Processamento e Análise de Dados em Tempo Real com Kafka, ElasticSearch e PyS...
Processamento e Análise de Dados em Tempo Real com Kafka, ElasticSearch e PyS...Processamento e Análise de Dados em Tempo Real com Kafka, ElasticSearch e PyS...
Processamento e Análise de Dados em Tempo Real com Kafka, ElasticSearch e PyS...
 
Processamento e Análise de Dados em Tempo Real com Python, Kafka e ElasticSearch
Processamento e Análise de Dados em Tempo Real com Python, Kafka e ElasticSearchProcessamento e Análise de Dados em Tempo Real com Python, Kafka e ElasticSearch
Processamento e Análise de Dados em Tempo Real com Python, Kafka e ElasticSearch
 
Construindo sua Assistente Virtual Integrada a Dispositivos IoT com Python ...
Construindo sua  Assistente Virtual  Integrada a Dispositivos IoT com Python ...Construindo sua  Assistente Virtual  Integrada a Dispositivos IoT com Python ...
Construindo sua Assistente Virtual Integrada a Dispositivos IoT com Python ...
 
IoT e TA: Soluções Com Potencial Para Melhorar a Vida das Pessoas
IoT e TA: Soluções Com Potencial Para Melhorar a Vida das PessoasIoT e TA: Soluções Com Potencial Para Melhorar a Vida das Pessoas
IoT e TA: Soluções Com Potencial Para Melhorar a Vida das Pessoas
 
Joincommunity - Goiânia 2018
Joincommunity - Goiânia 2018Joincommunity - Goiânia 2018
Joincommunity - Goiânia 2018
 
IoT: construindo ideias com nodeMCU e MQTT
IoT: construindo ideias com nodeMCU e MQTTIoT: construindo ideias com nodeMCU e MQTT
IoT: construindo ideias com nodeMCU e MQTT
 
PHP: Como não programar em POG
PHP: Como não programar em POGPHP: Como não programar em POG
PHP: Como não programar em POG
 

Recently uploaded

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 

Recently uploaded (20)

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 

Re:cap do AWS re:Invet 2022 for Data Engineer and Analytics

  • 1. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. for Data Engineering and Analytics
  • 2. 2 Ponto Lorem ipsum dolor sit amet, consec tetur adipiscing elit. at efficitur tortor. Donec congue odio bibendum 1 Ponto Lorem ipsum dolor sit amet, consec tetur adipiscing elit. at efficitur tortor. Donec congue odio bibendum 3 Ponto Lorem ipsum dolor sit amet, consec tetur adipiscing elit. at efficitur tortor. Donec congue odio bibendum 4 Ponto Lorem ipsum dolor sit amet, consec tetur adipiscing elit. at efficitur tortor. Donec congue odio bibendum ➔ Gerente de Produto e Plataforma na Dadosfera; ➔ AWS Community Builder; ➔ Bacharel em Sistemas de Informação pelo IFG; ➔ Pós graduado em Big Data e Machine Learning pela Fasam; ➔ Palestrante em eventos como Campus Party, TDC, Join Community, Google Dev Fest… Cicero Moura cicerojmm /in/cicero-moura
  • 3. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. APPS DEVICES PEOPLE APP/LOGS THIRD-PARTY DATA IOT/DEVICES Data sources FOR APPLICATIONS Amazon Aurora Amazon Kinesis & Amazon MSK FOR ANALYTICS AND MACHINE LEARNING Data Lake Amazon S3 Amazon Redshift Data Warehouse Amazon Redshift Amazon EMR BUSINESS INTELLIGENCE Amazon QuickSight MACHINE LEARNING Amazon SageMaker ANALYTICS Amazon DynamoDB AWS Glue Act Integrate Store and query © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Catalog and govern | AWS Lake Formation, Amazon DataZone Building an end-to-end data strategy
  • 4. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Pick the right tool DATABASES ANALYTICS ML / AI Amazon Relational Database Service (MySQL, PostgreSQL, MariaDB, Oracle, SQL Server) Amazon Relational Database Service Custom Amazon Aurora Amazon DynamoDB Amazon DocumentDB Amazon Redshift Amazon Athena Amazon OpenSearch Service Amazon EMR AWS Lake Formation Amazon MSK Amazon Keyspaces Amazon Neptune Amazon Timestream Amazon ElastiCache Amazon MemoryDB for Redis AWS Database Migration Service Amazon Kinesis Data Streams Amazon Kinesis Data Firehose Amazon Kinesis Data Analytics Amazon Kinesis Video Streams Amazon QuickSight AWS Glue Amazon SageMaker Amazon Comprehend Amazon Polly Amazon Rekognition Amazon Textract Amazon Transcribe Amazon Translate Amazon Fraud Detector Amazon Personalize Amazon Forecast Amazon Augmented AI Amazon DevOps Guru Amazon CodeGuru Amazon HealthLake Amazon Kendra Amazon Lex Amazon Lookout for Equipment Amazon Monitron AWS Panorama
  • 6. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Aurora zero-ETL integration NEW INTEGRATION WITH AMAZON REDSHIFT Preview • Drive holistic insights across applications or partitions • Analyze data from multiple Aurora databases in the same Redshift cluster • Leverage Redshift features such as materialized views, data sharing and federated access to data lakes Amazon Redshift Amazon Aurora
  • 7. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift integration for Apache Spark NEW BUILD INTERACTIVE SPARK APPLICATIONS GA Redshift Connector for Spark Amazon Redshift Amazon Glue Amazon EMR • Apache Spark applications accessing Amazon Redshift data from AWS analytics services such as Amazon EMR, AWS Glue, and Amazon SageMaker • Build Apache Spark applications that read from and write to your Amazon Redshift data warehouse, without compromising performance or transactional consistency. Amazon SageMaker
  • 8. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift integration for Apache Spark BUILD INTERACTIVE SPARK APPLICATIONS NEW GA With Amazon EMR
  • 9. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift integration for Apache Spark BUILD INTERACTIVE SPARK APPLICATIONS NEW GA With Amazon EMR With AWS Glue
  • 10. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift integration for Apache Spark BUILD INTERACTIVE SPARK APPLICATIONS NEW GA With Amazon EMR With AWS Glue
  • 11. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift Multi-AZ for RA3 clusters NEW HIGHLY RESILIENT DATA WAREHOUSE Amazon Redshift managed storage AZ 1 AZ 2 Preview • Auto-failover with zero data loss and no manual intervention • Easy management through a single endpoint • Workload processing across AZs
  • 12. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift Auto-Copy From Amazon S3 NEW SIMPLIFIED & AUTOMATED FILE INGESTION FROM AMAZON S3 INTO REDSHIFT Preview • Simple, low code data ingestion • Easily convert your existing COPY statements into automatic ingestion jobs • Automatic ingestion of new data from Amazon S3 based on user defined configurations Amazon S3 Redshift Copy Job Redshift Table Continuously monitoring S3 folder New file(s) detected Ingestion automatically starts
  • 13. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS Glue for Ray SCALING YOUR DATA INTEGRATION WORKLOADS USING PYTHON • AWS Glue for Ray is a new engine option on AWS Glue. • Data engineers can use AWS Glue for Ray to process large datasets with Python and popular Python libraries. • AWS Glue for Ray facilitates the distributed processing of your Python code over multi-node clusters. NEW Preview AWS Glue
  • 14. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS Glue Data Quality DELIVER HIGH QUALITY DATA ACROSS YOUR DATA LAKES AND DATA PIPELINES • Automatic data quality rule recommendations based on your data • Keep data quality high with ongoing data analysis • Data quality for datasets in your data lake and data pipelines • Cost-effective to scale with pay-as-you-go billing, with no lock-in NEW Preview AWS Glue
  • 15. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Athena for Apache Spark BUILD INTERACTIVE SPARK APPLICATIONS • Interactive Spark applications start under a second and run faster with our optimized Spark runtime • Harness Spark for complex, powerful analytics using the expressive power of Python along with its wide ecosystem • Build applications without managing resources or configuring software with Athena NEW GA Amazon Athena
  • 16. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Athena for Apache Spark BUILD INTERACTIVE SPARK APPLICATIONS NEW GA
  • 17. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Athena for Apache Spark BUILD INTERACTIVE SPARK APPLICATIONS NEW GA
  • 18. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon RDS Optimized Writes for MySQL UP TO 2X HIGHER WRITE THROUGHPUT AT NO ADDITIONAL COST • Write only once when flushing data to durable storage without the need for the double write buffer. • Great fit for write-intensive database workloads, such as digital payments, financial trading, and online gaming. • Uses the AWS Nitro System, to reliably and durably write to table storage in one step. NEW GA Amazon RDS
  • 19. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon RDS Optimized Reads for MySQL UP TO 50% FASTER QUERIES • Places temporary tables generated by MySQL on the local NVMe-based SSD block-level storage that’s physically connected to the host server. • Great for workloads that rely heavily on temporary tables such as high-load joins with complex common table expressions (CTEs) etc. NEW GA Amazon RDS
  • 20. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon RDS Blue/Green Deployments SAFER, SIMPLER, AND FASTER UPDATES • Keep your current production database safe • Promote the staging environment to production in as fast as a minute • The staging environment stays in sync with the current production environment using logical replication. • No data loss NEW GA Amazon RDS
  • 21. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon DocumentDB Elastic Clusters WITH MONGODB COMPATIBILITY • Elastically scale workloads in minutes • Elastic Clusters automatically scales to handle virtually any number of reads and writes • Fully managed solution to scale document workloads of virtually any size and scale • Zero impact to application availability or performance NEW GA Amazon DocumentDB (with MongoDB compatibility)
  • 22. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon OpenSearch Serverless (Preview) RUN LARGE-SCALE SEARCH AND ANALYTICS WORKLOADS • Only pay for the resources that you consume. • Automatically scales resources to match your application's usage • Cost-effective for infrequent, intermittent, or unpredictable workloads. • Primary use-cases - Log analytics, Full-text search NEW Amazon OpenSearch Service Preview
  • 23. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon ElastiCache and Amazon MemoryDB for Redis • Redis 7 support in Amazon ElastiCache for Redis - Redis Functions, Sharded Pub/Sub and more • Data tiering for Amazon MemoryDB for Redis - Ideal for workloads that access up to 20% of their overall dataset • IAM Authentication in Amazon ElastiCache for Redis NEW GA Amazon ElastiCache for Redis REDIS 7, IAM AND DATA TIERING Amazon MemoryDB for Redis
  • 24. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS DMS Schema Conversion • Convert the schema, views, stored procedures, and functions from a source database into the schema for the target database service. • Prescriptive guidance on how to resolve any incompatibilities between the source and target database engines. NEW GA AWS Database Migration Service (DMS) MIGRATE DATABASES QUICKLY AND SECURELY TO AWS
  • 25. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Expanded API capabilities for Amazon QuickSight PROGRAMMATIC ACCESS TO THE UNDERLYING STRUCTURE OF QUICKSIGHT DASHBOARDS • Access underlying data models of Amazon QuickSight dashboards, reports, analyses and templates via the SDK. • Translate legacy BI assets to cloud-native dashboards quickly • Integrate into DevOps processes such as code reviews, audits, and audit every change before deployment NEW GA Amazon QuickSight
  • 26. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon QuickSight Paginated Reports DAY-TO-DAY DETAILED OPERATIONAL DATA IN CUSTOM FORMATS • Create, schedule, and share highly formatted multipage reports • Build all insights, independent of preferred consumption model, on single source of truth governed datasets • Single authoring experience for dashboards and reports • Pay for what you use. NEW GA Amazon QuickSight
  • 27. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon QuickSight Paginated Reports DAY-TO-DAY DETAILED OPERATIONAL DATA IN CUSTOM FORMATS NEW GA
  • 28. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon DataZone UNLOCK DATA ACROSS ORGANIZATIONAL BOUNDARIES WITH BUILT-IN GOVERNANCE Preview NEW • A new data management service • Discover and share data at scale across organizational • Data catalog populated by ML is easy to search using business terms • Integrated with Redshift, Athena, and QuickSight and provides APIs to third party sources
  • 29. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon DataZone UNLOCK DATA ACROSS ORGANIZATIONAL BOUNDARIES WITH BUILT-IN GOVERNANCE Preview NEW
  • 30. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Questions?
  • 31. © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.