SlideShare a Scribd company logo
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Considerations for Building Your
First Streaming Application
0 7 / 1 1 / 2 0 1 8
Praveen Gattu
Software Developer Manager
AWS, Amazon Kinesis
Ryan Nienhuis
Senior Technical Product
Manager
AWS, Amazon Kinesis
A N T 3 5 9
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Talk outline
• Use case – operational dashboard
• Implement real time and batch analysis
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Application architecture
Generate web logs
Collect web logs
and deliver to
Amazon Simple
Storage Service
(Amazon S3)
Process & compute
aggregate web log metrics
Deliver processed web
log metrics to Amazon
CloudWatch
Raw web logs from
Data Firehose
Interactive
analysisof
web logs
Interactive
querying of
web logs
Alarm
Client
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Streaming with Amazon Kinesis
Easily collect, process, and analyze video and data streams in real time
Capture, process, and
store video streams
Load data streams
into AWS data stores
Analyze data streams
in real time
Capture, process, and
store data streams
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Application architecture
Generate web logs
Collect web logs
and deliver to
Amazon Simple
Storage Service
(Amazon S3)
Process & compute
aggregate web log metrics
Deliver processed web
log metrics to Amazon
CloudWatch
Raw web logs from
Data Firehose
Interactive
analysisof
web logs
Interactive
querying of
web logs
Alarm
Client
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Collect logs with a Kinesis Data Firehose delivery stream
We are going to
• Write to a Data Firehose delivery stream - Simulate writing transformed
Apache web logs to a Kinesis Data Firehose delivery stream that is
configured to deliver data into an S3 bucket
• There are many different libraries that can be used to write data to a Data
Firehose delivery stream; one popular option is called the Amazon Kinesis
Agent
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Collect logs with a Kinesis Data Firehose delivery stream
• So that we don’t have to install or set up software on your machine, we are
going to use a lambda function to simulate using the Amazon Kinesis agent.
The lambda function can populate a Data Firehose delivery stream using a
template and is simple to setup.
• Let’s get started!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Kinesis Data Firehose delivery to S3 stats
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Monitoring Kinesis Data Firehose delivery to S3
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Application architecture
Generate web logs
Collect web logs
and deliver to
Amazon Simple
Storage Service
(Amazon S3)
Process & compute
aggregate web log metrics
Deliver processed web
log metrics to Amazon
CloudWatch
Raw web logs from
Data Firehose
Interactive
analysisof
web logs
Interactive
querying of
web logs
Alarm
Client
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Kinesis Data Analytics
• Powerful real time applications
• Easy to use, fully managed
• Automatic elasticity
• Windowed aggregations
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Kinesis Data Analytics applications
Easily write SQL code to process streaming data
Connect to streaming source
Continuously deliver SQL results
1011101
1011010
0101010
1011101
1011010
0101010
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Process data using Kinesis Data Analytics
• SQL query to compute an aggregate metric for an interesting statistic on
the incoming data – Error Count
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
View sample records in Amazon Kinesis Data Analytics app
• Review sample records delivered to the source stream
(SOURCE_SQL_STREAM_001)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Kinesis Data Analytics application metadata
• Note that Amazon Kinesis adds metadata to each record being sent that was
shown in the formatted record sample
• The ROWTIME represents the time when the Kinesis application inserts a
row in the first in-application stream. It’s a special column used for time
series analytics. This is also known as a the processing time.
• The APPROXIMATE_ARRIVAL_TIME is the time the record was added to the
streaming source. This is also known as ingest time or server-side time.
• The event time is the timestamp when the event occurred. It’s a also called
client side time. Its useful because it’s the time when an event occurred at the
client.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Calculate an aggregate metric
Tumbling
Sliding
Custom
• Fixed size and non-overlapping
• Use FLOOR() or STEP()function in a GROUP BY statement
• Fixed size and overlapping; row boundaries are determined when
new rows enter window
• Use standard OVERand WINDOW clause
• Not fixed size and overlapping; row boundaries by conditions
• Implementations vary, but typically require two steps (step 1 –
identify boundaries, step 2 – perform computation)
Stagger • Not fixed size and non-overlapping; windows open when the first
event matching the partition key arrives
• Use WINDOWED BY STAGGER and PARTITION BY statements
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Calculate error count metric
In Kinesis Data Analytics
Application Editor Page author
following SQL
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Application architecture
Generate web logs
Collect web logs
and deliver to
Amazon Simple
Storage Service
(Amazon S3)
Process & compute
aggregate web log metrics
Deliver processed web
log metrics to Amazon
CloudWatch
Raw web logs from
Data Firehose
Interactive
analysisof
web logs
Interactive
querying of
web logs
Alarm
Client
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Deliver output data to Amazon CloudWatch
• Connect Kinesis Data Analytics output to lambda function
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Deliver output data to Amazon CloudWatch
• Lambda function delivers results to CloudWatch metrics
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Real-time alerts on error rate from CloudWatch alarms
• Alarms fire when error-rate breaches a threshold
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
HOW TO KNOW THE IMPACT ?
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Application architecture
Generate web logs
Collect web logs
and deliver to
Amazon Simple
Storage Service
(Amazon S3)
Process & compute
aggregate web log metrics
Deliver processed web
log metrics to Amazon
CloudWatch
Raw web logs from
Data Firehose
Interactive
analysisof
web logs
Interactive
querying of
web logs
Alarm
Client
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Interactive query service
• Query directly from Amazon S3
• Use ANSI SQL
• Serverless
• Multiple data formats
• Cost effective
Amazon
Athena
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Familiar technologies under the covers
Used for SQL Queries
In-memory distributed query engine
ANSI-SQL compatible with extensions
Used for DDL functionality
Complex data types
Multitude of formats
Supports data partitioning
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Want to learn more?
Workshop sessions covering streaming and big data
• ANT213-R and ANT213-R1 – Build Your First Big Data Application on
AWS
• ANT362 - Use Streaming Data to Gain Real-Time Insights into Your
Business
• ANT318-R and ANT318-R1 - Build, Deploy and Serve Machine
Learning Models on Streaming Data Using Amazon SageMaker,
Apache Spark on Amazon EMR and Amazon Kinesis
Thank you!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Praveen Gattu
Ryan Nienhuis
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.

More Related Content

What's hot

Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018
Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018
Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018
Amazon Web Services
 
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
Amazon Web Services
 
Effective Data Lakes: Challenges and Design Patterns (ANT316) - AWS re:Invent...
Effective Data Lakes: Challenges and Design Patterns (ANT316) - AWS re:Invent...Effective Data Lakes: Challenges and Design Patterns (ANT316) - AWS re:Invent...
Effective Data Lakes: Challenges and Design Patterns (ANT316) - AWS re:Invent...
Amazon Web Services
 
Running Your SQL Server Database on Amazon RDS (DAT329) - AWS re:Invent 2018
Running Your SQL Server Database on Amazon RDS (DAT329) - AWS re:Invent 2018Running Your SQL Server Database on Amazon RDS (DAT329) - AWS re:Invent 2018
Running Your SQL Server Database on Amazon RDS (DAT329) - AWS re:Invent 2018
Amazon Web Services
 
Build Data Engineering Platforms with Amazon EMR (ANT204) - AWS re:Invent 2018
Build Data Engineering Platforms with Amazon EMR (ANT204) - AWS re:Invent 2018Build Data Engineering Platforms with Amazon EMR (ANT204) - AWS re:Invent 2018
Build Data Engineering Platforms with Amazon EMR (ANT204) - AWS re:Invent 2018
Amazon Web Services
 
Building Your Geospatial Data Lake (WPS324) - AWS re:Invent 2018
Building Your Geospatial Data Lake (WPS324) - AWS re:Invent 2018Building Your Geospatial Data Lake (WPS324) - AWS re:Invent 2018
Building Your Geospatial Data Lake (WPS324) - AWS re:Invent 2018
Amazon Web Services
 
How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...
How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...
How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...
Amazon Web Services
 
Visualization with Amazon QuickSight
Visualization with Amazon QuickSightVisualization with Amazon QuickSight
Visualization with Amazon QuickSight
Amazon Web Services
 
Learn How You Can Accelerate Engineering Workloads with AppStream 2.0 (BAP318...
Learn How You Can Accelerate Engineering Workloads with AppStream 2.0 (BAP318...Learn How You Can Accelerate Engineering Workloads with AppStream 2.0 (BAP318...
Learn How You Can Accelerate Engineering Workloads with AppStream 2.0 (BAP318...
Amazon Web Services
 
Enable Programmatic and Federated Access to Amazon Athena (ANT380-R1) - AWS r...
Enable Programmatic and Federated Access to Amazon Athena (ANT380-R1) - AWS r...Enable Programmatic and Federated Access to Amazon Athena (ANT380-R1) - AWS r...
Enable Programmatic and Federated Access to Amazon Athena (ANT380-R1) - AWS r...
Amazon Web Services
 
Don’t Wait Until Tomorrow: From Batch to Streaming (ANT360) - AWS re:Invent 2018
Don’t Wait Until Tomorrow: From Batch to Streaming (ANT360) - AWS re:Invent 2018Don’t Wait Until Tomorrow: From Batch to Streaming (ANT360) - AWS re:Invent 2018
Don’t Wait Until Tomorrow: From Batch to Streaming (ANT360) - AWS re:Invent 2018
Amazon Web Services
 
How to Build a Data Lake in Amazon S3 & Amazon Glacier - AWS Online Tech Talks
How to Build a Data Lake in Amazon S3 & Amazon Glacier - AWS Online Tech TalksHow to Build a Data Lake in Amazon S3 & Amazon Glacier - AWS Online Tech Talks
How to Build a Data Lake in Amazon S3 & Amazon Glacier - AWS Online Tech Talks
Amazon Web Services
 
One Data Lake, Many Uses: Enabling Multi-Tenant Analytics with Amazon EMR (AN...
One Data Lake, Many Uses: Enabling Multi-Tenant Analytics with Amazon EMR (AN...One Data Lake, Many Uses: Enabling Multi-Tenant Analytics with Amazon EMR (AN...
One Data Lake, Many Uses: Enabling Multi-Tenant Analytics with Amazon EMR (AN...
Amazon Web Services
 
Data Patterns and Analysis with Amazon Neptune: A Case Study in Healthcare Bi...
Data Patterns and Analysis with Amazon Neptune: A Case Study in Healthcare Bi...Data Patterns and Analysis with Amazon Neptune: A Case Study in Healthcare Bi...
Data Patterns and Analysis with Amazon Neptune: A Case Study in Healthcare Bi...
Amazon Web Services
 
Loading Data into Redshift
Loading Data into RedshiftLoading Data into Redshift
Loading Data into Redshift
Amazon Web Services
 
Analyzing Streams
Analyzing StreamsAnalyzing Streams
Analyzing Streams
Amazon Web Services
 
Query in Place with AWS (STG315-R1) - AWS re:Invent 2018
Query in Place with AWS (STG315-R1) - AWS re:Invent 2018Query in Place with AWS (STG315-R1) - AWS re:Invent 2018
Query in Place with AWS (STG315-R1) - AWS re:Invent 2018
Amazon Web Services
 
Discover & Migrate at Scale with AWS Migration Hub & Application Discovery Se...
Discover & Migrate at Scale with AWS Migration Hub & Application Discovery Se...Discover & Migrate at Scale with AWS Migration Hub & Application Discovery Se...
Discover & Migrate at Scale with AWS Migration Hub & Application Discovery Se...
Amazon Web Services
 
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Amazon Web Services
 
Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...
Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...
Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...
Amazon Web Services
 

What's hot (20)

Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018
Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018
Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018
 
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
 
Effective Data Lakes: Challenges and Design Patterns (ANT316) - AWS re:Invent...
Effective Data Lakes: Challenges and Design Patterns (ANT316) - AWS re:Invent...Effective Data Lakes: Challenges and Design Patterns (ANT316) - AWS re:Invent...
Effective Data Lakes: Challenges and Design Patterns (ANT316) - AWS re:Invent...
 
Running Your SQL Server Database on Amazon RDS (DAT329) - AWS re:Invent 2018
Running Your SQL Server Database on Amazon RDS (DAT329) - AWS re:Invent 2018Running Your SQL Server Database on Amazon RDS (DAT329) - AWS re:Invent 2018
Running Your SQL Server Database on Amazon RDS (DAT329) - AWS re:Invent 2018
 
Build Data Engineering Platforms with Amazon EMR (ANT204) - AWS re:Invent 2018
Build Data Engineering Platforms with Amazon EMR (ANT204) - AWS re:Invent 2018Build Data Engineering Platforms with Amazon EMR (ANT204) - AWS re:Invent 2018
Build Data Engineering Platforms with Amazon EMR (ANT204) - AWS re:Invent 2018
 
Building Your Geospatial Data Lake (WPS324) - AWS re:Invent 2018
Building Your Geospatial Data Lake (WPS324) - AWS re:Invent 2018Building Your Geospatial Data Lake (WPS324) - AWS re:Invent 2018
Building Your Geospatial Data Lake (WPS324) - AWS re:Invent 2018
 
How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...
How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...
How to Build HR Lakes on AWS to Unlock New Business Insights (DAT367) - AWS r...
 
Visualization with Amazon QuickSight
Visualization with Amazon QuickSightVisualization with Amazon QuickSight
Visualization with Amazon QuickSight
 
Learn How You Can Accelerate Engineering Workloads with AppStream 2.0 (BAP318...
Learn How You Can Accelerate Engineering Workloads with AppStream 2.0 (BAP318...Learn How You Can Accelerate Engineering Workloads with AppStream 2.0 (BAP318...
Learn How You Can Accelerate Engineering Workloads with AppStream 2.0 (BAP318...
 
Enable Programmatic and Federated Access to Amazon Athena (ANT380-R1) - AWS r...
Enable Programmatic and Federated Access to Amazon Athena (ANT380-R1) - AWS r...Enable Programmatic and Federated Access to Amazon Athena (ANT380-R1) - AWS r...
Enable Programmatic and Federated Access to Amazon Athena (ANT380-R1) - AWS r...
 
Don’t Wait Until Tomorrow: From Batch to Streaming (ANT360) - AWS re:Invent 2018
Don’t Wait Until Tomorrow: From Batch to Streaming (ANT360) - AWS re:Invent 2018Don’t Wait Until Tomorrow: From Batch to Streaming (ANT360) - AWS re:Invent 2018
Don’t Wait Until Tomorrow: From Batch to Streaming (ANT360) - AWS re:Invent 2018
 
How to Build a Data Lake in Amazon S3 & Amazon Glacier - AWS Online Tech Talks
How to Build a Data Lake in Amazon S3 & Amazon Glacier - AWS Online Tech TalksHow to Build a Data Lake in Amazon S3 & Amazon Glacier - AWS Online Tech Talks
How to Build a Data Lake in Amazon S3 & Amazon Glacier - AWS Online Tech Talks
 
One Data Lake, Many Uses: Enabling Multi-Tenant Analytics with Amazon EMR (AN...
One Data Lake, Many Uses: Enabling Multi-Tenant Analytics with Amazon EMR (AN...One Data Lake, Many Uses: Enabling Multi-Tenant Analytics with Amazon EMR (AN...
One Data Lake, Many Uses: Enabling Multi-Tenant Analytics with Amazon EMR (AN...
 
Data Patterns and Analysis with Amazon Neptune: A Case Study in Healthcare Bi...
Data Patterns and Analysis with Amazon Neptune: A Case Study in Healthcare Bi...Data Patterns and Analysis with Amazon Neptune: A Case Study in Healthcare Bi...
Data Patterns and Analysis with Amazon Neptune: A Case Study in Healthcare Bi...
 
Loading Data into Redshift
Loading Data into RedshiftLoading Data into Redshift
Loading Data into Redshift
 
Analyzing Streams
Analyzing StreamsAnalyzing Streams
Analyzing Streams
 
Query in Place with AWS (STG315-R1) - AWS re:Invent 2018
Query in Place with AWS (STG315-R1) - AWS re:Invent 2018Query in Place with AWS (STG315-R1) - AWS re:Invent 2018
Query in Place with AWS (STG315-R1) - AWS re:Invent 2018
 
Discover & Migrate at Scale with AWS Migration Hub & Application Discovery Se...
Discover & Migrate at Scale with AWS Migration Hub & Application Discovery Se...Discover & Migrate at Scale with AWS Migration Hub & Application Discovery Se...
Discover & Migrate at Scale with AWS Migration Hub & Application Discovery Se...
 
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
 
Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...
Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...
Running Oracle Databases on Amazon RDS and Migrating to PostgreSQL (DAT307-R1...
 

Similar to Considerations for Building Your First Streaming Application (ANT359) - AWS re:Invent 2018

Real-Time Web Analytics with Amazon Kinesis Data Analytics (ADT401) - AWS re:...
Real-Time Web Analytics with Amazon Kinesis Data Analytics (ADT401) - AWS re:...Real-Time Web Analytics with Amazon Kinesis Data Analytics (ADT401) - AWS re:...
Real-Time Web Analytics with Amazon Kinesis Data Analytics (ADT401) - AWS re:...
Amazon Web Services
 
Analyzing Streams
Analyzing StreamsAnalyzing Streams
Analyzing Streams
Amazon Web Services
 
Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...
Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...
Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...
Amazon Web Services
 
Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...
Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...
Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...
Amazon Web Services
 
Analyzing Streams: Data Analytics Week SF
Analyzing Streams: Data Analytics Week SFAnalyzing Streams: Data Analytics Week SF
Analyzing Streams: Data Analytics Week SF
Amazon Web Services
 
Analyzing Streams
Analyzing StreamsAnalyzing Streams
Analyzing Streams
Amazon Web Services
 
Analyzing Streams: Data Analytics Week at the SF Loft
Analyzing Streams: Data Analytics Week at the SF LoftAnalyzing Streams: Data Analytics Week at the SF Loft
Analyzing Streams: Data Analytics Week at the SF Loft
Amazon Web Services
 
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018
Amazon Web Services
 
Analyzing Streams
Analyzing StreamsAnalyzing Streams
Analyzing Streams
Amazon Web Services
 
BDA309 Build Your First Big Data Application on AWS
BDA309 Build Your First Big Data Application on AWSBDA309 Build Your First Big Data Application on AWS
BDA309 Build Your First Big Data Application on AWS
Amazon Web Services
 
Real Time Data Ingestion & Analysis - AWS Summit Sydney 2018
Real Time Data Ingestion & Analysis - AWS Summit Sydney 2018Real Time Data Ingestion & Analysis - AWS Summit Sydney 2018
Real Time Data Ingestion & Analysis - AWS Summit Sydney 2018
Amazon Web Services
 
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
Amazon Web Services
 
Analyzing Data Streams in Real Time with Amazon Kinesis: PNNL's Serverless Da...
Analyzing Data Streams in Real Time with Amazon Kinesis: PNNL's Serverless Da...Analyzing Data Streams in Real Time with Amazon Kinesis: PNNL's Serverless Da...
Analyzing Data Streams in Real Time with Amazon Kinesis: PNNL's Serverless Da...
Amazon Web Services
 
From Batch to Streaming - How Amazon Flex Uses Real-time Analytics
From Batch to Streaming - How Amazon Flex Uses Real-time AnalyticsFrom Batch to Streaming - How Amazon Flex Uses Real-time Analytics
From Batch to Streaming - How Amazon Flex Uses Real-time Analytics
Amazon Web Services
 
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
AWS Summits
 
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
Amazon Web Services
 
SRV316 Serverless Data Processing at Scale: An Amazon.com Case Study
 SRV316 Serverless Data Processing at Scale: An Amazon.com Case Study SRV316 Serverless Data Processing at Scale: An Amazon.com Case Study
SRV316 Serverless Data Processing at Scale: An Amazon.com Case Study
Amazon Web Services
 
Analyzing Streams
Analyzing StreamsAnalyzing Streams
Analyzing Streams
Amazon Web Services
 
Workshop: Architecting a Serverless Data Lake
Workshop: Architecting a Serverless Data LakeWorkshop: Architecting a Serverless Data Lake
Workshop: Architecting a Serverless Data Lake
Amazon Web Services
 
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
AWS Riyadh User Group
 

Similar to Considerations for Building Your First Streaming Application (ANT359) - AWS re:Invent 2018 (20)

Real-Time Web Analytics with Amazon Kinesis Data Analytics (ADT401) - AWS re:...
Real-Time Web Analytics with Amazon Kinesis Data Analytics (ADT401) - AWS re:...Real-Time Web Analytics with Amazon Kinesis Data Analytics (ADT401) - AWS re:...
Real-Time Web Analytics with Amazon Kinesis Data Analytics (ADT401) - AWS re:...
 
Analyzing Streams
Analyzing StreamsAnalyzing Streams
Analyzing Streams
 
Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...
Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...
Using Amazon VPC Flow Logs for Predictive Security Analytics (NET319) - AWS r...
 
Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...
Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...
Serverless Stream Processing Pipeline Best Practices (SRV316-R1) - AWS re:Inv...
 
Analyzing Streams: Data Analytics Week SF
Analyzing Streams: Data Analytics Week SFAnalyzing Streams: Data Analytics Week SF
Analyzing Streams: Data Analytics Week SF
 
Analyzing Streams
Analyzing StreamsAnalyzing Streams
Analyzing Streams
 
Analyzing Streams: Data Analytics Week at the SF Loft
Analyzing Streams: Data Analytics Week at the SF LoftAnalyzing Streams: Data Analytics Week at the SF Loft
Analyzing Streams: Data Analytics Week at the SF Loft
 
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018
 
Analyzing Streams
Analyzing StreamsAnalyzing Streams
Analyzing Streams
 
BDA309 Build Your First Big Data Application on AWS
BDA309 Build Your First Big Data Application on AWSBDA309 Build Your First Big Data Application on AWS
BDA309 Build Your First Big Data Application on AWS
 
Real Time Data Ingestion & Analysis - AWS Summit Sydney 2018
Real Time Data Ingestion & Analysis - AWS Summit Sydney 2018Real Time Data Ingestion & Analysis - AWS Summit Sydney 2018
Real Time Data Ingestion & Analysis - AWS Summit Sydney 2018
 
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
 
Analyzing Data Streams in Real Time with Amazon Kinesis: PNNL's Serverless Da...
Analyzing Data Streams in Real Time with Amazon Kinesis: PNNL's Serverless Da...Analyzing Data Streams in Real Time with Amazon Kinesis: PNNL's Serverless Da...
Analyzing Data Streams in Real Time with Amazon Kinesis: PNNL's Serverless Da...
 
From Batch to Streaming - How Amazon Flex Uses Real-time Analytics
From Batch to Streaming - How Amazon Flex Uses Real-time AnalyticsFrom Batch to Streaming - How Amazon Flex Uses Real-time Analytics
From Batch to Streaming - How Amazon Flex Uses Real-time Analytics
 
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
 
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
 
SRV316 Serverless Data Processing at Scale: An Amazon.com Case Study
 SRV316 Serverless Data Processing at Scale: An Amazon.com Case Study SRV316 Serverless Data Processing at Scale: An Amazon.com Case Study
SRV316 Serverless Data Processing at Scale: An Amazon.com Case Study
 
Analyzing Streams
Analyzing StreamsAnalyzing Streams
Analyzing Streams
 
Workshop: Architecting a Serverless Data Lake
Workshop: Architecting a Serverless Data LakeWorkshop: Architecting a Serverless Data Lake
Workshop: Architecting a Serverless Data Lake
 
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
Amazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
Amazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
Amazon Web Services
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Amazon Web Services
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
Amazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
Amazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Amazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
Amazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Amazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
Amazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Considerations for Building Your First Streaming Application (ANT359) - AWS re:Invent 2018

  • 1.
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Considerations for Building Your First Streaming Application 0 7 / 1 1 / 2 0 1 8 Praveen Gattu Software Developer Manager AWS, Amazon Kinesis Ryan Nienhuis Senior Technical Product Manager AWS, Amazon Kinesis A N T 3 5 9
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Talk outline • Use case – operational dashboard • Implement real time and batch analysis
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Application architecture Generate web logs Collect web logs and deliver to Amazon Simple Storage Service (Amazon S3) Process & compute aggregate web log metrics Deliver processed web log metrics to Amazon CloudWatch Raw web logs from Data Firehose Interactive analysisof web logs Interactive querying of web logs Alarm Client
  • 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Streaming with Amazon Kinesis Easily collect, process, and analyze video and data streams in real time Capture, process, and store video streams Load data streams into AWS data stores Analyze data streams in real time Capture, process, and store data streams
  • 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Application architecture Generate web logs Collect web logs and deliver to Amazon Simple Storage Service (Amazon S3) Process & compute aggregate web log metrics Deliver processed web log metrics to Amazon CloudWatch Raw web logs from Data Firehose Interactive analysisof web logs Interactive querying of web logs Alarm Client
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Collect logs with a Kinesis Data Firehose delivery stream We are going to • Write to a Data Firehose delivery stream - Simulate writing transformed Apache web logs to a Kinesis Data Firehose delivery stream that is configured to deliver data into an S3 bucket • There are many different libraries that can be used to write data to a Data Firehose delivery stream; one popular option is called the Amazon Kinesis Agent
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Collect logs with a Kinesis Data Firehose delivery stream • So that we don’t have to install or set up software on your machine, we are going to use a lambda function to simulate using the Amazon Kinesis agent. The lambda function can populate a Data Firehose delivery stream using a template and is simple to setup. • Let’s get started!
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Kinesis Data Firehose delivery to S3 stats
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Monitoring Kinesis Data Firehose delivery to S3
  • 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Application architecture Generate web logs Collect web logs and deliver to Amazon Simple Storage Service (Amazon S3) Process & compute aggregate web log metrics Deliver processed web log metrics to Amazon CloudWatch Raw web logs from Data Firehose Interactive analysisof web logs Interactive querying of web logs Alarm Client
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Kinesis Data Analytics • Powerful real time applications • Easy to use, fully managed • Automatic elasticity • Windowed aggregations
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Kinesis Data Analytics applications Easily write SQL code to process streaming data Connect to streaming source Continuously deliver SQL results 1011101 1011010 0101010 1011101 1011010 0101010
  • 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Process data using Kinesis Data Analytics • SQL query to compute an aggregate metric for an interesting statistic on the incoming data – Error Count
  • 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. View sample records in Amazon Kinesis Data Analytics app • Review sample records delivered to the source stream (SOURCE_SQL_STREAM_001)
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Kinesis Data Analytics application metadata • Note that Amazon Kinesis adds metadata to each record being sent that was shown in the formatted record sample • The ROWTIME represents the time when the Kinesis application inserts a row in the first in-application stream. It’s a special column used for time series analytics. This is also known as a the processing time. • The APPROXIMATE_ARRIVAL_TIME is the time the record was added to the streaming source. This is also known as ingest time or server-side time. • The event time is the timestamp when the event occurred. It’s a also called client side time. Its useful because it’s the time when an event occurred at the client.
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Calculate an aggregate metric Tumbling Sliding Custom • Fixed size and non-overlapping • Use FLOOR() or STEP()function in a GROUP BY statement • Fixed size and overlapping; row boundaries are determined when new rows enter window • Use standard OVERand WINDOW clause • Not fixed size and overlapping; row boundaries by conditions • Implementations vary, but typically require two steps (step 1 – identify boundaries, step 2 – perform computation) Stagger • Not fixed size and non-overlapping; windows open when the first event matching the partition key arrives • Use WINDOWED BY STAGGER and PARTITION BY statements
  • 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Calculate error count metric In Kinesis Data Analytics Application Editor Page author following SQL
  • 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Application architecture Generate web logs Collect web logs and deliver to Amazon Simple Storage Service (Amazon S3) Process & compute aggregate web log metrics Deliver processed web log metrics to Amazon CloudWatch Raw web logs from Data Firehose Interactive analysisof web logs Interactive querying of web logs Alarm Client
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Deliver output data to Amazon CloudWatch • Connect Kinesis Data Analytics output to lambda function
  • 25. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Deliver output data to Amazon CloudWatch • Lambda function delivers results to CloudWatch metrics
  • 26. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Real-time alerts on error rate from CloudWatch alarms • Alarms fire when error-rate breaches a threshold
  • 27. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. HOW TO KNOW THE IMPACT ?
  • 28. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 29. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Application architecture Generate web logs Collect web logs and deliver to Amazon Simple Storage Service (Amazon S3) Process & compute aggregate web log metrics Deliver processed web log metrics to Amazon CloudWatch Raw web logs from Data Firehose Interactive analysisof web logs Interactive querying of web logs Alarm Client
  • 30. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Interactive query service • Query directly from Amazon S3 • Use ANSI SQL • Serverless • Multiple data formats • Cost effective Amazon Athena
  • 31. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Familiar technologies under the covers Used for SQL Queries In-memory distributed query engine ANSI-SQL compatible with extensions Used for DDL functionality Complex data types Multitude of formats Supports data partitioning
  • 32. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Want to learn more? Workshop sessions covering streaming and big data • ANT213-R and ANT213-R1 – Build Your First Big Data Application on AWS • ANT362 - Use Streaming Data to Gain Real-Time Insights into Your Business • ANT318-R and ANT318-R1 - Build, Deploy and Serve Machine Learning Models on Streaming Data Using Amazon SageMaker, Apache Spark on Amazon EMR and Amazon Kinesis
  • 33. Thank you! © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Praveen Gattu Ryan Nienhuis
  • 34. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.