SlideShare a Scribd company logo
1 of 34
www.immobilienscout24.de
Visual Dataflows with Apache NiFi
- and how they interact with AWS
AWS Berlin UserGroup – 2016-04-19 – Speaker: @KayLerch
Agenda
1. Why Apache NiFi?
2. That‘s Apache NiFi - Exploring the UI
3. AWS Integration Capabilities
4. AWS IoT – Basics (Recap)
5. Apache NiFi and AWS IoT
Seite 2
Apache NiFi & AWS | Kay Lerch
Why Apache NiFi
A brief overview of data processing and analysis
Seite 3
Apache NiFi & AWS | Kay Lerch
A brief overview of data processing and analysis
Stone age: no tooling at all
Seite 4
Data
Producers
Data
Consumers
Potential
Bottleneck
Integration
challenges
Left alone with
analytic challenges
unreliable
delivery
Apache NiFi & AWS | Kay Lerch
IoAT
A brief overview of data processing and analysis
Bronze age: invent the wheel (event broker) for reliable (message) transportation
Seite 5
Data
Producers
Data
Consumers
Event
Broker
Limited durability
Hidden complexities
Left alone with
analytic challenges
Apache NiFi & AWS | Kay Lerch
A brief overview of data processing and analysis
Industrialization: stores for massive production of durable yet unstructured information
Seite 6
Hidden complexities
Data
Producers
Data
Consumers
Event
Broker
Ingestionchallenges
Realtimelag
Left alone with
analytic challenges
Data analysis challengesData processing challenges
Data security challenges
(Big) data stores
Apache NiFi & AWS | Kay Lerch
A brief overview of data processing and analysis
Digital age: realtime processing and analysis of (streaming) data
Seite 7
Hidden complexities
Data
Producers
Data
Consumers
Event
Broker
(Big) data stores
Ingestionchallenges
Data analysis challenges
(Realtime) Data Processing & Analytics
Apache NiFi & AWS | Kay Lerch
Integration
challenges
Data security
challenges
A brief overview of data processing and analysis
That‘s quite a lot of tooling and technology …
Seite 8
Hidden complexities
Data
Producers
Data
Consumers
Event
Broker
(Big) data stores
Data analysis challenges
(Realtime) Data Processing & Analytics
Apache NiFi & AWS | Kay Lerch
Integration
challenges
Data security challenges
Ingestionchallenges
Data security
challenges
A brief overview of data processing and analysis
That‘s quite a lot of tooling and technology …
Seite 9
Hidden complexities
Data
Producers
Data
Consumers
Event
Broker
(Big) data stores
Data analysis challenges
(Realtime) Data Processing & Analytics
Apache NiFi & AWS | Kay Lerch
Integration
challenges
Ingestionchallenges
Data security
challenges
If you want …
 a (realtime) big picture of your dataflows
 an option to overlook lineage of each data element
 have the flexibility to change things on the fly
 prioritize data
 overcome challenges of integrating the variety of
technologies with one overarching solution
 enforce security and compliance along dataflows
 rely on extensibility driven by OS community
 satisfy all those needs and keep your tools
 get rid of only those tools focused on moving data without
making concessions to overall performance
… then you might love:
That’s Apache NiFi
Seite 10
Apache NiFi & AWS | Kay Lerch
Seite 11
That‘s Apache NiFi
in one page
Apache nifi supports powerful and scalable directed graphs of data
routing, transformation, and system mediation logic.
Web-based user interface
Seamless experience between design, control, feedback, and monitoring
Highly configurable
Loss tolerant vs guaranteed delivery, Low latency vs high throughput, Dynamic prioritization, Flow can be modified at runtime, Back pressure
Data Provenance
Track dataflow from beginning to end
Designed for extension
Build your own processors and more, Enables rapid development and effective testing
Secure
SSL, SSH, HTTPS, encrypted content, etc..., Pluggable role-based authentication/authorization
Apache NiFi & AWS | Kay LerchSource: https://nifi.apache.org/
Seite 12
That‘s Apache NiFi
in real and feel
Go to NiFi’s interface and understand:
Processors
Templates
Concept of back pressure
Concept of data prioritization
Provenence Graph
Apache NiFi & AWS | Kay Lerch
NiFi Cluster
NiFi Cluster
Manager (NCM)
JVM
Node (Primary)
NiFi Clustered Architecture
JVM
Webserver
Provenance
Repository
Content
Repository
Flowfile
Repository
REST-APIAdminUI
Webserver REST-APIAdmin UI
Flow Controller
Cluster Manager
Processor 1
Processor 2
Isolated
Processor
Controller Service 1
Controller Service 2
Controller Service n
Heartbeat
Leader
election
Report
change
Embedded Apache Zookeeper
Node (Slave)
JVM
Webserver
Provenance
Repository
Content
Repository
Flowfile
Repository
REST-APIAdminUI
Flow Controller
Processor 1
Processor 2
Isolated
Processor
Controller Service 1
Controller Service 2
Controller Service n
Heartbeat
Report
change
Sync
State
Sync
State
AWS Integration Capabilities
Seite 14
Apache NiFi & AWS | Kay Lerch
Seite 15
Demos
to be covered
AWS Credential Provider Service
Integrating SQS
Integrating S3
Integrating Lambda
Integrating Kinesis Firehose
Integrating SNS
Integrating IoT
Apache NiFi & AWS | Kay Lerch
AWS IoT
Basics (Recap)
Seite 16
Apache NiFi & AWS | Kay Lerch
Seite 17
AWS IoT
The Shadow
AWS IoT
Thing
Thing
Shadow
Rule
Reports State Mirrors State
in Shadow
Gets reported
state or sets
desired state
Propagates
desired state
Receives
desired state
Fulfills
desiredstate
Subscribes to
particular messages
AWS Services
Some
AWS
Resource
Routes
message
TLS
1.2
TLS
1.2 Policy
Apache NiFi & AWS | Kay Lerch
AWS IoT
MQTT topics
AWS IoT
Thing
Shadow
get
get/
accepted
get/
rejected
Request state
Get shadow state
Get error
update
update/
accepted
update/
rejected
update/
delta
Update state
Confirmation
Get error
Changed
state
1
2
1
2
3
Thing topics name pattern: $aws/things/thing_name/...
Apache NiFi & AWS | Kay Lerch
Apache NiFi & AWS IoT
New processors
Seite 19
Apache NiFi & AWS | Kay Lerch
Seite 20
Apache NiFi & AWS IoT
Where NiFi comes in
If your managed services you want to integrate with your „things“ run on AWS you are good to go => Thing rules
If not, you need either an MQTT client (=> live data) or an application which communicates with managed AWS API (for shadow data)
AWS announced MQTT over WebSockets in January 2016
Which means you’re not limited to TLS connections anymore
Establish durable connection to AWS IoT endpoint
Then talk MQTT over websockets in order to subscribe or publish to the thing topics
AWS service limit on connection duration: 300 seconds
You need a way to reconnect your client to hold your MQTT subscriptions
NiFi processors have potential to become MQTT clients __|
Apache NiFi & AWS | Kay Lerch
Seite 21
Apache NiFi & AWS IoT
GetIOTMqtt – a MQTT client
AWS IoT
Thing
Shadow
update
Update state
Establish
Connection
Subscribe
Receive state
1
2
3
Flow
file
Apache NiFi & AWS | Kay Lerch
Apache NiFi & AWS IoT
GetIOTMqtt – Reconnect accordingly
First of all: I don’t want to wait for the auto-termination. I want to act upfront
AWS IoT does not support persistent client sessions 
Therefore:
If disconnecting and then reconnecting there is a short gap in which I probably miss a message
If a reconnect and then disconnect there is a short gap in which I probably receive messages twice
Fortunately one of these effects is officially accepted by the client anyways due to the quality of service level
if a subscription is desired with QoS=0 (“at most once message delivery”)
=> disconnect, then reconnect
=> maybe message loss
=> that’s fine 
if a subscription is desired with QoS=1 (“at least one message delivery”)
=> reconnect, then disconnect
=> maybe duplicate message
=> that’s fine 
QoS=2 (“exact one message delivery”) is not supported by AWS IoT  __|
Session 1 Session 2
connect close connect
Potential
message loss
Session 1
Session 2
Potential
duplicates
connect closeconnect
Session 3
closeconnect
Potential
duplicates
close connect
Potential
message loss
Seite 23
Apache NiFi & AWS IoT
GetIOTMqtt – Configuration
Apache NiFi & AWS | Kay Lerch
Seite 24
Apache NiFi & AWS IoT
GetIOTMqtt – Live demo
Apache NiFi & AWS | Kay Lerch
Seite 25
Apache NiFi & AWS IoT
GetIOTMqtt – Live demo
Apache NiFi & AWS | Kay Lerch
Seite 26
Apache NiFi & AWS IoT
PutIOTMqtt – instruct a „thing“ (but bypass the shadow)
AWS IoT
Thing
Shadow
update /
delta
Update state
Establish
Connection
Publish state
1
2
Flow
file
Flow
file
Apache NiFi & AWS | Kay Lerch
Seite 27
Apache NiFi & AWS IoT
PutIOTMqtt – Configuration
Apache NiFi & AWS | Kay Lerch
Seite 28
Apache NiFi & AWS IoT
PutIOTMqtt – Live demo
Apache NiFi & AWS | Kay Lerch
Seite 29
Apache NiFi & AWS IoT
GetIOTShadow – constantly check last reported state
AWS IoT
Thing
Shadow
update
Report state
Request
Shadow
Flow
file
Flow
file
Apache NiFi & AWS | Kay Lerch
Seite 30
Apache NiFi & AWS IoT
GetIOTShadow – Configuration
Apache NiFi & AWS | Kay Lerch
Seite 31
Apache NiFi & AWS IoT
PutIOTShadow – instruct a „thing“ over its shadow
AWS IoT
Thing
Shadow
update /
delta
Desire state
Update
Shadow
Flow
file
Flow
file
Apache NiFi & AWS | Kay Lerch
Seite 32
Apache NiFi & AWS IoT
PutIOTShadow – Configuration
Apache NiFi & AWS | Kay Lerch
Seite 33
More to come
MiNiFi (lightweight agent as data collectors)
Variable registry
Improvement on HA / Cluster management
Multi tenancy
More Processors
Extension registry (choose nar’s from a central repository)
Apache NiFi & AWS | Kay Lerch
www.immobilienscout24.de
Thanks for you attention. Any questions?
Contact:
Immobilien Scout GmbH
Andreasstraße 10
10243 Berlin
Kay Lerch
Fon +49 30 24 301-1149
kay.lerch@immobilienscout24.de

More Related Content

What's hot

Introduction to Apache NiFi 1.10
Introduction to Apache NiFi 1.10Introduction to Apache NiFi 1.10
Introduction to Apache NiFi 1.10Timothy Spann
 
Running Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration OptionsRunning Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration OptionsTimothy Spann
 
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San JoseDataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San JoseAldrin Piri
 
MiNiFi 0.0.1 MeetUp talk
MiNiFi 0.0.1 MeetUp talkMiNiFi 0.0.1 MeetUp talk
MiNiFi 0.0.1 MeetUp talkJoe Percivall
 
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFiData at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFiAldrin Piri
 
Apache NiFi Crash Course - San Jose Hadoop Summit
Apache NiFi Crash Course - San Jose Hadoop SummitApache NiFi Crash Course - San Jose Hadoop Summit
Apache NiFi Crash Course - San Jose Hadoop SummitAldrin Piri
 
Future of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep DiveFuture of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep DiveAldrin Piri
 
Data ingestion and distribution with apache NiFi
Data ingestion and distribution with apache NiFiData ingestion and distribution with apache NiFi
Data ingestion and distribution with apache NiFiLev Brailovskiy
 
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFIHarnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFIHaimo Liu
 
Insight into Hyperconverged Infrastructure
Insight into Hyperconverged Infrastructure Insight into Hyperconverged Infrastructure
Insight into Hyperconverged Infrastructure HTS Hosting
 
Intelligently Collecting Data at the Edge – Intro to Apache MiNiFi
Intelligently Collecting Data at the Edge – Intro to Apache MiNiFiIntelligently Collecting Data at the Edge – Intro to Apache MiNiFi
Intelligently Collecting Data at the Edge – Intro to Apache MiNiFiDataWorks Summit
 
Apache NiFi Meetup - Princeton NJ 2016
Apache NiFi Meetup - Princeton NJ 2016Apache NiFi Meetup - Princeton NJ 2016
Apache NiFi Meetup - Princeton NJ 2016Timothy Spann
 
Introduction to Apache NiFi 1.11.4
Introduction to Apache NiFi 1.11.4Introduction to Apache NiFi 1.11.4
Introduction to Apache NiFi 1.11.4Timothy Spann
 
Building a modern end-to-end open source Big Data reference application
Building a modern end-to-end open source Big Data reference applicationBuilding a modern end-to-end open source Big Data reference application
Building a modern end-to-end open source Big Data reference applicationDataWorks Summit
 

What's hot (19)

Introduction to Apache NiFi 1.10
Introduction to Apache NiFi 1.10Introduction to Apache NiFi 1.10
Introduction to Apache NiFi 1.10
 
The Avant-garde of Apache NiFi
The Avant-garde of Apache NiFiThe Avant-garde of Apache NiFi
The Avant-garde of Apache NiFi
 
Running Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration OptionsRunning Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration Options
 
Nifi
NifiNifi
Nifi
 
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San JoseDataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
 
MiNiFi 0.0.1 MeetUp talk
MiNiFi 0.0.1 MeetUp talkMiNiFi 0.0.1 MeetUp talk
MiNiFi 0.0.1 MeetUp talk
 
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFiData at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
 
Apache NiFi Crash Course - San Jose Hadoop Summit
Apache NiFi Crash Course - San Jose Hadoop SummitApache NiFi Crash Course - San Jose Hadoop Summit
Apache NiFi Crash Course - San Jose Hadoop Summit
 
Future of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep DiveFuture of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep Dive
 
Nifi workshop
Nifi workshopNifi workshop
Nifi workshop
 
Data ingestion and distribution with apache NiFi
Data ingestion and distribution with apache NiFiData ingestion and distribution with apache NiFi
Data ingestion and distribution with apache NiFi
 
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFIHarnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
 
Insight into Hyperconverged Infrastructure
Insight into Hyperconverged Infrastructure Insight into Hyperconverged Infrastructure
Insight into Hyperconverged Infrastructure
 
Intelligently Collecting Data at the Edge – Intro to Apache MiNiFi
Intelligently Collecting Data at the Edge – Intro to Apache MiNiFiIntelligently Collecting Data at the Edge – Intro to Apache MiNiFi
Intelligently Collecting Data at the Edge – Intro to Apache MiNiFi
 
Apache NiFi Meetup - Princeton NJ 2016
Apache NiFi Meetup - Princeton NJ 2016Apache NiFi Meetup - Princeton NJ 2016
Apache NiFi Meetup - Princeton NJ 2016
 
Introduction to Apache NiFi 1.11.4
Introduction to Apache NiFi 1.11.4Introduction to Apache NiFi 1.11.4
Introduction to Apache NiFi 1.11.4
 
The Elephant in the Clouds
The Elephant in the CloudsThe Elephant in the Clouds
The Elephant in the Clouds
 
Dataflow with Apache NiFi - Crash Course - HS16SJ
Dataflow with Apache NiFi - Crash Course - HS16SJDataflow with Apache NiFi - Crash Course - HS16SJ
Dataflow with Apache NiFi - Crash Course - HS16SJ
 
Building a modern end-to-end open source Big Data reference application
Building a modern end-to-end open source Big Data reference applicationBuilding a modern end-to-end open source Big Data reference application
Building a modern end-to-end open source Big Data reference application
 

Viewers also liked

Mockito with a hint of PowerMock
Mockito with a hint of PowerMockMockito with a hint of PowerMock
Mockito with a hint of PowerMockYing Zhang
 
Mockito (JUG Latvia)
Mockito (JUG Latvia)Mockito (JUG Latvia)
Mockito (JUG Latvia)Dmitry Buzdin
 
NJ Hadoop Meetup - Apache NiFi Deep Dive
NJ Hadoop Meetup - Apache NiFi Deep DiveNJ Hadoop Meetup - Apache NiFi Deep Dive
NJ Hadoop Meetup - Apache NiFi Deep DiveBryan Bende
 
Mocking in Java with Mockito
Mocking in Java with MockitoMocking in Java with Mockito
Mocking in Java with MockitoRichard Paul
 
Integrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIntegrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkHortonworks
 
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Hortonworks Technical Workshop: Real Time Monitoring with Apache HadoopHortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Hortonworks Technical Workshop: Real Time Monitoring with Apache HadoopHortonworks
 
Discover.hdp2.2.storm and kafka.final
Discover.hdp2.2.storm and kafka.finalDiscover.hdp2.2.storm and kafka.final
Discover.hdp2.2.storm and kafka.finalHortonworks
 
Docker 基礎介紹與實戰
Docker 基礎介紹與實戰Docker 基礎介紹與實戰
Docker 基礎介紹與實戰Bo-Yi Wu
 
現代 IT 人一定要知道的 Ansible 自動化組態技巧
現代 IT 人一定要知道的 Ansible 自動化組態技巧現代 IT 人一定要知道的 Ansible 自動化組態技巧
現代 IT 人一定要知道的 Ansible 自動化組態技巧Chu-Siang Lai
 
Taking DataFlow Management to the Edge with Apache NiFi/MiNiFi
Taking DataFlow Management to the Edge with Apache NiFi/MiNiFiTaking DataFlow Management to the Edge with Apache NiFi/MiNiFi
Taking DataFlow Management to the Edge with Apache NiFi/MiNiFiBryan Bende
 
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHortonworks
 
Introduction to data flow management using apache nifi
Introduction to data flow management using apache nifiIntroduction to data flow management using apache nifi
Introduction to data flow management using apache nifiAnshuman Ghosh
 
Jupyter 簡介—互動式的筆記本系統
Jupyter 簡介—互動式的筆記本系統Jupyter 簡介—互動式的筆記本系統
Jupyter 簡介—互動式的筆記本系統Chengtao Lin
 

Viewers also liked (17)

Mock with Mockito
Mock with MockitoMock with Mockito
Mock with Mockito
 
Mockito with a hint of PowerMock
Mockito with a hint of PowerMockMockito with a hint of PowerMock
Mockito with a hint of PowerMock
 
Apache nifi
Apache nifiApache nifi
Apache nifi
 
Mockito (JUG Latvia)
Mockito (JUG Latvia)Mockito (JUG Latvia)
Mockito (JUG Latvia)
 
NJ Hadoop Meetup - Apache NiFi Deep Dive
NJ Hadoop Meetup - Apache NiFi Deep DiveNJ Hadoop Meetup - Apache NiFi Deep Dive
NJ Hadoop Meetup - Apache NiFi Deep Dive
 
Apache NiFi in the Hadoop Ecosystem
Apache NiFi in the Hadoop Ecosystem Apache NiFi in the Hadoop Ecosystem
Apache NiFi in the Hadoop Ecosystem
 
Mocking in Java with Mockito
Mocking in Java with MockitoMocking in Java with Mockito
Mocking in Java with Mockito
 
Integrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIntegrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache Flink
 
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Hortonworks Technical Workshop: Real Time Monitoring with Apache HadoopHortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
 
Discover.hdp2.2.storm and kafka.final
Discover.hdp2.2.storm and kafka.finalDiscover.hdp2.2.storm and kafka.final
Discover.hdp2.2.storm and kafka.final
 
Docker 基礎介紹與實戰
Docker 基礎介紹與實戰Docker 基礎介紹與實戰
Docker 基礎介紹與實戰
 
Hadoop Summit Tokyo Apache NiFi Crash Course
Hadoop Summit Tokyo Apache NiFi Crash CourseHadoop Summit Tokyo Apache NiFi Crash Course
Hadoop Summit Tokyo Apache NiFi Crash Course
 
現代 IT 人一定要知道的 Ansible 自動化組態技巧
現代 IT 人一定要知道的 Ansible 自動化組態技巧現代 IT 人一定要知道的 Ansible 自動化組態技巧
現代 IT 人一定要知道的 Ansible 自動化組態技巧
 
Taking DataFlow Management to the Edge with Apache NiFi/MiNiFi
Taking DataFlow Management to the Edge with Apache NiFi/MiNiFiTaking DataFlow Management to the Edge with Apache NiFi/MiNiFi
Taking DataFlow Management to the Edge with Apache NiFi/MiNiFi
 
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical Workshop
 
Introduction to data flow management using apache nifi
Introduction to data flow management using apache nifiIntroduction to data flow management using apache nifi
Introduction to data flow management using apache nifi
 
Jupyter 簡介—互動式的筆記本系統
Jupyter 簡介—互動式的筆記本系統Jupyter 簡介—互動式的筆記本系統
Jupyter 簡介—互動式的筆記本系統
 

Similar to AWS User Group Meetup Berlin - Kay Lerch on Apache NiFi (2016-04-19)

IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...
IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...
IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...Kay Lerch
 
devworkshop-10_28_1015-amazon-conference-presentation
devworkshop-10_28_1015-amazon-conference-presentationdevworkshop-10_28_1015-amazon-conference-presentation
devworkshop-10_28_1015-amazon-conference-presentationAlex Wu
 
The Scout24 Data Platform (A Technical Deep Dive)
The Scout24 Data Platform (A Technical Deep Dive)The Scout24 Data Platform (A Technical Deep Dive)
The Scout24 Data Platform (A Technical Deep Dive)RaffaelDzikowski
 
Building event-driven (Micro)Services with Apache Kafka Ecosystem
Building event-driven (Micro)Services with Apache Kafka EcosystemBuilding event-driven (Micro)Services with Apache Kafka Ecosystem
Building event-driven (Micro)Services with Apache Kafka EcosystemGuido Schmutz
 
AWS Webcast - What is Cloud Computing?
AWS Webcast - What is Cloud Computing?AWS Webcast - What is Cloud Computing?
AWS Webcast - What is Cloud Computing?Amazon Web Services
 
How EidosMedia Leverages ONTAP Cloud for AWS to Serve Millions of Users Globa...
How EidosMedia Leverages ONTAP Cloud for AWS to Serve Millions of Users Globa...How EidosMedia Leverages ONTAP Cloud for AWS to Serve Millions of Users Globa...
How EidosMedia Leverages ONTAP Cloud for AWS to Serve Millions of Users Globa...Amazon Web Services
 
Deep Dive - Hybrid Architectures
Deep Dive - Hybrid ArchitecturesDeep Dive - Hybrid Architectures
Deep Dive - Hybrid ArchitecturesAmazon Web Services
 
AWS re:Invent 2016: Serverless IoT Back Ends (IOT401)
AWS re:Invent 2016: Serverless IoT Back Ends (IOT401)AWS re:Invent 2016: Serverless IoT Back Ends (IOT401)
AWS re:Invent 2016: Serverless IoT Back Ends (IOT401)Amazon Web Services
 
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...confluent
 
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...confluent
 
Timothy Spann [StreamNative] | Using FLaNK with InfluxDB for EdgeAI IoT at Sc...
Timothy Spann [StreamNative] | Using FLaNK with InfluxDB for EdgeAI IoT at Sc...Timothy Spann [StreamNative] | Using FLaNK with InfluxDB for EdgeAI IoT at Sc...
Timothy Spann [StreamNative] | Using FLaNK with InfluxDB for EdgeAI IoT at Sc...InfluxData
 
Using FLiP with influxdb for EdgeAI IoT at Scale
Using FLiP with influxdb for EdgeAI IoT at ScaleUsing FLiP with influxdb for EdgeAI IoT at Scale
Using FLiP with influxdb for EdgeAI IoT at ScaleTimothy Spann
 
Lightbend Fast Data Platform
Lightbend Fast Data PlatformLightbend Fast Data Platform
Lightbend Fast Data PlatformLightbend
 
Building scalable OTT workflows on AWS - Serverless Video Workflows
Building scalable OTT workflows on AWS - Serverless Video WorkflowsBuilding scalable OTT workflows on AWS - Serverless Video Workflows
Building scalable OTT workflows on AWS - Serverless Video WorkflowsAmazon Web Services
 
Devoxx university - Kafka de haut en bas
Devoxx university - Kafka de haut en basDevoxx university - Kafka de haut en bas
Devoxx university - Kafka de haut en basFlorent Ramiere
 
Keeping your options open
Keeping your options openKeeping your options open
Keeping your options openDoug Tidwell
 
Hybrid Infrastructure Integration
Hybrid Infrastructure IntegrationHybrid Infrastructure Integration
Hybrid Infrastructure IntegrationAmazon Web Services
 
Hybrid Infrastructure Integration
Hybrid Infrastructure IntegrationHybrid Infrastructure Integration
Hybrid Infrastructure IntegrationAmazon Web Services
 
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...Kai Wähner
 

Similar to AWS User Group Meetup Berlin - Kay Lerch on Apache NiFi (2016-04-19) (20)

IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...
IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...
IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...
 
devworkshop-10_28_1015-amazon-conference-presentation
devworkshop-10_28_1015-amazon-conference-presentationdevworkshop-10_28_1015-amazon-conference-presentation
devworkshop-10_28_1015-amazon-conference-presentation
 
The Scout24 Data Platform (A Technical Deep Dive)
The Scout24 Data Platform (A Technical Deep Dive)The Scout24 Data Platform (A Technical Deep Dive)
The Scout24 Data Platform (A Technical Deep Dive)
 
Building event-driven (Micro)Services with Apache Kafka Ecosystem
Building event-driven (Micro)Services with Apache Kafka EcosystemBuilding event-driven (Micro)Services with Apache Kafka Ecosystem
Building event-driven (Micro)Services with Apache Kafka Ecosystem
 
AWS Webcast - What is Cloud Computing?
AWS Webcast - What is Cloud Computing?AWS Webcast - What is Cloud Computing?
AWS Webcast - What is Cloud Computing?
 
How EidosMedia Leverages ONTAP Cloud for AWS to Serve Millions of Users Globa...
How EidosMedia Leverages ONTAP Cloud for AWS to Serve Millions of Users Globa...How EidosMedia Leverages ONTAP Cloud for AWS to Serve Millions of Users Globa...
How EidosMedia Leverages ONTAP Cloud for AWS to Serve Millions of Users Globa...
 
Deep Dive - Hybrid Architectures
Deep Dive - Hybrid ArchitecturesDeep Dive - Hybrid Architectures
Deep Dive - Hybrid Architectures
 
Azure Umbraco workshop
Azure Umbraco workshopAzure Umbraco workshop
Azure Umbraco workshop
 
AWS re:Invent 2016: Serverless IoT Back Ends (IOT401)
AWS re:Invent 2016: Serverless IoT Back Ends (IOT401)AWS re:Invent 2016: Serverless IoT Back Ends (IOT401)
AWS re:Invent 2016: Serverless IoT Back Ends (IOT401)
 
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...
 
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
 
Timothy Spann [StreamNative] | Using FLaNK with InfluxDB for EdgeAI IoT at Sc...
Timothy Spann [StreamNative] | Using FLaNK with InfluxDB for EdgeAI IoT at Sc...Timothy Spann [StreamNative] | Using FLaNK with InfluxDB for EdgeAI IoT at Sc...
Timothy Spann [StreamNative] | Using FLaNK with InfluxDB for EdgeAI IoT at Sc...
 
Using FLiP with influxdb for EdgeAI IoT at Scale
Using FLiP with influxdb for EdgeAI IoT at ScaleUsing FLiP with influxdb for EdgeAI IoT at Scale
Using FLiP with influxdb for EdgeAI IoT at Scale
 
Lightbend Fast Data Platform
Lightbend Fast Data PlatformLightbend Fast Data Platform
Lightbend Fast Data Platform
 
Building scalable OTT workflows on AWS - Serverless Video Workflows
Building scalable OTT workflows on AWS - Serverless Video WorkflowsBuilding scalable OTT workflows on AWS - Serverless Video Workflows
Building scalable OTT workflows on AWS - Serverless Video Workflows
 
Devoxx university - Kafka de haut en bas
Devoxx university - Kafka de haut en basDevoxx university - Kafka de haut en bas
Devoxx university - Kafka de haut en bas
 
Keeping your options open
Keeping your options openKeeping your options open
Keeping your options open
 
Hybrid Infrastructure Integration
Hybrid Infrastructure IntegrationHybrid Infrastructure Integration
Hybrid Infrastructure Integration
 
Hybrid Infrastructure Integration
Hybrid Infrastructure IntegrationHybrid Infrastructure Integration
Hybrid Infrastructure Integration
 
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
 

Recently uploaded

Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 

Recently uploaded (20)

Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 

AWS User Group Meetup Berlin - Kay Lerch on Apache NiFi (2016-04-19)

  • 1. www.immobilienscout24.de Visual Dataflows with Apache NiFi - and how they interact with AWS AWS Berlin UserGroup – 2016-04-19 – Speaker: @KayLerch
  • 2. Agenda 1. Why Apache NiFi? 2. That‘s Apache NiFi - Exploring the UI 3. AWS Integration Capabilities 4. AWS IoT – Basics (Recap) 5. Apache NiFi and AWS IoT Seite 2 Apache NiFi & AWS | Kay Lerch
  • 3. Why Apache NiFi A brief overview of data processing and analysis Seite 3 Apache NiFi & AWS | Kay Lerch
  • 4. A brief overview of data processing and analysis Stone age: no tooling at all Seite 4 Data Producers Data Consumers Potential Bottleneck Integration challenges Left alone with analytic challenges unreliable delivery Apache NiFi & AWS | Kay Lerch IoAT
  • 5. A brief overview of data processing and analysis Bronze age: invent the wheel (event broker) for reliable (message) transportation Seite 5 Data Producers Data Consumers Event Broker Limited durability Hidden complexities Left alone with analytic challenges Apache NiFi & AWS | Kay Lerch
  • 6. A brief overview of data processing and analysis Industrialization: stores for massive production of durable yet unstructured information Seite 6 Hidden complexities Data Producers Data Consumers Event Broker Ingestionchallenges Realtimelag Left alone with analytic challenges Data analysis challengesData processing challenges Data security challenges (Big) data stores Apache NiFi & AWS | Kay Lerch
  • 7. A brief overview of data processing and analysis Digital age: realtime processing and analysis of (streaming) data Seite 7 Hidden complexities Data Producers Data Consumers Event Broker (Big) data stores Ingestionchallenges Data analysis challenges (Realtime) Data Processing & Analytics Apache NiFi & AWS | Kay Lerch Integration challenges Data security challenges
  • 8. A brief overview of data processing and analysis That‘s quite a lot of tooling and technology … Seite 8 Hidden complexities Data Producers Data Consumers Event Broker (Big) data stores Data analysis challenges (Realtime) Data Processing & Analytics Apache NiFi & AWS | Kay Lerch Integration challenges Data security challenges Ingestionchallenges Data security challenges
  • 9. A brief overview of data processing and analysis That‘s quite a lot of tooling and technology … Seite 9 Hidden complexities Data Producers Data Consumers Event Broker (Big) data stores Data analysis challenges (Realtime) Data Processing & Analytics Apache NiFi & AWS | Kay Lerch Integration challenges Ingestionchallenges Data security challenges If you want …  a (realtime) big picture of your dataflows  an option to overlook lineage of each data element  have the flexibility to change things on the fly  prioritize data  overcome challenges of integrating the variety of technologies with one overarching solution  enforce security and compliance along dataflows  rely on extensibility driven by OS community  satisfy all those needs and keep your tools  get rid of only those tools focused on moving data without making concessions to overall performance … then you might love:
  • 10. That’s Apache NiFi Seite 10 Apache NiFi & AWS | Kay Lerch
  • 11. Seite 11 That‘s Apache NiFi in one page Apache nifi supports powerful and scalable directed graphs of data routing, transformation, and system mediation logic. Web-based user interface Seamless experience between design, control, feedback, and monitoring Highly configurable Loss tolerant vs guaranteed delivery, Low latency vs high throughput, Dynamic prioritization, Flow can be modified at runtime, Back pressure Data Provenance Track dataflow from beginning to end Designed for extension Build your own processors and more, Enables rapid development and effective testing Secure SSL, SSH, HTTPS, encrypted content, etc..., Pluggable role-based authentication/authorization Apache NiFi & AWS | Kay LerchSource: https://nifi.apache.org/
  • 12. Seite 12 That‘s Apache NiFi in real and feel Go to NiFi’s interface and understand: Processors Templates Concept of back pressure Concept of data prioritization Provenence Graph Apache NiFi & AWS | Kay Lerch
  • 13. NiFi Cluster NiFi Cluster Manager (NCM) JVM Node (Primary) NiFi Clustered Architecture JVM Webserver Provenance Repository Content Repository Flowfile Repository REST-APIAdminUI Webserver REST-APIAdmin UI Flow Controller Cluster Manager Processor 1 Processor 2 Isolated Processor Controller Service 1 Controller Service 2 Controller Service n Heartbeat Leader election Report change Embedded Apache Zookeeper Node (Slave) JVM Webserver Provenance Repository Content Repository Flowfile Repository REST-APIAdminUI Flow Controller Processor 1 Processor 2 Isolated Processor Controller Service 1 Controller Service 2 Controller Service n Heartbeat Report change Sync State Sync State
  • 14. AWS Integration Capabilities Seite 14 Apache NiFi & AWS | Kay Lerch
  • 15. Seite 15 Demos to be covered AWS Credential Provider Service Integrating SQS Integrating S3 Integrating Lambda Integrating Kinesis Firehose Integrating SNS Integrating IoT Apache NiFi & AWS | Kay Lerch
  • 16. AWS IoT Basics (Recap) Seite 16 Apache NiFi & AWS | Kay Lerch
  • 17. Seite 17 AWS IoT The Shadow AWS IoT Thing Thing Shadow Rule Reports State Mirrors State in Shadow Gets reported state or sets desired state Propagates desired state Receives desired state Fulfills desiredstate Subscribes to particular messages AWS Services Some AWS Resource Routes message TLS 1.2 TLS 1.2 Policy Apache NiFi & AWS | Kay Lerch
  • 18. AWS IoT MQTT topics AWS IoT Thing Shadow get get/ accepted get/ rejected Request state Get shadow state Get error update update/ accepted update/ rejected update/ delta Update state Confirmation Get error Changed state 1 2 1 2 3 Thing topics name pattern: $aws/things/thing_name/... Apache NiFi & AWS | Kay Lerch
  • 19. Apache NiFi & AWS IoT New processors Seite 19 Apache NiFi & AWS | Kay Lerch
  • 20. Seite 20 Apache NiFi & AWS IoT Where NiFi comes in If your managed services you want to integrate with your „things“ run on AWS you are good to go => Thing rules If not, you need either an MQTT client (=> live data) or an application which communicates with managed AWS API (for shadow data) AWS announced MQTT over WebSockets in January 2016 Which means you’re not limited to TLS connections anymore Establish durable connection to AWS IoT endpoint Then talk MQTT over websockets in order to subscribe or publish to the thing topics AWS service limit on connection duration: 300 seconds You need a way to reconnect your client to hold your MQTT subscriptions NiFi processors have potential to become MQTT clients __| Apache NiFi & AWS | Kay Lerch
  • 21. Seite 21 Apache NiFi & AWS IoT GetIOTMqtt – a MQTT client AWS IoT Thing Shadow update Update state Establish Connection Subscribe Receive state 1 2 3 Flow file Apache NiFi & AWS | Kay Lerch
  • 22. Apache NiFi & AWS IoT GetIOTMqtt – Reconnect accordingly First of all: I don’t want to wait for the auto-termination. I want to act upfront AWS IoT does not support persistent client sessions  Therefore: If disconnecting and then reconnecting there is a short gap in which I probably miss a message If a reconnect and then disconnect there is a short gap in which I probably receive messages twice Fortunately one of these effects is officially accepted by the client anyways due to the quality of service level if a subscription is desired with QoS=0 (“at most once message delivery”) => disconnect, then reconnect => maybe message loss => that’s fine  if a subscription is desired with QoS=1 (“at least one message delivery”) => reconnect, then disconnect => maybe duplicate message => that’s fine  QoS=2 (“exact one message delivery”) is not supported by AWS IoT  __| Session 1 Session 2 connect close connect Potential message loss Session 1 Session 2 Potential duplicates connect closeconnect Session 3 closeconnect Potential duplicates close connect Potential message loss
  • 23. Seite 23 Apache NiFi & AWS IoT GetIOTMqtt – Configuration Apache NiFi & AWS | Kay Lerch
  • 24. Seite 24 Apache NiFi & AWS IoT GetIOTMqtt – Live demo Apache NiFi & AWS | Kay Lerch
  • 25. Seite 25 Apache NiFi & AWS IoT GetIOTMqtt – Live demo Apache NiFi & AWS | Kay Lerch
  • 26. Seite 26 Apache NiFi & AWS IoT PutIOTMqtt – instruct a „thing“ (but bypass the shadow) AWS IoT Thing Shadow update / delta Update state Establish Connection Publish state 1 2 Flow file Flow file Apache NiFi & AWS | Kay Lerch
  • 27. Seite 27 Apache NiFi & AWS IoT PutIOTMqtt – Configuration Apache NiFi & AWS | Kay Lerch
  • 28. Seite 28 Apache NiFi & AWS IoT PutIOTMqtt – Live demo Apache NiFi & AWS | Kay Lerch
  • 29. Seite 29 Apache NiFi & AWS IoT GetIOTShadow – constantly check last reported state AWS IoT Thing Shadow update Report state Request Shadow Flow file Flow file Apache NiFi & AWS | Kay Lerch
  • 30. Seite 30 Apache NiFi & AWS IoT GetIOTShadow – Configuration Apache NiFi & AWS | Kay Lerch
  • 31. Seite 31 Apache NiFi & AWS IoT PutIOTShadow – instruct a „thing“ over its shadow AWS IoT Thing Shadow update / delta Desire state Update Shadow Flow file Flow file Apache NiFi & AWS | Kay Lerch
  • 32. Seite 32 Apache NiFi & AWS IoT PutIOTShadow – Configuration Apache NiFi & AWS | Kay Lerch
  • 33. Seite 33 More to come MiNiFi (lightweight agent as data collectors) Variable registry Improvement on HA / Cluster management Multi tenancy More Processors Extension registry (choose nar’s from a central repository) Apache NiFi & AWS | Kay Lerch
  • 34. www.immobilienscout24.de Thanks for you attention. Any questions? Contact: Immobilien Scout GmbH Andreasstraße 10 10243 Berlin Kay Lerch Fon +49 30 24 301-1149 kay.lerch@immobilienscout24.de

Editor's Notes

  1. Kafka, RabbitMQ, SNS, TIBCO
  2. HBase, HDFS, Cassandra, S3, Dynamo DB, Redshift, RDS
  3. Hadoop, Spark, Storm, Flink, ElasticSearch, SolR, EMR, Lambda, Kinesis, ElasticSearch