Using Apache® NiFi to Empower Self-Organising Teams

1 © Hortonworks Inc. 2011 – 2016. All Rights
Reserved
Empowering Self-
Organising Teams with
Apache NiFi
Sebastian Carroll
Sr Consultant @ HWX

Reserved
What is NiFi?
The Data-Movement Production Line

Reserved
Data Movement

Reserved
Production Line

Reserved
Reserved
Pop Quiz!

Reserved
The Self-Organising Team
Intra-team Improvements

Reserved
The Problem
⬢ Traditional data movement frameworks are usually large, expensive and strictly
controlled by change management
⬢ Leading to teams that cannot control their own environment
⬢ Inability to change quickly

Reserved
Core
Data
Science
Supply
Chain
Website
Online In Store

Reserved
Core
Data
Science
Supply
Chain
Website
Online In Store
S2S
Direct Connection and Intuitive UI

1
0
© Hortonworks Inc. 2011 – 2016. All Rights
Reserved
Core
Data
Science
Supply
Chain
Website
Online In Store
S2S
⬢ Team to Team - No Core

1
1
Reserved
Core
Data
Science
Supply
Chain
Website
Online In Store
S2S
⬢ Individual to Individual

1
2
Reserved
Core
Data
Science
Supply
Chain
Website
Online In Store
S2S
⬢ Not just techies

1
3
Reserved
Core
Data
Science
Supply
Chain
Website
Online In Store
S2S
⬢ Productionable

1
4
Reserved
Core
Data
Science
Supply
Chain
Website
Online In Store
S2S
⬢ Productionable
⬢ Immediate

1
5
Reserved
Core
Data
Science
Supply
Chain
Website
Online In Store
S2S
⬢ Productionable
⬢ Immediate
Not just S2S!

1
6
Reserved
Core
Data
Science
Supply
Chain
Website
Online In Store
S2S
⬢ Productionable
⬢ Immediate
Not just S2S!
Many choices!

1
7
Reserved
Quickly work with live data
⬢ Test integration points, before developing

1
8
Reserved
Quickly work with live data
⬢ Test integration points, before developing
⬢ Quickly profile data
– Encrypted?
– Compressed?
– CSV/JSON/XML?
– Seasonal?
– Volume?

1
9
Reserved
Adobe Clickstream - 0 to Hive
I’ve done this for a customer:
⬢ Got only the necessary details - transport, location and credentials

2
0
Reserved
⬢ Connected up to NiFi

2
1
Reserved
⬢ Saw what data was there

2
2
Reserved
⬢ Landed in HDFS

2
3
Reserved
Adobe Clickstream - 0 to HDFS
⬢ Landed in HDFS
⬢ How long? 1 Day from 0 to HDFS

2
4
Reserved
Adobe Clickstream - 0 to HDFS
⬢ Landed in HDFS
⬢ How long? 1 Day from 0 to HDFS
Next piece of value - make it queryable in Hive

2
5
Reserved
Incremental Improvement
of Pipelines
Inter-team Improvements

2
6
Reserved
How do I get here?
All very nice ! but ...
⬢ Multi-team changes are difficult
⬢ Core changes are difficult
⬢ Too much risk to change everyone at once
So, what do we do?

2
7
Reserved
How do I get here?
All very nice ! but ...
⬢ Multi-team changes are difficult
⬢ Core changes are difficult
⬢ Too much risk to change everyone at once
So, what do we do?
Using NiFi we can deliver small changes regularly!

2
8
Reserved
DB OPs Warehouse
Supply
Chain
BuyersWider
Business
Shell
To File

2
9
Reserved
DB OPs Warehouse
Supply
Chain
BuyersWider
Business
Shell SAN1
To File To Excel

3
0
Reserved
DB OPs Warehouse
Supply
Chain
BuyersWider
Business
Shell SAN1
To File To Excel
Merge Multiple
Excels
sFTP

3
1
Reserved
DB OPs Warehouse
Supply
Chain
BuyersWider
Business
Shell SAN1
To File To Excel
Merge Multiple
Excels
sFTP
SAN2
Analysis

3
2
Reserved
DB OPs Warehouse
Supply
Chain
BuyersWider
Business
Shell SAN1
Email
To File To Excel
Merge Multiple
Excels
sFTP
SAN2
Analysis
Publishing

3
3
Reserved
DB OPs Warehouse
Supply
Chain
BuyersWider
Business
Shell SAN1
Email
To File To Excel
Merge Multiple
Excels
sFTP
SAN2
Analysis
Publishing

3
4
Reserved
DB OPs Warehouse
Supply
Chain
BuyersWider
Business
Shell SAN1
Email
To File To Excel
Merge Multiple
Excels
sFTP
SAN2
Analysis
Publishing

3
5
Reserved
DB OPs Warehouse
Supply
Chain
BuyersWider
Business
Shell SAN1
Email
To File To Excel
Merge Multiple
Excels
S2S
SAN2
Analysis
Publishing

3
6
Reserved
DB OPs Warehouse
Supply
Chain
BuyersWider
Business
Shell SAN1
Email
To File To Excel
Merge Multiple
Excels
S2S
SAN2
Analysis
Publishing

3
7
Reserved
DB OPs Warehouse
Supply
Chain
BuyersWider
Business
Shell S2S
Email
To File To Excel
Merge Multiple
Excels
S2S
SAN2
Analysis
Publishing

3
8
Reserved
DB OPs Warehouse
Supply
Chain
BuyersWider
Business
ExecuteSQL S2S
Email
To Excel
Merge Multiple
Excels
S2S
SAN2
Analysis
Publishing

3
9
Reserved
DB OPs Warehouse
Supply
Chain
BuyersWider
Business
ExecuteSQL S2S
Email
Merge Multiple
Excels
S2S
SAN2
Analysis
Publishing

4
0
Reserved
DB OPs Warehouse
Supply
Chain
BuyersWider
Business
ExecuteSQL S2S
PutEmail
Merge Multiple
Excels
S2S
S2S
Analysis
Publishing

4
1
Reserved
Why NiFi?
What features make it a good fit?

4
2
Reserved
Easy UI
⬢ Drag and drop integration
⬢ Only Web-Browser
⬢ Flow based programming
⬢ Changes take effect immediately
⬢ Nice Visual cues on queues

4
3
Reserved
Easy UI

4
4
Reserved
Live Data Inspection
⬢ See content in the queue
⬢ See attributes
⬢ Can download for further inspection or transformation

4
5
Reserved

4
6
Reserved

4
7
Reserved

4
8
Reserved
Provenance - lineage
⬢ Can track end to end
⬢ Gives timestamped events
⬢ Trace and record the history

4
9
Reserved
Provenance - Changes and Replay
⬢ You can replay this data back!
⬢ Can change quickly and then replay to correct mistakes

5
0
Reserved
Provenance - Changes and Replay

5
1
Reserved
S2S + Vast number of integration Points
⬢ S2S is a very easy to use NiFi to NiFi protocol
⬢ Also, integrates with almost everything OOTB
⬢ SFTP, Kafka, MQTT, RELP, Email, Disk, HDFS, JMS, etc

5
2
Reserved
S2S + Vast number of integration Points

5
3
Reserved
Security
⬢ Change Tracking On the UI

5
4
Reserved
Security
⬢ Very granular authorisation model

5
5
Reserved
Security
⬢ Can secure down to the processor group level

5
6
Reserved
Security
⬢ Can secure down to the processor group level
⬢ Integrates with Ranger

5
7
Reserved
Productionable
Can start as PoC then be
⬢ Secured
⬢ integrated with AD
⬢ Scaled up
⬢ Scaled out

5
8
Reserved
Re-Cap

5
9
Reserved
Re-cap
⬢ Data-Movement Production Line
⬢ Reduce Time-To-Production
⬢ Decrease feedback loop
⬢ Move responsibility to decision makers
⬢ Empower teams!

6
0
Reserved
Questions?

Using Apache® NiFi to Empower Self-Organising Teams

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Using Apache® NiFi to Empower Self-Organising Teams

Similar to Using Apache® NiFi to Empower Self-Organising Teams (20)

Recently uploaded

Recently uploaded (20)

Using Apache® NiFi to Empower Self-Organising Teams

Editor's Notes