SlideShare a Scribd company logo
Dataflow	with	
Apache	NiFi
Aldrin	Piri	- @aldrinpiri
Apache	NiFi Crash	Course
Hadoop Summit	2016	– San	Jose
29	June	2016
2 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Key:	'Apache	NiFi’
Value:	'PMC	Member'
Key:	'Work’
Value:	’Sr.	Member	of	Technical	Staff	@	Hortonworks'
Key:	'Working	with	NiFi Since’
Value:	'2010’
3 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Agenda
What	is	dataflow	and	what	are	the	challenges?
Apache	NiFi
Architecture
Live	Demo
Community
4 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Agenda
What	is	dataflow	and	what	are	the	challenges?
Apache	NiFi
Architecture
Live	Demo
Community
5 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Let’s	Connect	A	to	B
Producers	A.K.A	Things
Anything
AND	
Everything
Internet!
Consumers
• User
• Storage
• System
• …More	Things
6 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Moving	data	effectively	is	hard
Standards:		http://xkcd.com/927/
7 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Why	is	moving	data	effectively	hard?	
à Standards
à Formats
à “Exactly	Once”	Delivery
à Protocols
à Veracity	of	Information
à Validity	of	Information
à Ensuring	Security
à Overcoming	Security
à Compliance
à Schemas
à Consumers	Change
à Credential	Management
à “That [person|team|group]”
à Network
à “Exactly	Once”	Delivery
8 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Let’s	Connect	Lots	of	As	to	Bs to	As	to	Cs	to	Bs to	Δs to	Cs	to	ϕs
Let’s	consider	the	needs	of	a	courier	service
Physical	Store
Gateway	
Server
Mobile	Devices
Registers
Server	Cluster
Distribution	Center Core	Data	Center	at	HQ
Server	Cluster
On	Delivery	Routes
Trucks Deliverers
Delivery	Truck:	 Creative	Stall,	https://thenounproject.com/creativestall/
Deliverer:	RigoPeter,	https://thenounproject.com/rigo/
Cash	Register:	Sergey	Patutin,	https://thenounproject.com/bdesign.by/
Hand	Scanner:	Eric	Pearson,	https://thenounproject.com/epearson001/
9 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Great!	I	am	collecting	all	this	data!		Let’s	use	it!
Finding	our	needles	in	the	haystack
Physical	Store
Gateway	
Server
Mobile	Devices
Registers
Server	Cluster
Distribution	Center
Kafka
Core	Data	Center	at	HQ
Server	Cluster
Others
Storm	/	Spark	/	
Flink /	Apex
Kafka
Storm	/	Spark	/	Flink /	Apex
On	Delivery	Routes
Trucks Deliverers
Delivery	Truck:	 Creative	Stall,	https://thenounproject.com/creativestall/
Deliverer:	RigoPeter,	https://thenounproject.com/rigo/
Cash	Register:	Sergey	Patutin,	https://thenounproject.com/bdesign.by/
Hand	Scanner:	Eric	Pearson,	https://thenounproject.com/epearson001/
10 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Why	is	moving	data	effectively	hard	when	scoped	internally?	
à Standards
à Formats
à “Exactly	Once”	Delivery
à Protocols
à Veracity	of	Information
à Validity	of	Information
à Ensuring	Security
à Overcoming	Security
à Compliance
à Schemas
à Consumers	Change
à Credential	Management
à “That [person|team|group]”
à Network
à “Exactly	Once”	Delivery
11 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Let’s	Connect	Lots	of	As	to	Bs to	As	to	Cs	to	Bs to	Δs to	Cs	to	ϕs
Oh,	that	courier	service	is	global
12 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Why	is	moving	data	effectively	hard	when	scoped	globally?	
à Standards
à Formats
à “Exactly	Once”	Delivery
à Protocols
à Veracity	of	Information
à Validity	of	Information
à Ensuring	Security
à Overcoming	Security
à Compliance
à Schemas
à Consumers	Change
à Credential	Management
à “That [person|team|group]”
à Network
à “Exactly	Once”	Delivery
13 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
The	Unassuming	Line:		A	Case	Study
We’ve	seen	a	few	lines	show	up	in	the	wild	thus	far
Internet! Inter- &	Intra- connections	in
our	global	courier	enterprise
Spotlight:	Arthur	Lacôte,	https://thenounproject.com/turo/
14 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Dataflow	Line	Anatomy	101
Let’s	dissect	what	this	line	typically	represents
Fig	1.		Lineus Worldwidewebus.	Common	Name:	Internet!
Script	or	
Application
Script	or	
Application
Data Data
Disparate	Transport
Mechanisms
15 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Dataflow	Line	Anatomy	201
Sometimes	that	transport	is	just	more	lines
Fig	1.		Lineus Worldwidewebus.	Common	Name:	Internet!
Script	or	
Application
Script	or	
Application
Line	Inception
Data Data
16 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Dataflow	Line	Anatomy	301
But	those	lines	could	also	have	components…
Fig	1.		Lineus Worldwidewebus.	Common	Name:	Internet! Fig	2.		Good Recursion	Joke
NoSuchJokeException
footage	not	found
17 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Agenda
What	is	dataflow	and	what	are	the	challenges?
Apache	NiFi
Architecture
Live	Demo
Community
18 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Apache	NiFi
Key	Features
• Guaranteed	delivery
• Data	buffering	
- Backpressure
- Pressure	release
• Prioritized	queuing
• Flow	specific	QoS
- Latency	vs.	throughput
- Loss	tolerance
• Data	provenance
• Supports	push	and	pull	
models
• Recovery/recording	
a	rolling	log	of	fine-
grained	history
• Visual	command	and	
control
• Flow	templates
• Pluggable/multi-role	
security
• Designed	for	extension
• Clustering
19 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Apache	NiFi Subproject:	MiNiFi
à Let	me	get	the	key	parts	of	NiFi close	to	where	data	begins	and	provide	bidrectional
communication
à NiFi lives	in	the	data	center.		Give	it	an	enterprise	server	or	a	cluster	of	them.
à MiNiFi lives	as	close	to	where	data	is	born	and	is	a	guest	on	that	device	or	system
20 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Let’s	revisit	our	courier	service	from	the	perspective	of	NiFi
Physical	Store
Gateway	
Server
Mobile	Devices
Registers
Server	Cluster
Distribution	Center
Kafka
Core	Data	Center	at	HQ
Server	Cluster
Others
Storm	/	Spark	/	
Flink /	Apex
Kafka
Storm	/	Spark	/	Flink /	Apex
On	Delivery	Routes
Trucks Deliverers
Delivery	Truck:	 Creative	Stall,	https://thenounproject.com/creativestall/
Deliverer:	RigoPeter,	https://thenounproject.com/rigo/
Cash	Register:	Sergey	Patutin,	https://thenounproject.com/bdesign.by/
Hand	Scanner:	Eric	Pearson,	https://thenounproject.com/epearson001/
Client	
Libraries
Client	
Libraries
MiNiFi
MiNiFi
NiFi NiFi NiFi NiFi NiFi NiFi
Client	
Libraries
21 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Apache	NiFi Managed	Dataflow
SOURCES
REGIONAL	
INFRASTRUCTURE
CORE	
INFRASTRUCTURE
22 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
NiFi is	based	on	Flow	Based	Programming	(FBP)
FBP	Term NiFi Term Description
Information	
Packet
FlowFile Each object	moving	through	the	system.
Black Box FlowFile	
Processor
Performs	the	work, doing	some	combination	of	data	routing,	transformation,	
or	mediation	between	systems.
Bounded	
Buffer
Connection The	linkage between	processors,acting	as	queues	and	allowing	various	
processes	to	interact	at	differing	rates.
Scheduler Flow	
Controller
Maintains	the	knowledge	of	how	processes	are	connected, and	manages	the	
threads	and	allocations	thereof	which	all	processes	use.
Subnet Process	
Group
A	set	of	processes	and	their	connections,	which	can	receive	and	send	data	via	
ports.	A	process group	allows	creation	of	entirely	new	component	simply	by	
composition	of	its components.
23 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
FlowFiles &	Data	Agnosticism
à NiFi is	data	agnostic!
à But,	NiFi was	designed	understanding	that	users
can	care	about	specifics	and	provides	tooling	
to	interact	with	specific	formats,	protocols,	etc.
ISO	8601	- http://xkcd.com/1179/
Robustness	principle
Be	conservative	in	what	you	do,	
be	liberal	in	what	you	accept	from	others“
24 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
FlowFiles are	like	HTTP	data
HTTP	Data FlowFile
HTTP/1.1	200	OK
Date:	Sun,	10	Oct	2010	23:26:07	GMT
Server:	Apache/2.2.8	(CentOS)	OpenSSL/0.9.8g
Last-Modified:	Sun,	26	Sep	2010	22:04:35	GMT
ETag:	"45b6-834-49130cc1182c0"
Accept-Ranges:	bytes
Content-Length:	13
Connection:	 close
Content-Type:	text/html
Hello	world!
Standard	FlowFile Attributes
Key:	'entryDate’ Value:	'Fri	Jun	17	17:15:04	EDT	2016'
Key:	'lineageStartDate’			Value:	'Fri	Jun	17	17:15:04	EDT	2016'
Key:	'fileSize’ Value:	'23609'
FlowFile Attribute	Map	Content
Key:	'filename’ Value:	'15650246997242'
Key:	'path’ Value:	'./’
Binary	Content	*
Header
Content
25 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Agenda
What	is	dataflow	and	what	are	the	challenges?
Apache	NiFi
Architecture
Live	Demo
Community
26 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Extension	/	Integration	Points
NiFi Term Description
Flow File	
Processor
Push/Pull behavior.		Custom	UI
Reporting
Task
Used to	push	data	from	NiFi to	some	external	service	(metrics,	provenance,	
etc..)
Controller	
Service
Used	to	enable	reusable	components	/ shared	services	throughout	the	flow
REST	API Allows	clients	to	connect	to	pull	information,	change	behavior,	etc..
27 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
OS/Host
JVM
Flow	Controller
Web	Server
Processor	1 Extension	N
FlowFile
Repository
Content
Repository
Provenance
Repository
Local	Storage
OS/Host
JVM
Flow	Controller
Web	Server
Processor	1 Extension	N
FlowFile
Repository
Content
Repository
Provenance
Repository
Local	Storage
Architecture* OS/Host
JVM
NiFi	Cluster	Manger	– Request	Replicator
Web	Server
Master
NiFi	Cluster	
Manager	(NCM)
OS/Host
JVM
Flow	Controller
Web	Server
Processor	1 Extension	N
FlowFile
Repository
Content
Repository
Provenance
Repository
Local	Storage
Slaves
NiFi	Nodes
28 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
NiFi	Architecture	– Repositories	- Pass	by	reference
FlowFile Content Provenance
F1à C1 C1 P1à F1
Excerpt	of	demo	flow… What’s	happening	inside	the	repositories…
BEFORE
AFTER
F2à C1 C1 P3à F2 – Clone	(F1)
F1à C1 P2à F1 – Route	
P1à F1 – Create
29 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
NiFi	Architecture	– Repositories	– Copy	on	Write
FlowFile Content Provenance
F1à C1 C1 P1à F1	- CREATE
Excerpt	of	demo	flow… What’s	happening	inside	the	repositories…
BEFORE
AFTER
F1à C1
F1.1à C2 C2	(encrypted)
C1	(plaintext)
P2à F1.1 - MODIFY
P1à F1	- CREATE
30 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Agenda
What	is	dataflow	and	what	are	the	challenges?
Apache	NiFi
Architecture
Demo
Community
31 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Learn,	Share	at	Birds	of	a	Feather
Streaming,	DataFlow &	Cybersecurity
Thursday	June	30
6:30	pm,	Ballroom	C
32 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Why	NiFi?
à Moving	data	is	multifaceted	in	its	challenges	and	these	are	present	in	different	contexts	
at	varying	scopes
– Think	of	our	courier	example	and	organizations	like	it:	inter	vs intra,	domestically,	internationally
à Provide	common	tooling	and	extensions	that	are	commonly	needed	but	be	flexible	for	
extension
– Leverage	existing	libraries	and	expansive	Java	ecosystem	for	functionality
– Allow	organizations	to	integrate	with	their	existing	infrastructure	
à Empower	folks	managing	your	infrastructure	to	make	changes	and	reason	about	issues	
that	are	occurring
– Data	Provenance	to	show	context	and	data’s	journey
– User	Interface/Experience	a	key	component
33 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Learn	more	and	join	us!
Apache NiFi site
http://nifi.apache.org
Subproject MiNiFi site
http://nifi.apache.org/minifi/
Subscribe to and collaborate at
dev@nifi.apache.org
users@nifi.apache.org
Submit Ideas or Issues
https://issues.apache.org/jira/browse/NIFI
Follow us on Twitter
@apachenifi
34 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Our	Lab	for	Today
à We	will	be	exploring	some	examples	to	work	through	creating	a	dataflow	with	Apache	
NiFi
à Use	Case:			An	urban	planning	board	is	evaluating	the	need	for	a	new	highway,	
dependent	on	current	traffic	patterns,	particularly	as	other	roadwork	initiatives	are	
under	way.	Integrating	live	data	poses	a	problem	because	traffic	analysis	has	
traditionally	been	done	using	historical,	aggregated	traffic	counts.	To	improve	traffic	
analysis,	the	city	planner	wants	to	leverage	real-time	data	to	get	a	deeper	understanding	
of	traffic	patterns.	NiFi was	selected	for	for	this	real-time	data	integration.
à Labs	are	available	at	http://tinyurl.com/nificrashcourse
35 ©	Hortonworks	Inc.	2011	–2016.	All	Rights	Reserved
Thank	You

More Related Content

What's hot

Welcome to Apache Hadoop's Teenage Years, Arun Murthy Keynote
Welcome to Apache Hadoop's Teenage Years, Arun Murthy KeynoteWelcome to Apache Hadoop's Teenage Years, Arun Murthy Keynote
Welcome to Apache Hadoop's Teenage Years, Arun Murthy Keynote
DataWorks Summit/Hadoop Summit
 
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFi
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFiIntelligently Collecting Data at the Edge - Intro to Apache MiNiFi
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFi
DataWorks Summit
 
Apache NiFi Meetup - Princeton NJ 2016
Apache NiFi Meetup - Princeton NJ 2016Apache NiFi Meetup - Princeton NJ 2016
Apache NiFi Meetup - Princeton NJ 2016
Timothy Spann
 
Apache NiFi: Ingesting Enterprise Data At Scale
Apache NiFi:   Ingesting Enterprise Data At Scale Apache NiFi:   Ingesting Enterprise Data At Scale
Apache NiFi: Ingesting Enterprise Data At Scale
Timothy Spann
 
Data Science with Apache Spark - Crash Course - HS16SJ
Data Science with Apache Spark - Crash Course - HS16SJData Science with Apache Spark - Crash Course - HS16SJ
Data Science with Apache Spark - Crash Course - HS16SJ
DataWorks Summit/Hadoop Summit
 
REAL-TIME INGESTING AND TRANSFORMING SENSOR DATA & SOCIAL DATA w/ NIFI + TENS...
REAL-TIME INGESTING AND TRANSFORMING SENSOR DATA & SOCIAL DATA w/ NIFI + TENS...REAL-TIME INGESTING AND TRANSFORMING SENSOR DATA & SOCIAL DATA w/ NIFI + TENS...
REAL-TIME INGESTING AND TRANSFORMING SENSOR DATA & SOCIAL DATA w/ NIFI + TENS...
Timothy Spann
 
Real-time Twitter Sentiment Analysis and Image Recognition with Apache NiFi
Real-time Twitter Sentiment Analysis and Image Recognition with Apache NiFiReal-time Twitter Sentiment Analysis and Image Recognition with Apache NiFi
Real-time Twitter Sentiment Analysis and Image Recognition with Apache NiFi
Timothy Spann
 
Automatic Detection, Classification and Authorization of Sensitive Personal D...
Automatic Detection, Classification and Authorization of Sensitive Personal D...Automatic Detection, Classification and Authorization of Sensitive Personal D...
Automatic Detection, Classification and Authorization of Sensitive Personal D...
DataWorks Summit/Hadoop Summit
 
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and ZeppelinRevolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
DataWorks Summit/Hadoop Summit
 
HDF Powered by Apache NiFi Introduction
HDF Powered by Apache NiFi IntroductionHDF Powered by Apache NiFi Introduction
HDF Powered by Apache NiFi Introduction
Milind Pandit
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
Timothy Spann
 
What the #$* is a Business Catalog and why you need it
What the #$* is a Business Catalog and why you need it What the #$* is a Business Catalog and why you need it
What the #$* is a Business Catalog and why you need it
DataWorks Summit/Hadoop Summit
 
YARN - Past, Present, & Future
YARN - Past, Present, & FutureYARN - Past, Present, & Future
YARN - Past, Present, & Future
DataWorks Summit
 
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
Hortonworks
 
Apache Atlas: Governance for your Data
Apache Atlas: Governance for your DataApache Atlas: Governance for your Data
Apache Atlas: Governance for your Data
DataWorks Summit/Hadoop Summit
 
Apache NiFi 1.0 in Nutshell
Apache NiFi 1.0 in NutshellApache NiFi 1.0 in Nutshell
Apache NiFi 1.0 in Nutshell
DataWorks Summit/Hadoop Summit
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Hortonworks
 
SparkR Best Practices for R Data Scientists
SparkR Best Practices for R Data ScientistsSparkR Best Practices for R Data Scientists
SparkR Best Practices for R Data Scientists
DataWorks Summit
 
Intro to Spark with Zeppelin
Intro to Spark with ZeppelinIntro to Spark with Zeppelin
Intro to Spark with Zeppelin
Hortonworks
 
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San JoseDataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Aldrin Piri
 

What's hot (20)

Welcome to Apache Hadoop's Teenage Years, Arun Murthy Keynote
Welcome to Apache Hadoop's Teenage Years, Arun Murthy KeynoteWelcome to Apache Hadoop's Teenage Years, Arun Murthy Keynote
Welcome to Apache Hadoop's Teenage Years, Arun Murthy Keynote
 
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFi
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFiIntelligently Collecting Data at the Edge - Intro to Apache MiNiFi
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFi
 
Apache NiFi Meetup - Princeton NJ 2016
Apache NiFi Meetup - Princeton NJ 2016Apache NiFi Meetup - Princeton NJ 2016
Apache NiFi Meetup - Princeton NJ 2016
 
Apache NiFi: Ingesting Enterprise Data At Scale
Apache NiFi:   Ingesting Enterprise Data At Scale Apache NiFi:   Ingesting Enterprise Data At Scale
Apache NiFi: Ingesting Enterprise Data At Scale
 
Data Science with Apache Spark - Crash Course - HS16SJ
Data Science with Apache Spark - Crash Course - HS16SJData Science with Apache Spark - Crash Course - HS16SJ
Data Science with Apache Spark - Crash Course - HS16SJ
 
REAL-TIME INGESTING AND TRANSFORMING SENSOR DATA & SOCIAL DATA w/ NIFI + TENS...
REAL-TIME INGESTING AND TRANSFORMING SENSOR DATA & SOCIAL DATA w/ NIFI + TENS...REAL-TIME INGESTING AND TRANSFORMING SENSOR DATA & SOCIAL DATA w/ NIFI + TENS...
REAL-TIME INGESTING AND TRANSFORMING SENSOR DATA & SOCIAL DATA w/ NIFI + TENS...
 
Real-time Twitter Sentiment Analysis and Image Recognition with Apache NiFi
Real-time Twitter Sentiment Analysis and Image Recognition with Apache NiFiReal-time Twitter Sentiment Analysis and Image Recognition with Apache NiFi
Real-time Twitter Sentiment Analysis and Image Recognition with Apache NiFi
 
Automatic Detection, Classification and Authorization of Sensitive Personal D...
Automatic Detection, Classification and Authorization of Sensitive Personal D...Automatic Detection, Classification and Authorization of Sensitive Personal D...
Automatic Detection, Classification and Authorization of Sensitive Personal D...
 
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and ZeppelinRevolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
 
HDF Powered by Apache NiFi Introduction
HDF Powered by Apache NiFi IntroductionHDF Powered by Apache NiFi Introduction
HDF Powered by Apache NiFi Introduction
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
What the #$* is a Business Catalog and why you need it
What the #$* is a Business Catalog and why you need it What the #$* is a Business Catalog and why you need it
What the #$* is a Business Catalog and why you need it
 
YARN - Past, Present, & Future
YARN - Past, Present, & FutureYARN - Past, Present, & Future
YARN - Past, Present, & Future
 
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
 
Apache Atlas: Governance for your Data
Apache Atlas: Governance for your DataApache Atlas: Governance for your Data
Apache Atlas: Governance for your Data
 
Apache NiFi 1.0 in Nutshell
Apache NiFi 1.0 in NutshellApache NiFi 1.0 in Nutshell
Apache NiFi 1.0 in Nutshell
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDC
 
SparkR Best Practices for R Data Scientists
SparkR Best Practices for R Data ScientistsSparkR Best Practices for R Data Scientists
SparkR Best Practices for R Data Scientists
 
Intro to Spark with Zeppelin
Intro to Spark with ZeppelinIntro to Spark with Zeppelin
Intro to Spark with Zeppelin
 
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San JoseDataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
 

Similar to Apache NiFi Crash Course San Jose Hadoop Summit

Dataflow with Apache NiFi
Dataflow with Apache NiFiDataflow with Apache NiFi
Dataflow with Apache NiFi
DataWorks Summit/Hadoop Summit
 
Apache NiFi Crash Course - San Jose Hadoop Summit
Apache NiFi Crash Course - San Jose Hadoop SummitApache NiFi Crash Course - San Jose Hadoop Summit
Apache NiFi Crash Course - San Jose Hadoop Summit
Aldrin Piri
 
The Avant-garde of Apache NiFi
The Avant-garde of Apache NiFiThe Avant-garde of Apache NiFi
The Avant-garde of Apache NiFi
Joe Percivall
 
The Avant-garde of Apache NiFi
The Avant-garde of Apache NiFiThe Avant-garde of Apache NiFi
The Avant-garde of Apache NiFi
DataWorks Summit/Hadoop Summit
 
Connecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFiConnecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFi
DataWorks Summit
 
Apache NiFi Crash Course Intro
Apache NiFi Crash Course IntroApache NiFi Crash Course Intro
Apache NiFi Crash Course Intro
DataWorks Summit/Hadoop Summit
 
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data AnalysisApache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
DataWorks Summit/Hadoop Summit
 
Using Apache® NiFi to Empower Self-Organising Teams
Using Apache® NiFi to Empower Self-Organising TeamsUsing Apache® NiFi to Empower Self-Organising Teams
Using Apache® NiFi to Empower Self-Organising Teams
Sebastian Carroll
 
Hadoop Summit Tokyo Apache NiFi Crash Course
Hadoop Summit Tokyo Apache NiFi Crash CourseHadoop Summit Tokyo Apache NiFi Crash Course
Hadoop Summit Tokyo Apache NiFi Crash Course
DataWorks Summit/Hadoop Summit
 
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFiData at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Aldrin Piri
 
Apache Nifi Crash Course
Apache Nifi Crash CourseApache Nifi Crash Course
Apache Nifi Crash Course
DataWorks Summit
 
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
Raúl Marín
 
Introduction to Apache NiFi - Seattle Scalability Meetup
Introduction to Apache NiFi - Seattle Scalability MeetupIntroduction to Apache NiFi - Seattle Scalability Meetup
Introduction to Apache NiFi - Seattle Scalability Meetup
Saptak Sen
 
NiFi Best Practices for the Enterprise
NiFi Best Practices for the EnterpriseNiFi Best Practices for the Enterprise
NiFi Best Practices for the Enterprise
Gregory Keys
 
そのデータフロー NiFiで楽にしてあげましょう
そのデータフロー NiFiで楽にしてあげましょうそのデータフロー NiFiで楽にしてあげましょう
そのデータフロー NiFiで楽にしてあげましょう
Koji Kawamura
 
You Can't Search Without Data
You Can't Search Without DataYou Can't Search Without Data
You Can't Search Without Data
Bryan Bende
 
Apache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data AnalysisApache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
DataWorks Summit/Hadoop Summit
 
Apache NiFi + Tensorflow + Hadoop: Big Data AI サンドイッチの作り方
Apache NiFi + Tensorflow + Hadoop:Big Data AI サンドイッチの作り方Apache NiFi + Tensorflow + Hadoop:Big Data AI サンドイッチの作り方
Apache NiFi + Tensorflow + Hadoop: Big Data AI サンドイッチの作り方
HortonworksJapan
 
Intelligently collecting data at the edge—intro to Apache MiNiFi
Intelligently collecting data at the edge—intro to Apache MiNiFiIntelligently collecting data at the edge—intro to Apache MiNiFi
Intelligently collecting data at the edge—intro to Apache MiNiFi
DataWorks Summit
 
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Data Con LA
 

Similar to Apache NiFi Crash Course San Jose Hadoop Summit (20)

Dataflow with Apache NiFi
Dataflow with Apache NiFiDataflow with Apache NiFi
Dataflow with Apache NiFi
 
Apache NiFi Crash Course - San Jose Hadoop Summit
Apache NiFi Crash Course - San Jose Hadoop SummitApache NiFi Crash Course - San Jose Hadoop Summit
Apache NiFi Crash Course - San Jose Hadoop Summit
 
The Avant-garde of Apache NiFi
The Avant-garde of Apache NiFiThe Avant-garde of Apache NiFi
The Avant-garde of Apache NiFi
 
The Avant-garde of Apache NiFi
The Avant-garde of Apache NiFiThe Avant-garde of Apache NiFi
The Avant-garde of Apache NiFi
 
Connecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFiConnecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFi
 
Apache NiFi Crash Course Intro
Apache NiFi Crash Course IntroApache NiFi Crash Course Intro
Apache NiFi Crash Course Intro
 
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data AnalysisApache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
 
Using Apache® NiFi to Empower Self-Organising Teams
Using Apache® NiFi to Empower Self-Organising TeamsUsing Apache® NiFi to Empower Self-Organising Teams
Using Apache® NiFi to Empower Self-Organising Teams
 
Hadoop Summit Tokyo Apache NiFi Crash Course
Hadoop Summit Tokyo Apache NiFi Crash CourseHadoop Summit Tokyo Apache NiFi Crash Course
Hadoop Summit Tokyo Apache NiFi Crash Course
 
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFiData at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
 
Apache Nifi Crash Course
Apache Nifi Crash CourseApache Nifi Crash Course
Apache Nifi Crash Course
 
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
 
Introduction to Apache NiFi - Seattle Scalability Meetup
Introduction to Apache NiFi - Seattle Scalability MeetupIntroduction to Apache NiFi - Seattle Scalability Meetup
Introduction to Apache NiFi - Seattle Scalability Meetup
 
NiFi Best Practices for the Enterprise
NiFi Best Practices for the EnterpriseNiFi Best Practices for the Enterprise
NiFi Best Practices for the Enterprise
 
そのデータフロー NiFiで楽にしてあげましょう
そのデータフロー NiFiで楽にしてあげましょうそのデータフロー NiFiで楽にしてあげましょう
そのデータフロー NiFiで楽にしてあげましょう
 
You Can't Search Without Data
You Can't Search Without DataYou Can't Search Without Data
You Can't Search Without Data
 
Apache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data AnalysisApache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + Livy: Bringing Multi Tenancy to Interactive Data Analysis
 
Apache NiFi + Tensorflow + Hadoop: Big Data AI サンドイッチの作り方
Apache NiFi + Tensorflow + Hadoop:Big Data AI サンドイッチの作り方Apache NiFi + Tensorflow + Hadoop:Big Data AI サンドイッチの作り方
Apache NiFi + Tensorflow + Hadoop: Big Data AI サンドイッチの作り方
 
Intelligently collecting data at the edge—intro to Apache MiNiFi
Intelligently collecting data at the edge—intro to Apache MiNiFiIntelligently collecting data at the edge—intro to Apache MiNiFi
Intelligently collecting data at the edge—intro to Apache MiNiFi
 
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
 

Recently uploaded

在线办理(BU毕业证书)波士顿大学毕业证录取通知书一模一样
在线办理(BU毕业证书)波士顿大学毕业证录取通知书一模一样在线办理(BU毕业证书)波士顿大学毕业证录取通知书一模一样
在线办理(BU毕业证书)波士顿大学毕业证录取通知书一模一样
v6ldcxuq
 
Wayanad-The-Touristry-Heaven to the tour.pptx
Wayanad-The-Touristry-Heaven to the tour.pptxWayanad-The-Touristry-Heaven to the tour.pptx
Wayanad-The-Touristry-Heaven to the tour.pptx
cosmo-soil
 
Assessing the Influence of Transportation on the Tourism Industry in Nigeria
Assessing the Influence of Transportation on the  Tourism Industry in NigeriaAssessing the Influence of Transportation on the  Tourism Industry in Nigeria
Assessing the Influence of Transportation on the Tourism Industry in Nigeria
gsochially
 
The Power of a Glamping Go-To-Market Accelerator Plan.pptx
The Power of a Glamping Go-To-Market Accelerator Plan.pptxThe Power of a Glamping Go-To-Market Accelerator Plan.pptx
The Power of a Glamping Go-To-Market Accelerator Plan.pptx
RezStream
 
How To Talk To a Live Person at American Airlines
How To Talk To a Live Person at American AirlinesHow To Talk To a Live Person at American Airlines
How To Talk To a Live Person at American Airlines
flyn goo
 
Uk Visa Complete Guide and application process
Uk Visa Complete Guide and application processUk Visa Complete Guide and application process
Uk Visa Complete Guide and application process
pandeypratikwgblindi
 
Hidden Gems of Europe - DISCOVERING THE CONTINENT'S BEST-KEPT SECRETS
Hidden Gems of Europe - DISCOVERING THE CONTINENT'S BEST-KEPT SECRETSHidden Gems of Europe - DISCOVERING THE CONTINENT'S BEST-KEPT SECRETS
Hidden Gems of Europe - DISCOVERING THE CONTINENT'S BEST-KEPT SECRETS
Kamil Uğraş TÜRKOĞLU
 

Recently uploaded (7)

在线办理(BU毕业证书)波士顿大学毕业证录取通知书一模一样
在线办理(BU毕业证书)波士顿大学毕业证录取通知书一模一样在线办理(BU毕业证书)波士顿大学毕业证录取通知书一模一样
在线办理(BU毕业证书)波士顿大学毕业证录取通知书一模一样
 
Wayanad-The-Touristry-Heaven to the tour.pptx
Wayanad-The-Touristry-Heaven to the tour.pptxWayanad-The-Touristry-Heaven to the tour.pptx
Wayanad-The-Touristry-Heaven to the tour.pptx
 
Assessing the Influence of Transportation on the Tourism Industry in Nigeria
Assessing the Influence of Transportation on the  Tourism Industry in NigeriaAssessing the Influence of Transportation on the  Tourism Industry in Nigeria
Assessing the Influence of Transportation on the Tourism Industry in Nigeria
 
The Power of a Glamping Go-To-Market Accelerator Plan.pptx
The Power of a Glamping Go-To-Market Accelerator Plan.pptxThe Power of a Glamping Go-To-Market Accelerator Plan.pptx
The Power of a Glamping Go-To-Market Accelerator Plan.pptx
 
How To Talk To a Live Person at American Airlines
How To Talk To a Live Person at American AirlinesHow To Talk To a Live Person at American Airlines
How To Talk To a Live Person at American Airlines
 
Uk Visa Complete Guide and application process
Uk Visa Complete Guide and application processUk Visa Complete Guide and application process
Uk Visa Complete Guide and application process
 
Hidden Gems of Europe - DISCOVERING THE CONTINENT'S BEST-KEPT SECRETS
Hidden Gems of Europe - DISCOVERING THE CONTINENT'S BEST-KEPT SECRETSHidden Gems of Europe - DISCOVERING THE CONTINENT'S BEST-KEPT SECRETS
Hidden Gems of Europe - DISCOVERING THE CONTINENT'S BEST-KEPT SECRETS
 

Apache NiFi Crash Course San Jose Hadoop Summit