Design a Dataflow in 7 minutes with Apache NiFi/HDF

•Download as PPTX, PDF•

16 likes•12,107 views

Hortonworks

How to create a real-time dataflow in 7 Minutes with Hortonworks DataFlow, powered by Apache NiFi”.

Technology

1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Create a live dataflow in minutes
How would that change your business?

2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Add processor for data intake. Time: 1 minute
1 Drag and drop processor from top menu

3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Choose the specific processor
2 Choose one of the processors – currently 170+ available

4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Example: Pick Twitter Processor

5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Configure the processor. Time: 2 minutes
3
4
Select processor and choose
option to Configure
Adjust
parameters as
required

6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Another processor for data output. Time: 1 minute
5
6 Filter for and select a “Put” processor
Drag and drop processor from top menu

7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Configure second processor. Time: 1 minute
7 Configure 2nd processor

8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Connect processors, configure connection. 2 minutes
Configure Connection8
Note: Sample Flow is different from previous example of PutHDFS. This dataflow is PutFile. Same concepts apply.

9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Click Start to Begin Processing. Time total: 7 minutes
9 Click start “play” to begin processing
(will run continuously until you select stop)

10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
See Processors Update with Real Time Changes
10 As data flows, GUI interface updates in real time.
11 If destination is stopped or unable to receive, queue builds

11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Dynamically adjust and tune data flow as needed
12
Dynamically configure/ start/ stop/ tune/
reroute change/ pause dataflows as needed.

12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Powerful Tools to Quickly Replicate, Group, Repurpose, Tune and Test
in Real-Time
13
14 Create a new template
Group multiple processes together to create a process group

13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Provenance Means
Real-Time Traceability of:
Data Flow
Data Content
Data Context

14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Watch Real Time Flow of Data: Data Provenance
Select Data Provenance15

15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Trace Lineage of a Particular Piece of Data
Icon for Data Lineage16

16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Every Change to Data is Tracked in Real-Time: processing, views
Every event is traceable
17

17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Real-Time Updates of Dataflow: Traceable Context & Content
Know immediately both context and content18

18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Easily access and trace changes to dataflow

19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Audit trail of Hortonworks DataFlow User Actions

20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Questions?
Hortonworks Community Connection:
Data Ingestion and Streaming
https://community.hortonworks.com/

Viewers also liked

Apache NiFi- MiNiFi meetup SlidesIsheeta Sanghi

Apache NiFi in the Hadoop Ecosystem DataWorks Summit/Hadoop Summit

HDF: Hortonworks DataFlow: Technical WorkshopHortonworks

Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...Hortonworks

Real-Time Data Flows with Apache NiFiManish Gupta

Webinar Series Part 5 New Features of HDF 5Hortonworks

Apache NiFi Toronto MeetupHortonworks

Hortonworks Data In Motion Webinar Series Pt. 2Hortonworks

Hortonworks Data in Motion Webinar Series - Part 1Hortonworks

Taking DataFlow Management to the Edge with Apache NiFi/MiNiFiBryan Bende

Hortonworks Data In Motion Series Part 4Hortonworks

Integrating Apache Spark and NiFi for Data LakesDataWorks Summit/Hadoop Summit

Dynamic Column Masking and Row-Level Filtering in HDPHortonworks

Enabling the Real Time Analytical EnterpriseHortonworks

Double Your Hadoop Hardware Performance with SmartSenseHortonworks

KBM Equipamentos Agrícolaskbm_br

Admiral GroupDataWorks Summit/Hadoop Summit

Beyond Messaging Enterprise Dataflow powered by Apache NiFiIsheeta Sanghi

Extendible data model for real-time business process analysisMarcello Leida

Hortonworks DataFlow & Apache Nifi @Oslo Hadoop Big DataMats Johansson

Viewers also liked (20)

Apache NiFi- MiNiFi meetup Slides

Apache NiFi in the Hadoop Ecosystem

HDF: Hortonworks DataFlow: Technical Workshop

Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...

Real-Time Data Flows with Apache NiFi

Webinar Series Part 5 New Features of HDF 5

Apache NiFi Toronto Meetup

Hortonworks Data In Motion Webinar Series Pt. 2

Hortonworks Data in Motion Webinar Series - Part 1

Taking DataFlow Management to the Edge with Apache NiFi/MiNiFi

Hortonworks Data In Motion Series Part 4

Integrating Apache Spark and NiFi for Data Lakes

Dynamic Column Masking and Row-Level Filtering in HDP

Enabling the Real Time Analytical Enterprise

Double Your Hadoop Hardware Performance with SmartSense

KBM Equipamentos Agrícolas

Admiral Group

Beyond Messaging Enterprise Dataflow powered by Apache NiFi

Extendible data model for real-time business process analysis

Hortonworks DataFlow & Apache Nifi @Oslo Hadoop Big Data

Similar to Design a Dataflow in 7 minutes with Apache NiFi/HDF

Introduction to Apache NiFi - Seattle Scalability MeetupSaptak Sen

Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFIHaimo Liu

Unlocking insights in streaming dataCarolyn Duby

HDF Powered by Apache NiFi IntroductionMilind Pandit

Streamline Apache Hadoop Operations with Apache Ambari and SmartSenseHortonworks

Using Apache® NiFi to Empower Self-Organising TeamsSebastian Carroll

[Hortonworks] Future Of Data: Madrid - HDF & Data in motionRaúl Marín

Hadoop Operations - Past, Present, and FutureDataWorks Summit

Hive Performance Dataworks Summit Melbourne February 2019alanfgates

Fast SQL on Hadoop, Really?DataWorks Summit

Streamline - Stream Analytics for EveryoneDataWorks Summit/Hadoop Summit

NJ Hadoop Meetup - Apache NiFi Deep DiveBryan Bende

Hadoop Summit Tokyo Apache NiFi Crash CourseDataWorks Summit/Hadoop Summit

Hadoop & devOps : better togetherMaxime Lanciaux

Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...Data Con LA

Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseDataWorks Summit

Druid: Sub-Second OLAP queries over Petabytes of Streaming DataDataWorks Summit

そのデータフロー NiFiで楽にしてあげましょうKoji Kawamura

Log Analytics OptimizationHortonworks

Log Analytics OptimizationIsheeta Sanghi

Similar to Design a Dataflow in 7 minutes with Apache NiFi/HDF (20)

Introduction to Apache NiFi - Seattle Scalability Meetup

Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI

Unlocking insights in streaming data

HDF Powered by Apache NiFi Introduction

Streamline Apache Hadoop Operations with Apache Ambari and SmartSense

Using Apache® NiFi to Empower Self-Organising Teams

[Hortonworks] Future Of Data: Madrid - HDF & Data in motion

Hadoop Operations - Past, Present, and Future

Hive Performance Dataworks Summit Melbourne February 2019

Fast SQL on Hadoop, Really?

Streamline - Stream Analytics for Everyone

NJ Hadoop Meetup - Apache NiFi Deep Dive

Hadoop Summit Tokyo Apache NiFi Crash Course

Hadoop & devOps : better together

Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...

Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise

Druid: Sub-Second OLAP queries over Petabytes of Streaming Data

そのデータフロー NiFiで楽にしてあげましょう

Log Analytics Optimization

Recently uploaded

GenAI Risks & Security Meetup 01052024.pdflior mazor

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

A Domino Admins Adventures (Engage 2024)Gabriella Davis

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra

Scaling API-first – The story of a global engineering organizationRadu Cotescu

Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Real Time Object Detection Using Open CVKhem

Partners Life - Insurer Innovation Award 2024The Digital Insurer

Why Teams call analytics are critical to your entire businesspanagenda

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

Recently uploaded (20)

GenAI Risks & Security Meetup 01052024.pdf

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

A Domino Admins Adventures (Engage 2024)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Boost PC performance: How more available memory can improve productivity

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

Scaling API-first – The story of a global engineering organization

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams

presentation ICT roal in 21st century education

Tata AIG General Insurance Company - Insurer Innovation Award 2024

Real Time Object Detection Using Open CV

Partners Life - Insurer Innovation Award 2024

Why Teams call analytics are critical to your entire business

Powerful Google developer tools for immediate impact! (2023-24 C)

Exploring the Future Potential of AI-Enabled Smartphone Processors

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...

AWS Community Day CPH - Three problems of Terraform

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Artificial Intelligence Chap.5 : Uncertainty