Entity Extraction from Natural Language Text using Apache NiFi and Idyl E3
Entity Extraction from Natural Language Text in a
Data Flow Pipeline
Copyright 2017 Mountain Fog, Inc. All Rights Reserved.
Tools in Use
● Apache NiFi
● Facilitates data flow
sources and services.
● Idyl E3 Entity Extraction
● Extracts entities from natural
language text via user-
generated entity models
through a REST API.
● Ingest text files from the file system.
● Extract entities from the text.
● Store entities in a MongoDB database.
Apache NiFi Dataflow
Idyl E3 Entity Extraction Engine
Launch via the AWS Marketplace.
Comes with an entity model for English-language
REST API for entity extraction.
A processor executes the dataflow work “of data
routing, transformation, or mediation between
We are using:
Ingest Text Files
Processor’s properties set to read all files from /ingest.
Send Text to Idyl E3
The Idyl E3 endpoint is set in the processor’s properties.
Store Entities in MongoDB
The MongoDB URI set in the processor’s properties.