SlideShare a Scribd company logo
Industrial Natural Language Processing &
Information Extraction
Industrial Natural Language
Processing
3
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
Overview
NLP
NLU
summarization
semantic
parsing
sentiment
analysis
dialogue
agents
natural
language
inference
question
answering
machine
translation
text
categorization
syntactic
parsing
POS
tagging
keyword
extraction
named
entity
recognition
topic
recognition
Natural Language Processing
Developing and applying techniques
and methods for the automatic
processing of text
Industrial Natural Language
Processing
Developing and applying techniques
and methods for the automatic
processing of text in industry by
explicitly considering the
requirements and circumstances of
industrial environments
4
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
Research Goals
Reliable application and deployment of NLP in industrial
environments
Anonymization of textual data in order to be able to forward
it to third parties
Exploration of new areas for the use of natural language
processing in the wild
5
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
NLP in Industrial Environments
Reliable application and deployment of NLP in industrial
environments
 Design, develop and evaluate software architectures to make NLP useable
by non-technical users
 Improve the process of deploying and using NLP in industrial environments
 Analyze textual data based on state-of-the-art NLP approaches
Software Architectures Usability Analyze
6
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
Anonymization
Anonymization of textual data in order to be able to forward
it to third parties
 The analysis of unstructured company data in the cloud is either undesired
by the company itself or even forbidden by law (DSGVO)
 Cloud services provide more accurate and sophisticated analytics
 Develop new anonymization approaches by using machine learning as the
current approaches are too inaccurate
7
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
Exploiting new Application Domains
Exploration of new areas for the use of natural language
processing in the wild
 Most of the data that is available comprises unstructured data and especially
textual documents
 Identify available data sources and derive meaningful use cases from it
 Develop appropriate models and applications for the identified use cases that
generate an additional value
Identify Data Sources Derive Use Cases Explore possible Solutions
Selected Research Projects
and Applications
9
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
Integrating External Data into an Enterprise Information System
External Information Extraction Tool
 Personalized quick and easy access to a large amount of
data from several different sources within a single tool
 Identifying relevant data sources (e.g., new websites,
social media, internal enterprise data)
 Integrating data into a common data storage
 Creation of dedicated analytical services for specific user
requirements, like
 Natural Language Processing
 Translations
 Overview of business knowledge graph
 Sentiment analysis
 Recommendation of relevant data
Results
 Integrating internal & external data into an enterprise
information system to gain faster insights into changing
markets, relations etc.
Approach
News Websites Social Media
Enterprise Information System
Information
Data
Goal
10
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
Utilizing Textual Maintenance Data from Production
Maintenance Data Insights Tool
 Tool for
 assisted generation of maintenance report texts
 supported finding of solutions
 visualization of errors and costs
 Extracting textual reports from maintenance staff
 Classify text into description of symptoms, causes and
solutions
 Calculation of relevant statistics
 Creation of dedicated analytical services for staff and
decision makers, like
 Occurrence of similar error descriptions over time and
location
 Costs per machine location
 Troubleshooting proposal for specified symptoms
Results
 Utilizing unstructured textual information from machines’
maintenance protocols to gain insights and optimize
processes
Approach
Maintenance Data Platform
Information
Goal
„… defect, please check“
„… part was exchanged“
„… machine losing oil“
„… spare part ordered “
Solution Hints
11
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
Anonymization of Enterprise Documents using the Cloud
Hybrid Anonymizer
 Functional hybrid system for the automatic
anonymization/pseudonymization of textual data
 Enabled the use of cloud analysis for textual documents
 Development of an anonymization approach based on
predefined rules and deep learning
 Implementation and testing of the hybrid anonymizer
 Deployment of the anonymizer within the customers
ecosystem
 Methods: Natural Language Processing, Deep Learning,
Micro-Service Architecture
Results
 Enable the usage of cloud services for data processing and
analysis without revealing sensitive information
Approach
Goal
12
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
AISLE – Support learning academic phrases
AISLE
 Web platform that is actively used by students to improve
their vocabulary
 User studies showed the system's positive impact on
vocabulary growth
 Construct a large domain and target group specific text
corpus using NLP methods
 Use recent methods in the area of natural language
processing for extracting and evaluating words and
phrases based on their relevance
 Development of an adaptive learning system to improve
vocabulary on the basis of a developed learning algorithm
and the built up corpora
Results
 Support students at the beginning of their studies in reading
and understanding scientific publications
Approach
Goal
Interact
Enter
Word:
Vocabulary
Size
Evaluate &
Select Words
View
Results
Analyze
Results
Information Extraction
14
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Information Extraction
Overview
POS
tagging
Unstructured
Data
Information Extraction
“… Application of methods from practical computer science,
artificial intelligence and computational linguistics to the
problem of automatic machine processing of unstructured
information … ” Source: Wikipedia
Different Types of
Unstructured Data
named
entity
recognition
Data-Specific
Processing
Structured
Data
Structured
Datanamed
entity
recognition
Data
Analysis
Results
15
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Information Extraction
Research Goals
Leveraging of machine learning techniques to improve
information extraction
Transformation of unstructured data into useful structured
information and knowledge
16
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Information Extraction
Structuring unstructured data
Transformation of unstructured data into useful structured
information and knowledge
 Identify all the relevant information that need to be extracted
 Identify approaches for extracting information from unstructured data and
turning it into valuable knowledge
 Develop processing pipelines to automatically extract the identified
information and make them accessible in a structured way
Identify Information Choose Approaches
Develop Processing
Pipeline
17
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Information Extraction
Machine Learning based Information Extraction
Leveraging of machine learning techniques to improve
information extraction
Combine Machine Learning &
Classical Approaches
Data Annotation Model Training & Refinement
 Combine or substitute classical information extraction approaches with
machine learning
 Development of tools to improve the process of annotating unstructured data
in order to create a suitable data set for the training of ML models
 Development & Refinement of ML models
Selected Research Projects
and Applications
19
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Information Extraction
Structuring PDF Documents
PDF Analyzer
 Tool for identifying document elements in PDF files
 Header
 Text body
 Tables
 Figures
 Formulas and Algorithms
 First approaches for deriving information from diagrams
and tables exist
 Classifying diagram types
 Extracting values, axis labels etc.
 Consider additional context
 Using Deep Learning (CNNs) to detect different elements
within PDF documents
 Extract additional information from diagrams and tables
for further processing
Results
 Structuring of unstructured PDF documents to extract
additional information and prepare the data for further
analytics
Approach
Goal
Your Contact Person:
André Pomp, M.Sc.
Tel: +49 (0)202 439 1153
pomp@uni-wuppertal.de
Chair for Technologies and Management of Digital Transformation
Univ. Prof. Dr. Ing. Tobias Meisen
https://www.tmdt.uni-wuppertal.de/
Campus Freudenberg
Rainer-Gruenter-Str. 21
D-42119 Wuppertal
Germany
University of Wuppertal
School of Electrical, Information and Media Engineering

More Related Content

What's hot

Is Your Software Development Process Green?
Is Your Software Development Process Green?Is Your Software Development Process Green?
Is Your Software Development Process Green?
Förderverein Technische Fakultät
 
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
byteLAKE
 
Resume
ResumeResume
IBM Think Milano
IBM Think MilanoIBM Think Milano
IBM Think Milano
ATMOSPHERE .
 
Engineering 4.0: Digitization through task automation and reuse
Engineering 4.0:  Digitization through task automation and reuseEngineering 4.0:  Digitization through task automation and reuse
Engineering 4.0: Digitization through task automation and reuse
CARLOS III UNIVERSITY OF MADRID
 
The Study of the Large Scale Twitter on Machine Learning
The Study of the Large Scale Twitter on Machine LearningThe Study of the Large Scale Twitter on Machine Learning
The Study of the Large Scale Twitter on Machine Learning
IRJET Journal
 
The Power of Declarative Analytics
The Power of Declarative AnalyticsThe Power of Declarative Analytics
The Power of Declarative Analytics
Yunyao Li
 
Challenges in the integration of Systems Engineering and the AI/ML model life...
Challenges in the integration of Systems Engineering and the AI/ML model life...Challenges in the integration of Systems Engineering and the AI/ML model life...
Challenges in the integration of Systems Engineering and the AI/ML model life...
CARLOS III UNIVERSITY OF MADRID
 
II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...
II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...
II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...
Dr. Haxel Consult
 
IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...
IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...
IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...
ijcsit
 
Material management – using barcode system
Material management – using barcode systemMaterial management – using barcode system
Material management – using barcode system
vivatechijri
 
outsourcing for competitive advantage
outsourcing for competitive advantageoutsourcing for competitive advantage
outsourcing for competitive advantage
packets dontlie
 
Software application:assignment
Software application:assignmentSoftware application:assignment
Software application:assignment
마 이환
 
Mi 291 chapter 3 (reverse engineering)(1)
Mi 291 chapter 3 (reverse engineering)(1)Mi 291 chapter 3 (reverse engineering)(1)
Mi 291 chapter 3 (reverse engineering)(1)
varun teja G.V.V
 
Computer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineeringComputer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineering
university of sust.
 
Interventions for scientific and enterprise applications
Interventions for scientific and enterprise applicationsInterventions for scientific and enterprise applications
Interventions for scientific and enterprise applications
eSAT Publishing House
 
Interventions for scientific and enterprise applications based on high perfor...
Interventions for scientific and enterprise applications based on high perfor...Interventions for scientific and enterprise applications based on high perfor...
Interventions for scientific and enterprise applications based on high perfor...
eSAT Journals
 
Examination into it & competitive strategies within construction
Examination into it & competitive strategies within constructionExamination into it & competitive strategies within construction
Examination into it & competitive strategies within construction
sai0513
 
Rdp Software & IT Eligibility
Rdp Software & IT EligibilityRdp Software & IT Eligibility
Rdp Software & IT Eligibility
rkun
 
VamsiKrishna Maddiboina
VamsiKrishna MaddiboinaVamsiKrishna Maddiboina
VamsiKrishna Maddiboina
Maddiboina VamsiKrishna
 

What's hot (20)

Is Your Software Development Process Green?
Is Your Software Development Process Green?Is Your Software Development Process Green?
Is Your Software Development Process Green?
 
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
 
Resume
ResumeResume
Resume
 
IBM Think Milano
IBM Think MilanoIBM Think Milano
IBM Think Milano
 
Engineering 4.0: Digitization through task automation and reuse
Engineering 4.0:  Digitization through task automation and reuseEngineering 4.0:  Digitization through task automation and reuse
Engineering 4.0: Digitization through task automation and reuse
 
The Study of the Large Scale Twitter on Machine Learning
The Study of the Large Scale Twitter on Machine LearningThe Study of the Large Scale Twitter on Machine Learning
The Study of the Large Scale Twitter on Machine Learning
 
The Power of Declarative Analytics
The Power of Declarative AnalyticsThe Power of Declarative Analytics
The Power of Declarative Analytics
 
Challenges in the integration of Systems Engineering and the AI/ML model life...
Challenges in the integration of Systems Engineering and the AI/ML model life...Challenges in the integration of Systems Engineering and the AI/ML model life...
Challenges in the integration of Systems Engineering and the AI/ML model life...
 
II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...
II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...
II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...
 
IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...
IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...
IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...
 
Material management – using barcode system
Material management – using barcode systemMaterial management – using barcode system
Material management – using barcode system
 
outsourcing for competitive advantage
outsourcing for competitive advantageoutsourcing for competitive advantage
outsourcing for competitive advantage
 
Software application:assignment
Software application:assignmentSoftware application:assignment
Software application:assignment
 
Mi 291 chapter 3 (reverse engineering)(1)
Mi 291 chapter 3 (reverse engineering)(1)Mi 291 chapter 3 (reverse engineering)(1)
Mi 291 chapter 3 (reverse engineering)(1)
 
Computer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineeringComputer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineering
 
Interventions for scientific and enterprise applications
Interventions for scientific and enterprise applicationsInterventions for scientific and enterprise applications
Interventions for scientific and enterprise applications
 
Interventions for scientific and enterprise applications based on high perfor...
Interventions for scientific and enterprise applications based on high perfor...Interventions for scientific and enterprise applications based on high perfor...
Interventions for scientific and enterprise applications based on high perfor...
 
Examination into it & competitive strategies within construction
Examination into it & competitive strategies within constructionExamination into it & competitive strategies within construction
Examination into it & competitive strategies within construction
 
Rdp Software & IT Eligibility
Rdp Software & IT EligibilityRdp Software & IT Eligibility
Rdp Software & IT Eligibility
 
VamsiKrishna Maddiboina
VamsiKrishna MaddiboinaVamsiKrishna Maddiboina
VamsiKrishna Maddiboina
 

Similar to Industrial Natural Language Processing and Information Extraction

A Semantic Question Answering through Heterogeneous Data Source in the Domain...
A Semantic Question Answering through Heterogeneous Data Source in the Domain...A Semantic Question Answering through Heterogeneous Data Source in the Domain...
A Semantic Question Answering through Heterogeneous Data Source in the Domain...
ijnlc
 
Unlocking Value from Unstructured Data
Unlocking Value from Unstructured DataUnlocking Value from Unstructured Data
Unlocking Value from Unstructured Data
Accenture Insurance
 
Data science - An Introduction
Data science - An IntroductionData science - An Introduction
Data science - An Introduction
Ravishankar Rajagopalan
 
Information Technology for Management and Business
Information Technology for Management and BusinessInformation Technology for Management and Business
Information Technology for Management and Business
Ganta Kishore Kumar
 
OpenKM commercial
OpenKM commercialOpenKM commercial
OpenKM commercial
gpalmerpujol
 
Natural Language Processing at Scale
Natural Language Processing at ScaleNatural Language Processing at Scale
Natural Language Processing at Scale
Andrei Lopatenko
 
Deep Learning for Information Extraction in Natural Language Text
Deep Learning for Information Extraction in Natural Language TextDeep Learning for Information Extraction in Natural Language Text
Deep Learning for Information Extraction in Natural Language Text
Pankaj Gupta, PhD
 
Commercializing Alternative Data
Commercializing Alternative DataCommercializing Alternative Data
Commercializing Alternative Data
Databricks
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teams
Venkatesh Umaashankar
 
Enriching SMW based Virtual Research Environments with external data, Jan Nov...
Enriching SMW based Virtual Research Environments with external data, Jan Nov...Enriching SMW based Virtual Research Environments with external data, Jan Nov...
Enriching SMW based Virtual Research Environments with external data, Jan Nov...
KDZ - Zentrum für Verwaltungsforschung
 
A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...
A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...
A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...
Michael Mortenson
 
A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...
A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...
A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...
Michael Mortenson
 
Semantic Data Management
Semantic Data ManagementSemantic Data Management
Speech To Speech Translation
Speech To Speech TranslationSpeech To Speech Translation
Speech To Speech Translation
IRJET Journal
 
Industry-Academia Communication In Empirical Software Engineering
Industry-Academia Communication In Empirical Software EngineeringIndustry-Academia Communication In Empirical Software Engineering
Industry-Academia Communication In Empirical Software Engineering
Per Runeson
 
YASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptxYASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptx
YashShiva3
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
phdAssistance1
 
sample PPT.pptx
sample PPT.pptxsample PPT.pptx
sample PPT.pptx
ManishDubey91569
 
The Future of Search - Martin White
The Future of Search - Martin WhiteThe Future of Search - Martin White
The Future of Search - Martin White
Findwise
 
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big data: Descoberta de conhecimento em ambientes de big data e computação na...Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Rio Info
 

Similar to Industrial Natural Language Processing and Information Extraction (20)

A Semantic Question Answering through Heterogeneous Data Source in the Domain...
A Semantic Question Answering through Heterogeneous Data Source in the Domain...A Semantic Question Answering through Heterogeneous Data Source in the Domain...
A Semantic Question Answering through Heterogeneous Data Source in the Domain...
 
Unlocking Value from Unstructured Data
Unlocking Value from Unstructured DataUnlocking Value from Unstructured Data
Unlocking Value from Unstructured Data
 
Data science - An Introduction
Data science - An IntroductionData science - An Introduction
Data science - An Introduction
 
Information Technology for Management and Business
Information Technology for Management and BusinessInformation Technology for Management and Business
Information Technology for Management and Business
 
OpenKM commercial
OpenKM commercialOpenKM commercial
OpenKM commercial
 
Natural Language Processing at Scale
Natural Language Processing at ScaleNatural Language Processing at Scale
Natural Language Processing at Scale
 
Deep Learning for Information Extraction in Natural Language Text
Deep Learning for Information Extraction in Natural Language TextDeep Learning for Information Extraction in Natural Language Text
Deep Learning for Information Extraction in Natural Language Text
 
Commercializing Alternative Data
Commercializing Alternative DataCommercializing Alternative Data
Commercializing Alternative Data
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teams
 
Enriching SMW based Virtual Research Environments with external data, Jan Nov...
Enriching SMW based Virtual Research Environments with external data, Jan Nov...Enriching SMW based Virtual Research Environments with external data, Jan Nov...
Enriching SMW based Virtual Research Environments with external data, Jan Nov...
 
A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...
A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...
A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...
 
A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...
A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...
A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...
 
Semantic Data Management
Semantic Data ManagementSemantic Data Management
Semantic Data Management
 
Speech To Speech Translation
Speech To Speech TranslationSpeech To Speech Translation
Speech To Speech Translation
 
Industry-Academia Communication In Empirical Software Engineering
Industry-Academia Communication In Empirical Software EngineeringIndustry-Academia Communication In Empirical Software Engineering
Industry-Academia Communication In Empirical Software Engineering
 
YASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptxYASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptx
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
 
sample PPT.pptx
sample PPT.pptxsample PPT.pptx
sample PPT.pptx
 
The Future of Search - Martin White
The Future of Search - Martin WhiteThe Future of Search - Martin White
The Future of Search - Martin White
 
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big data: Descoberta de conhecimento em ambientes de big data e computação na...Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
 

Recently uploaded

一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
74nqk8xf
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
kuntobimo2016
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
mzpolocfi
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 

Recently uploaded (20)

一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 

Industrial Natural Language Processing and Information Extraction

  • 1. Industrial Natural Language Processing & Information Extraction
  • 3. 3 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing Overview NLP NLU summarization semantic parsing sentiment analysis dialogue agents natural language inference question answering machine translation text categorization syntactic parsing POS tagging keyword extraction named entity recognition topic recognition Natural Language Processing Developing and applying techniques and methods for the automatic processing of text Industrial Natural Language Processing Developing and applying techniques and methods for the automatic processing of text in industry by explicitly considering the requirements and circumstances of industrial environments
  • 4. 4 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing Research Goals Reliable application and deployment of NLP in industrial environments Anonymization of textual data in order to be able to forward it to third parties Exploration of new areas for the use of natural language processing in the wild
  • 5. 5 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing NLP in Industrial Environments Reliable application and deployment of NLP in industrial environments  Design, develop and evaluate software architectures to make NLP useable by non-technical users  Improve the process of deploying and using NLP in industrial environments  Analyze textual data based on state-of-the-art NLP approaches Software Architectures Usability Analyze
  • 6. 6 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing Anonymization Anonymization of textual data in order to be able to forward it to third parties  The analysis of unstructured company data in the cloud is either undesired by the company itself or even forbidden by law (DSGVO)  Cloud services provide more accurate and sophisticated analytics  Develop new anonymization approaches by using machine learning as the current approaches are too inaccurate
  • 7. 7 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing Exploiting new Application Domains Exploration of new areas for the use of natural language processing in the wild  Most of the data that is available comprises unstructured data and especially textual documents  Identify available data sources and derive meaningful use cases from it  Develop appropriate models and applications for the identified use cases that generate an additional value Identify Data Sources Derive Use Cases Explore possible Solutions
  • 9. 9 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing Integrating External Data into an Enterprise Information System External Information Extraction Tool  Personalized quick and easy access to a large amount of data from several different sources within a single tool  Identifying relevant data sources (e.g., new websites, social media, internal enterprise data)  Integrating data into a common data storage  Creation of dedicated analytical services for specific user requirements, like  Natural Language Processing  Translations  Overview of business knowledge graph  Sentiment analysis  Recommendation of relevant data Results  Integrating internal & external data into an enterprise information system to gain faster insights into changing markets, relations etc. Approach News Websites Social Media Enterprise Information System Information Data Goal
  • 10. 10 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing Utilizing Textual Maintenance Data from Production Maintenance Data Insights Tool  Tool for  assisted generation of maintenance report texts  supported finding of solutions  visualization of errors and costs  Extracting textual reports from maintenance staff  Classify text into description of symptoms, causes and solutions  Calculation of relevant statistics  Creation of dedicated analytical services for staff and decision makers, like  Occurrence of similar error descriptions over time and location  Costs per machine location  Troubleshooting proposal for specified symptoms Results  Utilizing unstructured textual information from machines’ maintenance protocols to gain insights and optimize processes Approach Maintenance Data Platform Information Goal „… defect, please check“ „… part was exchanged“ „… machine losing oil“ „… spare part ordered “ Solution Hints
  • 11. 11 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing Anonymization of Enterprise Documents using the Cloud Hybrid Anonymizer  Functional hybrid system for the automatic anonymization/pseudonymization of textual data  Enabled the use of cloud analysis for textual documents  Development of an anonymization approach based on predefined rules and deep learning  Implementation and testing of the hybrid anonymizer  Deployment of the anonymizer within the customers ecosystem  Methods: Natural Language Processing, Deep Learning, Micro-Service Architecture Results  Enable the usage of cloud services for data processing and analysis without revealing sensitive information Approach Goal
  • 12. 12 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing AISLE – Support learning academic phrases AISLE  Web platform that is actively used by students to improve their vocabulary  User studies showed the system's positive impact on vocabulary growth  Construct a large domain and target group specific text corpus using NLP methods  Use recent methods in the area of natural language processing for extracting and evaluating words and phrases based on their relevance  Development of an adaptive learning system to improve vocabulary on the basis of a developed learning algorithm and the built up corpora Results  Support students at the beginning of their studies in reading and understanding scientific publications Approach Goal Interact Enter Word: Vocabulary Size Evaluate & Select Words View Results Analyze Results
  • 14. 14 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Information Extraction Overview POS tagging Unstructured Data Information Extraction “… Application of methods from practical computer science, artificial intelligence and computational linguistics to the problem of automatic machine processing of unstructured information … ” Source: Wikipedia Different Types of Unstructured Data named entity recognition Data-Specific Processing Structured Data Structured Datanamed entity recognition Data Analysis Results
  • 15. 15 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Information Extraction Research Goals Leveraging of machine learning techniques to improve information extraction Transformation of unstructured data into useful structured information and knowledge
  • 16. 16 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Information Extraction Structuring unstructured data Transformation of unstructured data into useful structured information and knowledge  Identify all the relevant information that need to be extracted  Identify approaches for extracting information from unstructured data and turning it into valuable knowledge  Develop processing pipelines to automatically extract the identified information and make them accessible in a structured way Identify Information Choose Approaches Develop Processing Pipeline
  • 17. 17 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Information Extraction Machine Learning based Information Extraction Leveraging of machine learning techniques to improve information extraction Combine Machine Learning & Classical Approaches Data Annotation Model Training & Refinement  Combine or substitute classical information extraction approaches with machine learning  Development of tools to improve the process of annotating unstructured data in order to create a suitable data set for the training of ML models  Development & Refinement of ML models
  • 19. 19 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Information Extraction Structuring PDF Documents PDF Analyzer  Tool for identifying document elements in PDF files  Header  Text body  Tables  Figures  Formulas and Algorithms  First approaches for deriving information from diagrams and tables exist  Classifying diagram types  Extracting values, axis labels etc.  Consider additional context  Using Deep Learning (CNNs) to detect different elements within PDF documents  Extract additional information from diagrams and tables for further processing Results  Structuring of unstructured PDF documents to extract additional information and prepare the data for further analytics Approach Goal
  • 20. Your Contact Person: André Pomp, M.Sc. Tel: +49 (0)202 439 1153 pomp@uni-wuppertal.de Chair for Technologies and Management of Digital Transformation Univ. Prof. Dr. Ing. Tobias Meisen https://www.tmdt.uni-wuppertal.de/ Campus Freudenberg Rainer-Gruenter-Str. 21 D-42119 Wuppertal Germany University of Wuppertal School of Electrical, Information and Media Engineering