SlideShare a Scribd company logo
1 of 20
Industrial Natural Language Processing &
Information Extraction
Industrial Natural Language
Processing
3
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
Overview
NLP
NLU
summarization
semantic
parsing
sentiment
analysis
dialogue
agents
natural
language
inference
question
answering
machine
translation
text
categorization
syntactic
parsing
POS
tagging
keyword
extraction
named
entity
recognition
topic
recognition
Natural Language Processing
Developing and applying techniques
and methods for the automatic
processing of text
Industrial Natural Language
Processing
Developing and applying techniques
and methods for the automatic
processing of text in industry by
explicitly considering the
requirements and circumstances of
industrial environments
4
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
Research Goals
Reliable application and deployment of NLP in industrial
environments
Anonymization of textual data in order to be able to forward
it to third parties
Exploration of new areas for the use of natural language
processing in the wild
5
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
NLP in Industrial Environments
Reliable application and deployment of NLP in industrial
environments
 Design, develop and evaluate software architectures to make NLP useable
by non-technical users
 Improve the process of deploying and using NLP in industrial environments
 Analyze textual data based on state-of-the-art NLP approaches
Software Architectures Usability Analyze
6
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
Anonymization
Anonymization of textual data in order to be able to forward
it to third parties
 The analysis of unstructured company data in the cloud is either undesired
by the company itself or even forbidden by law (DSGVO)
 Cloud services provide more accurate and sophisticated analytics
 Develop new anonymization approaches by using machine learning as the
current approaches are too inaccurate
7
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
Exploiting new Application Domains
Exploration of new areas for the use of natural language
processing in the wild
 Most of the data that is available comprises unstructured data and especially
textual documents
 Identify available data sources and derive meaningful use cases from it
 Develop appropriate models and applications for the identified use cases that
generate an additional value
Identify Data Sources Derive Use Cases Explore possible Solutions
Selected Research Projects
and Applications
9
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
Integrating External Data into an Enterprise Information System
External Information Extraction Tool
 Personalized quick and easy access to a large amount of
data from several different sources within a single tool
 Identifying relevant data sources (e.g., new websites,
social media, internal enterprise data)
 Integrating data into a common data storage
 Creation of dedicated analytical services for specific user
requirements, like
 Natural Language Processing
 Translations
 Overview of business knowledge graph
 Sentiment analysis
 Recommendation of relevant data
Results
 Integrating internal & external data into an enterprise
information system to gain faster insights into changing
markets, relations etc.
Approach
News Websites Social Media
Enterprise Information System
Information
Data
Goal
10
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
Utilizing Textual Maintenance Data from Production
Maintenance Data Insights Tool
 Tool for
 assisted generation of maintenance report texts
 supported finding of solutions
 visualization of errors and costs
 Extracting textual reports from maintenance staff
 Classify text into description of symptoms, causes and
solutions
 Calculation of relevant statistics
 Creation of dedicated analytical services for staff and
decision makers, like
 Occurrence of similar error descriptions over time and
location
 Costs per machine location
 Troubleshooting proposal for specified symptoms
Results
 Utilizing unstructured textual information from machines’
maintenance protocols to gain insights and optimize
processes
Approach
Maintenance Data Platform
Information
Goal
„… defect, please check“
„… part was exchanged“
„… machine losing oil“
„… spare part ordered “
Solution Hints
11
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
Anonymization of Enterprise Documents using the Cloud
Hybrid Anonymizer
 Functional hybrid system for the automatic
anonymization/pseudonymization of textual data
 Enabled the use of cloud analysis for textual documents
 Development of an anonymization approach based on
predefined rules and deep learning
 Implementation and testing of the hybrid anonymizer
 Deployment of the anonymizer within the customers
ecosystem
 Methods: Natural Language Processing, Deep Learning,
Micro-Service Architecture
Results
 Enable the usage of cloud services for data processing and
analysis without revealing sensitive information
Approach
Goal
12
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Industrial Natural Language Processing
AISLE – Support learning academic phrases
AISLE
 Web platform that is actively used by students to improve
their vocabulary
 User studies showed the system's positive impact on
vocabulary growth
 Construct a large domain and target group specific text
corpus using NLP methods
 Use recent methods in the area of natural language
processing for extracting and evaluating words and
phrases based on their relevance
 Development of an adaptive learning system to improve
vocabulary on the basis of a developed learning algorithm
and the built up corpora
Results
 Support students at the beginning of their studies in reading
and understanding scientific publications
Approach
Goal
Interact
Enter
Word:
Vocabulary
Size
Evaluate &
Select Words
View
Results
Analyze
Results
Information Extraction
14
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Information Extraction
Overview
POS
tagging
Unstructured
Data
Information Extraction
“… Application of methods from practical computer science,
artificial intelligence and computational linguistics to the
problem of automatic machine processing of unstructured
information … ” Source: Wikipedia
Different Types of
Unstructured Data
named
entity
recognition
Data-Specific
Processing
Structured
Data
Structured
Datanamed
entity
recognition
Data
Analysis
Results
15
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Information Extraction
Research Goals
Leveraging of machine learning techniques to improve
information extraction
Transformation of unstructured data into useful structured
information and knowledge
16
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Information Extraction
Structuring unstructured data
Transformation of unstructured data into useful structured
information and knowledge
 Identify all the relevant information that need to be extracted
 Identify approaches for extracting information from unstructured data and
turning it into valuable knowledge
 Develop processing pipelines to automatically extract the identified
information and make them accessible in a structured way
Identify Information Choose Approaches
Develop Processing
Pipeline
17
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Information Extraction
Machine Learning based Information Extraction
Leveraging of machine learning techniques to improve
information extraction
Combine Machine Learning &
Classical Approaches
Data Annotation Model Training & Refinement
 Combine or substitute classical information extraction approaches with
machine learning
 Development of tools to improve the process of annotating unstructured data
in order to create a suitable data set for the training of ML models
 Development & Refinement of ML models
Selected Research Projects
and Applications
19
Industrial Natural Language Processing & Information Extraction
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Information Extraction
Structuring PDF Documents
PDF Analyzer
 Tool for identifying document elements in PDF files
 Header
 Text body
 Tables
 Figures
 Formulas and Algorithms
 First approaches for deriving information from diagrams
and tables exist
 Classifying diagram types
 Extracting values, axis labels etc.
 Consider additional context
 Using Deep Learning (CNNs) to detect different elements
within PDF documents
 Extract additional information from diagrams and tables
for further processing
Results
 Structuring of unstructured PDF documents to extract
additional information and prepare the data for further
analytics
Approach
Goal
Your Contact Person:
André Pomp, M.Sc.
Tel: +49 (0)202 439 1153
pomp@uni-wuppertal.de
Chair for Technologies and Management of Digital Transformation
Univ. Prof. Dr. Ing. Tobias Meisen
https://www.tmdt.uni-wuppertal.de/
Campus Freudenberg
Rainer-Gruenter-Str. 21
D-42119 Wuppertal
Germany
University of Wuppertal
School of Electrical, Information and Media Engineering

More Related Content

What's hot

AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)byteLAKE
 
Engineering 4.0: Digitization through task automation and reuse
Engineering 4.0:  Digitization through task automation and reuseEngineering 4.0:  Digitization through task automation and reuse
Engineering 4.0: Digitization through task automation and reuseCARLOS III UNIVERSITY OF MADRID
 
The Study of the Large Scale Twitter on Machine Learning
The Study of the Large Scale Twitter on Machine LearningThe Study of the Large Scale Twitter on Machine Learning
The Study of the Large Scale Twitter on Machine LearningIRJET Journal
 
The Power of Declarative Analytics
The Power of Declarative AnalyticsThe Power of Declarative Analytics
The Power of Declarative AnalyticsYunyao Li
 
Challenges in the integration of Systems Engineering and the AI/ML model life...
Challenges in the integration of Systems Engineering and the AI/ML model life...Challenges in the integration of Systems Engineering and the AI/ML model life...
Challenges in the integration of Systems Engineering and the AI/ML model life...CARLOS III UNIVERSITY OF MADRID
 
II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...
II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...
II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...Dr. Haxel Consult
 
IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...
IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...
IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...ijcsit
 
Material management – using barcode system
Material management – using barcode systemMaterial management – using barcode system
Material management – using barcode systemvivatechijri
 
outsourcing for competitive advantage
outsourcing for competitive advantageoutsourcing for competitive advantage
outsourcing for competitive advantagepackets dontlie
 
Software application:assignment
Software application:assignmentSoftware application:assignment
Software application:assignment마 이환
 
Mi 291 chapter 3 (reverse engineering)(1)
Mi 291 chapter 3 (reverse engineering)(1)Mi 291 chapter 3 (reverse engineering)(1)
Mi 291 chapter 3 (reverse engineering)(1)varun teja G.V.V
 
Computer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineeringComputer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineeringuniversity of sust.
 
Interventions for scientific and enterprise applications
Interventions for scientific and enterprise applicationsInterventions for scientific and enterprise applications
Interventions for scientific and enterprise applicationseSAT Publishing House
 
Interventions for scientific and enterprise applications based on high perfor...
Interventions for scientific and enterprise applications based on high perfor...Interventions for scientific and enterprise applications based on high perfor...
Interventions for scientific and enterprise applications based on high perfor...eSAT Journals
 
Examination into it & competitive strategies within construction
Examination into it & competitive strategies within constructionExamination into it & competitive strategies within construction
Examination into it & competitive strategies within constructionsai0513
 
Rdp Software & IT Eligibility
Rdp Software & IT EligibilityRdp Software & IT Eligibility
Rdp Software & IT Eligibilityrkun
 

What's hot (20)

Is Your Software Development Process Green?
Is Your Software Development Process Green?Is Your Software Development Process Green?
Is Your Software Development Process Green?
 
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
 
Resume
ResumeResume
Resume
 
IBM Think Milano
IBM Think MilanoIBM Think Milano
IBM Think Milano
 
Engineering 4.0: Digitization through task automation and reuse
Engineering 4.0:  Digitization through task automation and reuseEngineering 4.0:  Digitization through task automation and reuse
Engineering 4.0: Digitization through task automation and reuse
 
The Study of the Large Scale Twitter on Machine Learning
The Study of the Large Scale Twitter on Machine LearningThe Study of the Large Scale Twitter on Machine Learning
The Study of the Large Scale Twitter on Machine Learning
 
The Power of Declarative Analytics
The Power of Declarative AnalyticsThe Power of Declarative Analytics
The Power of Declarative Analytics
 
Challenges in the integration of Systems Engineering and the AI/ML model life...
Challenges in the integration of Systems Engineering and the AI/ML model life...Challenges in the integration of Systems Engineering and the AI/ML model life...
Challenges in the integration of Systems Engineering and the AI/ML model life...
 
II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...
II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...
II-SDV 2012 Expert System Driven Insights into Patent Quality and Competitive...
 
IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...
IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...
IMMERSIVE TECHNOLOGIES IN 5G-ENABLED APPLICATIONS: SOME TECHNICAL CHALLENGES ...
 
Material management – using barcode system
Material management – using barcode systemMaterial management – using barcode system
Material management – using barcode system
 
outsourcing for competitive advantage
outsourcing for competitive advantageoutsourcing for competitive advantage
outsourcing for competitive advantage
 
Software application:assignment
Software application:assignmentSoftware application:assignment
Software application:assignment
 
Mi 291 chapter 3 (reverse engineering)(1)
Mi 291 chapter 3 (reverse engineering)(1)Mi 291 chapter 3 (reverse engineering)(1)
Mi 291 chapter 3 (reverse engineering)(1)
 
Computer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineeringComputer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineering
 
Interventions for scientific and enterprise applications
Interventions for scientific and enterprise applicationsInterventions for scientific and enterprise applications
Interventions for scientific and enterprise applications
 
Interventions for scientific and enterprise applications based on high perfor...
Interventions for scientific and enterprise applications based on high perfor...Interventions for scientific and enterprise applications based on high perfor...
Interventions for scientific and enterprise applications based on high perfor...
 
Examination into it & competitive strategies within construction
Examination into it & competitive strategies within constructionExamination into it & competitive strategies within construction
Examination into it & competitive strategies within construction
 
Rdp Software & IT Eligibility
Rdp Software & IT EligibilityRdp Software & IT Eligibility
Rdp Software & IT Eligibility
 
VamsiKrishna Maddiboina
VamsiKrishna MaddiboinaVamsiKrishna Maddiboina
VamsiKrishna Maddiboina
 

Similar to Industrial Natural Language Processing and Information Extraction

A Semantic Question Answering through Heterogeneous Data Source in the Domain...
A Semantic Question Answering through Heterogeneous Data Source in the Domain...A Semantic Question Answering through Heterogeneous Data Source in the Domain...
A Semantic Question Answering through Heterogeneous Data Source in the Domain...ijnlc
 
Unlocking Value from Unstructured Data
Unlocking Value from Unstructured DataUnlocking Value from Unstructured Data
Unlocking Value from Unstructured DataAccenture Insurance
 
Information Technology for Management and Business
Information Technology for Management and BusinessInformation Technology for Management and Business
Information Technology for Management and BusinessGanta Kishore Kumar
 
Natural Language Processing at Scale
Natural Language Processing at ScaleNatural Language Processing at Scale
Natural Language Processing at ScaleAndrei Lopatenko
 
Deep Learning for Information Extraction in Natural Language Text
Deep Learning for Information Extraction in Natural Language TextDeep Learning for Information Extraction in Natural Language Text
Deep Learning for Information Extraction in Natural Language TextPankaj Gupta, PhD
 
Commercializing Alternative Data
Commercializing Alternative DataCommercializing Alternative Data
Commercializing Alternative DataDatabricks
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teamsVenkatesh Umaashankar
 
Enriching SMW based Virtual Research Environments with external data, Jan Nov...
Enriching SMW based Virtual Research Environments with external data, Jan Nov...Enriching SMW based Virtual Research Environments with external data, Jan Nov...
Enriching SMW based Virtual Research Environments with external data, Jan Nov...KDZ - Zentrum für Verwaltungsforschung
 
A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...
A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...
A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...Michael Mortenson
 
A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...
A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...
A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...Michael Mortenson
 
Speech To Speech Translation
Speech To Speech TranslationSpeech To Speech Translation
Speech To Speech TranslationIRJET Journal
 
Industry-Academia Communication In Empirical Software Engineering
Industry-Academia Communication In Empirical Software EngineeringIndustry-Academia Communication In Empirical Software Engineering
Industry-Academia Communication In Empirical Software EngineeringPer Runeson
 
YASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptxYASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptxYashShiva3
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...phdAssistance1
 
The Future of Search - Martin White
The Future of Search - Martin WhiteThe Future of Search - Martin White
The Future of Search - Martin WhiteFindwise
 
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big data: Descoberta de conhecimento em ambientes de big data e computação na...Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big data: Descoberta de conhecimento em ambientes de big data e computação na...Rio Info
 

Similar to Industrial Natural Language Processing and Information Extraction (20)

A Semantic Question Answering through Heterogeneous Data Source in the Domain...
A Semantic Question Answering through Heterogeneous Data Source in the Domain...A Semantic Question Answering through Heterogeneous Data Source in the Domain...
A Semantic Question Answering through Heterogeneous Data Source in the Domain...
 
Unlocking Value from Unstructured Data
Unlocking Value from Unstructured DataUnlocking Value from Unstructured Data
Unlocking Value from Unstructured Data
 
Data science - An Introduction
Data science - An IntroductionData science - An Introduction
Data science - An Introduction
 
Information Technology for Management and Business
Information Technology for Management and BusinessInformation Technology for Management and Business
Information Technology for Management and Business
 
OpenKM commercial
OpenKM commercialOpenKM commercial
OpenKM commercial
 
Natural Language Processing at Scale
Natural Language Processing at ScaleNatural Language Processing at Scale
Natural Language Processing at Scale
 
Deep Learning for Information Extraction in Natural Language Text
Deep Learning for Information Extraction in Natural Language TextDeep Learning for Information Extraction in Natural Language Text
Deep Learning for Information Extraction in Natural Language Text
 
Commercializing Alternative Data
Commercializing Alternative DataCommercializing Alternative Data
Commercializing Alternative Data
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teams
 
Enriching SMW based Virtual Research Environments with external data, Jan Nov...
Enriching SMW based Virtual Research Environments with external data, Jan Nov...Enriching SMW based Virtual Research Environments with external data, Jan Nov...
Enriching SMW based Virtual Research Environments with external data, Jan Nov...
 
A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...
A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...
A Topic Model of Analytics Job Adverts (Operational Research Society Annual C...
 
A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...
A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...
A Topic Model of Analytics Job Adverts (The Operational Research Society 55th...
 
Semantic Data Management
Semantic Data ManagementSemantic Data Management
Semantic Data Management
 
Speech To Speech Translation
Speech To Speech TranslationSpeech To Speech Translation
Speech To Speech Translation
 
Industry-Academia Communication In Empirical Software Engineering
Industry-Academia Communication In Empirical Software EngineeringIndustry-Academia Communication In Empirical Software Engineering
Industry-Academia Communication In Empirical Software Engineering
 
YASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptxYASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptx
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
 
sample PPT.pptx
sample PPT.pptxsample PPT.pptx
sample PPT.pptx
 
The Future of Search - Martin White
The Future of Search - Martin WhiteThe Future of Search - Martin White
The Future of Search - Martin White
 
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big data: Descoberta de conhecimento em ambientes de big data e computação na...Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
 

Recently uploaded

办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 

Recently uploaded (20)

办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 

Industrial Natural Language Processing and Information Extraction

  • 1. Industrial Natural Language Processing & Information Extraction
  • 3. 3 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing Overview NLP NLU summarization semantic parsing sentiment analysis dialogue agents natural language inference question answering machine translation text categorization syntactic parsing POS tagging keyword extraction named entity recognition topic recognition Natural Language Processing Developing and applying techniques and methods for the automatic processing of text Industrial Natural Language Processing Developing and applying techniques and methods for the automatic processing of text in industry by explicitly considering the requirements and circumstances of industrial environments
  • 4. 4 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing Research Goals Reliable application and deployment of NLP in industrial environments Anonymization of textual data in order to be able to forward it to third parties Exploration of new areas for the use of natural language processing in the wild
  • 5. 5 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing NLP in Industrial Environments Reliable application and deployment of NLP in industrial environments  Design, develop and evaluate software architectures to make NLP useable by non-technical users  Improve the process of deploying and using NLP in industrial environments  Analyze textual data based on state-of-the-art NLP approaches Software Architectures Usability Analyze
  • 6. 6 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing Anonymization Anonymization of textual data in order to be able to forward it to third parties  The analysis of unstructured company data in the cloud is either undesired by the company itself or even forbidden by law (DSGVO)  Cloud services provide more accurate and sophisticated analytics  Develop new anonymization approaches by using machine learning as the current approaches are too inaccurate
  • 7. 7 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing Exploiting new Application Domains Exploration of new areas for the use of natural language processing in the wild  Most of the data that is available comprises unstructured data and especially textual documents  Identify available data sources and derive meaningful use cases from it  Develop appropriate models and applications for the identified use cases that generate an additional value Identify Data Sources Derive Use Cases Explore possible Solutions
  • 9. 9 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing Integrating External Data into an Enterprise Information System External Information Extraction Tool  Personalized quick and easy access to a large amount of data from several different sources within a single tool  Identifying relevant data sources (e.g., new websites, social media, internal enterprise data)  Integrating data into a common data storage  Creation of dedicated analytical services for specific user requirements, like  Natural Language Processing  Translations  Overview of business knowledge graph  Sentiment analysis  Recommendation of relevant data Results  Integrating internal & external data into an enterprise information system to gain faster insights into changing markets, relations etc. Approach News Websites Social Media Enterprise Information System Information Data Goal
  • 10. 10 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing Utilizing Textual Maintenance Data from Production Maintenance Data Insights Tool  Tool for  assisted generation of maintenance report texts  supported finding of solutions  visualization of errors and costs  Extracting textual reports from maintenance staff  Classify text into description of symptoms, causes and solutions  Calculation of relevant statistics  Creation of dedicated analytical services for staff and decision makers, like  Occurrence of similar error descriptions over time and location  Costs per machine location  Troubleshooting proposal for specified symptoms Results  Utilizing unstructured textual information from machines’ maintenance protocols to gain insights and optimize processes Approach Maintenance Data Platform Information Goal „… defect, please check“ „… part was exchanged“ „… machine losing oil“ „… spare part ordered “ Solution Hints
  • 11. 11 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing Anonymization of Enterprise Documents using the Cloud Hybrid Anonymizer  Functional hybrid system for the automatic anonymization/pseudonymization of textual data  Enabled the use of cloud analysis for textual documents  Development of an anonymization approach based on predefined rules and deep learning  Implementation and testing of the hybrid anonymizer  Deployment of the anonymizer within the customers ecosystem  Methods: Natural Language Processing, Deep Learning, Micro-Service Architecture Results  Enable the usage of cloud services for data processing and analysis without revealing sensitive information Approach Goal
  • 12. 12 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Industrial Natural Language Processing AISLE – Support learning academic phrases AISLE  Web platform that is actively used by students to improve their vocabulary  User studies showed the system's positive impact on vocabulary growth  Construct a large domain and target group specific text corpus using NLP methods  Use recent methods in the area of natural language processing for extracting and evaluating words and phrases based on their relevance  Development of an adaptive learning system to improve vocabulary on the basis of a developed learning algorithm and the built up corpora Results  Support students at the beginning of their studies in reading and understanding scientific publications Approach Goal Interact Enter Word: Vocabulary Size Evaluate & Select Words View Results Analyze Results
  • 14. 14 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Information Extraction Overview POS tagging Unstructured Data Information Extraction “… Application of methods from practical computer science, artificial intelligence and computational linguistics to the problem of automatic machine processing of unstructured information … ” Source: Wikipedia Different Types of Unstructured Data named entity recognition Data-Specific Processing Structured Data Structured Datanamed entity recognition Data Analysis Results
  • 15. 15 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Information Extraction Research Goals Leveraging of machine learning techniques to improve information extraction Transformation of unstructured data into useful structured information and knowledge
  • 16. 16 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Information Extraction Structuring unstructured data Transformation of unstructured data into useful structured information and knowledge  Identify all the relevant information that need to be extracted  Identify approaches for extracting information from unstructured data and turning it into valuable knowledge  Develop processing pipelines to automatically extract the identified information and make them accessible in a structured way Identify Information Choose Approaches Develop Processing Pipeline
  • 17. 17 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Information Extraction Machine Learning based Information Extraction Leveraging of machine learning techniques to improve information extraction Combine Machine Learning & Classical Approaches Data Annotation Model Training & Refinement  Combine or substitute classical information extraction approaches with machine learning  Development of tools to improve the process of annotating unstructured data in order to create a suitable data set for the training of ML models  Development & Refinement of ML models
  • 19. 19 Industrial Natural Language Processing & Information Extraction Chair of Technologies and Management of Digital Transformation, University of Wuppertal Information Extraction Structuring PDF Documents PDF Analyzer  Tool for identifying document elements in PDF files  Header  Text body  Tables  Figures  Formulas and Algorithms  First approaches for deriving information from diagrams and tables exist  Classifying diagram types  Extracting values, axis labels etc.  Consider additional context  Using Deep Learning (CNNs) to detect different elements within PDF documents  Extract additional information from diagrams and tables for further processing Results  Structuring of unstructured PDF documents to extract additional information and prepare the data for further analytics Approach Goal
  • 20. Your Contact Person: André Pomp, M.Sc. Tel: +49 (0)202 439 1153 pomp@uni-wuppertal.de Chair for Technologies and Management of Digital Transformation Univ. Prof. Dr. Ing. Tobias Meisen https://www.tmdt.uni-wuppertal.de/ Campus Freudenberg Rainer-Gruenter-Str. 21 D-42119 Wuppertal Germany University of Wuppertal School of Electrical, Information and Media Engineering