SlideShare a Scribd company logo
1 of 18
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu ::
Faculty of Science, Technology and Communication (FSTC)
Bachelor en informatique (professionnel)
-- Media IT -–
¯_(ツ)_/¯
Unit 3
Natural language
processing
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu ::
3. Natural language processing
2
You may be surprised to learn that it is possible to see her spinning both
clockwise and counterclockwise. It is related to bistable perception in
which an ambiguous 2-dimensional figure can be seen in from two different
perspectives. Because there is no third dimension, our brains try to
construct space around the figure.
Try looking at the figure and then blink; she may appear to change
directions immediately after you blink. Another strategy is to focus on a
specific part of the figure.
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu ::
3. Natural language processing
3
3.1 Semantics and Artificial Intelligence
3.2 Computational linguistics
3.3 Syntax, Semantics and Pragmatics
3.4 Exercise
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 4
3. Natural language processing
3.1 Semantics and Artificial Intelligence
Art
For a human:
Art
Porn
Art
For a machine:
??!!
Weak AI
• Logical reasoning
• Knowledge
representation
• Natural Language
Processing
• Machine Learning
• …
Strong AI
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 5
3. Natural language processing
3.1 Semantics and Artificial Intelligence
Weak AI
• Logical reasoning
• Knowledge
representation
• Natural Language
Processing
• Machine Learning
• …
Voting
photography
best picture 2010
Tagging
Statistical solution
Requires critical mass of
“good” users
Deterministic solution
Requires machine-readable
metadata
Photo  hasColor.BW
 photoOf.(Woman 
isNude) 
isOutdoors.Daylight
Description Logics
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 6
3. Natural language processing
3.1 Semantics and Artificial Intelligence
Deterministic solution
Requires machine-readable
metadata
Photo  hasColor.BW
 photoOf.(Woman 
isNude) 
isOutdoors.Daylight
Description Logics
?
NLP is a subfield of linguistics and artificial intelligence. It studies the problems
inherent to the processing and manipulation of natural language. The ultimate
goal of NLP is to make computers “understand” statements written in human
languages. The definition of “understanding” is one of the major problems in
NLP.
Natural Language Processing (NLP)
Examples of applications:
• Optical character recognition (OCR)
• Machine translation
• Speech recognition
https://translate.google.com/
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 7
3. Natural language processing
3.1 Semantics and Artificial Intelligence
https://www.deepl.com/
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 8
3. Natural language processing
3.1 Semantics and Artificial Intelligence
wrong interpretation
false positive
too much of information
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 9
3.2 Computational linguistics
Tagging and Parsing natural language
noun phrase (NP),
prepositional phrase (PP),
adjective phrase (ADJP),
verb phrase (VP)
Phrase types
adjective (JJ),
conjunction (CC),
preposition (IN),
determiner (DT),
noun (NN),
verb (VBZ)
Phrase tags:
3. Natural language processing
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 10
Tagging and Parsing natural language
3. Natural language processing
The TreeTagger is a tool for annotating
text with part-of-speech and lemma
information. It was developed by Helmut
Schmid in the TC project at the Institute for
Computational Linguistics of the University
of Stuttgart.
The TreeTagger has been successfully used
to tag German, English, French, Italian,
Danish, Dutch, Spanish, Bulgarian, Russian,
Portuguese, Galician, Greek, Chinese,
Swahili, Slovak, Slovenian, Latin, Estonian,
Polish, Romanian, Czech, Coptic and old
French texts and is adaptable to other
languages if a lexicon and a manually
tagged training corpus are available.
http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/
3.2 Computational linguistics
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 11
a date
3. Natural language processing
Ambiguities in parsing a natural language
? Date ?
3.2 Computational linguistics
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 12
3. Natural language processing
WordNet – lexical database
http://wordnet.princeton.edu/
117.000 semantically equivalent words (synsets) are interlinked by means of conceptual-semantic and lexical relations
hypernym: a word with a more general meaning
E.g., animal is a hypernym of cat
hyponym: a word with a more specific meaning
E.g., cat is a hyponym of animal
synonym: a word with identical meaning
E.g., car and automobile are synonyms
homonym: words with identical spelling but different meaning
E.g., Ada is a programming language but also a person
3.2 Computational linguistics
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu ::
Lecture "WWW basics" by Prof. Christoph Meinel at the Hasso-Plattner-Institute (HPI) in Potsdam
#n
How TCP/IP works
#1
Intro
#2
Protocols in general
#3
Error-handling as
task of a protocol
#4
Error-
handling
13
3. Natural language processing
Semantic annotation of lecture videos
Natural language explanation
Protocol
hasTask.ErrorHandling
Description logics
<owl:Class rdf:about="#LO3">
<owl:intersectionOf rdf:parseType="Collection">
<owl:Class rdf:about="#Protocol" />
<owl:restriction>
<owl:onProperty rdf:resource="#hasTask" />
<owl:someValuesFrom rdf:resource="#ErrorHandling" />
</owl:restriction>
</owl:intersectionOf>
</owl:Class>
Machine readable XML
3.2 Computational linguistics
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu ::
14
How old are my 3
children?
1. The multiplication of their ages is 36
2. The addition of their ages is the number on the door
3. The oldest is blonde 1 x 1 x 36 = 36
1 x 2 x 18 = 36
1 x 3 x 12 = 36
1 x 4 x 9 = 36
1 x 6 x 6 = 36
2 x 2 x 9 = 36
2 x 3 x 6 = 36
3 x 3 x 4 = 36
1 + 1 + 36 = 38
1 + 2 + 18 = 21
1 + 3 + 12 = 16
1 + 4 + 9 = 14
1 + 6 + 6 = 13
2 + 2 + 9 = 13
2 + 3 + 6 = 11
3 + 3 + 4 = 10
use the same possible ages
you will ask a 3d question
because you have 2 x the
same number
there is just one who
the oldest
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 15
3. Natural language processing
3.3 Syntax, Semantics and Pragmatics
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 16
3. Natural language processing
3.3 Syntax, Semantics and Pragmatics
“...a goal of the Web was that, if the interaction
between person and hypertext could be so intuitive
that the machine-readable information space gave an
accurate representation of the state of people's
thoughts, interactions, and work patterns, then
machine analysis could become a very powerful
management tool, seeing patterns in our work and
facilitating our working together through the typical
problems which beset the management of large
organizations.”
A vision of a smarter web by Tim Berners-Lee
Scientific American, May, 2001
Classical Web is build upon HTML (HyperText Markup Language)
Can HTML be used to build the Semantic Web?
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 17
3. Natural language processing
3.3 Syntax, Semantics and Pragmatics
Semantics refers to aspects of meaning, as
expressed in language or other systems of
signs
Syntax is the study of the structure of sign
systems, focusing on the form, not the
meaning
Pragmatics is the study of the practical use
of signs by agents or communities of
interpretation within particular circumstances
and contexts
I movies the go to with my wife
The movies went to me
Is the window open?
[The asking person may feel cold]
I go to the movies with my wife
I went to the movies
"Classical Web"
Semantic Web applications
Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu ::
3. Natural language processing
3.4 Exercise
Practical exercise
Install Tree Tagger and test it in terminal / command line
1.
How it works
• You work in teams of 2 students.
• Submit the running program and the source code by November 30, 2018 via Moodle!
• This work is considered 10% of your final grade.
Develop a program, e.g., in Java, that asks the user a question and returns an exciting statement
2.
USER: I have a beer after the class
SYSTEM: Cool, you have a beer after the class
I
have
a
beer
after
the
class
./bin/tree-tagger -token ./lib/english-utf8.par input.txt output.txt
I PP
have VHP
a DT
beer NN
after IN
the DT
class NN
USER: My mom is great
SYSTEM: Cool, your mom is great

More Related Content

Similar to Media IT - Natural Language Processing

Domain Specific Terminology Extraction (ICICT 2006)
Domain Specific Terminology Extraction (ICICT 2006)Domain Specific Terminology Extraction (ICICT 2006)
Domain Specific Terminology Extraction (ICICT 2006)IT Industry
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial IntelligenceRohan Vadsola
 
NLTK - Natural Language Processing in Python
NLTK - Natural Language Processing in PythonNLTK - Natural Language Processing in Python
NLTK - Natural Language Processing in Pythonshanbady
 
Semantic Web - XML and sublanguages
Semantic Web - XML and sublanguagesSemantic Web - XML and sublanguages
Semantic Web - XML and sublanguagesSerge Linckels
 
The Mystery of Natural Language Processing
The Mystery of Natural Language ProcessingThe Mystery of Natural Language Processing
The Mystery of Natural Language ProcessingMahmood Aijazi, MD
 
Digital Technologies 2014 (ICTENSW)
Digital Technologies 2014 (ICTENSW)Digital Technologies 2014 (ICTENSW)
Digital Technologies 2014 (ICTENSW)Jason Zagami
 
Artificial intelligence uses in productive systems and impacts on the world...
Artificial intelligence   uses in productive systems and impacts on the world...Artificial intelligence   uses in productive systems and impacts on the world...
Artificial intelligence uses in productive systems and impacts on the world...Fernando Alcoforado
 
Automated Metadata Annotation What Is And Is Not Possible With Machine Learning
Automated Metadata Annotation  What Is And Is Not Possible With Machine LearningAutomated Metadata Annotation  What Is And Is Not Possible With Machine Learning
Automated Metadata Annotation What Is And Is Not Possible With Machine LearningJim Webb
 
Combining Multimedia and Semantics (LACNEM2010)
Combining Multimedia and Semantics (LACNEM2010)Combining Multimedia and Semantics (LACNEM2010)
Combining Multimedia and Semantics (LACNEM2010)Oscar Corcho
 
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...Micah Altman
 
"Hour of Code": Back to the roots... [1987-1993]
"Hour of Code": Back to the roots... [1987-1993]"Hour of Code": Back to the roots... [1987-1993]
"Hour of Code": Back to the roots... [1987-1993]Yannis Kotsanis
 
Advanced Artificial Intelligence
Advanced Artificial IntelligenceAdvanced Artificial Intelligence
Advanced Artificial IntelligenceAshik Iqbal
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerFrancesco Osborne
 
NLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptNLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptOlusolaTop
 
Digital Humanities: A brief introduction to the field
Digital Humanities: A brief introduction to the fieldDigital Humanities: A brief introduction to the field
Digital Humanities: A brief introduction to the fieldaelang
 
A Paper Presentation On Artificial Intelligence And Global Risk Paper Present...
A Paper Presentation On Artificial Intelligence And Global Risk Paper Present...A Paper Presentation On Artificial Intelligence And Global Risk Paper Present...
A Paper Presentation On Artificial Intelligence And Global Risk Paper Present...guestac67362
 

Similar to Media IT - Natural Language Processing (20)

Domain Specific Terminology Extraction (ICICT 2006)
Domain Specific Terminology Extraction (ICICT 2006)Domain Specific Terminology Extraction (ICICT 2006)
Domain Specific Terminology Extraction (ICICT 2006)
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
NLTK - Natural Language Processing in Python
NLTK - Natural Language Processing in PythonNLTK - Natural Language Processing in Python
NLTK - Natural Language Processing in Python
 
Semantic Web - XML and sublanguages
Semantic Web - XML and sublanguagesSemantic Web - XML and sublanguages
Semantic Web - XML and sublanguages
 
The Mystery of Natural Language Processing
The Mystery of Natural Language ProcessingThe Mystery of Natural Language Processing
The Mystery of Natural Language Processing
 
Digital Technologies 2014 (ICTENSW)
Digital Technologies 2014 (ICTENSW)Digital Technologies 2014 (ICTENSW)
Digital Technologies 2014 (ICTENSW)
 
Artificial intelligence uses in productive systems and impacts on the world...
Artificial intelligence   uses in productive systems and impacts on the world...Artificial intelligence   uses in productive systems and impacts on the world...
Artificial intelligence uses in productive systems and impacts on the world...
 
Automated Metadata Annotation What Is And Is Not Possible With Machine Learning
Automated Metadata Annotation  What Is And Is Not Possible With Machine LearningAutomated Metadata Annotation  What Is And Is Not Possible With Machine Learning
Automated Metadata Annotation What Is And Is Not Possible With Machine Learning
 
Combining Multimedia and Semantics (LACNEM2010)
Combining Multimedia and Semantics (LACNEM2010)Combining Multimedia and Semantics (LACNEM2010)
Combining Multimedia and Semantics (LACNEM2010)
 
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
 
LuisValeroInterests
LuisValeroInterestsLuisValeroInterests
LuisValeroInterests
 
"Hour of Code": Back to the roots... [1987-1993]
"Hour of Code": Back to the roots... [1987-1993]"Hour of Code": Back to the roots... [1987-1993]
"Hour of Code": Back to the roots... [1987-1993]
 
Advanced Artificial Intelligence
Advanced Artificial IntelligenceAdvanced Artificial Intelligence
Advanced Artificial Intelligence
 
Ai notes
Ai notesAi notes
Ai notes
 
FinalReport
FinalReportFinalReport
FinalReport
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
 
Sangeetha seminar (1)
Sangeetha  seminar (1)Sangeetha  seminar (1)
Sangeetha seminar (1)
 
NLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptNLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.ppt
 
Digital Humanities: A brief introduction to the field
Digital Humanities: A brief introduction to the fieldDigital Humanities: A brief introduction to the field
Digital Humanities: A brief introduction to the field
 
A Paper Presentation On Artificial Intelligence And Global Risk Paper Present...
A Paper Presentation On Artificial Intelligence And Global Risk Paper Present...A Paper Presentation On Artificial Intelligence And Global Risk Paper Present...
A Paper Presentation On Artificial Intelligence And Global Risk Paper Present...
 

More from Serge Linckels

Media IT - XML and XML Transformation (XSLT)
Media IT - XML and XML Transformation (XSLT)Media IT - XML and XML Transformation (XSLT)
Media IT - XML and XML Transformation (XSLT)Serge Linckels
 
Media IT - XML and sublanguages
Media IT - XML and sublanguagesMedia IT - XML and sublanguages
Media IT - XML and sublanguagesSerge Linckels
 
Media IT - author rights
Media IT - author rightsMedia IT - author rights
Media IT - author rightsSerge Linckels
 
Semantic Web - Search engines
Semantic Web - Search enginesSemantic Web - Search engines
Semantic Web - Search enginesSerge Linckels
 
Semantic Web - Ontologies
Semantic Web - OntologiesSemantic Web - Ontologies
Semantic Web - OntologiesSerge Linckels
 
Semantic Web - Overview
Semantic Web - OverviewSemantic Web - Overview
Semantic Web - OverviewSerge Linckels
 
Semantic Web - Introduction
Semantic Web - IntroductionSemantic Web - Introduction
Semantic Web - IntroductionSerge Linckels
 

More from Serge Linckels (13)

Media IT - XML and XML Transformation (XSLT)
Media IT - XML and XML Transformation (XSLT)Media IT - XML and XML Transformation (XSLT)
Media IT - XML and XML Transformation (XSLT)
 
Media IT - XML and sublanguages
Media IT - XML and sublanguagesMedia IT - XML and sublanguages
Media IT - XML and sublanguages
 
Media IT - author rights
Media IT - author rightsMedia IT - author rights
Media IT - author rights
 
Media IT - Images
Media IT - ImagesMedia IT - Images
Media IT - Images
 
Media IT - Entropy
Media IT - EntropyMedia IT - Entropy
Media IT - Entropy
 
Media IT - Coding
Media IT - CodingMedia IT - Coding
Media IT - Coding
 
Semantic Web - Search engines
Semantic Web - Search enginesSemantic Web - Search engines
Semantic Web - Search engines
 
Semantic Web - OWL
Semantic Web - OWLSemantic Web - OWL
Semantic Web - OWL
 
Semantic Web - Ontologies
Semantic Web - OntologiesSemantic Web - Ontologies
Semantic Web - Ontologies
 
Semantic Web - RDF
Semantic Web - RDFSemantic Web - RDF
Semantic Web - RDF
 
Semantic Web - Overview
Semantic Web - OverviewSemantic Web - Overview
Semantic Web - Overview
 
Semantic Web - Introduction
Semantic Web - IntroductionSemantic Web - Introduction
Semantic Web - Introduction
 
E-Librarian Service
E-Librarian ServiceE-Librarian Service
E-Librarian Service
 

Recently uploaded

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Media IT - Natural Language Processing

  • 1. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: Faculty of Science, Technology and Communication (FSTC) Bachelor en informatique (professionnel) -- Media IT -– ¯_(ツ)_/¯ Unit 3 Natural language processing
  • 2. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 3. Natural language processing 2 You may be surprised to learn that it is possible to see her spinning both clockwise and counterclockwise. It is related to bistable perception in which an ambiguous 2-dimensional figure can be seen in from two different perspectives. Because there is no third dimension, our brains try to construct space around the figure. Try looking at the figure and then blink; she may appear to change directions immediately after you blink. Another strategy is to focus on a specific part of the figure.
  • 3. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 3. Natural language processing 3 3.1 Semantics and Artificial Intelligence 3.2 Computational linguistics 3.3 Syntax, Semantics and Pragmatics 3.4 Exercise
  • 4. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 4 3. Natural language processing 3.1 Semantics and Artificial Intelligence Art For a human: Art Porn Art For a machine: ??!! Weak AI • Logical reasoning • Knowledge representation • Natural Language Processing • Machine Learning • … Strong AI
  • 5. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 5 3. Natural language processing 3.1 Semantics and Artificial Intelligence Weak AI • Logical reasoning • Knowledge representation • Natural Language Processing • Machine Learning • … Voting photography best picture 2010 Tagging Statistical solution Requires critical mass of “good” users Deterministic solution Requires machine-readable metadata Photo  hasColor.BW  photoOf.(Woman  isNude)  isOutdoors.Daylight Description Logics
  • 6. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 6 3. Natural language processing 3.1 Semantics and Artificial Intelligence Deterministic solution Requires machine-readable metadata Photo  hasColor.BW  photoOf.(Woman  isNude)  isOutdoors.Daylight Description Logics ? NLP is a subfield of linguistics and artificial intelligence. It studies the problems inherent to the processing and manipulation of natural language. The ultimate goal of NLP is to make computers “understand” statements written in human languages. The definition of “understanding” is one of the major problems in NLP. Natural Language Processing (NLP) Examples of applications: • Optical character recognition (OCR) • Machine translation • Speech recognition https://translate.google.com/
  • 7. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 7 3. Natural language processing 3.1 Semantics and Artificial Intelligence https://www.deepl.com/
  • 8. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 8 3. Natural language processing 3.1 Semantics and Artificial Intelligence wrong interpretation false positive too much of information
  • 9. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 9 3.2 Computational linguistics Tagging and Parsing natural language noun phrase (NP), prepositional phrase (PP), adjective phrase (ADJP), verb phrase (VP) Phrase types adjective (JJ), conjunction (CC), preposition (IN), determiner (DT), noun (NN), verb (VBZ) Phrase tags: 3. Natural language processing
  • 10. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 10 Tagging and Parsing natural language 3. Natural language processing The TreeTagger is a tool for annotating text with part-of-speech and lemma information. It was developed by Helmut Schmid in the TC project at the Institute for Computational Linguistics of the University of Stuttgart. The TreeTagger has been successfully used to tag German, English, French, Italian, Danish, Dutch, Spanish, Bulgarian, Russian, Portuguese, Galician, Greek, Chinese, Swahili, Slovak, Slovenian, Latin, Estonian, Polish, Romanian, Czech, Coptic and old French texts and is adaptable to other languages if a lexicon and a manually tagged training corpus are available. http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/ 3.2 Computational linguistics
  • 11. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 11 a date 3. Natural language processing Ambiguities in parsing a natural language ? Date ? 3.2 Computational linguistics
  • 12. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 12 3. Natural language processing WordNet – lexical database http://wordnet.princeton.edu/ 117.000 semantically equivalent words (synsets) are interlinked by means of conceptual-semantic and lexical relations hypernym: a word with a more general meaning E.g., animal is a hypernym of cat hyponym: a word with a more specific meaning E.g., cat is a hyponym of animal synonym: a word with identical meaning E.g., car and automobile are synonyms homonym: words with identical spelling but different meaning E.g., Ada is a programming language but also a person 3.2 Computational linguistics
  • 13. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: Lecture "WWW basics" by Prof. Christoph Meinel at the Hasso-Plattner-Institute (HPI) in Potsdam #n How TCP/IP works #1 Intro #2 Protocols in general #3 Error-handling as task of a protocol #4 Error- handling 13 3. Natural language processing Semantic annotation of lecture videos Natural language explanation Protocol hasTask.ErrorHandling Description logics <owl:Class rdf:about="#LO3"> <owl:intersectionOf rdf:parseType="Collection"> <owl:Class rdf:about="#Protocol" /> <owl:restriction> <owl:onProperty rdf:resource="#hasTask" /> <owl:someValuesFrom rdf:resource="#ErrorHandling" /> </owl:restriction> </owl:intersectionOf> </owl:Class> Machine readable XML 3.2 Computational linguistics
  • 14. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 14 How old are my 3 children? 1. The multiplication of their ages is 36 2. The addition of their ages is the number on the door 3. The oldest is blonde 1 x 1 x 36 = 36 1 x 2 x 18 = 36 1 x 3 x 12 = 36 1 x 4 x 9 = 36 1 x 6 x 6 = 36 2 x 2 x 9 = 36 2 x 3 x 6 = 36 3 x 3 x 4 = 36 1 + 1 + 36 = 38 1 + 2 + 18 = 21 1 + 3 + 12 = 16 1 + 4 + 9 = 14 1 + 6 + 6 = 13 2 + 2 + 9 = 13 2 + 3 + 6 = 11 3 + 3 + 4 = 10 use the same possible ages you will ask a 3d question because you have 2 x the same number there is just one who the oldest
  • 15. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 15 3. Natural language processing 3.3 Syntax, Semantics and Pragmatics
  • 16. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 16 3. Natural language processing 3.3 Syntax, Semantics and Pragmatics “...a goal of the Web was that, if the interaction between person and hypertext could be so intuitive that the machine-readable information space gave an accurate representation of the state of people's thoughts, interactions, and work patterns, then machine analysis could become a very powerful management tool, seeing patterns in our work and facilitating our working together through the typical problems which beset the management of large organizations.” A vision of a smarter web by Tim Berners-Lee Scientific American, May, 2001 Classical Web is build upon HTML (HyperText Markup Language) Can HTML be used to build the Semantic Web?
  • 17. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 17 3. Natural language processing 3.3 Syntax, Semantics and Pragmatics Semantics refers to aspects of meaning, as expressed in language or other systems of signs Syntax is the study of the structure of sign systems, focusing on the form, not the meaning Pragmatics is the study of the practical use of signs by agents or communities of interpretation within particular circumstances and contexts I movies the go to with my wife The movies went to me Is the window open? [The asking person may feel cold] I go to the movies with my wife I went to the movies "Classical Web" Semantic Web applications
  • 18. Media IT :: Dr Serge Linckels :: http://www.linckels.lu/ :: serge@linckels.lu :: 3. Natural language processing 3.4 Exercise Practical exercise Install Tree Tagger and test it in terminal / command line 1. How it works • You work in teams of 2 students. • Submit the running program and the source code by November 30, 2018 via Moodle! • This work is considered 10% of your final grade. Develop a program, e.g., in Java, that asks the user a question and returns an exciting statement 2. USER: I have a beer after the class SYSTEM: Cool, you have a beer after the class I have a beer after the class ./bin/tree-tagger -token ./lib/english-utf8.par input.txt output.txt I PP have VHP a DT beer NN after IN the DT class NN USER: My mom is great SYSTEM: Cool, your mom is great