SlideShare a Scribd company logo
1 of 164
Download to read offline
Never-Ending Learning Language
Estevam R. Hruschka Jr.
Federal University of São Carlos
Never-Ending Learning Language
Joint work with Carnegie Mellon Read The Web Project Group
(http://rtw.ml.cmu.edu/rtw/)
and MaLL (Machine Learning Lab) from Federal University of São Carlos
(http://www.dc.ufscar.br/MaLL/MaLL.html)
Never-Ending Learning Language
Picture taken from [Fern, 2008]
Never-Ending Learning Language
Years of Relevant AI/ML Research
• Architectures for problem solving/learning – SOAR [Newell, Laird,
Rosenbloom 1986] – ICARUS [Langley], PRODIGY [Carbonell], ...
• Large scale knowledge construction/extraction – Cyc [Lenat],
KnowItAll, TextRunner [Etzioni et al 2004], WOE [Weld et al. 2009]
• Life long learning – Learning to learn [Thrun & Pratt, 1998], EBNN [Thrun &
Mitchell 1993]
• Transfer learning – Multitask learning [Caruana 1995]
– Transfer reinforcement learning [Parr & Russell 1998] – Learning with structured
outputs [Taskar, 2009; Roth 2009]
• Active Learning – survey [Settles 2010]; Multi-task active learning [Harpale &
Yang, 2010]
• Curriculum learning – [Bengio, et al., 2009; Krueger & Dayan, 2009; Ni & Ling,
2010]
Never-Ending Learning
Main Task: acquire a growing competence without
asymptote
•  over years
•  multiple functions
•  where learning one thing improves ability to learn the next
•  acquiring data from humans, environment
•  Many candidate domains:
•  Robots
•  Softbots
•  Game players
Never-Ending Learning
Picture taken from [DARPA, 2012]
Picture taken from [DARPA, 2012]
NELL: Never-Ending Language
Learner
Inputs:
  initial ontology
  handful of examples of each predicate in ontology
  the web
  occasional interaction with human trainers
The task:
  run 24x7, forever
• each day:
1. extract more facts from the web to populate the initial ontology
2. learn to read (perform #1) better than yesterday
NELL: Never-Ending Language
Learner
Goal:
• run 24x7, forever
• each day:
1. extract more facts from the web to populate given ontology
2. learn to read better than yesterday
Today...
Running 24 x 7, since January, 2010
Input:
• ontology defining ~800 categories and relations
• 10-20 seed examples of each
• 1 billion web pages (ClueWeb – Jamie Callan)
Result:
• continuously growing KB with +1,800,000 extracted beliefs
http://rtw.ml.cmu.edu
ECML/PKDD2012 Bristol, UK September, 26th, 2012
NELL: Never-Ending Language
Learner
http://rtw.ml.cmu.edu
The Problem with Semi-Supervised
Bootstrap Learning
Paris
Pittsburgh
Seattle
Cupertino
The Problem with Semi-Supervised
Bootstrap Learning
Paris
Pittsburgh
Seattle
Cupertino
mayor of arg1
live in arg1
The Problem with Semi-Supervised
Bootstrap Learning
Paris
Pittsburgh
Seattle
Cupertino
mayor of arg1
live in arg1
San Francisco
Austin
denial
The Problem with Semi-Supervised
Bootstrap Learning
Paris
Pittsburgh
Seattle
Cupertino
mayor of arg1
live in arg1
San Francisco
Austin
denial
The Problem with Semi-Supervised
Bootstrap Learning
Paris
Pittsburgh
Seattle
Cupertino
mayor of arg1
live in arg1
San Francisco
Austin
denial
arg1 is home of
traits such as arg1
The Problem with Semi-Supervised
Bootstrap Learning
Paris
Pittsburgh
Seattle
Cupertino
mayor of arg1
live in arg1
…
San Francisco
Austin
denial
arg1 is home of
traits such as arg1
it’s underconstrained!!
Key Idea 1: Coupled semi-supervised training of
many functions
Coupled Training Type 1: Co-training, Multiview,
Co-regularization
Coupled Training Type 1: Co-training, Multiview,
Co-regularization
Coupled Training Type 1: Co-training, Multiview,
Co-regularization
Type 1 Coupling Constraints in NELL
Coupled Training Type 2:
Structured Outputs, Multitask, Posterior
Regularization, Multilabel
Learn functions with the same input, different outputs, where
we know some constraint
Coupled Training Type 2:
Structured Outputs, Multitask, Posterior
Regularization, Multilabel
Learn functions with the same input, different outputs, where
we know some constraint
Coupled Training Type 2:
Structured Outputs, Multitask, Posterior
Regularization, Multilabel
Learn functions with the same input, different outputs, where
we know some constraint
Type 2 Coupling Constraints in NELL
Multi-view, Multi-Task Coupling
Learning Relations between NP’s
Learning Relations between NP’s
Type 3 Coupling: Argument Types
Pure EM Approach to Coupled
Training
E: jointly estimate latent labels
for each function of each
unlabeled example
M: retrain all functions, based on
these probabilistic labels
Scaling problem:
• E step: 20M NP’s, 1014 NP pairs to label
• M step: 50M text contexts to consider for each function  1010
parameters to retrain
• even more URL-HTML contexts..
NELL’s Approximation to EM
E’ step:
• Consider only a growing subset of the latent variable
assignments
– category variables: up to 250 NP’s per category per iteration
– relation variables: add only if confident and args of correct type
– this set of explicit latent assignments *IS* the knowledge base
M’ step:
• Each view-based learner retrains itself from the updated KB
• “context” methods create growing subsets of contexts
NELL Architecture
If coupled learning is the key idea, how can
we get new coupling constraints?
Key Idea 2: Discover New Coupling
Constraints
• first order, probabilistic horn clause
constraints
0.93 athletePlaysSport(?x,?y) :- #athletePlaysForTeam(?x,?z),
teamPlaysSport(?
z,?y)
– connects previously uncoupled relation
predicates
– infers new beliefs for KB
Discover New Coupling Constraints
For each relation: seek probabilistic first order Horn
Clauses
• Positive examples: extracted beliefs in the KB
• Negative examples: ???
Ontology to the rescue:
numberOfValues(teamPlaysSport) = 1
numberOfValues(competesWith) = any
Example Learned Horn Clauses
0.95 athletePlaysSport(?x,basketball) :- athleteInLeague(?x,NBA)
0.93 athletePlaysSport(?x,?y) :- athletePlaysForTeam(?x,?z)
teamPlaysSport(?z,?y)
0.91 teamPlaysInLeague(?x,NHL) :- teamWonTrophy(?x,Stanley_Cup)
0.90 athleteInLeague(?x,?y):- athletePlaysForTeam(?x,?z),
teamPlaysInLeague(?z,?y)
0.88 cityInState(?x,?y) :- #cityCapitalOfState(?x,?y),
cityInCountry(?y,USA)
0.62* newspaperInCity(?x,New_York) :- companyEconomicSector(?x,media),
generalizations(?x,blog)
Learned Probabilistic Horn Clause Rules
Learned Probabilistic Horn Clause Rules
Key Idea 3: Automatically Extending
the Ontology
Ontology Extension in 2012
Categories Relations
KB ? Prophet
Corpus ? OntExt
EMNLP0211 paper
Reading
Hearst Patterns Relation
?
Ontology Extension in 2013
Categories Relations
KB
Every Relation
Generates
Subcategories
Prophet
Needs new
Implementation for
larger Graphs
Corpus ?
New OntExt based on
Verbs
Every Verb is a
Relation
Reading
Hearst Patterns Relation
?
Ontology Extension in 2014
Categories Relations
KB
Every Relation
Generates
Subcategories
To be done
Prophet
Needs new
Implementation for
larger Graphs
Corpus ?
New OntExt based on
Verbs
Every Verb is a
Relation
Reading Hearst Patterns Relation ?
OntExt (Ontology Extension)
Everything
PersonCompany CitySport
WorksFor PlayedIn
OntExt (Ontology Extension)
Everything
PersonCompany CitySport
WorksFor PlayedInPlays
OntExt (Ontology Extension)
Everything
PersonCompany CitySport
WorksFor PlayedIn
LocatedIn
Plays
Prophet
Mining the Graph representing NELL’s KB to:
1.  Extend the KB by predicting new relations
(edges)that might exist between pairs of
nodes;
2.  Induce inference rules;
3.  Identify misplaced edges which can be used by
NELL as hints to identify wrong connections
between nodes (wrong fats);
Prophet
Find open triangles in the Graph
Prophet
open triangles
Basketball NBA
Milwaukee Bucks
Sport Sport’s League
Prophet
open triangles
Basketball NBA
Milwaukee Bucks
Sport Sport’s League
Prophet
open triangles
Hokey NHL
Pittsburgh Penguins
Prophet
open triangles
Hokey NHL
Pittsburgh Penguins
Sport Sport’s League
Prophet
open triangles
Football NFL
Dallas Cowboys
Prophet
open triangles
Football NFL
Dallas Cowboys
Sport Sport’s League
Prophet
open triangles
sport sportsLeague
sportsTeam
Prophet
open triangles
sport sportsLeague
sportsTeam
Prophet
u w
v
•  Compute the number of
common neighbors
Prophet
u w
v
Category u Category w
Σ
•  Compute the number of
common neighbors
•  Compute the cumulative
number of instances for the
categories nodes Σ
Prophet
u w
v
Category u Category w
Σ
•  Compute the number of
common neighbors
•  Compute the cumulative
number of instances for the
categories nodes Σ
•  NΛc(uc,wc ) is the number of
open triangles for categories
u and w.
Prophet
sport sportsLeague
sportsTeam
Prophet
If > ξ then create the new relation
ξ = 10 (empirically)
sport sportsLeague
sportsTeam
Prophet
If > ξ then create the new relation
ξ = 10 (empirically)
Name the new relation based on ReVerb
sport sportsLeague
sportsTeam
isPlayedIn
Ontology Extension in 2014
Categories Relations
KB
Every Relation
Generates
Subcategories
To be done
Prophet
Needs new
Implementation for
larger Graphs
Corpus ?
New OntExt based on
Verbs
Every Verb is a
Relation
Reading Hearst Patterns Relation ?
Ontology Extension in 2014
Categories Relations
KB
Every Relation
Generates
Subcategories
To be done
Prothet needs new
Implementation for
larger Graphs
GraphDB
Corpus ?
New OntExt based on
Verbs
Every Verb is a
Relation
Reading Hearst Patterns Relation ?
Ontology Extension in 2014
Navarro et al., 2013
Ontology Extension in 2014
Categories Relations
KB
Every Relation
Generates
Subcategories
To be done
Prothet needs new
Implementation for
larger Graphs
GraphDB
Corpus ?
New OntExt
based on Verbs
Every Verb is a
Relation
Reading Hearst Patterns Relation ?
How to Extract New Relations?
Proposed Approach - OntExt
Traditional IE + Open IE
Cluster context patterns which are semantically similar although they
may be lexically dissimilar
Scalability: Context-pattern X Context-pattern matrix
Classifier learns to distinguish valid relations from semantically invalid
relations
OntExt
Input:
Preprocessed 2 billion sentences from ClueWeb09 data [Callan and Hoy, 2009].
Category instances (e.g. city(Ottawa), city(Berlin), country(Canada), etc.) are
used to find context patterns
Context x Context Matrix
OntExt
Key Idea 4: Cumulative, Staged Learning
Learning X improves ability to learn Y
1. Classify noun phrases (NPs) by category
2. Classify NP pairs by relation
3. Discover rules patterns to predict new relation instances
4. Learn which NPs (co)refer to which latent concepts
5. Discover new relations to extend the initial ontology
6. Learn to assign temporal scope to beliefs
7. Learn to microread single sentences
8. Vision: co-train text and visual object recognition
9. Goal-driven reading: predict, then read to corroborate/
correct
10. Make NELL a conversational agent
11. Add a robot body to NELL
NELL: Never-Ending Language Learner
NELL is grown enough for new
steps
NELL turned 4 on Jan 12!
Congratulations NELL!!
NELL: Never-Ending Language Learner
NELL is grown enough for new
steps
NELL: Never-Ending Language Learner
NELL is grown enough for new
steps
Knowledge on Demand
NELL: Never-Ending Language Learner
NELL is grown enough for new
steps
Knowledge on Demand
NELL: Never-Ending Language Learner
NELL is grown enough for new
steps
Knowledge on Demand – Ask NELL
NELL Architecture
Knowledge Base
(latent variables)
Text
Context
patterns
(CPL)
Orthographic
classifier
(CML)
Beliefs
Candidate
Beliefs
Knowledge
Integrator
Human
advice
Actively
search for
web text
(OpenEval)
Infer new
beliefs from
old
(PRA)
Image
classifier
(NEIL)
Ontology
extender
(OntExt)
URL specific
HTML
patterns
(SEAL)
NELL: Never-Ending Language Learner
Knowledge Base Validation in NELL
Human Supervision: RTW group members;
Conversing Learning: NELL can autonomously talk
to people in web communities and ask for help
Web Querying: NELL can query the Web on specific
facts to verify correctness, or to predict the validity
of a new fact;
Hiring Labelers: NELL can autonomously hire
people (using web services such as Mechanical
Turk) to label data and help the system to validate
acquired knowledge.
NELL: Never-Ending Language Learner
Knowledge Base Validation in NELL
Human Supervision: RTW group members;
Conversing Learning: NELL can autonomously talk
to people in web communities and ask for help
Web Querying: NELL can query the Web on specific
facts to verify correctness, or to predict the validity
of a new fact;
Hiring Labelers: NELL can autonomously hire
people (using web services such as Mechanical
Turk) to label data and help the system to validate
acquired knowledge.
NELL: Never-Ending Language Learner
Knowledge Base Validation in
NELL
Human Supervision: RTW group members;
NELL: Never-Ending Language Learner
Knowledge Base Validation in
NELL
Human Supervision: RTW group members;
NELL: Never-Ending Language Learner
Knowledge Base Validation in
NELL
Human Supervision: RTW group members;
NELL: Never-Ending Language Learner
Knowledge Base Validation in
NELL
Human Supervision: RTW group members;
NELL: Never-Ending Language Learner
Knowledge Base Validation in NELL
Human Supervision: RTW group members;
Conversing Learning: NELL can autonomously talk
to people in web communities and ask for help
Web Querying: NELL can query the Web on specific
facts to verify correctness, or to predict the validity
of a new fact;
Hiring Labelers: NELL can autonomously hire
people (using web services such as Mechanical
Turk) to label data and help the system to validate
acquired knowledge.
NELL: Never-Ending Language Learner
Knowledge Base Validation in NELL
Human Supervision: RTW group members;
Conversing Learning: NELL can autonomously talk
to people in web communities and ask for help
Web Querying: NELL can query the Web on specific
facts to verify correctness, or to predict the validity
of a new fact;
Hiring Labelers: NELL can autonomously hire
people (using web services such as Mechanical
Turk) to label data and help the system to validate
acquired knowledge.
Conversing Learning
Conversing Learning
Basic Steps:
Decide which task is going to be asked
Determine who are the oracles the ML system is
going to consult
Propose a method of conversation with oracles, often
humans
Determine how to feedback the ML system with the
community inputs
Conversing Learning
Basic Steps:
Decide which task is going to be asked
Determine who are the oracles the ML system is
going to consult
Propose a method of conversation with oracles, often
humans
Determine how to feedback the ML system with the
community inputs
Conversing Learning
Decide which task is going to be asked
Learned facts
Learned Inference Rules
Metadata (mainly for automatically extending the
ontology)
Conversing Learning
Basic Steps:
Decide which task is going to be asked
Determine who are the oracles the ML system is
going to consult
Propose a method of conversation with oracles, often
humans
Determine how to feedback the ML system with the
community inputs
Conversing Learning
who are the oracles the ML system is going to
consult
Yahoo! Answers
very popular on the Web
a lot of metadata to harvest
Twitter
millions of users worldwide
a system that was not designed to work as a
QA environment
Both web communities have API to connect to
their database
Conversing Learning
Conversing Learning
Basic Steps:
Decide which task is going to be asked
Determine who are the oracles the ML system is
going to consult
Propose a method of conversation with oracles,
often humans
Determine how to feedback the ML system with the
community inputs
Conversing Learning
Propose a method of conversation with
oracles, often humans
Macro Question-Answering
For each posted question:
Ask for yes/no simple answers
Try to understand every answer
Discard answers too difficult to understand
Conclude based only on fully understood answers
Conversing Learning
Basic Steps:
Decide which task is going to be asked
Determine who are the oracles the ML system is
going to consult
Propose a method of conversation with oracles, often
humans
Determine how to feedback the ML system with
the community inputs
Conversing Learning
how to feedback the ML system with the
community inputs?
Suggested actions to NELL:
Synonym/co-reference resolution
Automatically update the Knowledge Base
Conversing Learning
Some Initial Results with First Order
Rules:
Take top 10% of rules from Rule Learner
60 rules were converted into questions
and asked with both the regular and the
Yes/No question approach
The 120 questions received a total of 350
answers.
Conversing Learning
Some Initial Results with First Order Rules:
Rule extracted from NELL in PROLOG
format
stateLocatedInCountry(x,y):-
statehascapital(x,z), citylocatedincoutry(z,y)
converted into question:
Is this statement always true? If state X has
capital Z and city Z is located in country Y
then state X is located in country Y.
Conversing Learning
Question: (Yes or No?) If athlete Z is member of team X
and athlete Z plays in league Y, then team X plays in
league Y.
Twitter answers sample:
No. (Z in X) ∧ (Z in Y) → (X in Y)
Yahoo! Answers sample:
NO, Not in EVERY case. Athlete Z could be a
member of football team X and he could also play in his
pub’s Friday nights dart team. The Dart team could play
in league Y (and Z therefore by definition plays in league
Y). This does not mean that the football team plays in the
darts league!
Conversing Learning
Conversing Learning
Conversing Learning
Some Initial Results with Facts
Validation:
Conversing Learning
Some Initial Results with Facts
Validation:
Conversing Learning
Some Initial Results with Facts
Validation:
Conversing Learning
Some Initial Results with Facts
Validation:
Some Initial Results with Metadata:
Question: Could you please give me some examples of
clothing?
Answer 01: Snowshoes, rain ponchos, galoshes,
sunhats, visors, scarves, mittens, and wellies are all
examples of weather specific clothing!
Answer 02: pants
Answer 03: Training shoes can be worn by anyone for
any purpose, but the term means to train in sports
Conversing Learning
Some Initial Results with Metadata:
Users replied with 552 seeds for 129 categories
Total of 5900 promotions with seeds created by
NELL’s developers
Total of 5300 promotions with seeds extracted from
answers of Twitter users (similar precision)
Conversing Learning
Some Initial Results with Metadata:
For Relation Discovery Components
Symmetry: Is it always true that if a person P1 is neighbor
of a person P2, then P2 is neighbor of P1?
Anti-symmetry: Is it always true that if a person P1 is the
coach of a person P2, then P2 is not coach of P1?
Conversing Learning
Some Initial Results with Metadata:
Feature Weighting/Selection for CMC
Logistic Regression features are based on noun phrase
morphology
(true or false) hotel names tend to be compound noun
phrases having “hotel” as last the word.
(true or false) a word having “burgh” as sufix (ex.
Pittsburgh) tend to be a city name.
Conversing Learning
On going and future work
Asking to the right community and to the right person
Asking the right thing to maximize the results with
minimum questions (multi-view Active Learning)
Better Question-Answering methods
Asking in different languages and explore time zones.
Conversing Learning
NELL: Never-Ending Language Learner
Knowledge Base Validation in NELL
Human Supervision: RTW group members;
Conversing Learning: NELL can autonomously talk
to people in web communities and ask for help
Web Querying: NELL can query the Web on specific
facts to verify correctness, or to predict the validity
of a new fact;
Hiring Labelers: NELL can autonomously hire
people (using web services such as Mechanical
Turk) to label data and help the system to validate
acquired knowledge.
NELL: Never-Ending Language Learner
Knowledge Base Validation in NELL
Human Supervision: RTW group members;
Conversing Learning: NELL can autonomously talk
to people in web communities and ask for help
Web Querying: NELL can query the Web on specific
facts to verify correctness, or to predict the validity
of a new fact;
Hiring Labelers: NELL can autonomously hire
people (using web services such as Mechanical
Turk) to label data and help the system to validate
acquired knowledge.
OpenEval: Web Information
Query Evaluation
Mehdi Samadi, Manuela Veloso and Manuel Blum
Computer Science Department
Carnegie Mellon University, Pittsburgh, PA
AAAI 2013, July 16,
Bellevue, WA, USA
I can wait
more…
Shrimp
is healthy
0.72
12
0
Information
Validation
healthyFood
(shrimp)
healthyFood
(shrimp)
healthyFood
(apple)
0.88
•  Querying by human or agent
•  Information validation
•  Open Web
•  Online/Anytime
•  Scalable
•  Few seed examples
for training
•  Small
ontology
Motivation
Learning
healthyFood unHealthyFood . . .
12
1
Food
Apple
Kale
Black Beans
Salmon
Walnut
Banana
…
Animal
Learning
healthyFood unHealthyFood . . .
12
2
Food
1- Given an input predicate instance and a keyword,
OpenEval first formulates a search query.
A predicate
instance
healthyFood(Appl
e)
Convert to a
query:
{“apple”}.
Animal
Learning
healthyFood unHealthyFood . . .Food
2- OpenEval queries the open Web and processes the
retrieved unstructured Web pages.
A predicate
instance
healthyFood(Appl
e)
Convert to a
query:
{“apple”}.
.
.
.
Animal
Extracting CBIs
healthyFood unHealthyFood . . .
12
4
Food
3- OpenEval extracts a set of Context-Based Instances
(CBI).
A predicate
instance
healthyFood(Shri
mp)
Convert to a
query:
{“shrimp”}.
.
.
.
X pomaceous fruit apple tree,
species Malus domestica rose
family
widely known members genus
Malus used humans. X grow
small, deciduous trees. tree
originated Central Asia, wild
ancestora
.
.
.
Animal
Learning
healthyFood unHealthyFood . . .
OpenEval extracts CBIs for each predicate.
. . .
. . .+ + + + . . .+ + + +
healthyFood unHealthyFood
. . .+ + - -
healthyFood
-+
CBI
12
5
Food Animal
Learning
healthyFood unHealthyFood . . .
OpenEval extracts CBIs for each predicate.
. . .
. . .+ + + + . . .+ + + +
healthyFood unHealthyFood
healthyFood
-+
CBI
Food
. . .+ + -- . . .
OpenEval trains a SVM for each predicate using training CBIs.
Animal
What does OpenEval learn?
healthyFood(apple) healthyFood(apple) “vitamin”
Learn how to map instances to an appropriate predicate
(i.e., sense) that they belong to. 12
7
Learning
. . .. . .+ + --
healthyFood
. . . . . .+ + --
unHealthyFood
. . .
Learning
. . .
Choose predicate with maximum entropy.
. . .+ + + + . . .+ + + +
healthyFood unHealthyFood
. . .+ + --
healthyFood
-+ -
. . .
. . .+ + --
healthyFood
. . . . . .+ + --
unHealthyFood
. . .
Choose a keyword for the selected predicate.
Extract CBIs for the predicate using the selected keyword.
+ + . .
Re-train a SVM for the predicate.
Predicate Instance Evaluator
keywords:
healthyFood(shrimp)?
Given the input time, which CBIs should be
extracted?
13
0
Vitamin 0.88
Calories 0.83
Grow 0.69
Tree 0.66
Amount 0.59
Minerals 0.49
.
.
.
Precision/Recall Results
OpenEval uses 100 iterations for training and 5 iterations for evaluation. 13
1
NELL: Never-Ending Language Learner
OpenEval in the last iteration:
academicfield
0.8976357986206526Environmental
Anthropology.
Several excellent textbooks and
readers in environmental
anthropology have now appeared,
establishing a basic survey of the
field.
NELL: Never-Ending Language Learner
OpenEval in the last iteration:
academicfield0.912473775634353
Anesthesiology.
The Department of Anesthesiology is
committed to excellence in clinical
service, education, research and
faculty development.
NELL: Never-Ending Language Learner
OpenEval in the last iteration:
worksfor 0.9845774661303888
(charles osgood, cbs).
Charles Osgood, often referred to as
CBS News' poet-in-residence, has
been anchor of quot;CBS News
Sunday Morningquot; since 1994.
NELL: Never-Ending Language Learner
Knowledge Base Validation in NELL
Human Supervision: RTW group members;
Conversing Learning: NELL can autonomously talk
to people in web communities and ask for help
Web Querying: NELL can query the Web on specific
facts to verify correctness, or to predict the validity
of a new fact;
Hiring Labelers: NELL can autonomously hire
people (using web services such as Mechanical
Turk) to label data and help the system to validate
acquired knowledge.
NELL: Never-Ending Language Learner
Knowledge Base Validation in NELL
Human Supervision: RTW group members;
Conversing Learning: NELL can autonomously talk
to people in web communities and ask for help
Web Querying: NELL can query the Web on specific
facts to verify correctness, or to predict the validity
of a new fact;
Hiring Labelers: NELL can autonomously hire
people (using web services such as Mechanical
Turk) to label data and help the system to validate
acquired knowledge.
NELL: Never-Ending Language Learner
Hiring Labelers:
Currently NELL can autonomously hire people
(using Amazon’s Mechanical Turk)
Default number of instances is (uniformly
distributed) sampled from each Category and
each Relation
Can be used to precision estimate
NELL: Never-Ending Language Learner
Hiring Labelers:
Task is to validate Category and Relation
instances
Category instances: Is Bloomberg a company? Is New
York City a city?
Relation instances: Is Bloomberg headquartered in
New York City? Does Tom Mitchell work for
Carnegie Mellon?
NELL: Never-Ending Language Learner
Hiring Labelers:
Research Questions:
Sampling Strategies/Adaptive Sampling
Quality of answers/turkers
Coupled Bayesian Sets
•  semi-supervised learning approach to extract category
instances (e.g. country(USA), city(New York) from web
pages;
•  can be used to automatically generate new constraints to
the set expansion task even when no mutually
exclusiveness relationship is previously defined
•  It might bring independent errors when compared to CPL
What’s Next?
•  Keep ranked lists for inference on any NP (or pair of NPs) in
the corpus
NELL Architecture
Knowledge Base
(latent variables)
Text
Context
patterns
(CPL)
Orthographic
classifier
(CML)
Beliefs
Candidate
Beliefs
Knowledge
Integrator
Human
advice
Actively
search for
web text
(OpenEval)
Infer new
beliefs from
old
(PRA)
Image
classifier
(NEIL)
Ontology
extender
(OntExt)
URL specific
HTML
patterns
(SEAL)
How to Read the Web in Many Languages?
Read The Web in Portuguese
2012
Brazilian PhD Theses corpus
•  .pdf files
•  Many files are images :-(
ClueWeb2009
•  WARC Format
•  Expected to have +150M sentences
ClueWeb2012
•  Focus on English web pages
Read The Web in Portuguese
2013
Brazilian Corpus: “Brasileiro”
1 billion words
http://corpusbrasileiro.pucsp.br/cb/Inicial.html
ClueWeb2009
•  WARC Format
•  Expected to have +150M sentences
•  “NP x Context” all-pairs-data was started
Read The Web in Portuguese
2014
POS-Parsed PortugueseClueWeb2009
•  “NP x Context” all-pairs-data
•  “Nppair x Context” all-pairs-data
Translating current ontology to Portuguese
Running NELL in Portuguese
NELL: Never-Ending Language Learner
English Version
NELL: Never-Ending Language Learner
English
NELL
NELL: Never-Ending Language Learner
English
NELL
Portuguese
NELL
NELL: Never-Ending Language Learner
English
NELL
Portuguese
NELL
…
NELL: Never-Ending Language Learner
English
NELL
Portuguese
NELL
…
French
NELL
Multilingual
NELL
NELL: Never-Ending Language Learner
English
NELL
Portuguese
NELL
…
French
NELL
Multilingual
NELL
Multilingual Reading The Web
Picture Adapted from http://www.xlike.org
Multilingual Reading The Web
Picture Adapted from http://www.xlike.org
Multilingual Reading The Web
Picture Adapted from http://www.xlike.org
Multilingual Reading The Web
Picture Adapted from http://www.xlike.org
Multilingual Reading The Web
Picture Adapted from http://www.xlike.org
Multilingual Reading The Web
Picture Adapted from http://www.xlike.org
Multilingual Reading The Web
Picture Adapted from http://www.xlike.org
Multilingual Reading The Web
Picture Adapted from http://www.xlike.org
Multilingual Reading The Web
Picture Adapted from http://www.xlike.org
http://rtw.ml.cmu.edu
estevam.hruschka@gmail.com
Thank you very much JSI!
and thanks to Yahoo!, Google, NSF, DARPA, Intel,
Microsoft, Fulbright, CNPq and FAPESP for partial
funding and thanks to Carnegie Mellon University and
thanks to Federal University of São Carlos
References
[Fern, 2008] Xiaoli Z. Fern, CS 434: Machine Learning and Data Mining, School of Electrical Engineering and Computer
Science, Oregon State University, Fall 2008.
[DARPA, 2012] DARPA Machine Reading Program, http://www.darpa.mil/Our_Work/I2O/Programs/
Machine_Reading.aspx.
[Mitchell, 2006] Tom M. Mitchell, The Discipline of Machine Learning, my perspective on this research field, July 2006
(http://www.cs.cmu.edu/~tom/pubs/MachineLearning.pdf).
[Mitchell, 1997] Tom M. Mitchell, Machine Learning. McGraw-Hill, 1997.
[Etzioni et al., 2007] Oren Etzioni, Michele Banko, and Michael J. Cafarella, Machine Reading.The 2007 AAAI Spring
Symposium. Published by The AAAI Press, Menlo Park, California, 2007.
[Clark et al., 2007] Peter Clark, Phil Harrison, John Thompson, Rick Wojcik, Tom Jenkins, David Israel, Reading to Learn:
An Investigation into Language Understanding. The 2007 AAAI Spring Symposium. Published by The AAAI Press,
Menlo Park, California, 2007.
[Norvig, 2007] Peter Norvig, Inference in Text Understanding. The 2007 AAAI Spring Symposium. Published by The AAAI
Press, Menlo Park, California, 2007.
[Wang	
  	
  Cohen,	
  2007]	
  Richard	
  C.	
  Wang	
  and	
  William	
  W.	
  Cohen:	
  Language-­‐Independent	
  Set	
  Expansion	
  of	
  Named	
  EnFFes	
  using	
  the	
  
Web.	
  In	
  Proceedings	
  of	
  IEEE	
  Interna1onal	
  Conference	
  on	
  Data	
  Mining	
  (ICDM	
  2007),	
  Omaha,	
  NE,	
  USA.	
  2007.	
  
[Etzioni,	
  2008]	
  Oren Etzioni. 2008. Machine reading at web scale. In Proceedings of the international conference on Web
search and web data mining (WSDM '08). ACM, New York, NY, USA, 2-2.
[Banko,	
  et	
  al.,	
  2007]	
  Michele Banko, Michael J. Cafarella, Stephen Soderland, Matthew Broadhead, Oren Etzioni: Open
Information Extraction from the Web. IJCAI 2007: 2670-2676
References
[Weikum et al., 2009] G. Weikum, G., Kasneci, M. Ramanath, F. Suchanek. DB  IR methods for
knowledge discovery. Communications of the ACM 52(4), 2009.
[Theobald  Weikum, 2012] Martin Theobald and Gerhard Weikum. From Information to Knowledge: Harvesting Entities
and Relationships from Web Sources. Tutorial at PODS 2012
[Hoffart et al., 2012] Johannes Hoffart, Fabian Suchanek, Klaus Berberich, Gerhard Weikum. YAGO2: A Spatially and
Temporally Enhanced Knowledge Base from Wikipedia. Special issue of the Artificial Intelligence Journal, 2012
[Etzioni et al., 2011] Oren Etzioni, Anthony Fader, Janara Christensen, Stephen Soderland, and Mausam Open
Information Extraction: the Second Generation“. Proceedings of the 22nd International Joint Conference on Artificial
Intelligence (IJCAI 2011).
[Hady et al., 2011] Hady W. Lauw, Ralf Schenkel, Fabian Suchanek, Martin Theobald, and Gerhard Weikum, Semantic
Knowledge Bases from Web Sources at IJCAI 2011, Barcelona, July 2011
[Fader et al., 2011] Anthony Fader, Stephen Soderland, and Oren Etzioni. Identifying Relations for Open Information
Extraction”. Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP
2011)
Settles, B.: Closing the loop: Fast, interactive semi-supervised annotation with queries on features and instances. In: Proc.
of the EMNLP’11, Edinburgh, ACL (2011) 1467–1478 5.
Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Jr., E.R.H., Mitchell, T.M.: Toward an architecture for never-ending
language learning. In: Proceedings of the Twenty-Fourth Conference on Artificial Intelligence (AAAI 2010).
Pedro, S.D.S., Hruschka Jr., E.R.: Collective intelligence as a source for machine learning self-supervision. In: Proc. of the
4th International Workshop on Web Intelligence and Communities. WIC12, NY, USA, ACM (2012) 5:1–5:9
References
[Appel  Hruschka Jr., 2011] Appel, A.P., Hruschka Jr., E.R.: Prophet – a link-predictor to learn new rules on Nell. In:
Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops. pp. 917–924. ICDMW ’11,
IEEE Computer Society, Washington, DC, USA (2011)
[Mohamed et al., 2011] Mohamed, T.P., Hruschka, Jr., E.R., Mitchell, T.M.: Discovering relations between noun
categories. In: Proceedings of the Conference on Empirical Methods in Nat- ural Language Processing. pp. 1447–
1455. EMNLP ’11, Association for Computa- tional Linguistics, Stroudsburg, PA, USA (2011)
[Pedro  Hruschka Jr., 2012] Saulo D.S. Pedro and Estevam R. Hruschka Jr., Conversing Learning: active learning and
active social interaction for human supervision in never-ending learning systems. Xiii Ibero‐american Conference On
Artificial Intelligence, IBERAMIA 2012, 2012.
Krishnamurthy, J., Mitchell, T.M.: Which noun phrases denote which concepts. In: Proceedings of the Forty Ninth Annual
Meeting of the Association for Compu- tational Linguistics (2011)
Lao, N., Mitchell, T., Cohen, W.W.: Random walk inference and learning in a large scale knowledge base. In: Proceedings
of the 2011 Conference on Empirical Methods in Natural Language Processing. pp. 529–539. Associa- tion for
Computational Linguistics, Edinburgh, Scotland, UK. (July 2011), http://www.aclweb.org/anthology/D11-1049
E. R. Hruschka Jr. and M. C. Duarte and M. C. Nicoletti. Coupling as Strategy for Reducing Concept-Drift in Never-ending
Learning Environments. Fundamenta Informaticae, IOS Press, 2012.
Saulo D.S. Pedro, Ana Paula Appel, and Estevam R. Hruschka, Jr. Autonomously reviewing and validating the knowledge
base of a never-ending learning system. In Proceedings of the 22nd international conference on World Wide Web
companion (WWW '13 Companion), 1195-120, 2013.
S. Verma and E. R. Hruschka Jr. Coupled Bayesian Sets Algorithm for Semi-supervised Learning and Information
Extraction. In Proceedings of the European Conference on Machine Learning and Principles and Practice of
Knowledge Discovery in Databases (ECML PKDD), 2012.
Navarro, L. F. and Appel, A. P. and Hruschka Jr., E. R., GraphDB – Storing Large Graphs on Secondary Memory. In New
Trends in Databases and Information. Advances in Intelligent Systems and Computing, Springer, 177-186, 2013.
References
Assuming Facts Are Expressed More Than Once.
J. Betteridge, A. Ritter and T. Mitchell In Proceedings of the 27th International Florida Artificial Intelligence
Research Society Conference (FLAIRS-27), 2014.
Estimating Accuracy from Unlabeled Data.
E. A. Platanios, A. Blum, T. Mitchell. In Uncertainty in Artificial Intelligence (UAI), 2014.
CTPs: Contextual Temporal Profiles for Time Scoping Facts via Entity State Change Detection.
D.T. Wijaya, N. Nakashole and T.M. Mitchell. In Proceedings of the Conference on Empirical Methods in
Natural Language Processing (EMNLP), 2014.
Incorporating Vector Space Similarity in Random Walk Inference over Knowledge Bases.
M. Gardner, P. Talukdar, J. Krishnamurthy and T.M. Mitchell. In Proceedings of the Conference on Empirical
Methods in Natural Language Processing (EMNLP), 2014.
Scaling Graph-based Semi Supervised Learning to Large Number of Labels Using Count-Min Sketch
P. P. Talukdar, and W. Cohen In 17th International Conference on Artificial Intelligence and Statistics
(AISTATS, 2014.
Programming with Personalized PageRank: A Locally Groundable First-Order Probabilistic Logic.
W.Y. Wang, K. Mazaitis and W.W. Cohen. In Proceedings of the Conference on Information and Knowledge
Management (CIKM), 2013.
Improving Learning and Inference in a Large Knowledge-base using Latent Syntactic Cues.
Matt Gardner, Partha Pratim Talukdar, Bryan Kisiel, and Tom Mitchell. In Proceedings of the 2013 Conference
on Empirical Methods in Natural Language Processing (EMNLP 2013), 2013.

More Related Content

Viewers also liked

Semi-supervised Learning
Semi-supervised LearningSemi-supervised Learning
Semi-supervised Learning
butest
 
A line bridesmaid dresses uk - dress mebridal.co.uk
A line bridesmaid dresses uk - dress mebridal.co.ukA line bridesmaid dresses uk - dress mebridal.co.uk
A line bridesmaid dresses uk - dress mebridal.co.uk
Gudeer Kitty
 
Leslie.Archambeault.Thesis.December.4.2015.Final
Leslie.Archambeault.Thesis.December.4.2015.FinalLeslie.Archambeault.Thesis.December.4.2015.Final
Leslie.Archambeault.Thesis.December.4.2015.Final
Leslie Archambeault, JD
 
BrownRChanae_HappinessQuest
BrownRChanae_HappinessQuestBrownRChanae_HappinessQuest
BrownRChanae_HappinessQuest
R'Chanae Brown
 
NMC presentation Sababa Partners Inc
NMC presentation Sababa Partners IncNMC presentation Sababa Partners Inc
NMC presentation Sababa Partners Inc
Veronica Mendoza
 

Viewers also liked (19)

Being Sherlock Holmes: Guest Lecture, 9 January 2014
Being Sherlock Holmes: Guest Lecture, 9 January 2014Being Sherlock Holmes: Guest Lecture, 9 January 2014
Being Sherlock Holmes: Guest Lecture, 9 January 2014
 
Semi-supervised Learning
Semi-supervised LearningSemi-supervised Learning
Semi-supervised Learning
 
D010312230
D010312230D010312230
D010312230
 
A line bridesmaid dresses uk - dress mebridal.co.uk
A line bridesmaid dresses uk - dress mebridal.co.ukA line bridesmaid dresses uk - dress mebridal.co.uk
A line bridesmaid dresses uk - dress mebridal.co.uk
 
One shoulder bridesmaid dresses under 200 gudeer.com
One shoulder bridesmaid dresses  under 200 gudeer.comOne shoulder bridesmaid dresses  under 200 gudeer.com
One shoulder bridesmaid dresses under 200 gudeer.com
 
Izin Pembuatan NPIK, Tanda Pengenal Importir.
Izin Pembuatan NPIK, Tanda Pengenal Importir.Izin Pembuatan NPIK, Tanda Pengenal Importir.
Izin Pembuatan NPIK, Tanda Pengenal Importir.
 
Affordable & cheap bridesmaids dresses online page 5 gudeer.com
Affordable & cheap bridesmaids dresses online page 5   gudeer.comAffordable & cheap bridesmaids dresses online page 5   gudeer.com
Affordable & cheap bridesmaids dresses online page 5 gudeer.com
 
R130304103108
R130304103108R130304103108
R130304103108
 
DOMISILI USAHA
DOMISILI USAHADOMISILI USAHA
DOMISILI USAHA
 
Efficiency of Women’s Technical Institutions By Using Bcc Model Through Dea A...
Efficiency of Women’s Technical Institutions By Using Bcc Model Through Dea A...Efficiency of Women’s Technical Institutions By Using Bcc Model Through Dea A...
Efficiency of Women’s Technical Institutions By Using Bcc Model Through Dea A...
 
Leslie.Archambeault.Thesis.December.4.2015.Final
Leslie.Archambeault.Thesis.December.4.2015.FinalLeslie.Archambeault.Thesis.December.4.2015.Final
Leslie.Archambeault.Thesis.December.4.2015.Final
 
PEN initiatives 2014
PEN initiatives 2014PEN initiatives 2014
PEN initiatives 2014
 
A10130110
A10130110A10130110
A10130110
 
Cheap flower girl dresses nz,latest flower girl gowns online cmdress.co.nz
Cheap flower girl dresses nz,latest flower girl gowns online   cmdress.co.nzCheap flower girl dresses nz,latest flower girl gowns online   cmdress.co.nz
Cheap flower girl dresses nz,latest flower girl gowns online cmdress.co.nz
 
Pembuatan Izin Postel
Pembuatan Izin PostelPembuatan Izin Postel
Pembuatan Izin Postel
 
BrownRChanae_HappinessQuest
BrownRChanae_HappinessQuestBrownRChanae_HappinessQuest
BrownRChanae_HappinessQuest
 
NMC presentation Sababa Partners Inc
NMC presentation Sababa Partners IncNMC presentation Sababa Partners Inc
NMC presentation Sababa Partners Inc
 
Strategi yang tepat untuk merevolusi mental petani
Strategi yang tepat untuk merevolusi mental petaniStrategi yang tepat untuk merevolusi mental petani
Strategi yang tepat untuk merevolusi mental petani
 
LordJeshuainheritancemay2016
LordJeshuainheritancemay2016LordJeshuainheritancemay2016
LordJeshuainheritancemay2016
 

Similar to NELL: The Never-Ending Language Learning System

Similar to NELL: The Never-Ending Language Learning System (20)

A Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingA Panorama of Natural Language Processing
A Panorama of Natural Language Processing
 
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)
 
Revealing Entities From Texts With a Hybrid Approach
Revealing Entities From Texts With a Hybrid ApproachRevealing Entities From Texts With a Hybrid Approach
Revealing Entities From Texts With a Hybrid Approach
 
Sergey Nikolenko and Anton Alekseev User Profiling in Text-Based Recommende...
Sergey Nikolenko and  Anton Alekseev  User Profiling in Text-Based Recommende...Sergey Nikolenko and  Anton Alekseev  User Profiling in Text-Based Recommende...
Sergey Nikolenko and Anton Alekseev User Profiling in Text-Based Recommende...
 
Dynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & StatisticsDynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & Statistics
 
What might a spoken corpus tell us about language
What might a spoken corpus tell us about languageWhat might a spoken corpus tell us about language
What might a spoken corpus tell us about language
 
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word EmbeddingsDeep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word Embeddings
 
Four Languages From Forty Years Ago
Four Languages From Forty Years AgoFour Languages From Forty Years Ago
Four Languages From Forty Years Ago
 
The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...
The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...
The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...
 
Netizen style commenting on fashion photos
Netizen style commenting on fashion photosNetizen style commenting on fashion photos
Netizen style commenting on fashion photos
 
NLP & DBpedia
 NLP & DBpedia NLP & DBpedia
NLP & DBpedia
 
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesDeep Learning for Information Retrieval: Models, Progress, & Opportunities
Deep Learning for Information Retrieval: Models, Progress, & Opportunities
 
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
 
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
 
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
Knowledge extraction in Web media: at the frontier of NLP, Machine Learning a...
 
[246]QANet: Towards Efficient and Human-Level Reading Comprehension on SQuAD
[246]QANet: Towards Efficient and Human-Level Reading Comprehension on SQuAD[246]QANet: Towards Efficient and Human-Level Reading Comprehension on SQuAD
[246]QANet: Towards Efficient and Human-Level Reading Comprehension on SQuAD
 
[KDD 2018 tutorial] End to-end goal-oriented question answering systems
[KDD 2018 tutorial] End to-end goal-oriented question answering systems[KDD 2018 tutorial] End to-end goal-oriented question answering systems
[KDD 2018 tutorial] End to-end goal-oriented question answering systems
 
Table Retrieval and Generation
Table Retrieval and GenerationTable Retrieval and Generation
Table Retrieval and Generation
 
Roee Aharoni - 2017 - Towards String-to-Tree Neural Machine Translation
Roee Aharoni - 2017 - Towards String-to-Tree Neural Machine TranslationRoee Aharoni - 2017 - Towards String-to-Tree Neural Machine Translation
Roee Aharoni - 2017 - Towards String-to-Tree Neural Machine Translation
 
Deep learning with Keras
Deep learning with KerasDeep learning with Keras
Deep learning with Keras
 

Recently uploaded

Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
HyderabadDolls
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
gajnagarg
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 

Recently uploaded (20)

RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
 

NELL: The Never-Ending Language Learning System

  • 1. Never-Ending Learning Language Estevam R. Hruschka Jr. Federal University of São Carlos
  • 2. Never-Ending Learning Language Joint work with Carnegie Mellon Read The Web Project Group (http://rtw.ml.cmu.edu/rtw/) and MaLL (Machine Learning Lab) from Federal University of São Carlos (http://www.dc.ufscar.br/MaLL/MaLL.html)
  • 3.
  • 5. Picture taken from [Fern, 2008]
  • 7. Years of Relevant AI/ML Research • Architectures for problem solving/learning – SOAR [Newell, Laird, Rosenbloom 1986] – ICARUS [Langley], PRODIGY [Carbonell], ... • Large scale knowledge construction/extraction – Cyc [Lenat], KnowItAll, TextRunner [Etzioni et al 2004], WOE [Weld et al. 2009] • Life long learning – Learning to learn [Thrun & Pratt, 1998], EBNN [Thrun & Mitchell 1993] • Transfer learning – Multitask learning [Caruana 1995] – Transfer reinforcement learning [Parr & Russell 1998] – Learning with structured outputs [Taskar, 2009; Roth 2009] • Active Learning – survey [Settles 2010]; Multi-task active learning [Harpale & Yang, 2010] • Curriculum learning – [Bengio, et al., 2009; Krueger & Dayan, 2009; Ni & Ling, 2010]
  • 8. Never-Ending Learning Main Task: acquire a growing competence without asymptote •  over years •  multiple functions •  where learning one thing improves ability to learn the next •  acquiring data from humans, environment •  Many candidate domains: •  Robots •  Softbots •  Game players
  • 10. Picture taken from [DARPA, 2012]
  • 11. NELL: Never-Ending Language Learner Inputs:   initial ontology   handful of examples of each predicate in ontology   the web   occasional interaction with human trainers The task:   run 24x7, forever • each day: 1. extract more facts from the web to populate the initial ontology 2. learn to read (perform #1) better than yesterday
  • 12. NELL: Never-Ending Language Learner Goal: • run 24x7, forever • each day: 1. extract more facts from the web to populate given ontology 2. learn to read better than yesterday Today... Running 24 x 7, since January, 2010 Input: • ontology defining ~800 categories and relations • 10-20 seed examples of each • 1 billion web pages (ClueWeb – Jamie Callan) Result: • continuously growing KB with +1,800,000 extracted beliefs
  • 15.
  • 16. The Problem with Semi-Supervised Bootstrap Learning Paris Pittsburgh Seattle Cupertino
  • 17. The Problem with Semi-Supervised Bootstrap Learning Paris Pittsburgh Seattle Cupertino mayor of arg1 live in arg1
  • 18. The Problem with Semi-Supervised Bootstrap Learning Paris Pittsburgh Seattle Cupertino mayor of arg1 live in arg1 San Francisco Austin denial
  • 19. The Problem with Semi-Supervised Bootstrap Learning Paris Pittsburgh Seattle Cupertino mayor of arg1 live in arg1 San Francisco Austin denial
  • 20. The Problem with Semi-Supervised Bootstrap Learning Paris Pittsburgh Seattle Cupertino mayor of arg1 live in arg1 San Francisco Austin denial arg1 is home of traits such as arg1
  • 21. The Problem with Semi-Supervised Bootstrap Learning Paris Pittsburgh Seattle Cupertino mayor of arg1 live in arg1 … San Francisco Austin denial arg1 is home of traits such as arg1 it’s underconstrained!!
  • 22. Key Idea 1: Coupled semi-supervised training of many functions
  • 23. Coupled Training Type 1: Co-training, Multiview, Co-regularization
  • 24. Coupled Training Type 1: Co-training, Multiview, Co-regularization
  • 25. Coupled Training Type 1: Co-training, Multiview, Co-regularization
  • 26. Type 1 Coupling Constraints in NELL
  • 27. Coupled Training Type 2: Structured Outputs, Multitask, Posterior Regularization, Multilabel Learn functions with the same input, different outputs, where we know some constraint
  • 28. Coupled Training Type 2: Structured Outputs, Multitask, Posterior Regularization, Multilabel Learn functions with the same input, different outputs, where we know some constraint
  • 29. Coupled Training Type 2: Structured Outputs, Multitask, Posterior Regularization, Multilabel Learn functions with the same input, different outputs, where we know some constraint
  • 30. Type 2 Coupling Constraints in NELL
  • 34. Type 3 Coupling: Argument Types
  • 35. Pure EM Approach to Coupled Training E: jointly estimate latent labels for each function of each unlabeled example M: retrain all functions, based on these probabilistic labels Scaling problem: • E step: 20M NP’s, 1014 NP pairs to label • M step: 50M text contexts to consider for each function  1010 parameters to retrain • even more URL-HTML contexts..
  • 36. NELL’s Approximation to EM E’ step: • Consider only a growing subset of the latent variable assignments – category variables: up to 250 NP’s per category per iteration – relation variables: add only if confident and args of correct type – this set of explicit latent assignments *IS* the knowledge base M’ step: • Each view-based learner retrains itself from the updated KB • “context” methods create growing subsets of contexts
  • 38.
  • 39. If coupled learning is the key idea, how can we get new coupling constraints?
  • 40. Key Idea 2: Discover New Coupling Constraints • first order, probabilistic horn clause constraints 0.93 athletePlaysSport(?x,?y) :- #athletePlaysForTeam(?x,?z), teamPlaysSport(? z,?y) – connects previously uncoupled relation predicates – infers new beliefs for KB
  • 41. Discover New Coupling Constraints For each relation: seek probabilistic first order Horn Clauses • Positive examples: extracted beliefs in the KB • Negative examples: ??? Ontology to the rescue: numberOfValues(teamPlaysSport) = 1 numberOfValues(competesWith) = any
  • 42. Example Learned Horn Clauses 0.95 athletePlaysSport(?x,basketball) :- athleteInLeague(?x,NBA) 0.93 athletePlaysSport(?x,?y) :- athletePlaysForTeam(?x,?z) teamPlaysSport(?z,?y) 0.91 teamPlaysInLeague(?x,NHL) :- teamWonTrophy(?x,Stanley_Cup) 0.90 athleteInLeague(?x,?y):- athletePlaysForTeam(?x,?z), teamPlaysInLeague(?z,?y) 0.88 cityInState(?x,?y) :- #cityCapitalOfState(?x,?y), cityInCountry(?y,USA) 0.62* newspaperInCity(?x,New_York) :- companyEconomicSector(?x,media), generalizations(?x,blog)
  • 45.
  • 46. Key Idea 3: Automatically Extending the Ontology
  • 47. Ontology Extension in 2012 Categories Relations KB ? Prophet Corpus ? OntExt EMNLP0211 paper Reading Hearst Patterns Relation ?
  • 48. Ontology Extension in 2013 Categories Relations KB Every Relation Generates Subcategories Prophet Needs new Implementation for larger Graphs Corpus ? New OntExt based on Verbs Every Verb is a Relation Reading Hearst Patterns Relation ?
  • 49. Ontology Extension in 2014 Categories Relations KB Every Relation Generates Subcategories To be done Prophet Needs new Implementation for larger Graphs Corpus ? New OntExt based on Verbs Every Verb is a Relation Reading Hearst Patterns Relation ?
  • 51. OntExt (Ontology Extension) Everything PersonCompany CitySport WorksFor PlayedInPlays
  • 52. OntExt (Ontology Extension) Everything PersonCompany CitySport WorksFor PlayedIn LocatedIn Plays
  • 53. Prophet Mining the Graph representing NELL’s KB to: 1.  Extend the KB by predicting new relations (edges)that might exist between pairs of nodes; 2.  Induce inference rules; 3.  Identify misplaced edges which can be used by NELL as hints to identify wrong connections between nodes (wrong fats);
  • 55. Prophet open triangles Basketball NBA Milwaukee Bucks Sport Sport’s League
  • 56. Prophet open triangles Basketball NBA Milwaukee Bucks Sport Sport’s League
  • 58. Prophet open triangles Hokey NHL Pittsburgh Penguins Sport Sport’s League
  • 60. Prophet open triangles Football NFL Dallas Cowboys Sport Sport’s League
  • 63. Prophet u w v •  Compute the number of common neighbors
  • 64. Prophet u w v Category u Category w Σ •  Compute the number of common neighbors •  Compute the cumulative number of instances for the categories nodes Σ
  • 65. Prophet u w v Category u Category w Σ •  Compute the number of common neighbors •  Compute the cumulative number of instances for the categories nodes Σ •  NΛc(uc,wc ) is the number of open triangles for categories u and w.
  • 67. Prophet If > ξ then create the new relation ξ = 10 (empirically) sport sportsLeague sportsTeam
  • 68. Prophet If > ξ then create the new relation ξ = 10 (empirically) Name the new relation based on ReVerb sport sportsLeague sportsTeam isPlayedIn
  • 69. Ontology Extension in 2014 Categories Relations KB Every Relation Generates Subcategories To be done Prophet Needs new Implementation for larger Graphs Corpus ? New OntExt based on Verbs Every Verb is a Relation Reading Hearst Patterns Relation ?
  • 70. Ontology Extension in 2014 Categories Relations KB Every Relation Generates Subcategories To be done Prothet needs new Implementation for larger Graphs GraphDB Corpus ? New OntExt based on Verbs Every Verb is a Relation Reading Hearst Patterns Relation ?
  • 71. Ontology Extension in 2014 Navarro et al., 2013
  • 72. Ontology Extension in 2014 Categories Relations KB Every Relation Generates Subcategories To be done Prothet needs new Implementation for larger Graphs GraphDB Corpus ? New OntExt based on Verbs Every Verb is a Relation Reading Hearst Patterns Relation ?
  • 73. How to Extract New Relations? Proposed Approach - OntExt Traditional IE + Open IE Cluster context patterns which are semantically similar although they may be lexically dissimilar Scalability: Context-pattern X Context-pattern matrix Classifier learns to distinguish valid relations from semantically invalid relations
  • 74. OntExt Input: Preprocessed 2 billion sentences from ClueWeb09 data [Callan and Hoy, 2009]. Category instances (e.g. city(Ottawa), city(Berlin), country(Canada), etc.) are used to find context patterns Context x Context Matrix
  • 76.
  • 77. Key Idea 4: Cumulative, Staged Learning Learning X improves ability to learn Y 1. Classify noun phrases (NPs) by category 2. Classify NP pairs by relation 3. Discover rules patterns to predict new relation instances 4. Learn which NPs (co)refer to which latent concepts 5. Discover new relations to extend the initial ontology 6. Learn to assign temporal scope to beliefs 7. Learn to microread single sentences 8. Vision: co-train text and visual object recognition 9. Goal-driven reading: predict, then read to corroborate/ correct 10. Make NELL a conversational agent 11. Add a robot body to NELL
  • 78. NELL: Never-Ending Language Learner NELL is grown enough for new steps NELL turned 4 on Jan 12! Congratulations NELL!!
  • 79. NELL: Never-Ending Language Learner NELL is grown enough for new steps
  • 80. NELL: Never-Ending Language Learner NELL is grown enough for new steps Knowledge on Demand
  • 81. NELL: Never-Ending Language Learner NELL is grown enough for new steps Knowledge on Demand
  • 82. NELL: Never-Ending Language Learner NELL is grown enough for new steps Knowledge on Demand – Ask NELL
  • 83. NELL Architecture Knowledge Base (latent variables) Text Context patterns (CPL) Orthographic classifier (CML) Beliefs Candidate Beliefs Knowledge Integrator Human advice Actively search for web text (OpenEval) Infer new beliefs from old (PRA) Image classifier (NEIL) Ontology extender (OntExt) URL specific HTML patterns (SEAL)
  • 84. NELL: Never-Ending Language Learner Knowledge Base Validation in NELL Human Supervision: RTW group members; Conversing Learning: NELL can autonomously talk to people in web communities and ask for help Web Querying: NELL can query the Web on specific facts to verify correctness, or to predict the validity of a new fact; Hiring Labelers: NELL can autonomously hire people (using web services such as Mechanical Turk) to label data and help the system to validate acquired knowledge.
  • 85. NELL: Never-Ending Language Learner Knowledge Base Validation in NELL Human Supervision: RTW group members; Conversing Learning: NELL can autonomously talk to people in web communities and ask for help Web Querying: NELL can query the Web on specific facts to verify correctness, or to predict the validity of a new fact; Hiring Labelers: NELL can autonomously hire people (using web services such as Mechanical Turk) to label data and help the system to validate acquired knowledge.
  • 86. NELL: Never-Ending Language Learner Knowledge Base Validation in NELL Human Supervision: RTW group members;
  • 87. NELL: Never-Ending Language Learner Knowledge Base Validation in NELL Human Supervision: RTW group members;
  • 88. NELL: Never-Ending Language Learner Knowledge Base Validation in NELL Human Supervision: RTW group members;
  • 89. NELL: Never-Ending Language Learner Knowledge Base Validation in NELL Human Supervision: RTW group members;
  • 90. NELL: Never-Ending Language Learner Knowledge Base Validation in NELL Human Supervision: RTW group members; Conversing Learning: NELL can autonomously talk to people in web communities and ask for help Web Querying: NELL can query the Web on specific facts to verify correctness, or to predict the validity of a new fact; Hiring Labelers: NELL can autonomously hire people (using web services such as Mechanical Turk) to label data and help the system to validate acquired knowledge.
  • 91. NELL: Never-Ending Language Learner Knowledge Base Validation in NELL Human Supervision: RTW group members; Conversing Learning: NELL can autonomously talk to people in web communities and ask for help Web Querying: NELL can query the Web on specific facts to verify correctness, or to predict the validity of a new fact; Hiring Labelers: NELL can autonomously hire people (using web services such as Mechanical Turk) to label data and help the system to validate acquired knowledge.
  • 93. Conversing Learning Basic Steps: Decide which task is going to be asked Determine who are the oracles the ML system is going to consult Propose a method of conversation with oracles, often humans Determine how to feedback the ML system with the community inputs
  • 94. Conversing Learning Basic Steps: Decide which task is going to be asked Determine who are the oracles the ML system is going to consult Propose a method of conversation with oracles, often humans Determine how to feedback the ML system with the community inputs
  • 95. Conversing Learning Decide which task is going to be asked Learned facts Learned Inference Rules Metadata (mainly for automatically extending the ontology)
  • 96. Conversing Learning Basic Steps: Decide which task is going to be asked Determine who are the oracles the ML system is going to consult Propose a method of conversation with oracles, often humans Determine how to feedback the ML system with the community inputs
  • 97. Conversing Learning who are the oracles the ML system is going to consult Yahoo! Answers very popular on the Web a lot of metadata to harvest Twitter millions of users worldwide a system that was not designed to work as a QA environment Both web communities have API to connect to their database
  • 99. Conversing Learning Basic Steps: Decide which task is going to be asked Determine who are the oracles the ML system is going to consult Propose a method of conversation with oracles, often humans Determine how to feedback the ML system with the community inputs
  • 100. Conversing Learning Propose a method of conversation with oracles, often humans Macro Question-Answering For each posted question: Ask for yes/no simple answers Try to understand every answer Discard answers too difficult to understand Conclude based only on fully understood answers
  • 101. Conversing Learning Basic Steps: Decide which task is going to be asked Determine who are the oracles the ML system is going to consult Propose a method of conversation with oracles, often humans Determine how to feedback the ML system with the community inputs
  • 102. Conversing Learning how to feedback the ML system with the community inputs? Suggested actions to NELL: Synonym/co-reference resolution Automatically update the Knowledge Base
  • 103. Conversing Learning Some Initial Results with First Order Rules: Take top 10% of rules from Rule Learner 60 rules were converted into questions and asked with both the regular and the Yes/No question approach The 120 questions received a total of 350 answers.
  • 104. Conversing Learning Some Initial Results with First Order Rules: Rule extracted from NELL in PROLOG format stateLocatedInCountry(x,y):- statehascapital(x,z), citylocatedincoutry(z,y) converted into question: Is this statement always true? If state X has capital Z and city Z is located in country Y then state X is located in country Y.
  • 105. Conversing Learning Question: (Yes or No?) If athlete Z is member of team X and athlete Z plays in league Y, then team X plays in league Y. Twitter answers sample: No. (Z in X) ∧ (Z in Y) → (X in Y) Yahoo! Answers sample: NO, Not in EVERY case. Athlete Z could be a member of football team X and he could also play in his pub’s Friday nights dart team. The Dart team could play in league Y (and Z therefore by definition plays in league Y). This does not mean that the football team plays in the darts league!
  • 108. Conversing Learning Some Initial Results with Facts Validation:
  • 109. Conversing Learning Some Initial Results with Facts Validation:
  • 110. Conversing Learning Some Initial Results with Facts Validation:
  • 111. Conversing Learning Some Initial Results with Facts Validation:
  • 112. Some Initial Results with Metadata: Question: Could you please give me some examples of clothing? Answer 01: Snowshoes, rain ponchos, galoshes, sunhats, visors, scarves, mittens, and wellies are all examples of weather specific clothing! Answer 02: pants Answer 03: Training shoes can be worn by anyone for any purpose, but the term means to train in sports Conversing Learning
  • 113. Some Initial Results with Metadata: Users replied with 552 seeds for 129 categories Total of 5900 promotions with seeds created by NELL’s developers Total of 5300 promotions with seeds extracted from answers of Twitter users (similar precision) Conversing Learning
  • 114. Some Initial Results with Metadata: For Relation Discovery Components Symmetry: Is it always true that if a person P1 is neighbor of a person P2, then P2 is neighbor of P1? Anti-symmetry: Is it always true that if a person P1 is the coach of a person P2, then P2 is not coach of P1? Conversing Learning
  • 115. Some Initial Results with Metadata: Feature Weighting/Selection for CMC Logistic Regression features are based on noun phrase morphology (true or false) hotel names tend to be compound noun phrases having “hotel” as last the word. (true or false) a word having “burgh” as sufix (ex. Pittsburgh) tend to be a city name. Conversing Learning
  • 116. On going and future work Asking to the right community and to the right person Asking the right thing to maximize the results with minimum questions (multi-view Active Learning) Better Question-Answering methods Asking in different languages and explore time zones. Conversing Learning
  • 117. NELL: Never-Ending Language Learner Knowledge Base Validation in NELL Human Supervision: RTW group members; Conversing Learning: NELL can autonomously talk to people in web communities and ask for help Web Querying: NELL can query the Web on specific facts to verify correctness, or to predict the validity of a new fact; Hiring Labelers: NELL can autonomously hire people (using web services such as Mechanical Turk) to label data and help the system to validate acquired knowledge.
  • 118. NELL: Never-Ending Language Learner Knowledge Base Validation in NELL Human Supervision: RTW group members; Conversing Learning: NELL can autonomously talk to people in web communities and ask for help Web Querying: NELL can query the Web on specific facts to verify correctness, or to predict the validity of a new fact; Hiring Labelers: NELL can autonomously hire people (using web services such as Mechanical Turk) to label data and help the system to validate acquired knowledge.
  • 119. OpenEval: Web Information Query Evaluation Mehdi Samadi, Manuela Veloso and Manuel Blum Computer Science Department Carnegie Mellon University, Pittsburgh, PA AAAI 2013, July 16, Bellevue, WA, USA
  • 120. I can wait more… Shrimp is healthy 0.72 12 0 Information Validation healthyFood (shrimp) healthyFood (shrimp) healthyFood (apple) 0.88 •  Querying by human or agent •  Information validation •  Open Web •  Online/Anytime •  Scalable •  Few seed examples for training •  Small ontology Motivation
  • 121. Learning healthyFood unHealthyFood . . . 12 1 Food Apple Kale Black Beans Salmon Walnut Banana … Animal
  • 122. Learning healthyFood unHealthyFood . . . 12 2 Food 1- Given an input predicate instance and a keyword, OpenEval first formulates a search query. A predicate instance healthyFood(Appl e) Convert to a query: {“apple”}. Animal
  • 123. Learning healthyFood unHealthyFood . . .Food 2- OpenEval queries the open Web and processes the retrieved unstructured Web pages. A predicate instance healthyFood(Appl e) Convert to a query: {“apple”}. . . . Animal
  • 124. Extracting CBIs healthyFood unHealthyFood . . . 12 4 Food 3- OpenEval extracts a set of Context-Based Instances (CBI). A predicate instance healthyFood(Shri mp) Convert to a query: {“shrimp”}. . . . X pomaceous fruit apple tree, species Malus domestica rose family widely known members genus Malus used humans. X grow small, deciduous trees. tree originated Central Asia, wild ancestora . . . Animal
  • 125. Learning healthyFood unHealthyFood . . . OpenEval extracts CBIs for each predicate. . . . . . .+ + + + . . .+ + + + healthyFood unHealthyFood . . .+ + - - healthyFood -+ CBI 12 5 Food Animal
  • 126. Learning healthyFood unHealthyFood . . . OpenEval extracts CBIs for each predicate. . . . . . .+ + + + . . .+ + + + healthyFood unHealthyFood healthyFood -+ CBI Food . . .+ + -- . . . OpenEval trains a SVM for each predicate using training CBIs. Animal
  • 127. What does OpenEval learn? healthyFood(apple) healthyFood(apple) “vitamin” Learn how to map instances to an appropriate predicate (i.e., sense) that they belong to. 12 7
  • 128. Learning . . .. . .+ + -- healthyFood . . . . . .+ + -- unHealthyFood . . .
  • 129. Learning . . . Choose predicate with maximum entropy. . . .+ + + + . . .+ + + + healthyFood unHealthyFood . . .+ + -- healthyFood -+ - . . . . . .+ + -- healthyFood . . . . . .+ + -- unHealthyFood . . . Choose a keyword for the selected predicate. Extract CBIs for the predicate using the selected keyword. + + . . Re-train a SVM for the predicate.
  • 130. Predicate Instance Evaluator keywords: healthyFood(shrimp)? Given the input time, which CBIs should be extracted? 13 0 Vitamin 0.88 Calories 0.83 Grow 0.69 Tree 0.66 Amount 0.59 Minerals 0.49 . . .
  • 131. Precision/Recall Results OpenEval uses 100 iterations for training and 5 iterations for evaluation. 13 1
  • 132. NELL: Never-Ending Language Learner OpenEval in the last iteration: academicfield 0.8976357986206526Environmental Anthropology. Several excellent textbooks and readers in environmental anthropology have now appeared, establishing a basic survey of the field.
  • 133. NELL: Never-Ending Language Learner OpenEval in the last iteration: academicfield0.912473775634353 Anesthesiology. The Department of Anesthesiology is committed to excellence in clinical service, education, research and faculty development.
  • 134. NELL: Never-Ending Language Learner OpenEval in the last iteration: worksfor 0.9845774661303888 (charles osgood, cbs). Charles Osgood, often referred to as CBS News' poet-in-residence, has been anchor of quot;CBS News Sunday Morningquot; since 1994.
  • 135. NELL: Never-Ending Language Learner Knowledge Base Validation in NELL Human Supervision: RTW group members; Conversing Learning: NELL can autonomously talk to people in web communities and ask for help Web Querying: NELL can query the Web on specific facts to verify correctness, or to predict the validity of a new fact; Hiring Labelers: NELL can autonomously hire people (using web services such as Mechanical Turk) to label data and help the system to validate acquired knowledge.
  • 136. NELL: Never-Ending Language Learner Knowledge Base Validation in NELL Human Supervision: RTW group members; Conversing Learning: NELL can autonomously talk to people in web communities and ask for help Web Querying: NELL can query the Web on specific facts to verify correctness, or to predict the validity of a new fact; Hiring Labelers: NELL can autonomously hire people (using web services such as Mechanical Turk) to label data and help the system to validate acquired knowledge.
  • 137. NELL: Never-Ending Language Learner Hiring Labelers: Currently NELL can autonomously hire people (using Amazon’s Mechanical Turk) Default number of instances is (uniformly distributed) sampled from each Category and each Relation Can be used to precision estimate
  • 138. NELL: Never-Ending Language Learner Hiring Labelers: Task is to validate Category and Relation instances Category instances: Is Bloomberg a company? Is New York City a city? Relation instances: Is Bloomberg headquartered in New York City? Does Tom Mitchell work for Carnegie Mellon?
  • 139. NELL: Never-Ending Language Learner Hiring Labelers: Research Questions: Sampling Strategies/Adaptive Sampling Quality of answers/turkers
  • 140. Coupled Bayesian Sets •  semi-supervised learning approach to extract category instances (e.g. country(USA), city(New York) from web pages; •  can be used to automatically generate new constraints to the set expansion task even when no mutually exclusiveness relationship is previously defined •  It might bring independent errors when compared to CPL What’s Next? •  Keep ranked lists for inference on any NP (or pair of NPs) in the corpus NELL Architecture Knowledge Base (latent variables) Text Context patterns (CPL) Orthographic classifier (CML) Beliefs Candidate Beliefs Knowledge Integrator Human advice Actively search for web text (OpenEval) Infer new beliefs from old (PRA) Image classifier (NEIL) Ontology extender (OntExt) URL specific HTML patterns (SEAL)
  • 141. How to Read the Web in Many Languages?
  • 142. Read The Web in Portuguese 2012 Brazilian PhD Theses corpus •  .pdf files •  Many files are images :-( ClueWeb2009 •  WARC Format •  Expected to have +150M sentences ClueWeb2012 •  Focus on English web pages
  • 143. Read The Web in Portuguese 2013 Brazilian Corpus: “Brasileiro” 1 billion words http://corpusbrasileiro.pucsp.br/cb/Inicial.html ClueWeb2009 •  WARC Format •  Expected to have +150M sentences •  “NP x Context” all-pairs-data was started
  • 144. Read The Web in Portuguese 2014 POS-Parsed PortugueseClueWeb2009 •  “NP x Context” all-pairs-data •  “Nppair x Context” all-pairs-data Translating current ontology to Portuguese Running NELL in Portuguese
  • 145. NELL: Never-Ending Language Learner English Version
  • 146. NELL: Never-Ending Language Learner English NELL
  • 147. NELL: Never-Ending Language Learner English NELL Portuguese NELL
  • 148. NELL: Never-Ending Language Learner English NELL Portuguese NELL …
  • 149. NELL: Never-Ending Language Learner English NELL Portuguese NELL … French NELL Multilingual NELL
  • 150. NELL: Never-Ending Language Learner English NELL Portuguese NELL … French NELL Multilingual NELL
  • 151. Multilingual Reading The Web Picture Adapted from http://www.xlike.org
  • 152. Multilingual Reading The Web Picture Adapted from http://www.xlike.org
  • 153. Multilingual Reading The Web Picture Adapted from http://www.xlike.org
  • 154. Multilingual Reading The Web Picture Adapted from http://www.xlike.org
  • 155. Multilingual Reading The Web Picture Adapted from http://www.xlike.org
  • 156. Multilingual Reading The Web Picture Adapted from http://www.xlike.org
  • 157. Multilingual Reading The Web Picture Adapted from http://www.xlike.org
  • 158. Multilingual Reading The Web Picture Adapted from http://www.xlike.org
  • 159. Multilingual Reading The Web Picture Adapted from http://www.xlike.org
  • 160. http://rtw.ml.cmu.edu estevam.hruschka@gmail.com Thank you very much JSI! and thanks to Yahoo!, Google, NSF, DARPA, Intel, Microsoft, Fulbright, CNPq and FAPESP for partial funding and thanks to Carnegie Mellon University and thanks to Federal University of São Carlos
  • 161. References [Fern, 2008] Xiaoli Z. Fern, CS 434: Machine Learning and Data Mining, School of Electrical Engineering and Computer Science, Oregon State University, Fall 2008. [DARPA, 2012] DARPA Machine Reading Program, http://www.darpa.mil/Our_Work/I2O/Programs/ Machine_Reading.aspx. [Mitchell, 2006] Tom M. Mitchell, The Discipline of Machine Learning, my perspective on this research field, July 2006 (http://www.cs.cmu.edu/~tom/pubs/MachineLearning.pdf). [Mitchell, 1997] Tom M. Mitchell, Machine Learning. McGraw-Hill, 1997. [Etzioni et al., 2007] Oren Etzioni, Michele Banko, and Michael J. Cafarella, Machine Reading.The 2007 AAAI Spring Symposium. Published by The AAAI Press, Menlo Park, California, 2007. [Clark et al., 2007] Peter Clark, Phil Harrison, John Thompson, Rick Wojcik, Tom Jenkins, David Israel, Reading to Learn: An Investigation into Language Understanding. The 2007 AAAI Spring Symposium. Published by The AAAI Press, Menlo Park, California, 2007. [Norvig, 2007] Peter Norvig, Inference in Text Understanding. The 2007 AAAI Spring Symposium. Published by The AAAI Press, Menlo Park, California, 2007. [Wang    Cohen,  2007]  Richard  C.  Wang  and  William  W.  Cohen:  Language-­‐Independent  Set  Expansion  of  Named  EnFFes  using  the   Web.  In  Proceedings  of  IEEE  Interna1onal  Conference  on  Data  Mining  (ICDM  2007),  Omaha,  NE,  USA.  2007.   [Etzioni,  2008]  Oren Etzioni. 2008. Machine reading at web scale. In Proceedings of the international conference on Web search and web data mining (WSDM '08). ACM, New York, NY, USA, 2-2. [Banko,  et  al.,  2007]  Michele Banko, Michael J. Cafarella, Stephen Soderland, Matthew Broadhead, Oren Etzioni: Open Information Extraction from the Web. IJCAI 2007: 2670-2676
  • 162. References [Weikum et al., 2009] G. Weikum, G., Kasneci, M. Ramanath, F. Suchanek. DB IR methods for knowledge discovery. Communications of the ACM 52(4), 2009. [Theobald Weikum, 2012] Martin Theobald and Gerhard Weikum. From Information to Knowledge: Harvesting Entities and Relationships from Web Sources. Tutorial at PODS 2012 [Hoffart et al., 2012] Johannes Hoffart, Fabian Suchanek, Klaus Berberich, Gerhard Weikum. YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia. Special issue of the Artificial Intelligence Journal, 2012 [Etzioni et al., 2011] Oren Etzioni, Anthony Fader, Janara Christensen, Stephen Soderland, and Mausam Open Information Extraction: the Second Generation“. Proceedings of the 22nd International Joint Conference on Artificial Intelligence (IJCAI 2011). [Hady et al., 2011] Hady W. Lauw, Ralf Schenkel, Fabian Suchanek, Martin Theobald, and Gerhard Weikum, Semantic Knowledge Bases from Web Sources at IJCAI 2011, Barcelona, July 2011 [Fader et al., 2011] Anthony Fader, Stephen Soderland, and Oren Etzioni. Identifying Relations for Open Information Extraction”. Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP 2011) Settles, B.: Closing the loop: Fast, interactive semi-supervised annotation with queries on features and instances. In: Proc. of the EMNLP’11, Edinburgh, ACL (2011) 1467–1478 5. Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Jr., E.R.H., Mitchell, T.M.: Toward an architecture for never-ending language learning. In: Proceedings of the Twenty-Fourth Conference on Artificial Intelligence (AAAI 2010). Pedro, S.D.S., Hruschka Jr., E.R.: Collective intelligence as a source for machine learning self-supervision. In: Proc. of the 4th International Workshop on Web Intelligence and Communities. WIC12, NY, USA, ACM (2012) 5:1–5:9
  • 163. References [Appel Hruschka Jr., 2011] Appel, A.P., Hruschka Jr., E.R.: Prophet – a link-predictor to learn new rules on Nell. In: Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops. pp. 917–924. ICDMW ’11, IEEE Computer Society, Washington, DC, USA (2011) [Mohamed et al., 2011] Mohamed, T.P., Hruschka, Jr., E.R., Mitchell, T.M.: Discovering relations between noun categories. In: Proceedings of the Conference on Empirical Methods in Nat- ural Language Processing. pp. 1447– 1455. EMNLP ’11, Association for Computa- tional Linguistics, Stroudsburg, PA, USA (2011) [Pedro Hruschka Jr., 2012] Saulo D.S. Pedro and Estevam R. Hruschka Jr., Conversing Learning: active learning and active social interaction for human supervision in never-ending learning systems. Xiii Ibero‐american Conference On Artificial Intelligence, IBERAMIA 2012, 2012. Krishnamurthy, J., Mitchell, T.M.: Which noun phrases denote which concepts. In: Proceedings of the Forty Ninth Annual Meeting of the Association for Compu- tational Linguistics (2011) Lao, N., Mitchell, T., Cohen, W.W.: Random walk inference and learning in a large scale knowledge base. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. pp. 529–539. Associa- tion for Computational Linguistics, Edinburgh, Scotland, UK. (July 2011), http://www.aclweb.org/anthology/D11-1049 E. R. Hruschka Jr. and M. C. Duarte and M. C. Nicoletti. Coupling as Strategy for Reducing Concept-Drift in Never-ending Learning Environments. Fundamenta Informaticae, IOS Press, 2012. Saulo D.S. Pedro, Ana Paula Appel, and Estevam R. Hruschka, Jr. Autonomously reviewing and validating the knowledge base of a never-ending learning system. In Proceedings of the 22nd international conference on World Wide Web companion (WWW '13 Companion), 1195-120, 2013. S. Verma and E. R. Hruschka Jr. Coupled Bayesian Sets Algorithm for Semi-supervised Learning and Information Extraction. In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), 2012. Navarro, L. F. and Appel, A. P. and Hruschka Jr., E. R., GraphDB – Storing Large Graphs on Secondary Memory. In New Trends in Databases and Information. Advances in Intelligent Systems and Computing, Springer, 177-186, 2013.
  • 164. References Assuming Facts Are Expressed More Than Once. J. Betteridge, A. Ritter and T. Mitchell In Proceedings of the 27th International Florida Artificial Intelligence Research Society Conference (FLAIRS-27), 2014. Estimating Accuracy from Unlabeled Data. E. A. Platanios, A. Blum, T. Mitchell. In Uncertainty in Artificial Intelligence (UAI), 2014. CTPs: Contextual Temporal Profiles for Time Scoping Facts via Entity State Change Detection. D.T. Wijaya, N. Nakashole and T.M. Mitchell. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014. Incorporating Vector Space Similarity in Random Walk Inference over Knowledge Bases. M. Gardner, P. Talukdar, J. Krishnamurthy and T.M. Mitchell. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014. Scaling Graph-based Semi Supervised Learning to Large Number of Labels Using Count-Min Sketch P. P. Talukdar, and W. Cohen In 17th International Conference on Artificial Intelligence and Statistics (AISTATS, 2014. Programming with Personalized PageRank: A Locally Groundable First-Order Probabilistic Logic. W.Y. Wang, K. Mazaitis and W.W. Cohen. In Proceedings of the Conference on Information and Knowledge Management (CIKM), 2013. Improving Learning and Inference in a Large Knowledge-base using Latent Syntactic Cues. Matt Gardner, Partha Pratim Talukdar, Bryan Kisiel, and Tom Mitchell. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP 2013), 2013.