SlideShare a Scribd company logo
1 of 27
Download to read offline
Improving Correlation with Human
Judgments by Integrating Second-
Order Vectors with Semantic Similarity
Bridget T. McInnes, PhD
Virginia Commonwealth University
Ted Pedersen, PhD
University of Minnesota, Duluth
tpederse@d.umn.edu
http://www.d.umn.edu/~tpederse
Measuring Similarity & Relatedness
● Similarity != Relatedness (!!!)
● Assign scores to pairs of concepts
● Compare to scores decided on by humans
● Measure correlation
– Often by rank, because scales differ
– Spearman's rank correlation coefficient
● Think about ways to do better
3
Contribution of this Work?
● We show that integrating a similarity measure
into a 2nd
order measure of relatedness
improves correlation to human judgments
– Compare impact of various similarity measures
– Compare to other methods including word2vec
● Focus is on UMLS and medical concepts
although ideas apply more generally
4
Similar or Related?
● Similarity based on is-a relations
– How much is X like Y?
– Share ancestor in is-a hierarchy
● LCS : least common subsumer
● Closer / deeper the ancestor the more similar
● Tetanus and strep_throat are similar
– both are kinds-of bacterial infections
5
Least Common Subsumer (LCS)
6
Measures of Similarity
● Path based
– Is-a hierarchy
● Path + Depth
– Is-a hierarchy
● Feature
– Is-a hierarchy
● Information Content
– Is-a hierarchy + corpus
7
Similar or Related?
● Relatedness more general
– How much is X related to Y?
– Many ways to be related
● is-a, part-of, treats, affects, symptom-of, …
● Tetanus and puncture_wound are related but
they really aren't similar
– (puncture wounds can cause tetanus)
● All similar concepts are related, but not all
related concepts are similar
8
Measures of Relatedness
● Path based
– Hirst & St-Onge, 1998 (hso)
● Definition based
– Adapted Lesk / Extended Gloss Overlaps (lesk)
● Banerjee & Pedersen, 2003
● Definition + distributional
– 2nd
order Gloss vector (vector)
● Patwardhan & Pedersen, 2006
● Distributional
– Word2Vec, Mikolov et al., 2013
9
Definition Based Relatedness
● Related concepts defined using many of the
same terms
● Concepts don't need to be connected via
relations or paths to measure them
– Lesk, 1986
– Adapted Lesk, Banerjee & Pedersen, 2003
10
BUT! ...
● Definitions are brief, potentially inconsistent
– Alopecia : … a result of cancer_treatment
– Thrush : … a side_effect of chemotherapy
● Lesk matching won't recognize the similarity
between result and side_effect, or between
cancer_treatment and chemotherapy
– Will find alopecia and thrush totally unrelated
11
Gloss Vector Measure
● Rely on co-occurrences of terms
● Allows for a fuzzier notion of matching
● Exploits second order co-occurrences
– Friend of a friend relation
– Suppose cancer_treatment and chemotherapy
don't occur in text with each other. But,
suppose that “survival” occurs with each.
– cancer_treatment and chemotherapy are
second order co-occurrences via “survival”
12
Gloss Vector Measure
● Replace words or terms in definitions with
vector of co-occurrence counts from corpus
● Represent defined concept by the average of
all the vectors of the words in its definition
● Measure relatedness of concepts via cosine
between their respective vectors
● Patwardhan and Pedersen, 2006 (vector)
– Schutze, 1998
– Latent Semantic Analysis
Can We Improve Gloss Vector?
● Instead of constructing second order vectors
using frequency counts or measures of
association...
● Use semantic similarity measures!
– Not all pairs of concepts will have similarity
values, but some do!
– Weight co-occurrences based on how similar
they are...
Integrated 2nd
Order Vector
● Construct a co-occurrence matrix from an
external corpus
– NLM Medline Bigram data
– https://mbr.nlm.nih.gov
– Bigram counts from 2014 Medline baseline
● 44 million bigrams
● Replace co-occurrence counts in matrix with
similarity measure scores
– UMLS::Similarity
– http://umls-similarity.sourceforge.net
Integrated 2nd
Order Vector
● Build second order vector for each concept
● Obtain definitions of concept (from UMLS)
– Augment with definitions of parents (PAR),
children (CHD), broader than (RB), and
narrower than (RN) relations
– Look up vector for each word in definition
– Average these vectors together
– Resulting averaged vector represents the
concept
16
Measures of Similarity
● Path
– Rada et al., 1989 (path)
– Caviedes & Cimino, 2004 (cdist)
● Path + Depth
– Wu & Palmer, 1994 (wup)
– Leacock & Chodorow, 1998 (lch)
– Pekar & Staab, 2002 (pks)
– Zhong et al., 2002 (zhong)
– Nguyen & Al-Mubaid, 2006 (nam)
17
Measures of Similarity
● Feature
– Maedche & Staab, 2001 (cmatch)
– Batet & Valls, 2011 (batet)
● Information Content
– Resnik, 1995 (res)
– Jiang & Conrath, 1997 (jcn)
– Lin, 1998, (lin)
– Pirro & Euzenat, 2010 (faith)
Reference Standards
● UMNSRS : 587 pairs ranked for relatedness,
566 for similarity, both by medical residents
– We used subsets of 430 and 401 pairs
– ICC > .7
● MayoSRS : 101 pairs ranked by physicians
and (separately) by medical coders
– MiniMayoSRS – 30 pair subset
● http://www.people.vcu.edu/~btmcinnes/
Results
UMNSRS
similarity
UMNSRS
relatedness
MiniMayoSRS
MD
relatedness
MiniMayoSRS
coder
relatedness
1st order
distributional
.47 .36 .43 .54
Extended
Lesk
.49 .33 .52 .56
2nd order
gloss vector
.54 .45 .63 .59
Vector-faith .59 .42 .58 .63
Vector-res .58 .41 .58 .65
Thresholds
● Remove all similarity scores less than a given
threshold
● Experiments with thresholds with res and faith
showed that results improved significantly with
some threshold settings
Vector-res Results with Thresholds
T #
bigrams
UMN
sim
UMN
rel
Mayo
MD
Mayo
Coder
0 851k .58 .41 .58 .65
1 166k .56 .39 .60 .67
2 65k .64 .47 .56 .62
3 28k .60 .46 .62 .71
4 11k .56 .43 .75 .76
5 3k .26 .16 .36 .36
Discussion
● Information content measures fared well!
● Clear advantage to filtering low similarity
scores, but ...
– How low, and how do we set?
– Values of thresholds vary with measures
● With reference standard?
● With corpora used for co-occurrences?
Related work
● Various studies using word2vec
– UMNSRS, MayoSRS, and MiniMayoSRS
– CBow and / or skip-grams with various
different kinds of corpora
● Vector retrofitting (Yu et al., 2016) very related!
– Map terms to MeSH terms, build vectors
based on documents assigned those terms
– Include semantically related words from UMLS
– MiniMayoSRS
Word2vec UMNSRS comparisons
Similarity Relatedness
Vector-res (T=0) .58 (N=401) .41 (N=430)
Vector-res (T=0) .59 (N=566) .48 (N=587)
Vector-faith (T=0) .59 (N=401) .42 (N=430)
Vector-faith (T=0) .61 (N=566) .49 (N=587)
Sajadi et al., 2015 .39 (N=566) .39 (N=587)
Pakhomov et al., 2016 .62 (N=449) .58 (N=458)
Muneeb et al., 2015 .52 (N=462) .45 (N=465)
Chiu et al., 2016 .65 (N=???) .60 (N=???)
Future Work
● Why more improvement on similarity results?
● Regularize comparisons (!!!)
● Which corpora for co-occurrences?
● Which definitions to represent concepts?
● How can threshold be automatically set?
● What about WordNet and general English?
Relatedness dragon eats similarity tail
27
Resources
● UMLS::Similarity
– http://umls-similarity.sourceforge.net
● Corpus for co-occurrences
– https://mbr.nlm.nih.gov/
● Reference Standards
– http://www.people.vcu.edu/~btmcinnes/data.html

More Related Content

Similar to Improving Correlation with Human Judgments by Integrating Second-Order Vectors with Semantic Similarity

2012-Nathans-PARE-RegressionGuidebook.pdf
2012-Nathans-PARE-RegressionGuidebook.pdf2012-Nathans-PARE-RegressionGuidebook.pdf
2012-Nathans-PARE-RegressionGuidebook.pdf
s_p2000
 
Correlational research
Correlational researchCorrelational research
Correlational research
Jijo G John
 
Cannonical correlation
Cannonical correlationCannonical correlation
Cannonical correlation
domsr
 
Cannonical Correlation
Cannonical CorrelationCannonical Correlation
Cannonical Correlation
domsr
 
Conducting a 3-Way ANOVAWhy ANOVA can be used to handle mult.docx
Conducting a 3-Way ANOVAWhy  ANOVA can be used to handle mult.docxConducting a 3-Way ANOVAWhy  ANOVA can be used to handle mult.docx
Conducting a 3-Way ANOVAWhy ANOVA can be used to handle mult.docx
maxinesmith73660
 
Evaluation Of A Correlation Analysis Essay
Evaluation Of A Correlation Analysis EssayEvaluation Of A Correlation Analysis Essay
Evaluation Of A Correlation Analysis Essay
Crystal Alvarez
 
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNAS
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNASRodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNAS
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNAS
​Iván Rodríguez
 

Similar to Improving Correlation with Human Judgments by Integrating Second-Order Vectors with Semantic Similarity (20)

Talk at UAB, April 12, 2013
Talk at UAB, April 12, 2013Talk at UAB, April 12, 2013
Talk at UAB, April 12, 2013
 
2012-Nathans-PARE-RegressionGuidebook.pdf
2012-Nathans-PARE-RegressionGuidebook.pdf2012-Nathans-PARE-RegressionGuidebook.pdf
2012-Nathans-PARE-RegressionGuidebook.pdf
 
Community detection using citation relations and textual similarities in a la...
Community detection using citation relations and textual similarities in a la...Community detection using citation relations and textual similarities in a la...
Community detection using citation relations and textual similarities in a la...
 
Correlational research
Correlational researchCorrelational research
Correlational research
 
semeval2016
semeval2016semeval2016
semeval2016
 
P13 corley
P13 corleyP13 corley
P13 corley
 
Cannonical correlation
Cannonical correlationCannonical correlation
Cannonical correlation
 
Cannonical Correlation
Cannonical CorrelationCannonical Correlation
Cannonical Correlation
 
Chat bot using text similarity approach
Chat bot using text similarity approachChat bot using text similarity approach
Chat bot using text similarity approach
 
Dyadic Data Analysis
Dyadic Data AnalysisDyadic Data Analysis
Dyadic Data Analysis
 
Conducting a 3-Way ANOVAWhy ANOVA can be used to handle mult.docx
Conducting a 3-Way ANOVAWhy  ANOVA can be used to handle mult.docxConducting a 3-Way ANOVAWhy  ANOVA can be used to handle mult.docx
Conducting a 3-Way ANOVAWhy ANOVA can be used to handle mult.docx
 
Spatial_final
Spatial_finalSpatial_final
Spatial_final
 
Data Processing and Statistical Treatment: Spreads and Correlation
Data Processing and Statistical Treatment: Spreads and CorrelationData Processing and Statistical Treatment: Spreads and Correlation
Data Processing and Statistical Treatment: Spreads and Correlation
 
Relevance feature discovery for text mining
Relevance feature discovery for text miningRelevance feature discovery for text mining
Relevance feature discovery for text mining
 
Evaluation Of A Correlation Analysis Essay
Evaluation Of A Correlation Analysis EssayEvaluation Of A Correlation Analysis Essay
Evaluation Of A Correlation Analysis Essay
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
 
Networks and Natural Language Processing
Networks and Natural Language ProcessingNetworks and Natural Language Processing
Networks and Natural Language Processing
 
Aantekeningen uit referenties van Biostatistics.docx
Aantekeningen uit referenties van Biostatistics.docxAantekeningen uit referenties van Biostatistics.docx
Aantekeningen uit referenties van Biostatistics.docx
 
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNAS
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNASRodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNAS
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Poster_Presentation_SACNAS
 
cannonicalpresentation-110505114327-phpapp01.pdf
cannonicalpresentation-110505114327-phpapp01.pdfcannonicalpresentation-110505114327-phpapp01.pdf
cannonicalpresentation-110505114327-phpapp01.pdf
 

Recently uploaded

SURVEY I created for uni project research
SURVEY I created for uni project researchSURVEY I created for uni project research
SURVEY I created for uni project research
CaitlinCummins3
 
Financial Accounting IFRS, 3rd Edition-dikompresi.pdf
Financial Accounting IFRS, 3rd Edition-dikompresi.pdfFinancial Accounting IFRS, 3rd Edition-dikompresi.pdf
Financial Accounting IFRS, 3rd Edition-dikompresi.pdf
MinawBelay
 
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
中 央社
 
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
Krashi Coaching
 

Recently uploaded (20)

PSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptxPSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptx
 
SURVEY I created for uni project research
SURVEY I created for uni project researchSURVEY I created for uni project research
SURVEY I created for uni project research
 
The Ball Poem- John Berryman_20240518_001617_0000.pptx
The Ball Poem- John Berryman_20240518_001617_0000.pptxThe Ball Poem- John Berryman_20240518_001617_0000.pptx
The Ball Poem- John Berryman_20240518_001617_0000.pptx
 
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
 
An Overview of the Odoo 17 Knowledge App
An Overview of the Odoo 17 Knowledge AppAn Overview of the Odoo 17 Knowledge App
An Overview of the Odoo 17 Knowledge App
 
Features of Video Calls in the Discuss Module in Odoo 17
Features of Video Calls in the Discuss Module in Odoo 17Features of Video Calls in the Discuss Module in Odoo 17
Features of Video Calls in the Discuss Module in Odoo 17
 
How to Manage Closest Location in Odoo 17 Inventory
How to Manage Closest Location in Odoo 17 InventoryHow to Manage Closest Location in Odoo 17 Inventory
How to Manage Closest Location in Odoo 17 Inventory
 
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...
 
ANTI PARKISON DRUGS.pptx
ANTI         PARKISON          DRUGS.pptxANTI         PARKISON          DRUGS.pptx
ANTI PARKISON DRUGS.pptx
 
Dementia (Alzheimer & vasular dementia).
Dementia (Alzheimer & vasular dementia).Dementia (Alzheimer & vasular dementia).
Dementia (Alzheimer & vasular dementia).
 
UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024
 
IPL Online Quiz by Pragya; Question Set.
IPL Online Quiz by Pragya; Question Set.IPL Online Quiz by Pragya; Question Set.
IPL Online Quiz by Pragya; Question Set.
 
Financial Accounting IFRS, 3rd Edition-dikompresi.pdf
Financial Accounting IFRS, 3rd Edition-dikompresi.pdfFinancial Accounting IFRS, 3rd Edition-dikompresi.pdf
Financial Accounting IFRS, 3rd Edition-dikompresi.pdf
 
An Overview of the Odoo 17 Discuss App.pptx
An Overview of the Odoo 17 Discuss App.pptxAn Overview of the Odoo 17 Discuss App.pptx
An Overview of the Odoo 17 Discuss App.pptx
 
Operations Management - Book1.p - Dr. Abdulfatah A. Salem
Operations Management - Book1.p  - Dr. Abdulfatah A. SalemOperations Management - Book1.p  - Dr. Abdulfatah A. Salem
Operations Management - Book1.p - Dr. Abdulfatah A. Salem
 
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
 
How to Analyse Profit of a Sales Order in Odoo 17
How to Analyse Profit of a Sales Order in Odoo 17How to Analyse Profit of a Sales Order in Odoo 17
How to Analyse Profit of a Sales Order in Odoo 17
 
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
 
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
 
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
 

Improving Correlation with Human Judgments by Integrating Second-Order Vectors with Semantic Similarity

  • 1. Improving Correlation with Human Judgments by Integrating Second- Order Vectors with Semantic Similarity Bridget T. McInnes, PhD Virginia Commonwealth University Ted Pedersen, PhD University of Minnesota, Duluth tpederse@d.umn.edu http://www.d.umn.edu/~tpederse
  • 2. Measuring Similarity & Relatedness ● Similarity != Relatedness (!!!) ● Assign scores to pairs of concepts ● Compare to scores decided on by humans ● Measure correlation – Often by rank, because scales differ – Spearman's rank correlation coefficient ● Think about ways to do better
  • 3. 3 Contribution of this Work? ● We show that integrating a similarity measure into a 2nd order measure of relatedness improves correlation to human judgments – Compare impact of various similarity measures – Compare to other methods including word2vec ● Focus is on UMLS and medical concepts although ideas apply more generally
  • 4. 4 Similar or Related? ● Similarity based on is-a relations – How much is X like Y? – Share ancestor in is-a hierarchy ● LCS : least common subsumer ● Closer / deeper the ancestor the more similar ● Tetanus and strep_throat are similar – both are kinds-of bacterial infections
  • 6. 6 Measures of Similarity ● Path based – Is-a hierarchy ● Path + Depth – Is-a hierarchy ● Feature – Is-a hierarchy ● Information Content – Is-a hierarchy + corpus
  • 7. 7 Similar or Related? ● Relatedness more general – How much is X related to Y? – Many ways to be related ● is-a, part-of, treats, affects, symptom-of, … ● Tetanus and puncture_wound are related but they really aren't similar – (puncture wounds can cause tetanus) ● All similar concepts are related, but not all related concepts are similar
  • 8. 8 Measures of Relatedness ● Path based – Hirst & St-Onge, 1998 (hso) ● Definition based – Adapted Lesk / Extended Gloss Overlaps (lesk) ● Banerjee & Pedersen, 2003 ● Definition + distributional – 2nd order Gloss vector (vector) ● Patwardhan & Pedersen, 2006 ● Distributional – Word2Vec, Mikolov et al., 2013
  • 9. 9 Definition Based Relatedness ● Related concepts defined using many of the same terms ● Concepts don't need to be connected via relations or paths to measure them – Lesk, 1986 – Adapted Lesk, Banerjee & Pedersen, 2003
  • 10. 10 BUT! ... ● Definitions are brief, potentially inconsistent – Alopecia : … a result of cancer_treatment – Thrush : … a side_effect of chemotherapy ● Lesk matching won't recognize the similarity between result and side_effect, or between cancer_treatment and chemotherapy – Will find alopecia and thrush totally unrelated
  • 11. 11 Gloss Vector Measure ● Rely on co-occurrences of terms ● Allows for a fuzzier notion of matching ● Exploits second order co-occurrences – Friend of a friend relation – Suppose cancer_treatment and chemotherapy don't occur in text with each other. But, suppose that “survival” occurs with each. – cancer_treatment and chemotherapy are second order co-occurrences via “survival”
  • 12. 12 Gloss Vector Measure ● Replace words or terms in definitions with vector of co-occurrence counts from corpus ● Represent defined concept by the average of all the vectors of the words in its definition ● Measure relatedness of concepts via cosine between their respective vectors ● Patwardhan and Pedersen, 2006 (vector) – Schutze, 1998 – Latent Semantic Analysis
  • 13. Can We Improve Gloss Vector? ● Instead of constructing second order vectors using frequency counts or measures of association... ● Use semantic similarity measures! – Not all pairs of concepts will have similarity values, but some do! – Weight co-occurrences based on how similar they are...
  • 14. Integrated 2nd Order Vector ● Construct a co-occurrence matrix from an external corpus – NLM Medline Bigram data – https://mbr.nlm.nih.gov – Bigram counts from 2014 Medline baseline ● 44 million bigrams ● Replace co-occurrence counts in matrix with similarity measure scores – UMLS::Similarity – http://umls-similarity.sourceforge.net
  • 15. Integrated 2nd Order Vector ● Build second order vector for each concept ● Obtain definitions of concept (from UMLS) – Augment with definitions of parents (PAR), children (CHD), broader than (RB), and narrower than (RN) relations – Look up vector for each word in definition – Average these vectors together – Resulting averaged vector represents the concept
  • 16. 16 Measures of Similarity ● Path – Rada et al., 1989 (path) – Caviedes & Cimino, 2004 (cdist) ● Path + Depth – Wu & Palmer, 1994 (wup) – Leacock & Chodorow, 1998 (lch) – Pekar & Staab, 2002 (pks) – Zhong et al., 2002 (zhong) – Nguyen & Al-Mubaid, 2006 (nam)
  • 17. 17 Measures of Similarity ● Feature – Maedche & Staab, 2001 (cmatch) – Batet & Valls, 2011 (batet) ● Information Content – Resnik, 1995 (res) – Jiang & Conrath, 1997 (jcn) – Lin, 1998, (lin) – Pirro & Euzenat, 2010 (faith)
  • 18. Reference Standards ● UMNSRS : 587 pairs ranked for relatedness, 566 for similarity, both by medical residents – We used subsets of 430 and 401 pairs – ICC > .7 ● MayoSRS : 101 pairs ranked by physicians and (separately) by medical coders – MiniMayoSRS – 30 pair subset ● http://www.people.vcu.edu/~btmcinnes/
  • 19. Results UMNSRS similarity UMNSRS relatedness MiniMayoSRS MD relatedness MiniMayoSRS coder relatedness 1st order distributional .47 .36 .43 .54 Extended Lesk .49 .33 .52 .56 2nd order gloss vector .54 .45 .63 .59 Vector-faith .59 .42 .58 .63 Vector-res .58 .41 .58 .65
  • 20. Thresholds ● Remove all similarity scores less than a given threshold ● Experiments with thresholds with res and faith showed that results improved significantly with some threshold settings
  • 21. Vector-res Results with Thresholds T # bigrams UMN sim UMN rel Mayo MD Mayo Coder 0 851k .58 .41 .58 .65 1 166k .56 .39 .60 .67 2 65k .64 .47 .56 .62 3 28k .60 .46 .62 .71 4 11k .56 .43 .75 .76 5 3k .26 .16 .36 .36
  • 22. Discussion ● Information content measures fared well! ● Clear advantage to filtering low similarity scores, but ... – How low, and how do we set? – Values of thresholds vary with measures ● With reference standard? ● With corpora used for co-occurrences?
  • 23. Related work ● Various studies using word2vec – UMNSRS, MayoSRS, and MiniMayoSRS – CBow and / or skip-grams with various different kinds of corpora ● Vector retrofitting (Yu et al., 2016) very related! – Map terms to MeSH terms, build vectors based on documents assigned those terms – Include semantically related words from UMLS – MiniMayoSRS
  • 24. Word2vec UMNSRS comparisons Similarity Relatedness Vector-res (T=0) .58 (N=401) .41 (N=430) Vector-res (T=0) .59 (N=566) .48 (N=587) Vector-faith (T=0) .59 (N=401) .42 (N=430) Vector-faith (T=0) .61 (N=566) .49 (N=587) Sajadi et al., 2015 .39 (N=566) .39 (N=587) Pakhomov et al., 2016 .62 (N=449) .58 (N=458) Muneeb et al., 2015 .52 (N=462) .45 (N=465) Chiu et al., 2016 .65 (N=???) .60 (N=???)
  • 25. Future Work ● Why more improvement on similarity results? ● Regularize comparisons (!!!) ● Which corpora for co-occurrences? ● Which definitions to represent concepts? ● How can threshold be automatically set? ● What about WordNet and general English?
  • 26. Relatedness dragon eats similarity tail
  • 27. 27 Resources ● UMLS::Similarity – http://umls-similarity.sourceforge.net ● Corpus for co-occurrences – https://mbr.nlm.nih.gov/ ● Reference Standards – http://www.people.vcu.edu/~btmcinnes/data.html