SlideShare a Scribd company logo
A Hybrid Semantic Approach to Building 
Dynamic Maps of Research Communities 
Francesco Osborne, Beppe Scavo, Enrico Motta 
KMi, The Open University, United Kingdom 
November 27th 2014
Research communities 
The engine of research.
We need to understand how scientific communities 
adapt and cooperate to implement visions into 
concrete technologies.
Research communities 
Communities of academic authors are usually identified by using 
standard community detection algorithms, which typically 
exploit co-authorship or citation graphs.
Temporal topic-based communities (TTC) 
A different type of community we investigated is formed by the 
set of researchers who, at a given time, are following shared 
research trajectory, i.e. they are working on the same topics at 
the same time. 
Osborne, F., Scavo, G., & Motta, E. (2014). Identifying diachronic topic-based research 
communities by clustering shared research trajectories. In The Semantic Web: Trends and 
Challenges (pp. 114-129). Springer International Publishing.
Research Communities Map Builder 
• RCMB is able to automatically link diachronic topic-based 
communities over subsequent time intervals to 
identify significant events. 
• These include topic shifts within a community; the 
appearance and fading of a community; communities 
splitting, merging, spawning other communities; etc. 
• The output of RCMB is a map of research 
communities, annotated with the detected events, 
which provides a concise visual representation of the 
dynamics of a research area.
RCMB steps: 
1. Applies the Temporal Semantic Topic-Based 
Clustering (TST) algorithm to find Temporal topic-based 
communities in different time intervals; 
2. Detects Topic Shifts; 
3. Links Communities in different years; 
4. Detect Key Events;
RCMB steps: 
1. Applies the Temporal Semantic Topic-Based 
Clustering (TST) algorithm to find Temporal topic-based 
communities in different time intervals. 
2. Detects Topic Shifts in following years 
3. Links Communities in different years 
4. Detect Key Events 
Temporal Semantic Topic-Based Clustering 
Osborne, F., Scavo, G., & Motta, E. (2014). Identifying diachronic topic-based 
research communities by clustering shared research trajectories. In The 
Semantic Web: Trends and Challenges (pp. 114-129). Springer International 
Publishing.
TST in short 
1. It augments the topic semantically using an automatically 
generated OWL ontology and represent each author as a 
semantic topic distribution over subsequent years. 
2. It weighs each topic according to its relationship with the 
main topic, for highlighting the communities strongly 
related to the main topic. 
3. It clusters authors using the ATTS (Adjusted Temporal 
Topic Similarity), which is computed by averaging the 
cosine similarities of the topic vectors over progressively 
smaller intervals of time.
Detecting Topic Shifts 
We use a sliding window algorithm that checks for a topic shift 
by comparing the initial topic distribution in time t with the topic 
distributions in time t+1, t+2… t+n. 
Information Extraction/Semantic Annotation community 
2002 
Infor. Extraction: 26 % 
Natural Language: 17 % 
Named Entity: 12 % 
Machine Learning: 9 % 
Knowledge Base: 9 % 
2010 
Linked Data: 16 % 
Natural Language: 15 % 
Semantic Annotation: 15 % 
SW Technology: 10 % 
Information Retrieval: 10 % 
Knowledge Base: 9 % 
Semantic Wiki: 9 % 
2006 
Semantic Annotation: 25 % 
Knowledge Base: 15 % 
Semantic Wiki: 11 % 
Information Extraction: 10 % 
Semantic Information: 8 % 
Natural Language: 6 % 
Information Retrieval: 6 %
Detecting Topic Shifts 
We define a topic shift a statistically significant change (detected 
via chi-square test ) in the topic distribution of a community 
which occurred in a certain time interval. 
To detect which topics were the main protagonists of this shift, 
we applying the same test excluding each time a different topic, 
and selecting the topic whose absence yields the bigger 
increment in the p value.
Community linking 
We are interested in two different links between community: 
• The strong link is defined as a link that connects the same 
community in subsequent timeframes. 
• The weak link is defined as the link that connects community 
C1 with community C2 in a subsequent timeframe, if C1 has an 
impact over C2 in terms of migrating authors and/or topics.
Community linking
Community linking 
We take the minimum values of ts 
and tw that minimize the MEF using 
the Nelder-Mead algorithm.
Key Events detection 
If a community has no strong links with any precedent 
interval communities, we detect the appearance of a 
community. 
2006 2007 
C1 
C3 
C2 
C1 
C2
Key Events detection 
If a community has no strong links with any subsequent 
interval communities, we detect the fading of a community. 
2006 2007 
C1 
C2 
C3 
C1 
C2
Key Events detection 
If a community is linked to more than one community in the 
subsequent interval and one of the links is a strong one we 
detect the forking of one or more communities out of the 
community characterized by the strong link. 
2006 2007 
C1 C1 
C2
Key Events detection 
If a community is linked to more than one community in the 
subsequent interval and none of the links is a strong one we 
detect the splitting of a community into multiple communities. 
2006 2007 
C1 
C2 
C3
Key Events detection 
If two or more communities are linked to one community in 
the subsequent interval and one of the inlinks is a strong link, 
we detect the assimilation of one or more communities into 
the community C characterized by the strong link. 
2006 2007 
C1 C1 
C2 
If the communities fade after the event, they are labelled 
as absorbed to C.
Key Events detection 
If two or more communities are linked to one community in 
the subsequent interval and none of the inlinks is a strong 
link, we detect the merging of two or more communities in a 
new community C. 
2006 2007 
C1 
C3 
C2 
If the communities fade after the event, they are labelled 
as merged in C.
Evaluation: Cluster Compactness
Case study 
We applying RCMB to two research areas: World Wide Web 
(WWW) and Semantic Web (SW). 
Our study was based on a dataset built from data retrieved by 
means of the API provided by Microsoft Academic Search. 
We first retrieved authors and papers labelled with WWW 
and SW or with their first 150 co-occurring topics. We then 
run RCMB on WWW and SW in the 2000-2010 time interval 
with a granularity of 3. The average number of authors 
selected in each year was 932 for WWW and 646 for SW.
Semantic Web
WWW
Future Work 
• Automatically generate comprehensive explanations for 
the identified dynamics. 
• Forecasting topic shifts and key events, e.g., estimating 
the probability that a new topic will emerge in a certain 
community or that two communities will merge in the 
coming years.
Questions? 
Interested in scholarly data? 
SAVE-SD 2015 
Semantics, Analytics, Visualisation: Enhancing Scholarly Data 
Workshop at 24th International World Wide Web Conference 
May 19, 2015 - Florence, Italy 
Site: cs.unibo.it/save-sd

More Related Content

What's hot

Tutorial on Question Answering Systems
Tutorial on Question Answering Systems Tutorial on Question Answering Systems
Tutorial on Question Answering Systems
Saeedeh Shekarpour
 
Rostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic Networks
Rostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic NetworksRostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic Networks
Rostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic Networks
Witology
 
Complex Networks Analysis @ Universita Roma Tre
Complex Networks Analysis @ Universita Roma TreComplex Networks Analysis @ Universita Roma Tre
Complex Networks Analysis @ Universita Roma Tre
Matteo Moci
 
Semantic Interpretation of User Query for Question Answering on Interlinked Data
Semantic Interpretation of User Query for Question Answering on Interlinked DataSemantic Interpretation of User Query for Question Answering on Interlinked Data
Semantic Interpretation of User Query for Question Answering on Interlinked Data
Saeedeh Shekarpour
 
Disease spreading & control in temporal networks
Disease spreading & control in temporal networksDisease spreading & control in temporal networks
Disease spreading & control in temporal networks
Petter Holme
 
Bibliometric Study and Network Analysis of the Phenomenon of Self-Publishing
Bibliometric Study and Network Analysis of the Phenomenon of Self-PublishingBibliometric Study and Network Analysis of the Phenomenon of Self-Publishing
Bibliometric Study and Network Analysis of the Phenomenon of Self-Publishing
Technological Ecosystems for Enhancing Multiculturality
 
Optimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networks
Optimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networksOptimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networks
Optimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networks
Petter Holme
 
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatramanOdsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
venkatramanJ4
 
00 Social Influence Effects on Men's HIV Testing
00 Social Influence Effects on Men's HIV Testing00 Social Influence Effects on Men's HIV Testing
00 Social Influence Effects on Men's HIV Testing
Duke Network Analysis Center
 
IRJET- Fake News Detection and Rumour Source Identification
IRJET- Fake News Detection and Rumour Source IdentificationIRJET- Fake News Detection and Rumour Source Identification
IRJET- Fake News Detection and Rumour Source Identification
IRJET Journal
 
Automatic Hate Speech Detection: A Literature Review
Automatic Hate Speech Detection: A Literature ReviewAutomatic Hate Speech Detection: A Literature Review
Automatic Hate Speech Detection: A Literature Review
Dr. Amarjeet Singh
 
A Study of User Interaction with Context Aware Notifications from a Moodle Le...
A Study of User Interaction with Context Aware Notifications from a Moodle Le...A Study of User Interaction with Context Aware Notifications from a Moodle Le...
A Study of User Interaction with Context Aware Notifications from a Moodle Le...
Periquest Ltd
 
Data Science Education at JHSPH
Data Science Education at JHSPHData Science Education at JHSPH
Data Science Education at JHSPH
jtleek
 
WAPWG Jan 2020 Sloan cosmos workshop
WAPWG Jan 2020 Sloan cosmos workshopWAPWG Jan 2020 Sloan cosmos workshop
WAPWG Jan 2020 Sloan cosmos workshop
Sara Day Thomson
 
#CPLOL18 paper on #ResNetSLT community
#CPLOL18 paper on #ResNetSLT community#CPLOL18 paper on #ResNetSLT community
#CPLOL18 paper on #ResNetSLT community
Bronwyn Hemsley
 
Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media. Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media.
Mohsen Shahriari
 
Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...
Axel Bruns
 
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Dasha Herrmannova
 
A method to evaluate the reliability of social media data for social network ...
A method to evaluate the reliability of social media data for social network ...A method to evaluate the reliability of social media data for social network ...
A method to evaluate the reliability of social media data for social network ...
Derek Weber
 
How the information content of your contact pattern representation affects pr...
How the information content of your contact pattern representation affects pr...How the information content of your contact pattern representation affects pr...
How the information content of your contact pattern representation affects pr...
Petter Holme
 

What's hot (20)

Tutorial on Question Answering Systems
Tutorial on Question Answering Systems Tutorial on Question Answering Systems
Tutorial on Question Answering Systems
 
Rostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic Networks
Rostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic NetworksRostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic Networks
Rostislav Yavorsky - Research Challenges of Dynamic Socio-Semantic Networks
 
Complex Networks Analysis @ Universita Roma Tre
Complex Networks Analysis @ Universita Roma TreComplex Networks Analysis @ Universita Roma Tre
Complex Networks Analysis @ Universita Roma Tre
 
Semantic Interpretation of User Query for Question Answering on Interlinked Data
Semantic Interpretation of User Query for Question Answering on Interlinked DataSemantic Interpretation of User Query for Question Answering on Interlinked Data
Semantic Interpretation of User Query for Question Answering on Interlinked Data
 
Disease spreading & control in temporal networks
Disease spreading & control in temporal networksDisease spreading & control in temporal networks
Disease spreading & control in temporal networks
 
Bibliometric Study and Network Analysis of the Phenomenon of Self-Publishing
Bibliometric Study and Network Analysis of the Phenomenon of Self-PublishingBibliometric Study and Network Analysis of the Phenomenon of Self-Publishing
Bibliometric Study and Network Analysis of the Phenomenon of Self-Publishing
 
Optimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networks
Optimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networksOptimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networks
Optimizing
 sentinel
 surveillance 
in
 static
 and 
temporal 
networks
 
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatramanOdsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
Odsc 2018 detection_classification_of_fake_news_using_cnn_venkatraman
 
00 Social Influence Effects on Men's HIV Testing
00 Social Influence Effects on Men's HIV Testing00 Social Influence Effects on Men's HIV Testing
00 Social Influence Effects on Men's HIV Testing
 
IRJET- Fake News Detection and Rumour Source Identification
IRJET- Fake News Detection and Rumour Source IdentificationIRJET- Fake News Detection and Rumour Source Identification
IRJET- Fake News Detection and Rumour Source Identification
 
Automatic Hate Speech Detection: A Literature Review
Automatic Hate Speech Detection: A Literature ReviewAutomatic Hate Speech Detection: A Literature Review
Automatic Hate Speech Detection: A Literature Review
 
A Study of User Interaction with Context Aware Notifications from a Moodle Le...
A Study of User Interaction with Context Aware Notifications from a Moodle Le...A Study of User Interaction with Context Aware Notifications from a Moodle Le...
A Study of User Interaction with Context Aware Notifications from a Moodle Le...
 
Data Science Education at JHSPH
Data Science Education at JHSPHData Science Education at JHSPH
Data Science Education at JHSPH
 
WAPWG Jan 2020 Sloan cosmos workshop
WAPWG Jan 2020 Sloan cosmos workshopWAPWG Jan 2020 Sloan cosmos workshop
WAPWG Jan 2020 Sloan cosmos workshop
 
#CPLOL18 paper on #ResNetSLT community
#CPLOL18 paper on #ResNetSLT community#CPLOL18 paper on #ResNetSLT community
#CPLOL18 paper on #ResNetSLT community
 
Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media. Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media.
 
Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...
 
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
 
A method to evaluate the reliability of social media data for social network ...
A method to evaluate the reliability of social media data for social network ...A method to evaluate the reliability of social media data for social network ...
A method to evaluate the reliability of social media data for social network ...
 
How the information content of your contact pattern representation affects pr...
How the information content of your contact pattern representation affects pr...How the information content of your contact pattern representation affects pr...
How the information content of your contact pattern representation affects pr...
 

Viewers also liked

EKAW 2016 - TechMiner: Extracting Technologies from Academic Publications
EKAW 2016 - TechMiner: Extracting Technologies from Academic PublicationsEKAW 2016 - TechMiner: Extracting Technologies from Academic Publications
EKAW 2016 - TechMiner: Extracting Technologies from Academic Publications
Francesco Osborne
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Francesco Osborne
 
Ekaw2014 - Inferring Semantic Relations by User Feedback
Ekaw2014 - Inferring Semantic Relations by User FeedbackEkaw2014 - Inferring Semantic Relations by User Feedback
Ekaw2014 - Inferring Semantic Relations by User Feedback
Francesco Osborne
 
Linked science presentation 25
Linked science presentation 25Linked science presentation 25
Linked science presentation 25
Francesco Osborne
 
Supporting Springer Nature Editors by means of Semantic Technologies
Supporting Springer Nature Editors by means of Semantic TechnologiesSupporting Springer Nature Editors by means of Semantic Technologies
Supporting Springer Nature Editors by means of Semantic Technologies
Francesco Osborne
 
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
Francesco Osborne
 
Klink-2: integrating multiple web sources to generate semantic topic networks
 Klink-2: integrating multiple web sources to generate semantic topic networks Klink-2: integrating multiple web sources to generate semantic topic networks
Klink-2: integrating multiple web sources to generate semantic topic networks
Francesco Osborne
 

Viewers also liked (7)

EKAW 2016 - TechMiner: Extracting Technologies from Academic Publications
EKAW 2016 - TechMiner: Extracting Technologies from Academic PublicationsEKAW 2016 - TechMiner: Extracting Technologies from Academic Publications
EKAW 2016 - TechMiner: Extracting Technologies from Academic Publications
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
 
Ekaw2014 - Inferring Semantic Relations by User Feedback
Ekaw2014 - Inferring Semantic Relations by User FeedbackEkaw2014 - Inferring Semantic Relations by User Feedback
Ekaw2014 - Inferring Semantic Relations by User Feedback
 
Linked science presentation 25
Linked science presentation 25Linked science presentation 25
Linked science presentation 25
 
Supporting Springer Nature Editors by means of Semantic Technologies
Supporting Springer Nature Editors by means of Semantic TechnologiesSupporting Springer Nature Editors by means of Semantic Technologies
Supporting Springer Nature Editors by means of Semantic Technologies
 
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
 
Klink-2: integrating multiple web sources to generate semantic topic networks
 Klink-2: integrating multiple web sources to generate semantic topic networks Klink-2: integrating multiple web sources to generate semantic topic networks
Klink-2: integrating multiple web sources to generate semantic topic networks
 

Similar to EKAW2014 - A Hybrid Semantic Approach to Building 
Dynamic Maps of Research Communities

Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 posterLife-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
Václav Belák
 
Entity-Based Semantics Emerging from Personal Awareness Streams
Entity-Based Semantics Emerging from Personal Awareness Streams Entity-Based Semantics Emerging from Personal Awareness Streams
Entity-Based Semantics Emerging from Personal Awareness Streams
Amparo Elizabeth Cano Basave
 
Feedback Effects Between Similarity And Social Influence In Online Communities
Feedback Effects Between Similarity And Social Influence In Online CommunitiesFeedback Effects Between Similarity And Social Influence In Online Communities
Feedback Effects Between Similarity And Social Influence In Online Communities
Paolo Massa
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)
theijes
 
Mining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network ResearchMining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network Research
Marko Rodriguez
 
Notes on mining social media updated
Notes on mining social media updatedNotes on mining social media updated
Notes on mining social media updated
Gary Myers KMb Unit, York University
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
Angelo Salatino
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology:  A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology:  A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
Angelo Salatino
 
Measuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online CommunitiesMeasuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online Communities
Matthew Rowe
 
1026 telling story from text 2
1026 telling story from text 21026 telling story from text 2
1026 telling story from text 2
Ke Jiang
 
Searching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced information
Silvia Puglisi
 
Socialnetworkanalysis
SocialnetworkanalysisSocialnetworkanalysis
Socialnetworkanalysis
kcarter14
 
Did we become a community - A Literature Review
Did we become a community - A Literature ReviewDid we become a community - A Literature Review
Did we become a community - A Literature Review
Su-Tuan Lulee
 
Current trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networksCurrent trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networks
eSAT Publishing House
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures
dnac
 
01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)
Duke Network Analysis Center
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Media
rezahk
 
Social networkanalysisfinal
Social networkanalysisfinalSocial networkanalysisfinal
Social networkanalysisfinal
kcarter14
 
02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and Overview02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and Overview
Duke Network Analysis Center
 
Multi-level analysis on structures and dynamics of OSN
Multi-level analysis on structures and dynamics of OSNMulti-level analysis on structures and dynamics of OSN
Multi-level analysis on structures and dynamics of OSN
Haewoon Kwak
 

Similar to EKAW2014 - A Hybrid Semantic Approach to Building 
Dynamic Maps of Research Communities (20)

Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 posterLife-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
 
Entity-Based Semantics Emerging from Personal Awareness Streams
Entity-Based Semantics Emerging from Personal Awareness Streams Entity-Based Semantics Emerging from Personal Awareness Streams
Entity-Based Semantics Emerging from Personal Awareness Streams
 
Feedback Effects Between Similarity And Social Influence In Online Communities
Feedback Effects Between Similarity And Social Influence In Online CommunitiesFeedback Effects Between Similarity And Social Influence In Online Communities
Feedback Effects Between Similarity And Social Influence In Online Communities
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)
 
Mining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network ResearchMining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network Research
 
Notes on mining social media updated
Notes on mining social media updatedNotes on mining social media updated
Notes on mining social media updated
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology:  A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology:  A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
Measuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online CommunitiesMeasuring the Topical Specificity of Online Communities
Measuring the Topical Specificity of Online Communities
 
1026 telling story from text 2
1026 telling story from text 21026 telling story from text 2
1026 telling story from text 2
 
Searching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced information
 
Socialnetworkanalysis
SocialnetworkanalysisSocialnetworkanalysis
Socialnetworkanalysis
 
Did we become a community - A Literature Review
Did we become a community - A Literature ReviewDid we become a community - A Literature Review
Did we become a community - A Literature Review
 
Current trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networksCurrent trends of opinion mining and sentiment analysis in social networks
Current trends of opinion mining and sentiment analysis in social networks
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures
 
01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Media
 
Social networkanalysisfinal
Social networkanalysisfinalSocial networkanalysisfinal
Social networkanalysisfinal
 
02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and Overview02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and Overview
 
Multi-level analysis on structures and dynamics of OSN
Multi-level analysis on structures and dynamics of OSNMulti-level analysis on structures and dynamics of OSN
Multi-level analysis on structures and dynamics of OSN
 

Recently uploaded

SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
terusbelajar5
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
Sérgio Sacani
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
LengamoLAppostilic
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
Hitesh Sikarwar
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
Daniel Tubbenhauer
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Leonel Morgado
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
Vandana Devesh Sharma
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
European Sustainable Phosphorus Platform
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
MaheshaNanjegowda
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
Abdul Wali Khan University Mardan,kP,Pakistan
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
İsa Badur
 

Recently uploaded (20)

SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
 

EKAW2014 - A Hybrid Semantic Approach to Building 
Dynamic Maps of Research Communities

  • 1. A Hybrid Semantic Approach to Building Dynamic Maps of Research Communities Francesco Osborne, Beppe Scavo, Enrico Motta KMi, The Open University, United Kingdom November 27th 2014
  • 2. Research communities The engine of research.
  • 3. We need to understand how scientific communities adapt and cooperate to implement visions into concrete technologies.
  • 4. Research communities Communities of academic authors are usually identified by using standard community detection algorithms, which typically exploit co-authorship or citation graphs.
  • 5. Temporal topic-based communities (TTC) A different type of community we investigated is formed by the set of researchers who, at a given time, are following shared research trajectory, i.e. they are working on the same topics at the same time. Osborne, F., Scavo, G., & Motta, E. (2014). Identifying diachronic topic-based research communities by clustering shared research trajectories. In The Semantic Web: Trends and Challenges (pp. 114-129). Springer International Publishing.
  • 6. Research Communities Map Builder • RCMB is able to automatically link diachronic topic-based communities over subsequent time intervals to identify significant events. • These include topic shifts within a community; the appearance and fading of a community; communities splitting, merging, spawning other communities; etc. • The output of RCMB is a map of research communities, annotated with the detected events, which provides a concise visual representation of the dynamics of a research area.
  • 7. RCMB steps: 1. Applies the Temporal Semantic Topic-Based Clustering (TST) algorithm to find Temporal topic-based communities in different time intervals; 2. Detects Topic Shifts; 3. Links Communities in different years; 4. Detect Key Events;
  • 8. RCMB steps: 1. Applies the Temporal Semantic Topic-Based Clustering (TST) algorithm to find Temporal topic-based communities in different time intervals. 2. Detects Topic Shifts in following years 3. Links Communities in different years 4. Detect Key Events Temporal Semantic Topic-Based Clustering Osborne, F., Scavo, G., & Motta, E. (2014). Identifying diachronic topic-based research communities by clustering shared research trajectories. In The Semantic Web: Trends and Challenges (pp. 114-129). Springer International Publishing.
  • 9. TST in short 1. It augments the topic semantically using an automatically generated OWL ontology and represent each author as a semantic topic distribution over subsequent years. 2. It weighs each topic according to its relationship with the main topic, for highlighting the communities strongly related to the main topic. 3. It clusters authors using the ATTS (Adjusted Temporal Topic Similarity), which is computed by averaging the cosine similarities of the topic vectors over progressively smaller intervals of time.
  • 10. Detecting Topic Shifts We use a sliding window algorithm that checks for a topic shift by comparing the initial topic distribution in time t with the topic distributions in time t+1, t+2… t+n. Information Extraction/Semantic Annotation community 2002 Infor. Extraction: 26 % Natural Language: 17 % Named Entity: 12 % Machine Learning: 9 % Knowledge Base: 9 % 2010 Linked Data: 16 % Natural Language: 15 % Semantic Annotation: 15 % SW Technology: 10 % Information Retrieval: 10 % Knowledge Base: 9 % Semantic Wiki: 9 % 2006 Semantic Annotation: 25 % Knowledge Base: 15 % Semantic Wiki: 11 % Information Extraction: 10 % Semantic Information: 8 % Natural Language: 6 % Information Retrieval: 6 %
  • 11. Detecting Topic Shifts We define a topic shift a statistically significant change (detected via chi-square test ) in the topic distribution of a community which occurred in a certain time interval. To detect which topics were the main protagonists of this shift, we applying the same test excluding each time a different topic, and selecting the topic whose absence yields the bigger increment in the p value.
  • 12. Community linking We are interested in two different links between community: • The strong link is defined as a link that connects the same community in subsequent timeframes. • The weak link is defined as the link that connects community C1 with community C2 in a subsequent timeframe, if C1 has an impact over C2 in terms of migrating authors and/or topics.
  • 14. Community linking We take the minimum values of ts and tw that minimize the MEF using the Nelder-Mead algorithm.
  • 15. Key Events detection If a community has no strong links with any precedent interval communities, we detect the appearance of a community. 2006 2007 C1 C3 C2 C1 C2
  • 16. Key Events detection If a community has no strong links with any subsequent interval communities, we detect the fading of a community. 2006 2007 C1 C2 C3 C1 C2
  • 17. Key Events detection If a community is linked to more than one community in the subsequent interval and one of the links is a strong one we detect the forking of one or more communities out of the community characterized by the strong link. 2006 2007 C1 C1 C2
  • 18. Key Events detection If a community is linked to more than one community in the subsequent interval and none of the links is a strong one we detect the splitting of a community into multiple communities. 2006 2007 C1 C2 C3
  • 19. Key Events detection If two or more communities are linked to one community in the subsequent interval and one of the inlinks is a strong link, we detect the assimilation of one or more communities into the community C characterized by the strong link. 2006 2007 C1 C1 C2 If the communities fade after the event, they are labelled as absorbed to C.
  • 20. Key Events detection If two or more communities are linked to one community in the subsequent interval and none of the inlinks is a strong link, we detect the merging of two or more communities in a new community C. 2006 2007 C1 C3 C2 If the communities fade after the event, they are labelled as merged in C.
  • 22. Case study We applying RCMB to two research areas: World Wide Web (WWW) and Semantic Web (SW). Our study was based on a dataset built from data retrieved by means of the API provided by Microsoft Academic Search. We first retrieved authors and papers labelled with WWW and SW or with their first 150 co-occurring topics. We then run RCMB on WWW and SW in the 2000-2010 time interval with a granularity of 3. The average number of authors selected in each year was 932 for WWW and 646 for SW.
  • 24. WWW
  • 25. Future Work • Automatically generate comprehensive explanations for the identified dynamics. • Forecasting topic shifts and key events, e.g., estimating the probability that a new topic will emerge in a certain community or that two communities will merge in the coming years.
  • 26. Questions? Interested in scholarly data? SAVE-SD 2015 Semantics, Analytics, Visualisation: Enhancing Scholarly Data Workshop at 24th International World Wide Web Conference May 19, 2015 - Florence, Italy Site: cs.unibo.it/save-sd