SlideShare a Scribd company logo
Mining Research Publication Networks for Impact
PhD Topic Presentation

Drahomira Herrmannova
Knowledge Media Institute
The Open University

KMi Internal Seminar, November 2013

1 / 19
Table of Contents
1 Research Aim

Motivation
Problem statement
2 Literature review

State of the art
Limitations
3 Research objectives

Research questions
Selected approach
Tasks and plans
4 Pilot study
5 References

2 / 19
The key question

“How to evaluate the quality of research publications?”

3 / 19
Who needs this anyway?
• Researchers
• How to select relevant literature for reading?
• Librarians
• How to select journal subscriptions?
• Universities, funding agencies and other institutions
• How to aid reviewers of funding and grant proposals, hiring
committees etc.?
• Publishers and editors
• How can publishers evaluate and promote their journals?
• Society
• How to evaluate the returns of research to the society?

4 / 19
The growth of scholarly literature

Figure : Monthly submission rate (since 1991) for Arxiv.org. Source:
http://arxiv.org/

5 / 19
The growth of journal subscription costs

Figure : Expenditures in ARL libraries (1986 – 2009). Source: [1]

6 / 19
What’s being used

• Peer review
• Qualitative evaluation method
• Traditionally the main filter for controlling the quality of
published research
• Classical quantitative methods
• Typically based on citations and/or productivity
• Citation counts
• JIF
• h-index

7 / 19
So, what’s the problem?

• Peer review
• Speed and cost
• Biased opinion
• Doesn’t limit the amount of published research
• Classical quantitative methods
• Quality vs. impact
• Reasons for citation
• Citation half-life
• Manipulation and gaming
• Author variability
• Field effects

8 / 19
Bibliometrics today

Two changes which influenced the evolution of bibliometrics
• creation of the Web and web-related developments
• growth of Open Access publishing

9 / 19
Bibliometrics today
Two ideas driving the current research
1 Development of new metrics (improvements and replacements
of JIF)
• h-index
• Eigenfactor
• SJR
2

Concerns about the validity of using citations
• Methods using different data
• Patent analysis
• Webometrics
• Altmetrics
• Full-text analysis
• “Fixing” citations (field normalisation of indicators)

10 / 19
Limitations
• Limitations of citation-based metrics
• Citation bias
• Incomplete journal coverage
• Author variability
• Field effects
• Uncited publications
• Manipulation of metrics
• Using JIF for research evaluation
• Limitations of web-based metrics
• Gaming web-based and social metrics
• Problems of data collection
• Adoption of social media by users
• Accumulated advantage
• Limitations of text-based metrics
• Full-text not always available

11 / 19
Research questions

Question 1: What factors influence the quality of a research
publication (with regard to the publication type)?
Question 2: What is the relationship (if there is any) between the
impact of a publication, measured by the classical
bibliometric methods, and the quality of a
publication?
Question 3: How can we detect the factors influencing quality in
order to evaluate the quality of a research
publication?
Question 4: How can this evaluation be used in other disciplines?

12 / 19
Selected approach

• Single number vs. collection of metrics and indicators
• Analysis of full-text
• Until quite recently not easily available
• Full-text – the best indicator of publication quality
• For example
• Co-word analysis
• Analysis of citation context
• Semantic similarity of publications

• Additional indicators
• Famous author or collaboration with famous authors
• Citing or is being cited outside of the research area
• Paper published in a field-specific prestigious journal

13 / 19
Requirements for science evaluation methods
Source: [2]

1

Reliable and accurate, comparable or better than the peer
review system

2

Easy to understand.

3

Economical in terms of development and maintenance, time
required to understand it, etc.

4

Faster than citations, at least comparable to the speed of peer
review

5

Resistant to manipulation and gaming

14 / 19
Tasks and plans
Data collection

Task 1: Identify information sources that may provide relevant
publication data
• Mostly done
Task 2a: Investigate factors that influence the quality of research
publications
Task 2b: Using the identified information sources, develop various
relevant data structures such as:
• collaboration networks
• citation, co-citation and bibliographic coupling
networks
• clusters of semantically related publications
• clusters of publications corresponding to different
topics

15 / 19
Tasks and plans
Data analysis

Task 3a: Study the possibilities of application of NLP for the
evaluation of research publications
Task 3b: Investigate the developed data structures using graph
and network theory as well as bibliometric indicators

16 / 19
Tasks
Development of new methods

Task 4a: Analyse the possibilities of combining the studied
methods in order to design a set of new methods for
estimating quality
Task 4b: Evaluate the proposed methods against current
standards
Task 4c: Analyse the use of the new methods in other
disciplines

17 / 19
Task 1
Identification of data sources

Source
CSX
MAS
JSTOR
DBLP
CORE
ArXiv
KDD
iSearch
DBLP+C
ACM
OCC

MD
X
X
X
X
-

API
X
X
X
X
-

OAI-PMH
X
X
X
-

dumps
X
X
X
X
X
X
X
X
X

cit.
X
X
X
X
X
X
X
X
X

FT
X
*
*
*
X
X
X
X
-

Table : Stars (*) represent sources, which don’t store full-text but provide
links to the full-text where available. MD stands for multidisciplinary.

18 / 19
References

[1] Kyrillidou, Martha and Morris, Shaneka.
ARL Statistics 2008 - 2009.
Association of Research Libraries, Washington, DC, 2011.
[2] Taraborelli, Dario.
Soft peer review: Social software and distributed scientific
evaluation.
Proceedings of the 8th International Conference on the Design
of Cooperative Systems (COOP ’08), Carry-le-Rouet, France,
2008.

19 / 19
How many metrics?

Scientometrics: study of science and research
Bibliometrics: study of scientific literature
Informetrics: study of any type of information
Webometrics: informetric studies of the web
Cybermetrics: informetric studies of the whole Internet
Altmetrics: study of science and research using data from
social media

20 / 19

More Related Content

What's hot

Bibliometrics
BibliometricsBibliometrics
Bibliometrics
Merve Nur Taş
 
Google scholar profiles
Google scholar profilesGoogle scholar profiles
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshopScholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
Plethora121
 
Altmetrics: An Overview
Altmetrics: An OverviewAltmetrics: An Overview
Altmetrics: An Overview
Pallab Pradhan
 
Bibliometrics: journals, articles, authors (v2)
Bibliometrics: journals, articles, authors (v2)Bibliometrics: journals, articles, authors (v2)
Bibliometrics: journals, articles, authors (v2)
Lancaster University Library
 
Measuring research impact with bibliometrics
Measuring research impact with bibliometricsMeasuring research impact with bibliometrics
Measuring research impact with bibliometrics
Lancaster University Library
 
Bibliometrics jul 2014
Bibliometrics jul 2014Bibliometrics jul 2014
Bibliometrics jul 2014
bradscifi
 
Bibliometrics in the library
Bibliometrics in the libraryBibliometrics in the library
Bibliometrics in the libraryWouter Gerritsma
 
Van bibliometrics naar altmetrics
Van bibliometrics naar altmetricsVan bibliometrics naar altmetrics
Van bibliometrics naar altmetrics
Wouter Gerritsma
 
Bibliometric Tools
Bibliometric ToolsBibliometric Tools
Bibliometric Tools
UCT
 
Showcasing your Research Impact using Bibliometrics
Showcasing your Research Impact using BibliometricsShowcasing your Research Impact using Bibliometrics
Showcasing your Research Impact using Bibliometrics
Ciarán Quinn
 
Introduction to Bibliometrics
Introduction to BibliometricsIntroduction to Bibliometrics
Introduction to Bibliometrics
Awot Kiflu Gebregziabher
 
Citation metrics
Citation metricsCitation metrics
Citation metrics
Vasantha Raju N
 
Resources for measuring and maximizing research impact fall 2015
Resources for measuring and maximizing research impact fall 2015Resources for measuring and maximizing research impact fall 2015
Resources for measuring and maximizing research impact fall 2015
Plethora121
 
Journal Impact Metrics
Journal Impact MetricsJournal Impact Metrics
Journal Impact Metrics
Plethora121
 
SciVal
SciValSciVal
SciVal
UCT
 
Finding Journal Impact Factor using Journal Citation Reports
Finding Journal Impact Factor using Journal Citation Reports Finding Journal Impact Factor using Journal Citation Reports
Finding Journal Impact Factor using Journal Citation Reports
Andiswa Mfengu
 
STS Hot Topics Midwinter 2014 altmetrics presentation
STS Hot Topics Midwinter 2014 altmetrics presentationSTS Hot Topics Midwinter 2014 altmetrics presentation
STS Hot Topics Midwinter 2014 altmetrics presentation
Plethora121
 
Scopus Journal Metrics
Scopus Journal MetricsScopus Journal Metrics
Scopus Journal Metrics
Andiswa Mfengu
 
Journal metrics July 2016
Journal metrics July 2016Journal metrics July 2016
Journal metrics July 2016
UCT
 

What's hot (20)

Bibliometrics
BibliometricsBibliometrics
Bibliometrics
 
Google scholar profiles
Google scholar profilesGoogle scholar profiles
Google scholar profiles
 
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshopScholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
 
Altmetrics: An Overview
Altmetrics: An OverviewAltmetrics: An Overview
Altmetrics: An Overview
 
Bibliometrics: journals, articles, authors (v2)
Bibliometrics: journals, articles, authors (v2)Bibliometrics: journals, articles, authors (v2)
Bibliometrics: journals, articles, authors (v2)
 
Measuring research impact with bibliometrics
Measuring research impact with bibliometricsMeasuring research impact with bibliometrics
Measuring research impact with bibliometrics
 
Bibliometrics jul 2014
Bibliometrics jul 2014Bibliometrics jul 2014
Bibliometrics jul 2014
 
Bibliometrics in the library
Bibliometrics in the libraryBibliometrics in the library
Bibliometrics in the library
 
Van bibliometrics naar altmetrics
Van bibliometrics naar altmetricsVan bibliometrics naar altmetrics
Van bibliometrics naar altmetrics
 
Bibliometric Tools
Bibliometric ToolsBibliometric Tools
Bibliometric Tools
 
Showcasing your Research Impact using Bibliometrics
Showcasing your Research Impact using BibliometricsShowcasing your Research Impact using Bibliometrics
Showcasing your Research Impact using Bibliometrics
 
Introduction to Bibliometrics
Introduction to BibliometricsIntroduction to Bibliometrics
Introduction to Bibliometrics
 
Citation metrics
Citation metricsCitation metrics
Citation metrics
 
Resources for measuring and maximizing research impact fall 2015
Resources for measuring and maximizing research impact fall 2015Resources for measuring and maximizing research impact fall 2015
Resources for measuring and maximizing research impact fall 2015
 
Journal Impact Metrics
Journal Impact MetricsJournal Impact Metrics
Journal Impact Metrics
 
SciVal
SciValSciVal
SciVal
 
Finding Journal Impact Factor using Journal Citation Reports
Finding Journal Impact Factor using Journal Citation Reports Finding Journal Impact Factor using Journal Citation Reports
Finding Journal Impact Factor using Journal Citation Reports
 
STS Hot Topics Midwinter 2014 altmetrics presentation
STS Hot Topics Midwinter 2014 altmetrics presentationSTS Hot Topics Midwinter 2014 altmetrics presentation
STS Hot Topics Midwinter 2014 altmetrics presentation
 
Scopus Journal Metrics
Scopus Journal MetricsScopus Journal Metrics
Scopus Journal Metrics
 
Journal metrics July 2016
Journal metrics July 2016Journal metrics July 2016
Journal metrics July 2016
 

Similar to Mining Research Publication Networks for Impact -- KMi Internal Seminar

Open Discovery Initiative Update - CNI, April 4, 2013
Open Discovery Initiative Update - CNI, April 4, 2013Open Discovery Initiative Update - CNI, April 4, 2013
Open Discovery Initiative Update - CNI, April 4, 2013
National Information Standards Organization (NISO)
 
#lak2013, Leuven, DC slides, #learninganalytics
#lak2013, Leuven, DC slides, #learninganalytics#lak2013, Leuven, DC slides, #learninganalytics
#lak2013, Leuven, DC slides, #learninganalyticsSoudé Fazeli
 
2 Topic Selection, Abstrat, Introduction and Objectives.pptx
2 Topic Selection, Abstrat, Introduction and Objectives.pptx2 Topic Selection, Abstrat, Introduction and Objectives.pptx
2 Topic Selection, Abstrat, Introduction and Objectives.pptx
kaleabtegegne
 
The changing world of research evaluation
The changing world of research evaluationThe changing world of research evaluation
The changing world of research evaluation
Jisc
 
LIBER's New Strategy 2018-2022
LIBER's New Strategy 2018-2022LIBER's New Strategy 2018-2022
LIBER's New Strategy 2018-2022
Jeannette Frey
 
CORE Analytics Dashboard
CORE Analytics DashboardCORE Analytics Dashboard
CORE Analytics Dashboard
petrknoth
 
Academic Social Networks and Researcher Ranking
Academic Social Networks and Researcher RankingAcademic Social Networks and Researcher Ranking
Academic Social Networks and Researcher Ranking
Amanyalsayed
 
The role of new information and communication technologies in information and...
The role of new information and communication technologies in information and...The role of new information and communication technologies in information and...
The role of new information and communication technologies in information and...
Christina Pikas
 
Lagace - Copyright Clearance Center April 2, 2015
Lagace - Copyright Clearance Center April 2, 2015Lagace - Copyright Clearance Center April 2, 2015
Lagace - Copyright Clearance Center April 2, 2015
National Information Standards Organization (NISO)
 
Blurring boundaries to spark motivation: collaborative approaches to teaching...
Blurring boundaries to spark motivation: collaborative approaches to teaching...Blurring boundaries to spark motivation: collaborative approaches to teaching...
Blurring boundaries to spark motivation: collaborative approaches to teaching...
megan.fitzgibbons
 
LIBER Strategy for libraries and research data
LIBER Strategy for libraries and research dataLIBER Strategy for libraries and research data
LIBER Strategy for libraries and research data
Jeannette Frey
 
DORA and the reinvention of research assessment
DORA and the reinvention of research assessmentDORA and the reinvention of research assessment
DORA and the reinvention of research assessment
Mark Patterson
 
DOAJ as Gatekeeper for Quality Open Access Journals
DOAJ as Gatekeeper for Quality Open Access JournalsDOAJ as Gatekeeper for Quality Open Access Journals
DOAJ as Gatekeeper for Quality Open Access Journals
DOAJ (Directory of Open Access Journals)
 
RecSysTEL2012 slides
RecSysTEL2012 slidesRecSysTEL2012 slides
RecSysTEL2012 slidesSoudé Fazeli
 
Writing papers during the journey phd workshop Oct 2013
Writing papers during the journey phd workshop Oct 2013Writing papers during the journey phd workshop Oct 2013
Writing papers during the journey phd workshop Oct 2013
Dianne Dredge
 
Spotlight on users: an introduction to client-centered collection assessment
Spotlight on users: an introduction to client-centered collection assessmentSpotlight on users: an introduction to client-centered collection assessment
Spotlight on users: an introduction to client-centered collection assessment
Philippine Association of Academic/Research Librarians
 
Data-Informed Decision Making for Libraries - Athenaeum21
Data-Informed Decision Making for Libraries - Athenaeum21Data-Informed Decision Making for Libraries - Athenaeum21
Data-Informed Decision Making for Libraries - Athenaeum21
Megan Hurst
 
Data-Informed Decision Making for Digital Resources
Data-Informed Decision Making for Digital ResourcesData-Informed Decision Making for Digital Resources
Data-Informed Decision Making for Digital Resources
Christine Madsen
 
Promoting Data Literacy at the Grassroots (ACRL 2015, Portland, OR)
Promoting Data Literacy at the Grassroots (ACRL 2015, Portland, OR)Promoting Data Literacy at the Grassroots (ACRL 2015, Portland, OR)
Promoting Data Literacy at the Grassroots (ACRL 2015, Portland, OR)
Adam Beauchamp
 
Lern, jan 2015, digital media slides
Lern, jan 2015, digital media slidesLern, jan 2015, digital media slides
Lern, jan 2015, digital media slides
York University - Osgoode Hall Law School
 

Similar to Mining Research Publication Networks for Impact -- KMi Internal Seminar (20)

Open Discovery Initiative Update - CNI, April 4, 2013
Open Discovery Initiative Update - CNI, April 4, 2013Open Discovery Initiative Update - CNI, April 4, 2013
Open Discovery Initiative Update - CNI, April 4, 2013
 
#lak2013, Leuven, DC slides, #learninganalytics
#lak2013, Leuven, DC slides, #learninganalytics#lak2013, Leuven, DC slides, #learninganalytics
#lak2013, Leuven, DC slides, #learninganalytics
 
2 Topic Selection, Abstrat, Introduction and Objectives.pptx
2 Topic Selection, Abstrat, Introduction and Objectives.pptx2 Topic Selection, Abstrat, Introduction and Objectives.pptx
2 Topic Selection, Abstrat, Introduction and Objectives.pptx
 
The changing world of research evaluation
The changing world of research evaluationThe changing world of research evaluation
The changing world of research evaluation
 
LIBER's New Strategy 2018-2022
LIBER's New Strategy 2018-2022LIBER's New Strategy 2018-2022
LIBER's New Strategy 2018-2022
 
CORE Analytics Dashboard
CORE Analytics DashboardCORE Analytics Dashboard
CORE Analytics Dashboard
 
Academic Social Networks and Researcher Ranking
Academic Social Networks and Researcher RankingAcademic Social Networks and Researcher Ranking
Academic Social Networks and Researcher Ranking
 
The role of new information and communication technologies in information and...
The role of new information and communication technologies in information and...The role of new information and communication technologies in information and...
The role of new information and communication technologies in information and...
 
Lagace - Copyright Clearance Center April 2, 2015
Lagace - Copyright Clearance Center April 2, 2015Lagace - Copyright Clearance Center April 2, 2015
Lagace - Copyright Clearance Center April 2, 2015
 
Blurring boundaries to spark motivation: collaborative approaches to teaching...
Blurring boundaries to spark motivation: collaborative approaches to teaching...Blurring boundaries to spark motivation: collaborative approaches to teaching...
Blurring boundaries to spark motivation: collaborative approaches to teaching...
 
LIBER Strategy for libraries and research data
LIBER Strategy for libraries and research dataLIBER Strategy for libraries and research data
LIBER Strategy for libraries and research data
 
DORA and the reinvention of research assessment
DORA and the reinvention of research assessmentDORA and the reinvention of research assessment
DORA and the reinvention of research assessment
 
DOAJ as Gatekeeper for Quality Open Access Journals
DOAJ as Gatekeeper for Quality Open Access JournalsDOAJ as Gatekeeper for Quality Open Access Journals
DOAJ as Gatekeeper for Quality Open Access Journals
 
RecSysTEL2012 slides
RecSysTEL2012 slidesRecSysTEL2012 slides
RecSysTEL2012 slides
 
Writing papers during the journey phd workshop Oct 2013
Writing papers during the journey phd workshop Oct 2013Writing papers during the journey phd workshop Oct 2013
Writing papers during the journey phd workshop Oct 2013
 
Spotlight on users: an introduction to client-centered collection assessment
Spotlight on users: an introduction to client-centered collection assessmentSpotlight on users: an introduction to client-centered collection assessment
Spotlight on users: an introduction to client-centered collection assessment
 
Data-Informed Decision Making for Libraries - Athenaeum21
Data-Informed Decision Making for Libraries - Athenaeum21Data-Informed Decision Making for Libraries - Athenaeum21
Data-Informed Decision Making for Libraries - Athenaeum21
 
Data-Informed Decision Making for Digital Resources
Data-Informed Decision Making for Digital ResourcesData-Informed Decision Making for Digital Resources
Data-Informed Decision Making for Digital Resources
 
Promoting Data Literacy at the Grassroots (ACRL 2015, Portland, OR)
Promoting Data Literacy at the Grassroots (ACRL 2015, Portland, OR)Promoting Data Literacy at the Grassroots (ACRL 2015, Portland, OR)
Promoting Data Literacy at the Grassroots (ACRL 2015, Portland, OR)
 
Lern, jan 2015, digital media slides
Lern, jan 2015, digital media slidesLern, jan 2015, digital media slides
Lern, jan 2015, digital media slides
 

More from Dasha Herrmannova

Machine Learning for Data Extraction
Machine Learning for Data ExtractionMachine Learning for Data Extraction
Machine Learning for Data Extraction
Dasha Herrmannova
 
Do Authors Deposit on Time? Tracking Open Access Policy Compliance
Do Authors Deposit on Time? Tracking Open Access Policy ComplianceDo Authors Deposit on Time? Tracking Open Access Policy Compliance
Do Authors Deposit on Time? Tracking Open Access Policy Compliance
Dasha Herrmannova
 
Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation
Dasha Herrmannova
 
Do Citations and Readership Predict Excellent Publications?
Do Citations and Readership Predict Excellent Publications?Do Citations and Readership Predict Excellent Publications?
Do Citations and Readership Predict Excellent Publications?
Dasha Herrmannova
 
An Analysis of the Microsoft Academic Graph
An Analysis of the Microsoft Academic GraphAn Analysis of the Microsoft Academic Graph
An Analysis of the Microsoft Academic Graph
Dasha Herrmannova
 
Visual Search for Supporting Content Exploration in Large Document Collections
Visual Search for Supporting Content Exploration in Large Document CollectionsVisual Search for Supporting Content Exploration in Large Document Collections
Visual Search for Supporting Content Exploration in Large Document Collections
Dasha Herrmannova
 
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
Dasha Herrmannova
 
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication RankingSimple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Dasha Herrmannova
 
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Dasha Herrmannova
 
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
Dasha Herrmannova
 

More from Dasha Herrmannova (10)

Machine Learning for Data Extraction
Machine Learning for Data ExtractionMachine Learning for Data Extraction
Machine Learning for Data Extraction
 
Do Authors Deposit on Time? Tracking Open Access Policy Compliance
Do Authors Deposit on Time? Tracking Open Access Policy ComplianceDo Authors Deposit on Time? Tracking Open Access Policy Compliance
Do Authors Deposit on Time? Tracking Open Access Policy Compliance
 
Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation
 
Do Citations and Readership Predict Excellent Publications?
Do Citations and Readership Predict Excellent Publications?Do Citations and Readership Predict Excellent Publications?
Do Citations and Readership Predict Excellent Publications?
 
An Analysis of the Microsoft Academic Graph
An Analysis of the Microsoft Academic GraphAn Analysis of the Microsoft Academic Graph
An Analysis of the Microsoft Academic Graph
 
Visual Search for Supporting Content Exploration in Large Document Collections
Visual Search for Supporting Content Exploration in Large Document CollectionsVisual Search for Supporting Content Exploration in Large Document Collections
Visual Search for Supporting Content Exploration in Large Document Collections
 
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
 
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication RankingSimple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
 
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
 
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
 

Recently uploaded

LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 

Recently uploaded (20)

LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 

Mining Research Publication Networks for Impact -- KMi Internal Seminar

  • 1. Mining Research Publication Networks for Impact PhD Topic Presentation Drahomira Herrmannova Knowledge Media Institute The Open University KMi Internal Seminar, November 2013 1 / 19
  • 2. Table of Contents 1 Research Aim Motivation Problem statement 2 Literature review State of the art Limitations 3 Research objectives Research questions Selected approach Tasks and plans 4 Pilot study 5 References 2 / 19
  • 3. The key question “How to evaluate the quality of research publications?” 3 / 19
  • 4. Who needs this anyway? • Researchers • How to select relevant literature for reading? • Librarians • How to select journal subscriptions? • Universities, funding agencies and other institutions • How to aid reviewers of funding and grant proposals, hiring committees etc.? • Publishers and editors • How can publishers evaluate and promote their journals? • Society • How to evaluate the returns of research to the society? 4 / 19
  • 5. The growth of scholarly literature Figure : Monthly submission rate (since 1991) for Arxiv.org. Source: http://arxiv.org/ 5 / 19
  • 6. The growth of journal subscription costs Figure : Expenditures in ARL libraries (1986 – 2009). Source: [1] 6 / 19
  • 7. What’s being used • Peer review • Qualitative evaluation method • Traditionally the main filter for controlling the quality of published research • Classical quantitative methods • Typically based on citations and/or productivity • Citation counts • JIF • h-index 7 / 19
  • 8. So, what’s the problem? • Peer review • Speed and cost • Biased opinion • Doesn’t limit the amount of published research • Classical quantitative methods • Quality vs. impact • Reasons for citation • Citation half-life • Manipulation and gaming • Author variability • Field effects 8 / 19
  • 9. Bibliometrics today Two changes which influenced the evolution of bibliometrics • creation of the Web and web-related developments • growth of Open Access publishing 9 / 19
  • 10. Bibliometrics today Two ideas driving the current research 1 Development of new metrics (improvements and replacements of JIF) • h-index • Eigenfactor • SJR 2 Concerns about the validity of using citations • Methods using different data • Patent analysis • Webometrics • Altmetrics • Full-text analysis • “Fixing” citations (field normalisation of indicators) 10 / 19
  • 11. Limitations • Limitations of citation-based metrics • Citation bias • Incomplete journal coverage • Author variability • Field effects • Uncited publications • Manipulation of metrics • Using JIF for research evaluation • Limitations of web-based metrics • Gaming web-based and social metrics • Problems of data collection • Adoption of social media by users • Accumulated advantage • Limitations of text-based metrics • Full-text not always available 11 / 19
  • 12. Research questions Question 1: What factors influence the quality of a research publication (with regard to the publication type)? Question 2: What is the relationship (if there is any) between the impact of a publication, measured by the classical bibliometric methods, and the quality of a publication? Question 3: How can we detect the factors influencing quality in order to evaluate the quality of a research publication? Question 4: How can this evaluation be used in other disciplines? 12 / 19
  • 13. Selected approach • Single number vs. collection of metrics and indicators • Analysis of full-text • Until quite recently not easily available • Full-text – the best indicator of publication quality • For example • Co-word analysis • Analysis of citation context • Semantic similarity of publications • Additional indicators • Famous author or collaboration with famous authors • Citing or is being cited outside of the research area • Paper published in a field-specific prestigious journal 13 / 19
  • 14. Requirements for science evaluation methods Source: [2] 1 Reliable and accurate, comparable or better than the peer review system 2 Easy to understand. 3 Economical in terms of development and maintenance, time required to understand it, etc. 4 Faster than citations, at least comparable to the speed of peer review 5 Resistant to manipulation and gaming 14 / 19
  • 15. Tasks and plans Data collection Task 1: Identify information sources that may provide relevant publication data • Mostly done Task 2a: Investigate factors that influence the quality of research publications Task 2b: Using the identified information sources, develop various relevant data structures such as: • collaboration networks • citation, co-citation and bibliographic coupling networks • clusters of semantically related publications • clusters of publications corresponding to different topics 15 / 19
  • 16. Tasks and plans Data analysis Task 3a: Study the possibilities of application of NLP for the evaluation of research publications Task 3b: Investigate the developed data structures using graph and network theory as well as bibliometric indicators 16 / 19
  • 17. Tasks Development of new methods Task 4a: Analyse the possibilities of combining the studied methods in order to design a set of new methods for estimating quality Task 4b: Evaluate the proposed methods against current standards Task 4c: Analyse the use of the new methods in other disciplines 17 / 19
  • 18. Task 1 Identification of data sources Source CSX MAS JSTOR DBLP CORE ArXiv KDD iSearch DBLP+C ACM OCC MD X X X X - API X X X X - OAI-PMH X X X - dumps X X X X X X X X X cit. X X X X X X X X X FT X * * * X X X X - Table : Stars (*) represent sources, which don’t store full-text but provide links to the full-text where available. MD stands for multidisciplinary. 18 / 19
  • 19. References [1] Kyrillidou, Martha and Morris, Shaneka. ARL Statistics 2008 - 2009. Association of Research Libraries, Washington, DC, 2011. [2] Taraborelli, Dario. Soft peer review: Social software and distributed scientific evaluation. Proceedings of the 8th International Conference on the Design of Cooperative Systems (COOP ’08), Carry-le-Rouet, France, 2008. 19 / 19
  • 20. How many metrics? Scientometrics: study of science and research Bibliometrics: study of scientific literature Informetrics: study of any type of information Webometrics: informetric studies of the web Cybermetrics: informetric studies of the whole Internet Altmetrics: study of science and research using data from social media 20 / 19