SlideShare a Scribd company logo
Towards a Semantic Citation
Index for the German Social
Sciences
William Dinkel, Philipp Mayr,
Frank Sawitzky, Andreas Strotmann*
GESIS – Leibnizinstitut für Sozialwissenschaften, Köln
*alphabetic ordering of names
The Problem
● German sociology / political science research output /
impact coverage in SSCI
– SOLIS: ~ 1/3 each of books, journal articles, chapters
● Cover ~ 50% of German researchers' “relevant” output*
– ~1/3 of core journals covered in SSCI**
– So, ~10% of literature indexed there
– Very low percentage of cited literature indexed in SSCI***
● * Research rating exercise Sociology, Wissenschaftsrat
● ** compared to SOLIS “class A” journals
● *** Chi (IfQ) study of core German political science journals
The Problem (ctd.)
● Citation culture in the social sciences
– Citations are important
● Perhaps even more so than in the natural sciences
– Some authors are extremely highly cited (Weber, Marx...)
● Suspect very high(!!) Gini coefficient in distribution
● But: it is their books (not articles) that are highly cited!
– Significant fraction of citations are contrastive
– Datasets (survey results) highly mentioned, not cited
– Multilingual citation environment
–
–
The Need
● German social scientists & SSCI
– They consider their field inadequateinadequately
represented in “the” citation index
– But useBut use it quite heavily anyway
● e.g. for research, evaluation
● Survey of sociologists and political scientists, GESIS
The Need (ctd.)
● We need a citation index for the (German)
social sciences
– Existing citation indexes frankly inadequate
● No reasonable effort in sight to resolve this
– Hence, we need to build our own
● If we want to do serious bibliometrics on SocSci
● If we want to provide a decent social science citation
index in, e.g., sociology or political science
The Need (ctd.)
●
We need an open semantic citation index for the
(German) social sciences
– Incorporate referential semantics into search engine
● e.g., reliable hyperlinks to referenced articles
● e.g., equivalence or hierarchy relations for translations, aggregations
– Publish referential semantics as linked open data
● Allow other institutions to discover references to their holdings in our
database(s)
● Invite them to offer the same service to us, too
– Bibliometrics requires cleaned/disambiguated data!
The Long-Term Goal
A globally distributed open semantic citation index
● Based on digital full-text collections (cooperate with publishers)
– Semi-automatic / Computer-aided
– Algorithms + professional indexers (authority files) + crowd sourcing +...
● Reference extraction (with contexts)
– Enables sentiment analysis (important in social sciences)
● Reference matching
– Enables referential semantics
● Open reference semantics information exchange
– „<this> paper indexed in our collection cites <that> paper indexed in yours“
Sowiport – German Social
Sciences Research Information
● GESIS' Sowiport portal: Single access point to 18 databases, including
– 6 Cambridge Scientific Abstracts databases on social sciences
– GESIS' own SOLIS (literature) and SOFIS (projects) RISs
– SSOAR (Social Science Open Access Repository) @ GESIS
● Goal: Extend to social science citation index
– CSA comes with cited refs for some docs
– SSOAR – extract refs from OA full text and index in Sowiport
– Extract links to data sets / surveys used but not cited from full texts
– Crawl Google Scholar for citations to “our” docs
– Link to/from RepEc (and other) data ...
First Steps: National CSA
Social Sciences Citation Index
● Cambridge Scientific Abstracts – Social Sciences
– 6 CSA databases offered & run by GESIS
● National research licence for Germany
– Include >8 mio references
● A good starting point
● Recently activated in Sowiport
● ~25-30% refs found to link to other records
– Using simple matching algorithm
– Biased towards accuracy (>90%), not recall
First Steps:
CSA Reference Matching
Reference matching is much(!) harder in social sciences
● Social science publication culture
– Books & chapters, and articles
● Published in roughly equal numbers, books cited most
– Multilingual publishing
● English is not the only language
● Publications may be cited in translation, different editions
– Broad referencing behaviour
● Large proportion of references to non-source items
=> A first-try high-precision match rate of ~25-30% is an excellent result
● Close to expected rate of references to journal articles
CSA References in
GESIS' Sowiport Database
● Each full record contains „references“ and
„cited-by“ information
– Some with actionable links to full records
● Combines WoS/Scopus and Google
Scholar approaches to citation index
construction
First Steps: Citation Extraction
● SSOAR full texts
– First successful experiments to extract
references from full text
● Based on RepEc's ParsCit
● Extended to German citation styles
– First successful experiments to identify
acknowledgments of large surveys in text
Next Steps: “Haus der
Sozialwissenschaften”
● Goal: Digital Special Collection for German
Social Scientists
– Digital access to full literature in one place
●
Large parts unfortunately only accessible in-house
● Collect existing digital versions from “all” sources
● Digitize “important” literature where necessary
● Full text of literature, survey data, project descriptions...
● Joint DFG application with Sondersammelgebiet
Sozialwissenschaften, Univ.- & Stadt-Bibl. Köln
Next Step: “GESIS Application
Laboratory Web 3.0”
● Full text collection and processing results available in toto to
visiting researchers
– Social scientists
– Computer scientists
– Computer linguists
– Bibliometricians: You are invited!!!
● Upgrade database
– e.g. disambiguation of authors, institutions, titles
e.g. incorporation of external authority files / semantic web
–
Experiment: E-Traces
● Goal: Tracking ideas through the sociology literature
(“text re-use”)
– Experiment (ongoing): attempt to categorize citation contexts
as positive/neutral/negative (sentiment analysis)
– BMBF funded project with U Leipzig, U Göttingen
● Long term use: identify negative citations and contrastive
co-citations for social science citation index
Summary
● For GESIS' core covered social sciences (German sociology, political
science), traditional citation indexes are inadequate
● and Google Scholar only provides “cited by” info
● Yet, GESIS' core audience uses them
● and complains about their inadequacies
● Bibliometrics requires an adequate citation index for reliable results
(given typical distributions)
● but no improvements in sight for classic indexes
● Therefore, we need to build our own
● and we have the expertise at GESIS to succeed where others have failed
● and we have taken the first few steps in this direction
●
Summary (ctd.)
● In the long run, we would like
– A citation index that is
● Semantic (with explicit referential semantics)
● Distributed (each institution builds their own)
● Open (each institution shares semantics as LOD)
● Global (implemented world wide)
● Cooperative (indexers+researchers contribute)
● Computer-aided (software to get started, people to improve)
– Based on best practices we hope to develop
Thank You!
Two Models of Citation Graphs
Bipartite (Classic IR) Model:
Citing and Cited Partitions
• Citing nodes: full
bibliographic records
• Cited nodes: „keys“, e.g.
– First author name & initials
+ Year of publication
+ Journal key, + volume,
+number, +page
Uniform Model:
Interconnected Documents
• All nodes: bibliographic
records
– Citing nodes full records
– Cited nodes mostly simplified
records
– „Matched“ cited nodes have
full records
Citation Matching
• Goal: Citation network
–Unique nodes for documents
• Sub tasks:
–Match cited references to each other
–Match cited references to full records
–Match full records across databases
Matching Citations to Full Records
„Internal“ matching
● Direct access to
full database(s)
● Options: match
key based or
algorithmic
matching
„External“ matching
● Access only via
search engine
● Options: matching
against same or
different database
Scopus Citations
• Cited reference info contains
–Up to 8 author names (family+inits)
• Including last author
• Frequently as cited (not standardized or corrected)
–Publication year, title, journal name/vol./nr./p.
• Frequently as cited
–Reasonably well parsable, not normalized
Matching Scopus Citations to
Scopus Full Records
External matching: Scopus search engine
● „Algorithm“: parse Scopus reference into subfields,
construct complex search queries for Scopus engine,
download resulting full records, choose best fit
● High precision searches: complex searches allowed,
many searchable fields
– Improve recall by successively vaguer queries
● Small number of downloads allowed, so many queries
needed to construct sizable citation index
Matching Scopus Citations to
PubMed Full Records
CrossDB External Match: Scopus/Medline
● „Algorithm“: parse Scopus reference, construct
PubMed batch citation matcher queries, download
matched PubMed(!) records
– Only for biomedical fields
– Result is a citation network of PubMed records, not Scopus
– Requires matching of Scopus citing records as well
● Either direction (Scopus<->PubMed)
● Both include PubMed IDs
Matching Web of Science
References to WoS Full Records
WoS cited reference info contains
● First author (last name plus initials)
● Publication year
● Source title code
● Vol./num./page
● More and more frequently DOI
No title included!
Matching WoS Cited References
to WoS Record
External matching via WoS web search
● Only small queries supported
– Many downloads necessary
● Crucial search fields not supported (vol., num.)
– Therefore highly ambiguous results to be expected
● Requires translation of source title from code to full
● Requires algorithmic filtering of correct hit from long
result list
Matching WoS references to
WoS
● Internal Matching
● Kompetenzzentrum Bibliometrie has full local
copy of WoS data
● Experiment: good „match key“ to support
this?
– Dinkel (2011), ISSI
– Results in error estimates for references
Building a Citation Index for the
Social Sciences: CSA
● Basis: Cambridge Scientific Abstracts (Social Sciences)
– To be extended with additional sources of cited refs info
● Nationwide licensing scheme for Germany administered at
GESIS
● Six CSA/Proquest databases incorporated into GESIS'
„Sowiport“ social sciences portal
– Now including ~8.5 mio cited references
● No matchings to full records provided by Proquest
● Early experimental results available on portal
– Focus on precision, not recall
Citation Matching in CSA
„Algorithm“:
● Internal matching
– However, across multiple CSA databases
● Parse references; construct search queries (Solr)
– exact title and year
– or fuzzy title and year and ISSN;
– choose first match
● Favors precision over recall
– Fuzzy match only for journal literature, for example
● Research to be continued!
Experiments - Datasets
Caveat
● Scopus/PubMed and WoS experiments run on stem cell
research field (biomedical area)
– < 100k citing docs, ~1mio references
– >95% refs are to journal articles
● CSA experiment run on social sciences databases
– ~1mio full records, ~10mio references
● Only recent records contain refs
● Many(!!) refs to non-journal articles
Some Rough Numbers
● Scopus ↔ PubMed full record matching
– >95% match rate
● Scopus references → Scopus/PubMed full record
– ~90% match rate „exact“ + ~5% fuzzy match
– ~1% false positives needed to be filtered out
● WoS references → WoS full record
– ~90% match rate
– >>50% false positives needed to be filtered out
● CSA references → CSA full record
– ~30% match rate
– ~1% false positives
CSA reference information
● Fields: citing ID, reference ID, authors, title, year, publisher,
source title/num/vol/p., ISSN
– Format changes, though
● Mostly automatically parsed, as fields frequently mis-assigned
● Example (book):
<CI>200601317</CI><CA>Voice UK</CA>
<CT>No More Abuse.</CT><CY>2000</CY>
<CZ>Derby: Voice UK</CZ>
Discussion
● Plenty of research opportunities to improve matching of
non-journal literature references to source records
– e.g. to GESIS' own SOLIS / SOFIS / SSOAR databases
– e.g. by crawling Google Scholar for reference links
– You are invited to try your hands at this, too!
● See below: GESIS Application Laboratory

More Related Content

What's hot

Political Science Beginning Research
Political Science Beginning ResearchPolitical Science Beginning Research
Political Science Beginning Researchannbee
 
Political Science Senior Seminar Fall 2011
Political Science Senior Seminar Fall 2011Political Science Senior Seminar Fall 2011
Political Science Senior Seminar Fall 2011annbee
 
W13 libr250 evaluating and citing websites1
W13 libr250 evaluating and citing websites1W13 libr250 evaluating and citing websites1
W13 libr250 evaluating and citing websites1lterrones
 
W13 libr250 do_iv_urlciations
W13 libr250 do_iv_urlciationsW13 libr250 do_iv_urlciations
W13 libr250 do_iv_urlciationslterrones
 
Poli Sci Fall2010
Poli Sci Fall2010Poli Sci Fall2010
Poli Sci Fall2010annbee
 
Web of Science
Web of ScienceWeb of Science
Web of Science
guest74bab9
 
W13 libr250 databases_scholarlyvs_popular
W13 libr250 databases_scholarlyvs_popularW13 libr250 databases_scholarlyvs_popular
W13 libr250 databases_scholarlyvs_popularlterrones
 
Referencing and zotero
Referencing and zoteroReferencing and zotero
Referencing and zotero
kevinwilsongold
 
Literature search and review
Literature search and reviewLiterature search and review
Literature search and review
Graça Gabriel
 
Google Scholar as a research and evaluation tool
Google Scholar as a research and evaluation toolGoogle Scholar as a research and evaluation tool
Google Scholar as a research and evaluation tool
Alvaro Cabezas Clavijo
 
How to find scholarly resources.updated 2020
How to find scholarly resources.updated 2020How to find scholarly resources.updated 2020
How to find scholarly resources.updated 2020
Zakir Hossain/ICS, Zurich
 
Psychology Grad Res08
Psychology Grad Res08Psychology Grad Res08
Psychology Grad Res08annbee
 
Research Metrics
Research Metrics Research Metrics
Research Metrics
Naz Torabi
 
FSA1201 Library tutorial
FSA1201 Library tutorialFSA1201 Library tutorial
FSA1201 Library tutorial
nuslibraries
 
Year 9 Research Success - BGS Libraries
Year 9 Research Success - BGS LibrariesYear 9 Research Success - BGS Libraries
Year 9 Research Success - BGS Libraries
BGS Library
 
Ocn 1010 special assignments (fall 2014)
Ocn 1010 special assignments (fall 2014)Ocn 1010 special assignments (fall 2014)
Ocn 1010 special assignments (fall 2014)
Rob_Sippel
 
MySearch Overview
MySearch OverviewMySearch Overview
MySearch Overview
BGS Library
 
Research Sources & Techniques
Research Sources & TechniquesResearch Sources & Techniques
Research Sources & TechniquesGina Singh
 
How to use EndNote for managing your references
How to use EndNote for managing your referencesHow to use EndNote for managing your references
How to use EndNote for managing your references
Md. Zahid Hossain Shoeb
 
Referencing mla style powerpoint
Referencing mla style powerpointReferencing mla style powerpoint
Referencing mla style powerpoint
marianogalan23
 

What's hot (20)

Political Science Beginning Research
Political Science Beginning ResearchPolitical Science Beginning Research
Political Science Beginning Research
 
Political Science Senior Seminar Fall 2011
Political Science Senior Seminar Fall 2011Political Science Senior Seminar Fall 2011
Political Science Senior Seminar Fall 2011
 
W13 libr250 evaluating and citing websites1
W13 libr250 evaluating and citing websites1W13 libr250 evaluating and citing websites1
W13 libr250 evaluating and citing websites1
 
W13 libr250 do_iv_urlciations
W13 libr250 do_iv_urlciationsW13 libr250 do_iv_urlciations
W13 libr250 do_iv_urlciations
 
Poli Sci Fall2010
Poli Sci Fall2010Poli Sci Fall2010
Poli Sci Fall2010
 
Web of Science
Web of ScienceWeb of Science
Web of Science
 
W13 libr250 databases_scholarlyvs_popular
W13 libr250 databases_scholarlyvs_popularW13 libr250 databases_scholarlyvs_popular
W13 libr250 databases_scholarlyvs_popular
 
Referencing and zotero
Referencing and zoteroReferencing and zotero
Referencing and zotero
 
Literature search and review
Literature search and reviewLiterature search and review
Literature search and review
 
Google Scholar as a research and evaluation tool
Google Scholar as a research and evaluation toolGoogle Scholar as a research and evaluation tool
Google Scholar as a research and evaluation tool
 
How to find scholarly resources.updated 2020
How to find scholarly resources.updated 2020How to find scholarly resources.updated 2020
How to find scholarly resources.updated 2020
 
Psychology Grad Res08
Psychology Grad Res08Psychology Grad Res08
Psychology Grad Res08
 
Research Metrics
Research Metrics Research Metrics
Research Metrics
 
FSA1201 Library tutorial
FSA1201 Library tutorialFSA1201 Library tutorial
FSA1201 Library tutorial
 
Year 9 Research Success - BGS Libraries
Year 9 Research Success - BGS LibrariesYear 9 Research Success - BGS Libraries
Year 9 Research Success - BGS Libraries
 
Ocn 1010 special assignments (fall 2014)
Ocn 1010 special assignments (fall 2014)Ocn 1010 special assignments (fall 2014)
Ocn 1010 special assignments (fall 2014)
 
MySearch Overview
MySearch OverviewMySearch Overview
MySearch Overview
 
Research Sources & Techniques
Research Sources & TechniquesResearch Sources & Techniques
Research Sources & Techniques
 
How to use EndNote for managing your references
How to use EndNote for managing your referencesHow to use EndNote for managing your references
How to use EndNote for managing your references
 
Referencing mla style powerpoint
Referencing mla style powerpointReferencing mla style powerpoint
Referencing mla style powerpoint
 

Viewers also liked

Efficient blocking method for a large scale citation matching
Efficient blocking method for a large scale citation matchingEfficient blocking method for a large scale citation matching
Efficient blocking method for a large scale citation matching
Mateusz Fedoryszak
 
Searching Chemical Abstracts print edition
Searching Chemical Abstracts print editionSearching Chemical Abstracts print edition
Searching Chemical Abstracts print editionForsyth Library
 
CAS: Transforming Discovery
CAS: Transforming DiscoveryCAS: Transforming Discovery
CAS: Transforming Discovery
CAS
 
Emerging Sources Citation Index – A new edition of Web Of Science
Emerging Sources Citation Index – A new edition of Web Of ScienceEmerging Sources Citation Index – A new edition of Web Of Science
Emerging Sources Citation Index – A new edition of Web Of Science
State Of Innovation
 
Introduction to the 15th NKOS workshop @TPDL2016
Introduction to the 15th NKOS workshop @TPDL2016Introduction to the 15th NKOS workshop @TPDL2016
Introduction to the 15th NKOS workshop @TPDL2016
GESIS
 
Recent applications of Knowledge Organization Systems
Recent applications of Knowledge Organization SystemsRecent applications of Knowledge Organization Systems
Recent applications of Knowledge Organization Systems
GESIS
 
Demonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations SystemsDemonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations SystemsGESIS
 
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information SystemsBibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information SystemsGESIS
 
Establishing an Online Access Panel for Interactive Information Retrieval Res...
Establishing an Online Access Panel for Interactive Information Retrieval Res...Establishing an Online Access Panel for Interactive Information Retrieval Res...
Establishing an Online Access Panel for Interactive Information Retrieval Res...
GESIS
 
PEP-TF: Social Media Monitoring of the Campaigns for the 2013 German Bundesta...
PEP-TF: Social Media Monitoring of the Campaigns for the 2013 German Bundesta...PEP-TF: Social Media Monitoring of the Campaigns for the 2013 German Bundesta...
PEP-TF: Social Media Monitoring of the Campaigns for the 2013 German Bundesta...
GESIS
 
Are topic-specific search term, journal name and author name recommendations ...
Are topic-specific search term, journal name and author name recommendations ...Are topic-specific search term, journal name and author name recommendations ...
Are topic-specific search term, journal name and author name recommendations ...
GESIS
 
Pennants for Descriptors
Pennants for DescriptorsPennants for Descriptors
Pennants for DescriptorsGESIS
 
Introduction of the 3rd International Workshop on Bibliometric-enhanced Infor...
Introduction of the 3rd International Workshop on Bibliometric-enhanced Infor...Introduction of the 3rd International Workshop on Bibliometric-enhanced Infor...
Introduction of the 3rd International Workshop on Bibliometric-enhanced Infor...
GESIS
 
Analyzing the research output presented at European Networked Knowledge Organ...
Analyzing the research output presented at European Networked Knowledge Organ...Analyzing the research output presented at European Networked Knowledge Organ...
Analyzing the research output presented at European Networked Knowledge Organ...
GESIS
 
Past, present and future of scientific information
Past, present and future of scientific informationPast, present and future of scientific information
Past, present and future of scientific information
GESIS
 
Opening Scholarly Communication in the Social Sciences
Opening Scholarly Communication in the Social SciencesOpening Scholarly Communication in the Social Sciences
Opening Scholarly Communication in the Social Sciences
GESIS
 
Opening Scholarly Communication in Social Sciences (OSCOSS)
Opening Scholarly Communication in Social Sciences (OSCOSS)Opening Scholarly Communication in Social Sciences (OSCOSS)
Opening Scholarly Communication in Social Sciences (OSCOSS)
GESIS
 
Introduction of the Bibliometric-enhanced Information Retrieval (BIR) workshop
Introduction of the Bibliometric-enhanced Information Retrieval (BIR) workshopIntroduction of the Bibliometric-enhanced Information Retrieval (BIR) workshop
Introduction of the Bibliometric-enhanced Information Retrieval (BIR) workshopGESIS
 
Recent Advances in Bibliometric-Enhanced Information Retrieval
Recent Advances in Bibliometric-Enhanced Information RetrievalRecent Advances in Bibliometric-Enhanced Information Retrieval
Recent Advances in Bibliometric-Enhanced Information Retrieval
GESIS
 

Viewers also liked (20)

Efficient blocking method for a large scale citation matching
Efficient blocking method for a large scale citation matchingEfficient blocking method for a large scale citation matching
Efficient blocking method for a large scale citation matching
 
Ssci
SsciSsci
Ssci
 
Searching Chemical Abstracts print edition
Searching Chemical Abstracts print editionSearching Chemical Abstracts print edition
Searching Chemical Abstracts print edition
 
CAS: Transforming Discovery
CAS: Transforming DiscoveryCAS: Transforming Discovery
CAS: Transforming Discovery
 
Emerging Sources Citation Index – A new edition of Web Of Science
Emerging Sources Citation Index – A new edition of Web Of ScienceEmerging Sources Citation Index – A new edition of Web Of Science
Emerging Sources Citation Index – A new edition of Web Of Science
 
Introduction to the 15th NKOS workshop @TPDL2016
Introduction to the 15th NKOS workshop @TPDL2016Introduction to the 15th NKOS workshop @TPDL2016
Introduction to the 15th NKOS workshop @TPDL2016
 
Recent applications of Knowledge Organization Systems
Recent applications of Knowledge Organization SystemsRecent applications of Knowledge Organization Systems
Recent applications of Knowledge Organization Systems
 
Demonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations SystemsDemonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations Systems
 
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information SystemsBibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
 
Establishing an Online Access Panel for Interactive Information Retrieval Res...
Establishing an Online Access Panel for Interactive Information Retrieval Res...Establishing an Online Access Panel for Interactive Information Retrieval Res...
Establishing an Online Access Panel for Interactive Information Retrieval Res...
 
PEP-TF: Social Media Monitoring of the Campaigns for the 2013 German Bundesta...
PEP-TF: Social Media Monitoring of the Campaigns for the 2013 German Bundesta...PEP-TF: Social Media Monitoring of the Campaigns for the 2013 German Bundesta...
PEP-TF: Social Media Monitoring of the Campaigns for the 2013 German Bundesta...
 
Are topic-specific search term, journal name and author name recommendations ...
Are topic-specific search term, journal name and author name recommendations ...Are topic-specific search term, journal name and author name recommendations ...
Are topic-specific search term, journal name and author name recommendations ...
 
Pennants for Descriptors
Pennants for DescriptorsPennants for Descriptors
Pennants for Descriptors
 
Introduction of the 3rd International Workshop on Bibliometric-enhanced Infor...
Introduction of the 3rd International Workshop on Bibliometric-enhanced Infor...Introduction of the 3rd International Workshop on Bibliometric-enhanced Infor...
Introduction of the 3rd International Workshop on Bibliometric-enhanced Infor...
 
Analyzing the research output presented at European Networked Knowledge Organ...
Analyzing the research output presented at European Networked Knowledge Organ...Analyzing the research output presented at European Networked Knowledge Organ...
Analyzing the research output presented at European Networked Knowledge Organ...
 
Past, present and future of scientific information
Past, present and future of scientific informationPast, present and future of scientific information
Past, present and future of scientific information
 
Opening Scholarly Communication in the Social Sciences
Opening Scholarly Communication in the Social SciencesOpening Scholarly Communication in the Social Sciences
Opening Scholarly Communication in the Social Sciences
 
Opening Scholarly Communication in Social Sciences (OSCOSS)
Opening Scholarly Communication in Social Sciences (OSCOSS)Opening Scholarly Communication in Social Sciences (OSCOSS)
Opening Scholarly Communication in Social Sciences (OSCOSS)
 
Introduction of the Bibliometric-enhanced Information Retrieval (BIR) workshop
Introduction of the Bibliometric-enhanced Information Retrieval (BIR) workshopIntroduction of the Bibliometric-enhanced Information Retrieval (BIR) workshop
Introduction of the Bibliometric-enhanced Information Retrieval (BIR) workshop
 
Recent Advances in Bibliometric-Enhanced Information Retrieval
Recent Advances in Bibliometric-Enhanced Information RetrievalRecent Advances in Bibliometric-Enhanced Information Retrieval
Recent Advances in Bibliometric-Enhanced Information Retrieval
 

Similar to Towards a Semantic Citation Index for the German Social Sciences

How to prepare a research paper and its evaluation tools
How to prepare a research paper and its evaluation toolsHow to prepare a research paper and its evaluation tools
How to prepare a research paper and its evaluation tools
Mohanapriya Suresh
 
Journal Impact Factors and Citation Analysis
Journal Impact Factors and Citation AnalysisJournal Impact Factors and Citation Analysis
Journal Impact Factors and Citation Analysisrepayne
 
Research impact metrics for librarians: calculation & context
Research impact metrics for librarians: calculation & contextResearch impact metrics for librarians: calculation & context
Research impact metrics for librarians: calculation & context
Library_Connect
 
Google Scholar as a research and evaluation tool
Google Scholar as a research and evaluation toolGoogle Scholar as a research and evaluation tool
Google Scholar as a research and evaluation tool
EC3metrics Spin-Off
 
British Library
British LibraryBritish Library
British Library
clarivate
 
Bibliometrics jul 2014
Bibliometrics jul 2014Bibliometrics jul 2014
Bibliometrics jul 2014
bradscifi
 
Citation Metrics and Journal Rankings
Citation Metrics and Journal RankingsCitation Metrics and Journal Rankings
Citation Metrics and Journal Rankings
ArielNeff
 
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
Michael Levine-Clark
 
Library connect-webinar---february-2020---slides 560401
Library connect-webinar---february-2020---slides 560401Library connect-webinar---february-2020---slides 560401
Library connect-webinar---february-2020---slides 560401
Ricardo Valls P. Geo., M. Sc.
 
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshopScholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
Plethora121
 
Methodology ProjectThis project will be completed in steps wi.docx
Methodology ProjectThis project will be completed in steps wi.docxMethodology ProjectThis project will be completed in steps wi.docx
Methodology ProjectThis project will be completed in steps wi.docx
buffydtesurina
 
Raising Your Research Profile: Evidence of Exposure Measuring Your Research I...
Raising Your Research Profile: Evidence of Exposure Measuring Your Research I...Raising Your Research Profile: Evidence of Exposure Measuring Your Research I...
Raising Your Research Profile: Evidence of Exposure Measuring Your Research I...
NTU Library Research Team
 
Gaining Insights Through Bibliometric Analysis
Gaining Insights Through Bibliometric AnalysisGaining Insights Through Bibliometric Analysis
Gaining Insights Through Bibliometric Analysis
Elaine Lasda
 
Education_selecting key discovery tools for education research_v1_2021.pptx
Education_selecting key discovery tools for education research_v1_2021.pptxEducation_selecting key discovery tools for education research_v1_2021.pptx
Education_selecting key discovery tools for education research_v1_2021.pptx
ShivamChaturvedi67
 
Research.pptx
Research.pptxResearch.pptx
Research.pptx
VictorLucas76
 
Search Strategies - using the library catalogue.pptx
Search Strategies - using the library catalogue.pptxSearch Strategies - using the library catalogue.pptx
Search Strategies - using the library catalogue.pptx
National College of Art & Design Library
 
slide (2).pdf
slide (2).pdfslide (2).pdf
slide (2).pdf
shelememosisa
 
Bibliometrics presentation, Window on Research June 2010
Bibliometrics presentation, Window on Research June 2010Bibliometrics presentation, Window on Research June 2010
Bibliometrics presentation, Window on Research June 2010Jenny Delasalle
 
Critical reading skills
Critical reading skillsCritical reading skills
Critical reading skills
Hazel Hall
 

Similar to Towards a Semantic Citation Index for the German Social Sciences (20)

How to prepare a research paper and its evaluation tools
How to prepare a research paper and its evaluation toolsHow to prepare a research paper and its evaluation tools
How to prepare a research paper and its evaluation tools
 
Journal Impact Factors and Citation Analysis
Journal Impact Factors and Citation AnalysisJournal Impact Factors and Citation Analysis
Journal Impact Factors and Citation Analysis
 
Final delasalle for uksg
Final delasalle for uksgFinal delasalle for uksg
Final delasalle for uksg
 
Research impact metrics for librarians: calculation & context
Research impact metrics for librarians: calculation & contextResearch impact metrics for librarians: calculation & context
Research impact metrics for librarians: calculation & context
 
Google Scholar as a research and evaluation tool
Google Scholar as a research and evaluation toolGoogle Scholar as a research and evaluation tool
Google Scholar as a research and evaluation tool
 
British Library
British LibraryBritish Library
British Library
 
Bibliometrics jul 2014
Bibliometrics jul 2014Bibliometrics jul 2014
Bibliometrics jul 2014
 
Citation Metrics and Journal Rankings
Citation Metrics and Journal RankingsCitation Metrics and Journal Rankings
Citation Metrics and Journal Rankings
 
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
 
Library connect-webinar---february-2020---slides 560401
Library connect-webinar---february-2020---slides 560401Library connect-webinar---february-2020---slides 560401
Library connect-webinar---february-2020---slides 560401
 
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshopScholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
 
Methodology ProjectThis project will be completed in steps wi.docx
Methodology ProjectThis project will be completed in steps wi.docxMethodology ProjectThis project will be completed in steps wi.docx
Methodology ProjectThis project will be completed in steps wi.docx
 
Raising Your Research Profile: Evidence of Exposure Measuring Your Research I...
Raising Your Research Profile: Evidence of Exposure Measuring Your Research I...Raising Your Research Profile: Evidence of Exposure Measuring Your Research I...
Raising Your Research Profile: Evidence of Exposure Measuring Your Research I...
 
Gaining Insights Through Bibliometric Analysis
Gaining Insights Through Bibliometric AnalysisGaining Insights Through Bibliometric Analysis
Gaining Insights Through Bibliometric Analysis
 
Education_selecting key discovery tools for education research_v1_2021.pptx
Education_selecting key discovery tools for education research_v1_2021.pptxEducation_selecting key discovery tools for education research_v1_2021.pptx
Education_selecting key discovery tools for education research_v1_2021.pptx
 
Research.pptx
Research.pptxResearch.pptx
Research.pptx
 
Search Strategies - using the library catalogue.pptx
Search Strategies - using the library catalogue.pptxSearch Strategies - using the library catalogue.pptx
Search Strategies - using the library catalogue.pptx
 
slide (2).pdf
slide (2).pdfslide (2).pdf
slide (2).pdf
 
Bibliometrics presentation, Window on Research June 2010
Bibliometrics presentation, Window on Research June 2010Bibliometrics presentation, Window on Research June 2010
Bibliometrics presentation, Window on Research June 2010
 
Critical reading skills
Critical reading skillsCritical reading skills
Critical reading skills
 

More from GESIS

10th BIR Workshop @ECIR 2020: introduction
10th  BIR Workshop @ECIR 2020: introduction10th  BIR Workshop @ECIR 2020: introduction
10th BIR Workshop @ECIR 2020: introduction
GESIS
 
From closed to open access: A case study of flipped journals
From closed to open access: A case study of flipped journalsFrom closed to open access: A case study of flipped journals
From closed to open access: A case study of flipped journals
GESIS
 
Highly cited references in PLOS ONE and their in-text usage over time
Highly cited references in PLOS ONE and their in-text usage over timeHighly cited references in PLOS ONE and their in-text usage over time
Highly cited references in PLOS ONE and their in-text usage over time
GESIS
 
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
GESIS
 
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with BibliometricsBibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
GESIS
 
Analyzing the network structure and gender differences of the “NKOS community”
Analyzing the network structure and gender differences of the “NKOS community”Analyzing the network structure and gender differences of the “NKOS community”
Analyzing the network structure and gender differences of the “NKOS community”
GESIS
 
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
GESIS
 
Searching beyond datasets in the Social Sciences
Searching beyond datasets in the Social SciencesSearching beyond datasets in the Social Sciences
Searching beyond datasets in the Social Sciences
GESIS
 
Bedeutung von Text Mining am Beispiel der Sozialwissenschaften
Bedeutung von Text Mining am Beispiel der SozialwissenschaftenBedeutung von Text Mining am Beispiel der Sozialwissenschaften
Bedeutung von Text Mining am Beispiel der Sozialwissenschaften
GESIS
 
Contextualised Browsing in a Digital Library’s Living Lab
Contextualised Browsing in a Digital Library’s Living LabContextualised Browsing in a Digital Library’s Living Lab
Contextualised Browsing in a Digital Library’s Living Lab
GESIS
 
41st European Conference on Information Retrieval (ECIR 2019)
41st European Conference on Information Retrieval (ECIR 2019)41st European Conference on Information Retrieval (ECIR 2019)
41st European Conference on Information Retrieval (ECIR 2019)
GESIS
 
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
GESIS
 
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
GESIS
 
Challenges in Extracting and Managing References
Challenges in Extracting and Managing ReferencesChallenges in Extracting and Managing References
Challenges in Extracting and Managing References
GESIS
 
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
GESIS
 
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
GESIS
 
Using co-authorship networks for author name disambiguation
Using co-authorship networks for author name disambiguationUsing co-authorship networks for author name disambiguation
Using co-authorship networks for author name disambiguation
GESIS
 
Einführung in das Vektorraummodell
Einführung in das VektorraummodellEinführung in das Vektorraummodell
Einführung in das Vektorraummodell
GESIS
 
Assessing a human mediated current awareness service
Assessing a human mediated current awareness serviceAssessing a human mediated current awareness service
Assessing a human mediated current awareness service
GESIS
 
Industrie 4.0
Industrie 4.0Industrie 4.0
Industrie 4.0
GESIS
 

More from GESIS (20)

10th BIR Workshop @ECIR 2020: introduction
10th  BIR Workshop @ECIR 2020: introduction10th  BIR Workshop @ECIR 2020: introduction
10th BIR Workshop @ECIR 2020: introduction
 
From closed to open access: A case study of flipped journals
From closed to open access: A case study of flipped journalsFrom closed to open access: A case study of flipped journals
From closed to open access: A case study of flipped journals
 
Highly cited references in PLOS ONE and their in-text usage over time
Highly cited references in PLOS ONE and their in-text usage over timeHighly cited references in PLOS ONE and their in-text usage over time
Highly cited references in PLOS ONE and their in-text usage over time
 
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
 
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with BibliometricsBibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
 
Analyzing the network structure and gender differences of the “NKOS community”
Analyzing the network structure and gender differences of the “NKOS community”Analyzing the network structure and gender differences of the “NKOS community”
Analyzing the network structure and gender differences of the “NKOS community”
 
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
 
Searching beyond datasets in the Social Sciences
Searching beyond datasets in the Social SciencesSearching beyond datasets in the Social Sciences
Searching beyond datasets in the Social Sciences
 
Bedeutung von Text Mining am Beispiel der Sozialwissenschaften
Bedeutung von Text Mining am Beispiel der SozialwissenschaftenBedeutung von Text Mining am Beispiel der Sozialwissenschaften
Bedeutung von Text Mining am Beispiel der Sozialwissenschaften
 
Contextualised Browsing in a Digital Library’s Living Lab
Contextualised Browsing in a Digital Library’s Living LabContextualised Browsing in a Digital Library’s Living Lab
Contextualised Browsing in a Digital Library’s Living Lab
 
41st European Conference on Information Retrieval (ECIR 2019)
41st European Conference on Information Retrieval (ECIR 2019)41st European Conference on Information Retrieval (ECIR 2019)
41st European Conference on Information Retrieval (ECIR 2019)
 
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
 
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
 
Challenges in Extracting and Managing References
Challenges in Extracting and Managing ReferencesChallenges in Extracting and Managing References
Challenges in Extracting and Managing References
 
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
 
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
 
Using co-authorship networks for author name disambiguation
Using co-authorship networks for author name disambiguationUsing co-authorship networks for author name disambiguation
Using co-authorship networks for author name disambiguation
 
Einführung in das Vektorraummodell
Einführung in das VektorraummodellEinführung in das Vektorraummodell
Einführung in das Vektorraummodell
 
Assessing a human mediated current awareness service
Assessing a human mediated current awareness serviceAssessing a human mediated current awareness service
Assessing a human mediated current awareness service
 
Industrie 4.0
Industrie 4.0Industrie 4.0
Industrie 4.0
 

Recently uploaded

一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
mzpolocfi
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
eddie19851
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 

Recently uploaded (20)

一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 

Towards a Semantic Citation Index for the German Social Sciences

  • 1. Towards a Semantic Citation Index for the German Social Sciences William Dinkel, Philipp Mayr, Frank Sawitzky, Andreas Strotmann* GESIS – Leibnizinstitut für Sozialwissenschaften, Köln *alphabetic ordering of names
  • 2. The Problem ● German sociology / political science research output / impact coverage in SSCI – SOLIS: ~ 1/3 each of books, journal articles, chapters ● Cover ~ 50% of German researchers' “relevant” output* – ~1/3 of core journals covered in SSCI** – So, ~10% of literature indexed there – Very low percentage of cited literature indexed in SSCI*** ● * Research rating exercise Sociology, Wissenschaftsrat ● ** compared to SOLIS “class A” journals ● *** Chi (IfQ) study of core German political science journals
  • 3. The Problem (ctd.) ● Citation culture in the social sciences – Citations are important ● Perhaps even more so than in the natural sciences – Some authors are extremely highly cited (Weber, Marx...) ● Suspect very high(!!) Gini coefficient in distribution ● But: it is their books (not articles) that are highly cited! – Significant fraction of citations are contrastive – Datasets (survey results) highly mentioned, not cited – Multilingual citation environment – –
  • 4. The Need ● German social scientists & SSCI – They consider their field inadequateinadequately represented in “the” citation index – But useBut use it quite heavily anyway ● e.g. for research, evaluation ● Survey of sociologists and political scientists, GESIS
  • 5. The Need (ctd.) ● We need a citation index for the (German) social sciences – Existing citation indexes frankly inadequate ● No reasonable effort in sight to resolve this – Hence, we need to build our own ● If we want to do serious bibliometrics on SocSci ● If we want to provide a decent social science citation index in, e.g., sociology or political science
  • 6. The Need (ctd.) ● We need an open semantic citation index for the (German) social sciences – Incorporate referential semantics into search engine ● e.g., reliable hyperlinks to referenced articles ● e.g., equivalence or hierarchy relations for translations, aggregations – Publish referential semantics as linked open data ● Allow other institutions to discover references to their holdings in our database(s) ● Invite them to offer the same service to us, too – Bibliometrics requires cleaned/disambiguated data!
  • 7. The Long-Term Goal A globally distributed open semantic citation index ● Based on digital full-text collections (cooperate with publishers) – Semi-automatic / Computer-aided – Algorithms + professional indexers (authority files) + crowd sourcing +... ● Reference extraction (with contexts) – Enables sentiment analysis (important in social sciences) ● Reference matching – Enables referential semantics ● Open reference semantics information exchange – „<this> paper indexed in our collection cites <that> paper indexed in yours“
  • 8. Sowiport – German Social Sciences Research Information ● GESIS' Sowiport portal: Single access point to 18 databases, including – 6 Cambridge Scientific Abstracts databases on social sciences – GESIS' own SOLIS (literature) and SOFIS (projects) RISs – SSOAR (Social Science Open Access Repository) @ GESIS ● Goal: Extend to social science citation index – CSA comes with cited refs for some docs – SSOAR – extract refs from OA full text and index in Sowiport – Extract links to data sets / surveys used but not cited from full texts – Crawl Google Scholar for citations to “our” docs – Link to/from RepEc (and other) data ...
  • 9. First Steps: National CSA Social Sciences Citation Index ● Cambridge Scientific Abstracts – Social Sciences – 6 CSA databases offered & run by GESIS ● National research licence for Germany – Include >8 mio references ● A good starting point ● Recently activated in Sowiport ● ~25-30% refs found to link to other records – Using simple matching algorithm – Biased towards accuracy (>90%), not recall
  • 10.
  • 11. First Steps: CSA Reference Matching Reference matching is much(!) harder in social sciences ● Social science publication culture – Books & chapters, and articles ● Published in roughly equal numbers, books cited most – Multilingual publishing ● English is not the only language ● Publications may be cited in translation, different editions – Broad referencing behaviour ● Large proportion of references to non-source items => A first-try high-precision match rate of ~25-30% is an excellent result ● Close to expected rate of references to journal articles
  • 12. CSA References in GESIS' Sowiport Database ● Each full record contains „references“ and „cited-by“ information – Some with actionable links to full records ● Combines WoS/Scopus and Google Scholar approaches to citation index construction
  • 13. First Steps: Citation Extraction ● SSOAR full texts – First successful experiments to extract references from full text ● Based on RepEc's ParsCit ● Extended to German citation styles – First successful experiments to identify acknowledgments of large surveys in text
  • 14. Next Steps: “Haus der Sozialwissenschaften” ● Goal: Digital Special Collection for German Social Scientists – Digital access to full literature in one place ● Large parts unfortunately only accessible in-house ● Collect existing digital versions from “all” sources ● Digitize “important” literature where necessary ● Full text of literature, survey data, project descriptions... ● Joint DFG application with Sondersammelgebiet Sozialwissenschaften, Univ.- & Stadt-Bibl. Köln
  • 15. Next Step: “GESIS Application Laboratory Web 3.0” ● Full text collection and processing results available in toto to visiting researchers – Social scientists – Computer scientists – Computer linguists – Bibliometricians: You are invited!!! ● Upgrade database – e.g. disambiguation of authors, institutions, titles e.g. incorporation of external authority files / semantic web –
  • 16. Experiment: E-Traces ● Goal: Tracking ideas through the sociology literature (“text re-use”) – Experiment (ongoing): attempt to categorize citation contexts as positive/neutral/negative (sentiment analysis) – BMBF funded project with U Leipzig, U Göttingen ● Long term use: identify negative citations and contrastive co-citations for social science citation index
  • 17. Summary ● For GESIS' core covered social sciences (German sociology, political science), traditional citation indexes are inadequate ● and Google Scholar only provides “cited by” info ● Yet, GESIS' core audience uses them ● and complains about their inadequacies ● Bibliometrics requires an adequate citation index for reliable results (given typical distributions) ● but no improvements in sight for classic indexes ● Therefore, we need to build our own ● and we have the expertise at GESIS to succeed where others have failed ● and we have taken the first few steps in this direction ●
  • 18. Summary (ctd.) ● In the long run, we would like – A citation index that is ● Semantic (with explicit referential semantics) ● Distributed (each institution builds their own) ● Open (each institution shares semantics as LOD) ● Global (implemented world wide) ● Cooperative (indexers+researchers contribute) ● Computer-aided (software to get started, people to improve) – Based on best practices we hope to develop
  • 20. Two Models of Citation Graphs Bipartite (Classic IR) Model: Citing and Cited Partitions • Citing nodes: full bibliographic records • Cited nodes: „keys“, e.g. – First author name & initials + Year of publication + Journal key, + volume, +number, +page Uniform Model: Interconnected Documents • All nodes: bibliographic records – Citing nodes full records – Cited nodes mostly simplified records – „Matched“ cited nodes have full records
  • 21. Citation Matching • Goal: Citation network –Unique nodes for documents • Sub tasks: –Match cited references to each other –Match cited references to full records –Match full records across databases
  • 22. Matching Citations to Full Records „Internal“ matching ● Direct access to full database(s) ● Options: match key based or algorithmic matching „External“ matching ● Access only via search engine ● Options: matching against same or different database
  • 23. Scopus Citations • Cited reference info contains –Up to 8 author names (family+inits) • Including last author • Frequently as cited (not standardized or corrected) –Publication year, title, journal name/vol./nr./p. • Frequently as cited –Reasonably well parsable, not normalized
  • 24. Matching Scopus Citations to Scopus Full Records External matching: Scopus search engine ● „Algorithm“: parse Scopus reference into subfields, construct complex search queries for Scopus engine, download resulting full records, choose best fit ● High precision searches: complex searches allowed, many searchable fields – Improve recall by successively vaguer queries ● Small number of downloads allowed, so many queries needed to construct sizable citation index
  • 25. Matching Scopus Citations to PubMed Full Records CrossDB External Match: Scopus/Medline ● „Algorithm“: parse Scopus reference, construct PubMed batch citation matcher queries, download matched PubMed(!) records – Only for biomedical fields – Result is a citation network of PubMed records, not Scopus – Requires matching of Scopus citing records as well ● Either direction (Scopus<->PubMed) ● Both include PubMed IDs
  • 26. Matching Web of Science References to WoS Full Records WoS cited reference info contains ● First author (last name plus initials) ● Publication year ● Source title code ● Vol./num./page ● More and more frequently DOI No title included!
  • 27. Matching WoS Cited References to WoS Record External matching via WoS web search ● Only small queries supported – Many downloads necessary ● Crucial search fields not supported (vol., num.) – Therefore highly ambiguous results to be expected ● Requires translation of source title from code to full ● Requires algorithmic filtering of correct hit from long result list
  • 28. Matching WoS references to WoS ● Internal Matching ● Kompetenzzentrum Bibliometrie has full local copy of WoS data ● Experiment: good „match key“ to support this? – Dinkel (2011), ISSI – Results in error estimates for references
  • 29. Building a Citation Index for the Social Sciences: CSA ● Basis: Cambridge Scientific Abstracts (Social Sciences) – To be extended with additional sources of cited refs info ● Nationwide licensing scheme for Germany administered at GESIS ● Six CSA/Proquest databases incorporated into GESIS' „Sowiport“ social sciences portal – Now including ~8.5 mio cited references ● No matchings to full records provided by Proquest ● Early experimental results available on portal – Focus on precision, not recall
  • 30. Citation Matching in CSA „Algorithm“: ● Internal matching – However, across multiple CSA databases ● Parse references; construct search queries (Solr) – exact title and year – or fuzzy title and year and ISSN; – choose first match ● Favors precision over recall – Fuzzy match only for journal literature, for example ● Research to be continued!
  • 31. Experiments - Datasets Caveat ● Scopus/PubMed and WoS experiments run on stem cell research field (biomedical area) – < 100k citing docs, ~1mio references – >95% refs are to journal articles ● CSA experiment run on social sciences databases – ~1mio full records, ~10mio references ● Only recent records contain refs ● Many(!!) refs to non-journal articles
  • 32. Some Rough Numbers ● Scopus ↔ PubMed full record matching – >95% match rate ● Scopus references → Scopus/PubMed full record – ~90% match rate „exact“ + ~5% fuzzy match – ~1% false positives needed to be filtered out ● WoS references → WoS full record – ~90% match rate – >>50% false positives needed to be filtered out ● CSA references → CSA full record – ~30% match rate – ~1% false positives
  • 33. CSA reference information ● Fields: citing ID, reference ID, authors, title, year, publisher, source title/num/vol/p., ISSN – Format changes, though ● Mostly automatically parsed, as fields frequently mis-assigned ● Example (book): <CI>200601317</CI><CA>Voice UK</CA> <CT>No More Abuse.</CT><CY>2000</CY> <CZ>Derby: Voice UK</CZ>
  • 34. Discussion ● Plenty of research opportunities to improve matching of non-journal literature references to source records – e.g. to GESIS' own SOLIS / SOFIS / SSOAR databases – e.g. by crawling Google Scholar for reference links – You are invited to try your hands at this, too! ● See below: GESIS Application Laboratory