SlideShare a Scribd company logo
1 of 28
Working with scholarly APIs:
Dimensions
NISO API Training Series - May 2022
Digital Science ecosystem
The Dimensions database and
platform
1-5 years from grant to publication immediate 2-3 years years years decades
Pre-publication Post publication
Grants
Research
Preprints
Data sets
Publications
Tweets Blogs
Citations
Clinical
Trials
Patents
Policy docs
We capture a more comprehensive view of the research
lifecycle
We enrich and organize the data
Categorization
Concept
extraction
Reference extraction
Institution identification
Researcher
disambiguation
Incoming data
Enrichment
Dimensions
enriched metadata
Concepts
Researchers
Organizations
Classifications
References Metrics
Publications
Policy
docs
Patents
Altmetric
Datasets Grants
Clinical
trials
We make the connections
1.5bn links
1.8m links
448k links
142m
Patents
402m links
Status: Jan 2022
6.1m
Grants
$1.9 trillion
in funding
743k
Policy
documents
681k
Clinical
trials
200m
Altmetric
data
points
124m
Publications
improved
metadata
of 88m
11m
Datasets
● Fast, large scale analyses;
dynamic dashboards
● Direct integration with BI tools
e.g. Tableau, Qlik, PowerBI
● Join with private & public data
● Ad hoc analysis & single data-
type analyses
● Full-text search & special
functions e.g. affiliation
extraction
● Product integrations e.g. CRIS
● Search & discovery;
top analytical use cases
● Dedicated UI, inbuilt
visualizations
● In the browser, no specialized
knowledge required
Web App API Google BigQuery
For everyone
For API users
+ data & analytics teams
For data & analytics teams
+ dashboards for everyone
Simplest access: https://app.dimensions.ai/dsl
API
Dimensions Search Language (DSL)
Powerful query language developed around
simple syntax, allowing users with various
levels of technical skills to use.
● Basic query based around two phrases
● More complexity can be included by
adding filters
● Returned data can be specified using
custom fieldsets
search publications
return publications
Dimensions Search Language (DSL)
Powerful query language developed around
simple syntax, allowing users with various
levels of technical skills to use.
● Basic query based around two phrases
● More complexity can be included by
adding filters
● Returned data can be specified using
custom fieldsets
search publications
return publications
Sources:
● Publications
● Grants
● Patents
● Clinical Trials
● Policy Documents
● Datasets
● Source Titles
● Reports
● Researchers
● Organizations
Dimensions Search Language (DSL)
Powerful query language developed around
simple syntax, allowing users with various
levels of technical skills to use.
● Basic query based around two phrases
● More complexity can be included by
adding filters
● Returned data can be specified using
custom fieldsets
search publications
for “leukemia”
where year in [2015:2017]
return publications
Dimensions Search Language (DSL)
Powerful query language developed around
simple syntax, allowing users with various
levels of technical skills to use.
● Basic query based around two phrases
● More complexity can be included by
adding filters
● Returned data can be specified using
custom fieldsets
search publications
for “leukemia”
where year in [2015:2017]
return publications [title+doi]
return researchers [id+last_name
return journal [title]
Special Functions
● Affiliations matching
○ from strings to GRID IDs (https://grid.ac/)
● Concepts extraction
○ from text to keywords
● Classification service
○ from text to ontologies (FOR, UOA, HRA, RCDC,
HRCS_HC, HRCS_RAC, SDG, BRA, ICRP_CSO
and ICRP_CT)
Dimensions Search Language (DSL)
extract_affiliations(
affiliation="university
of oxford, uk")
extract_concepts("text of
abstract")
classify(
title="..text..",
abstract="..text..",
system="FOR")
Python library, developed by Dimensions.
● Simplifies common API operations
○ Log in, log out
○ Querying, iterative queries
○ Extracting and transforming data e.g. to dataframes
● Includes a Command Line Interface (CLI)
○ Like a ‘query console’ with autocomplete and other handy functionalities
https://github.com/digital-science/dimcli
Dimcli
https://colab.research.google.com/github/digital-science/dimensions-api-
lab/blob/master/cookbooks/2-publications/General-statistics.ipynb
Example
Additional information
Product versions
Dimensions Analytics web app
Online search & discovery platform:
● Explore the data with customizable and
exportable visualizations.
● Search the full-text to gain more relevant
results. Plus, search via abstract or
concept.
● Filter with sophisticated research
classification systems and content-
specific options e.g. Inventor for patents.
● Identify trends and opportunities: grants,
research topics, open access, compliance
and more.
● Use directly from the browser with no
specialised knowledge required.
Dimensions APIs
Powerful APIs - designed to allow flexible use
of the enriched data:
● Integrate data into applications outside of
the web-app; e.g. admin systems.
● Enrich your own data/content using
special functions like affiliation extraction
and concept extraction.
● Query using full-text search. Ideal for ad
hoc analytical queries going a step further
than the web app.
● Use without constraints for internal
purposes.
● Easy-to-learn, human-readable querying
language.
Dimensions on Google BigQuery
Access to the full Dimensions database, via a direct
integration with BigQuery - Google’s cloud data
warehouse.
● Integrate Dimensions data into your existing reporting
and analysis infrastructure quickly, using out-of-the
box connectors.
● Analyze the full dataset with complete flexibility to
support multiple decision-making processes.
● Join Dimensions data to your own private data to
provide a global research context for benchmarking.
● Create custom dashboards and automated reports
on different topics e.g. funding opportunities,
collaboration.
● Share these analyses and dashboards with
stakeholders across the organization so that the
relevant insights are always at their fingertips.
This Covid-19 topic dashboard took 15 minutes to
build
Data in Dimensions
Dimensions uses an ‘inclusive approach’
● We do not exclude research outputs
based on a “behind closed doors”
content selection processes
● We believe the user should be able to
make inclusion/exclusion decisions
based on their own needs
● We provide the user with as much
information and tools as possible in order
to do so
Dimensions
empowers our users
● Journal articles, pre-prints, conference
proceedings, books/book chapters
● Full text searching available for ~70% of
publications
● 100M + records based on metadata
● Metadata derived from multiple available
databases
● Highly contextualized - related grants,
publication references, citing publications,
related trials, related patents, related policy
documents, Altmetric attention
● OA tagged
Publications
PUBLICATIONS
JOURNALS /
BOOKS
PRE-PRINT / OA
...and many
more!
● More than 8 million datasets
● Sourced from DataCite and Figshare
● Linked to publications, supporting grants
and funders
● Filters for research organizations, funders,
researchers and more
Datasets
DATASETS
DATACITE
FIGSHARE & FIGSHARE HOSTED REPOSITORIES
800+ more
70+ more
● Project funding
● 6 million grants from 600+ funders globally
● $2 trillion of funding
● Not limited to federal/national funding
● Sourcing
○ Direct relationships with funders
○ Data available via APIs
○ Data available via websites which we
crawl
Grants data
GRANTS
Patents data
● 134 million+ patent
documents
● Global coverage
● 100+ jurisdictions, including
but not limited to:
○ China
○ Japan
○ United States
○ Germany
○ European Union
○ South Korea
● ClinicalTrials.gov (United States)
● EU-CTR (EU)
● UMIN-CTR (Japan)
● ISRCTN (International)
● ANZCTR (Australia/New Zealand)
● CHICTR (China)
● NTR (Netherlands)
● GCTR (Germany)
● CTRI (India)
● CRIS (Korea)
● ICRT (Iran)
… and more are coming
Clinical trials
CLINICAL TRIALS
Over 700,000 policy document records,
linked to publications
Including but not limited to:
● World Health Organization
● World Bank
● Centers for Disease Control &
Prevention
● Government of the United Kingdom
● National Bureau of Economic
Research
Policy documents data
POLICY DOCUMENTS

More Related Content

Similar to Draux "Working with Scholarly APIs: A NISO Training Series, Session Four: Digital Science Dimensions"

ICIC 2014 New Product Introduction Gridlogisc
ICIC 2014 New Product Introduction GridlogiscICIC 2014 New Product Introduction Gridlogisc
ICIC 2014 New Product Introduction GridlogiscDr. Haxel Consult
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformSanjay Padhi, Ph.D
 
ERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarFAIRDOM
 
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM
 
Open government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactOpen government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactElena Simperl
 
PatSeer Premier Overview
PatSeer Premier OverviewPatSeer Premier Overview
PatSeer Premier OverviewGridlogics
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolutionitnewsafrica
 
PatSeer Introduction
PatSeer IntroductionPatSeer Introduction
PatSeer IntroductionGridlogics
 
What's new at Crossref - Ed Pentz - London LIVE 2017
What's new at Crossref - Ed Pentz - London LIVE 2017What's new at Crossref - Ed Pentz - London LIVE 2017
What's new at Crossref - Ed Pentz - London LIVE 2017Crossref
 
CGSpace and PRMS Information Session
CGSpace and PRMS Information SessionCGSpace and PRMS Information Session
CGSpace and PRMS Information SessionILRI
 
Smart cities no ai without ia
Smart cities   no ai without iaSmart cities   no ai without ia
Smart cities no ai without iaFredric Landqvist
 
PatSeer Overview
PatSeer OverviewPatSeer Overview
PatSeer OverviewGridlogics
 
Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceWebinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceLucidworks
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...AKSHAY BHAGAT
 
An Introduction and Applications of DOI
An Introduction and Applications of DOIAn Introduction and Applications of DOI
An Introduction and Applications of DOINader Ale Ebrahim
 

Similar to Draux "Working with Scholarly APIs: A NISO Training Series, Session Four: Digital Science Dimensions" (20)

ICIC 2014 New Product Introduction Gridlogisc
ICIC 2014 New Product Introduction GridlogiscICIC 2014 New Product Introduction Gridlogisc
ICIC 2014 New Product Introduction Gridlogisc
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
ERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management Webinar
 
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech Proposals
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
 
Open government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactOpen government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impact
 
PatSeer Premier Overview
PatSeer Premier OverviewPatSeer Premier Overview
PatSeer Premier Overview
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
Falk-Krzesinski, "Administrator (Institutional Use of the Data): Data-informe...
Falk-Krzesinski, "Administrator (Institutional Use of the Data): Data-informe...Falk-Krzesinski, "Administrator (Institutional Use of the Data): Data-informe...
Falk-Krzesinski, "Administrator (Institutional Use of the Data): Data-informe...
 
PatSeer Introduction
PatSeer IntroductionPatSeer Introduction
PatSeer Introduction
 
Shareable by Design: Making Better Use of your Research
Shareable by Design: Making Better Use of your ResearchShareable by Design: Making Better Use of your Research
Shareable by Design: Making Better Use of your Research
 
What's new at Crossref - Ed Pentz - London LIVE 2017
What's new at Crossref - Ed Pentz - London LIVE 2017What's new at Crossref - Ed Pentz - London LIVE 2017
What's new at Crossref - Ed Pentz - London LIVE 2017
 
Data, data, everywhere? Not nearly enough!
Data, data, everywhere? Not nearly enough!Data, data, everywhere? Not nearly enough!
Data, data, everywhere? Not nearly enough!
 
CGSpace and PRMS Information Session
CGSpace and PRMS Information SessionCGSpace and PRMS Information Session
CGSpace and PRMS Information Session
 
Smart cities no ai without ia
Smart cities   no ai without iaSmart cities   no ai without ia
Smart cities no ai without ia
 
PatSeer Overview
PatSeer OverviewPatSeer Overview
PatSeer Overview
 
Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceWebinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
 
Hawkins, "Scitopia.org, A Discovery Tool Using Federated Search"
Hawkins, "Scitopia.org, A Discovery Tool Using Federated Search"Hawkins, "Scitopia.org, A Discovery Tool Using Federated Search"
Hawkins, "Scitopia.org, A Discovery Tool Using Federated Search"
 
An Introduction and Applications of DOI
An Introduction and Applications of DOIAn Introduction and Applications of DOI
An Introduction and Applications of DOI
 

More from National Information Standards Organization (NISO)

More from National Information Standards Organization (NISO) (20)

Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
 
Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"
 
Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"
 
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
 
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
 
Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"
 
Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"
 
Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"
 

Recently uploaded

Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxJiesonDelaCerna
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...jaredbarbolino94
 

Recently uploaded (20)

Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptx
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...
 

Draux "Working with Scholarly APIs: A NISO Training Series, Session Four: Digital Science Dimensions"

  • 1. Working with scholarly APIs: Dimensions NISO API Training Series - May 2022
  • 3. The Dimensions database and platform
  • 4. 1-5 years from grant to publication immediate 2-3 years years years decades Pre-publication Post publication Grants Research Preprints Data sets Publications Tweets Blogs Citations Clinical Trials Patents Policy docs We capture a more comprehensive view of the research lifecycle
  • 5. We enrich and organize the data Categorization Concept extraction Reference extraction Institution identification Researcher disambiguation Incoming data Enrichment Dimensions enriched metadata Concepts Researchers Organizations Classifications References Metrics Publications Policy docs Patents Altmetric Datasets Grants Clinical trials
  • 6. We make the connections 1.5bn links 1.8m links 448k links 142m Patents 402m links Status: Jan 2022 6.1m Grants $1.9 trillion in funding 743k Policy documents 681k Clinical trials 200m Altmetric data points 124m Publications improved metadata of 88m 11m Datasets
  • 7. ● Fast, large scale analyses; dynamic dashboards ● Direct integration with BI tools e.g. Tableau, Qlik, PowerBI ● Join with private & public data ● Ad hoc analysis & single data- type analyses ● Full-text search & special functions e.g. affiliation extraction ● Product integrations e.g. CRIS ● Search & discovery; top analytical use cases ● Dedicated UI, inbuilt visualizations ● In the browser, no specialized knowledge required Web App API Google BigQuery For everyone For API users + data & analytics teams For data & analytics teams + dashboards for everyone
  • 9. Dimensions Search Language (DSL) Powerful query language developed around simple syntax, allowing users with various levels of technical skills to use. ● Basic query based around two phrases ● More complexity can be included by adding filters ● Returned data can be specified using custom fieldsets search publications return publications
  • 10. Dimensions Search Language (DSL) Powerful query language developed around simple syntax, allowing users with various levels of technical skills to use. ● Basic query based around two phrases ● More complexity can be included by adding filters ● Returned data can be specified using custom fieldsets search publications return publications Sources: ● Publications ● Grants ● Patents ● Clinical Trials ● Policy Documents ● Datasets ● Source Titles ● Reports ● Researchers ● Organizations
  • 11. Dimensions Search Language (DSL) Powerful query language developed around simple syntax, allowing users with various levels of technical skills to use. ● Basic query based around two phrases ● More complexity can be included by adding filters ● Returned data can be specified using custom fieldsets search publications for “leukemia” where year in [2015:2017] return publications
  • 12. Dimensions Search Language (DSL) Powerful query language developed around simple syntax, allowing users with various levels of technical skills to use. ● Basic query based around two phrases ● More complexity can be included by adding filters ● Returned data can be specified using custom fieldsets search publications for “leukemia” where year in [2015:2017] return publications [title+doi] return researchers [id+last_name return journal [title]
  • 13. Special Functions ● Affiliations matching ○ from strings to GRID IDs (https://grid.ac/) ● Concepts extraction ○ from text to keywords ● Classification service ○ from text to ontologies (FOR, UOA, HRA, RCDC, HRCS_HC, HRCS_RAC, SDG, BRA, ICRP_CSO and ICRP_CT) Dimensions Search Language (DSL) extract_affiliations( affiliation="university of oxford, uk") extract_concepts("text of abstract") classify( title="..text..", abstract="..text..", system="FOR")
  • 14. Python library, developed by Dimensions. ● Simplifies common API operations ○ Log in, log out ○ Querying, iterative queries ○ Extracting and transforming data e.g. to dataframes ● Includes a Command Line Interface (CLI) ○ Like a ‘query console’ with autocomplete and other handy functionalities https://github.com/digital-science/dimcli Dimcli
  • 18. Dimensions Analytics web app Online search & discovery platform: ● Explore the data with customizable and exportable visualizations. ● Search the full-text to gain more relevant results. Plus, search via abstract or concept. ● Filter with sophisticated research classification systems and content- specific options e.g. Inventor for patents. ● Identify trends and opportunities: grants, research topics, open access, compliance and more. ● Use directly from the browser with no specialised knowledge required.
  • 19. Dimensions APIs Powerful APIs - designed to allow flexible use of the enriched data: ● Integrate data into applications outside of the web-app; e.g. admin systems. ● Enrich your own data/content using special functions like affiliation extraction and concept extraction. ● Query using full-text search. Ideal for ad hoc analytical queries going a step further than the web app. ● Use without constraints for internal purposes. ● Easy-to-learn, human-readable querying language.
  • 20. Dimensions on Google BigQuery Access to the full Dimensions database, via a direct integration with BigQuery - Google’s cloud data warehouse. ● Integrate Dimensions data into your existing reporting and analysis infrastructure quickly, using out-of-the box connectors. ● Analyze the full dataset with complete flexibility to support multiple decision-making processes. ● Join Dimensions data to your own private data to provide a global research context for benchmarking. ● Create custom dashboards and automated reports on different topics e.g. funding opportunities, collaboration. ● Share these analyses and dashboards with stakeholders across the organization so that the relevant insights are always at their fingertips. This Covid-19 topic dashboard took 15 minutes to build
  • 22. Dimensions uses an ‘inclusive approach’ ● We do not exclude research outputs based on a “behind closed doors” content selection processes ● We believe the user should be able to make inclusion/exclusion decisions based on their own needs ● We provide the user with as much information and tools as possible in order to do so Dimensions empowers our users
  • 23. ● Journal articles, pre-prints, conference proceedings, books/book chapters ● Full text searching available for ~70% of publications ● 100M + records based on metadata ● Metadata derived from multiple available databases ● Highly contextualized - related grants, publication references, citing publications, related trials, related patents, related policy documents, Altmetric attention ● OA tagged Publications PUBLICATIONS JOURNALS / BOOKS PRE-PRINT / OA ...and many more!
  • 24. ● More than 8 million datasets ● Sourced from DataCite and Figshare ● Linked to publications, supporting grants and funders ● Filters for research organizations, funders, researchers and more Datasets DATASETS DATACITE FIGSHARE & FIGSHARE HOSTED REPOSITORIES 800+ more 70+ more
  • 25. ● Project funding ● 6 million grants from 600+ funders globally ● $2 trillion of funding ● Not limited to federal/national funding ● Sourcing ○ Direct relationships with funders ○ Data available via APIs ○ Data available via websites which we crawl Grants data GRANTS
  • 26. Patents data ● 134 million+ patent documents ● Global coverage ● 100+ jurisdictions, including but not limited to: ○ China ○ Japan ○ United States ○ Germany ○ European Union ○ South Korea
  • 27. ● ClinicalTrials.gov (United States) ● EU-CTR (EU) ● UMIN-CTR (Japan) ● ISRCTN (International) ● ANZCTR (Australia/New Zealand) ● CHICTR (China) ● NTR (Netherlands) ● GCTR (Germany) ● CTRI (India) ● CRIS (Korea) ● ICRT (Iran) … and more are coming Clinical trials CLINICAL TRIALS
  • 28. Over 700,000 policy document records, linked to publications Including but not limited to: ● World Health Organization ● World Bank ● Centers for Disease Control & Prevention ● Government of the United Kingdom ● National Bureau of Economic Research Policy documents data POLICY DOCUMENTS