SlideShare a Scribd company logo
Abstract: Connecting locally hosted data repositories to internationally hosted related articles has never
been easier. With APIs and other web services becoming standardized at the same time that new linking
standards, such as Datacite DOIs, are being adopted, new ways to distribute and mashup content are now
possible. This presentation will explore emerging trends in linking scholarly literature to data. Both entity
linking and data linking will be discussed. Examples will be presented demonstrating how these
technologies are being employed by publishers and A&I vendors in cooperation with local data repositories.
__________________________________________
Before I get started, I would like to take a minute to set some expectations for this talk. The examples used
will primarily be about hard sciences, my challenge to you is to figure out how to apply these technologies
and methods to the digital humanities.




                                                                                                            1
This is a theoretical framework for looking at the different ways that publications can be connected
to data.
This is also the agenda for the talk. I will first speak about the top left quadrant and then work my
way to the bottom right. This means starting from the easiest to apply to the humanities and
working through to the hardest.




                                                                                                        2
This quadrant is primarily about publications to supplemental data.




                                                                      3
Supplemental data submitted as a file with an article is the traditional way. It has its place, but that
is not what I am talking about today.




                                                                                                       4
Instead, new tools now enable display and direct manipulation of data in new and interesting ways.
This example is an application that displays KML files on a Google Map:
http://www.applications.sciverse.com/action/appDetail/298231?zone=main&pageOrigin=appGallery
&activity=display




                                                                                                 5
Next on the agenda is automating the connection between publications and whole supplementary
or related datasets.




                                                                                               6
One example of this is the PANGAEA app which searches PANGAEA apis by article DOI and
retrieves the coordinates of where supplementary data was collected and then charts these on a
Google map displayed directly on the ScienceDirect article page.




                                                                                                 7
This also works on Scopus record pages (so for lot’s of publishers and journals). From deciding to
put it on Scopus as well it took less than 24 hours for the PANGAEA developer to implement. This
was enabled by the SciVerse Applications platform.




                                                                                                 8
Users can link through to the main record for the dataset on PANGAEA. One thing I would like to
mention here is that there is also a DOI for the dataset. This was done through DataCite.




                                                                                                  9
So what is DataCite and why is it important? It is also very important for creating links to data in
repositories.




                                                                                                       10
Takeaway points: International DOI Foundation enables CrossRef to give out DOIs. DataCite
roughly equivalent to CrossRef. Learn more at the DataCite website. A central institution in Serbia
might want to become a Member Institute.




                                                                                                 11
So those were examples of linking to whole datasets and displaying them in new and interesting
ways. Next to discuss is linking to entities.




                                                                                                 12
Traditional linking involves an author marking up an entity such as a protein so that it can be easily
linked to additional information about that entity in a different database. While this is useful, it is
not what I wish to share with you today. Why make a user follow a link when…




                                                                                                     13
You can now embed a 3D interactive model of the protein directly in context in the article. In this
example the PDB Protein Viewer is embedded directly in the article.




                                                                                                      14
In this example an author adds key structures to the article and they are then embedded using
Reaxys information and software.




                                                                                                15
16
The last examples still required an Author to manually mark up entities. Through text analysis and
mining, this is no longer always necessary.




                                                                                                17
In this example, our partner NextBio automatically recognizes entities in the text of the
article.

Easily extendable to new / other entities
Works retrospectively on older content
Does create recall / precision errors




                                                                                            18
Not only can it display them in the sidebar, but the application framework enables adding links to
the entities in the text on the fly.




                                                                                                     19
A reader can then click those links for additional information form multiple databases.




                                                                                          20
1.   Colours & tags genes, proteins, molecule names
2.   Clicking shows a summary of features for the term (ie: sequence or 2D structure)
3.   User can click on links in the pop-up leading out to more information




                                                                                        21
22
* To summarize, we started with very traditional linking of datasets where an author submits the dataset with the
article. One example of how this can be improved was the Interactive map viewer that displays supplementary KML
files rather than simple attaching the files to the article.
* Next we discussed automated linking to datasets. This included the example of searching PANGAEA APIs for
related datasets and then displaying the locations the data was collected. This will be driven by new standards such as
DataCite.
* Third, authors manually mark up entities that can be linked to in other databases. Now it is possible to embed
content from other databases using APIs.
* Last, is totally automated entity recognition using text analysis and mining, Again, information from third party
databases can be embedded directly in the article itself.
* While I haven’t spoken too much about the technologies enabling these new ways of linking articles to data, one
example is the SciVerse Application Framework, which now enables all of the examples discussed today.
http://www.applications.sciverse.com/action/userhome




                                                                                                                      23
I would like to close with the same questions I opened with. Thank you.




                                                                          24

More Related Content

Similar to Connecting Publications & Data: Raising visibility of local data collections through linking with international publication databases

Mendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 PaperMendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 Paper
William Gunn
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016
Jessie Chuang
 
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemLeveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Semantic Web Company
 
Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database  Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database
dannyijwest
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchange
lagoze
 
EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...
EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...
EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...
IJCI JOURNAL
 
Development of a Web based Shopping Cart using the Mongo DB Database for Huma...
Development of a Web based Shopping Cart using the Mongo DB Database for Huma...Development of a Web based Shopping Cart using the Mongo DB Database for Huma...
Development of a Web based Shopping Cart using the Mongo DB Database for Huma...
AI Publications
 
PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010
Andreas Blumauer
 
Big data-analytics-cpe8035
Big data-analytics-cpe8035Big data-analytics-cpe8035
Big data-analytics-cpe8035
Neelam Rawat
 
SKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategiesSKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategies
Semantic Web Company
 
Linked Data to Improve the OER Experience
Linked Data to Improve the OER ExperienceLinked Data to Improve the OER Experience
Linked Data to Improve the OER Experience
The Open Education Consortium
 
Notes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
Notes for talk on 12th June 2013 to Open Innovation meeting, GlasgowNotes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
Notes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
PeterWinstanley1
 
reegle - a new key portal for open energy data
reegle - a new key portal for open energy datareegle - a new key portal for open energy data
reegle - a new key portal for open energy data
reeep
 
moving_from_relational_to_nosql_couchbase_2016
moving_from_relational_to_nosql_couchbase_2016moving_from_relational_to_nosql_couchbase_2016
moving_from_relational_to_nosql_couchbase_2016
Richard (Rick) Nelson
 
The "Big Data" Ecosystem at LinkedIn
The "Big Data" Ecosystem at LinkedInThe "Big Data" Ecosystem at LinkedIn
The "Big Data" Ecosystem at LinkedIn
Sam Shah
 
The “Big Data” Ecosystem at LinkedIn
The “Big Data” Ecosystem at LinkedInThe “Big Data” Ecosystem at LinkedIn
The “Big Data” Ecosystem at LinkedIn
Kun Le
 
Graph Databases and Graph Data Science in Neo4j
Graph Databases and Graph Data Science in Neo4jGraph Databases and Graph Data Science in Neo4j
Graph Databases and Graph Data Science in Neo4j
ijtsrd
 
Evaluation criteria for nosql databases
Evaluation criteria for nosql databasesEvaluation criteria for nosql databases
Evaluation criteria for nosql databases
Ebenezer Daniel
 
Jarrar: Introduction to Linked Data
Jarrar: Introduction to Linked DataJarrar: Introduction to Linked Data
Jarrar: Introduction to Linked Data
Mustafa Jarrar
 
PoolParty Semantic Suite - LT-Innovate Industry Summit-2016 - Brussels
PoolParty Semantic Suite -  LT-Innovate Industry Summit-2016 - BrusselsPoolParty Semantic Suite -  LT-Innovate Industry Summit-2016 - Brussels
PoolParty Semantic Suite - LT-Innovate Industry Summit-2016 - Brussels
Martin Kaltenböck
 

Similar to Connecting Publications & Data: Raising visibility of local data collections through linking with international publication databases (20)

Mendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 PaperMendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 Paper
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016
 
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemLeveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
 
Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database  Linked Data Generation for the University Data From Legacy Database
Linked Data Generation for the University Data From Legacy Database
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchange
 
EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...
EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...
EMPLOYING THE CATEGORIES OF WIKIPEDIA IN THE TASK OF AUTOMATIC DOCUMENTS CLUS...
 
Development of a Web based Shopping Cart using the Mongo DB Database for Huma...
Development of a Web based Shopping Cart using the Mongo DB Database for Huma...Development of a Web based Shopping Cart using the Mongo DB Database for Huma...
Development of a Web based Shopping Cart using the Mongo DB Database for Huma...
 
PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010
 
Big data-analytics-cpe8035
Big data-analytics-cpe8035Big data-analytics-cpe8035
Big data-analytics-cpe8035
 
SKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategiesSKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategies
 
Linked Data to Improve the OER Experience
Linked Data to Improve the OER ExperienceLinked Data to Improve the OER Experience
Linked Data to Improve the OER Experience
 
Notes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
Notes for talk on 12th June 2013 to Open Innovation meeting, GlasgowNotes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
Notes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
 
reegle - a new key portal for open energy data
reegle - a new key portal for open energy datareegle - a new key portal for open energy data
reegle - a new key portal for open energy data
 
moving_from_relational_to_nosql_couchbase_2016
moving_from_relational_to_nosql_couchbase_2016moving_from_relational_to_nosql_couchbase_2016
moving_from_relational_to_nosql_couchbase_2016
 
The "Big Data" Ecosystem at LinkedIn
The "Big Data" Ecosystem at LinkedInThe "Big Data" Ecosystem at LinkedIn
The "Big Data" Ecosystem at LinkedIn
 
The “Big Data” Ecosystem at LinkedIn
The “Big Data” Ecosystem at LinkedInThe “Big Data” Ecosystem at LinkedIn
The “Big Data” Ecosystem at LinkedIn
 
Graph Databases and Graph Data Science in Neo4j
Graph Databases and Graph Data Science in Neo4jGraph Databases and Graph Data Science in Neo4j
Graph Databases and Graph Data Science in Neo4j
 
Evaluation criteria for nosql databases
Evaluation criteria for nosql databasesEvaluation criteria for nosql databases
Evaluation criteria for nosql databases
 
Jarrar: Introduction to Linked Data
Jarrar: Introduction to Linked DataJarrar: Introduction to Linked Data
Jarrar: Introduction to Linked Data
 
PoolParty Semantic Suite - LT-Innovate Industry Summit-2016 - Brussels
PoolParty Semantic Suite -  LT-Innovate Industry Summit-2016 - BrusselsPoolParty Semantic Suite -  LT-Innovate Industry Summit-2016 - Brussels
PoolParty Semantic Suite - LT-Innovate Industry Summit-2016 - Brussels
 

More from Michael Habib

Complexities in Open Access Discovery Interfaces
Complexities in Open Access Discovery InterfacesComplexities in Open Access Discovery Interfaces
Complexities in Open Access Discovery Interfaces
Michael Habib
 
Ubiquitous Open Access: Changing culture by integrating OA into user workflows
Ubiquitous Open Access: Changing culture by integrating OA into user workflowsUbiquitous Open Access: Changing culture by integrating OA into user workflows
Ubiquitous Open Access: Changing culture by integrating OA into user workflows
Michael Habib
 
Measure for Measure: The role of metrics in assessing research performance - ...
Measure for Measure: The role of metrics in assessing research performance - ...Measure for Measure: The role of metrics in assessing research performance - ...
Measure for Measure: The role of metrics in assessing research performance - ...
Michael Habib
 
Application Platforms and Developer Communities - New software tools and app...
Application Platforms and Developer Communities -  New software tools and app...Application Platforms and Developer Communities -  New software tools and app...
Application Platforms and Developer Communities - New software tools and app...
Michael Habib
 
"New Technologies: Empowering the Research community for Better Outcomes", L...
"New Technologies:  Empowering the Research community for Better Outcomes", L..."New Technologies:  Empowering the Research community for Better Outcomes", L...
"New Technologies: Empowering the Research community for Better Outcomes", L...
Michael Habib
 
Scopus March 2012 release overview: New Document Details Pages, Interoperabil...
Scopus March 2012 release overview: New Document Details Pages, Interoperabil...Scopus March 2012 release overview: New Document Details Pages, Interoperabil...
Scopus March 2012 release overview: New Document Details Pages, Interoperabil...
Michael Habib
 
SNEAK PREVIEW Scopus Analyze Results: Overview and use case
SNEAK PREVIEW Scopus Analyze Results: Overview and use caseSNEAK PREVIEW Scopus Analyze Results: Overview and use case
SNEAK PREVIEW Scopus Analyze Results: Overview and use case
Michael Habib
 
From Academic Library 2.0 to (Literature) Research 2.0
From Academic Library 2.0  to (Literature) Research 2.0From Academic Library 2.0  to (Literature) Research 2.0
From Academic Library 2.0 to (Literature) Research 2.0
Michael Habib
 
Scholarly Reputation Management Online : The Challenges and Opportunities of ...
Scholarly Reputation Management Online: The Challenges and Opportunities of ...Scholarly Reputation Management Online: The Challenges and Opportunities of ...
Scholarly Reputation Management Online : The Challenges and Opportunities of ...
Michael Habib
 
Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0
Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0
Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0
Michael Habib
 

More from Michael Habib (10)

Complexities in Open Access Discovery Interfaces
Complexities in Open Access Discovery InterfacesComplexities in Open Access Discovery Interfaces
Complexities in Open Access Discovery Interfaces
 
Ubiquitous Open Access: Changing culture by integrating OA into user workflows
Ubiquitous Open Access: Changing culture by integrating OA into user workflowsUbiquitous Open Access: Changing culture by integrating OA into user workflows
Ubiquitous Open Access: Changing culture by integrating OA into user workflows
 
Measure for Measure: The role of metrics in assessing research performance - ...
Measure for Measure: The role of metrics in assessing research performance - ...Measure for Measure: The role of metrics in assessing research performance - ...
Measure for Measure: The role of metrics in assessing research performance - ...
 
Application Platforms and Developer Communities - New software tools and app...
Application Platforms and Developer Communities -  New software tools and app...Application Platforms and Developer Communities -  New software tools and app...
Application Platforms and Developer Communities - New software tools and app...
 
"New Technologies: Empowering the Research community for Better Outcomes", L...
"New Technologies:  Empowering the Research community for Better Outcomes", L..."New Technologies:  Empowering the Research community for Better Outcomes", L...
"New Technologies: Empowering the Research community for Better Outcomes", L...
 
Scopus March 2012 release overview: New Document Details Pages, Interoperabil...
Scopus March 2012 release overview: New Document Details Pages, Interoperabil...Scopus March 2012 release overview: New Document Details Pages, Interoperabil...
Scopus March 2012 release overview: New Document Details Pages, Interoperabil...
 
SNEAK PREVIEW Scopus Analyze Results: Overview and use case
SNEAK PREVIEW Scopus Analyze Results: Overview and use caseSNEAK PREVIEW Scopus Analyze Results: Overview and use case
SNEAK PREVIEW Scopus Analyze Results: Overview and use case
 
From Academic Library 2.0 to (Literature) Research 2.0
From Academic Library 2.0  to (Literature) Research 2.0From Academic Library 2.0  to (Literature) Research 2.0
From Academic Library 2.0 to (Literature) Research 2.0
 
Scholarly Reputation Management Online : The Challenges and Opportunities of ...
Scholarly Reputation Management Online: The Challenges and Opportunities of ...Scholarly Reputation Management Online: The Challenges and Opportunities of ...
Scholarly Reputation Management Online : The Challenges and Opportunities of ...
 
Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0
Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0
Engaging a New Generation of Authors, Reviewers & Readers through Web 2.0
 

Recently uploaded

Skimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S EliotSkimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S Eliot
nitinpv4ai
 
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
indexPub
 
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptxCapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapitolTechU
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
haiqairshad
 
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
Payaamvohra1
 
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdfREASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
giancarloi8888
 
Observational Learning
Observational Learning Observational Learning
Observational Learning
sanamushtaq922
 
Wound healing PPT
Wound healing PPTWound healing PPT
Wound healing PPT
Jyoti Chand
 
HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.
deepaannamalai16
 
Accounting for Restricted Grants When and How To Record Properly
Accounting for Restricted Grants  When and How To Record ProperlyAccounting for Restricted Grants  When and How To Record Properly
Accounting for Restricted Grants When and How To Record Properly
TechSoup
 
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
ImMuslim
 
skeleton System.pdf (skeleton system wow)
skeleton System.pdf (skeleton system wow)skeleton System.pdf (skeleton system wow)
skeleton System.pdf (skeleton system wow)
Mohammad Al-Dhahabi
 
Standardized tool for Intelligence test.
Standardized tool for Intelligence test.Standardized tool for Intelligence test.
Standardized tool for Intelligence test.
deepaannamalai16
 
The basics of sentences session 7pptx.pptx
The basics of sentences session 7pptx.pptxThe basics of sentences session 7pptx.pptx
The basics of sentences session 7pptx.pptx
heathfieldcps1
 
Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...
PsychoTech Services
 
CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...
CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...
CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...
Nguyen Thanh Tu Collection
 
220711130088 Sumi Basak Virtual University EPC 3.pptx
220711130088 Sumi Basak Virtual University EPC 3.pptx220711130088 Sumi Basak Virtual University EPC 3.pptx
220711130088 Sumi Basak Virtual University EPC 3.pptx
Kalna College
 
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
Kalna College
 
Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10
nitinpv4ai
 
How to Download & Install Module From the Odoo App Store in Odoo 17
How to Download & Install Module From the Odoo App Store in Odoo 17How to Download & Install Module From the Odoo App Store in Odoo 17
How to Download & Install Module From the Odoo App Store in Odoo 17
Celine George
 

Recently uploaded (20)

Skimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S EliotSkimbleshanks-The-Railway-Cat by T S Eliot
Skimbleshanks-The-Railway-Cat by T S Eliot
 
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
THE SACRIFICE HOW PRO-PALESTINE PROTESTS STUDENTS ARE SACRIFICING TO CHANGE T...
 
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptxCapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
CapTechTalks Webinar Slides June 2024 Donovan Wright.pptx
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
 
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
 
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdfREASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
REASIGNACION 2024 UGEL CHUPACA 2024 UGEL CHUPACA.pdf
 
Observational Learning
Observational Learning Observational Learning
Observational Learning
 
Wound healing PPT
Wound healing PPTWound healing PPT
Wound healing PPT
 
HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.HYPERTENSION - SLIDE SHARE PRESENTATION.
HYPERTENSION - SLIDE SHARE PRESENTATION.
 
Accounting for Restricted Grants When and How To Record Properly
Accounting for Restricted Grants  When and How To Record ProperlyAccounting for Restricted Grants  When and How To Record Properly
Accounting for Restricted Grants When and How To Record Properly
 
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
 
skeleton System.pdf (skeleton system wow)
skeleton System.pdf (skeleton system wow)skeleton System.pdf (skeleton system wow)
skeleton System.pdf (skeleton system wow)
 
Standardized tool for Intelligence test.
Standardized tool for Intelligence test.Standardized tool for Intelligence test.
Standardized tool for Intelligence test.
 
The basics of sentences session 7pptx.pptx
The basics of sentences session 7pptx.pptxThe basics of sentences session 7pptx.pptx
The basics of sentences session 7pptx.pptx
 
Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...
 
CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...
CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...
CHUYÊN ĐỀ ÔN TẬP VÀ PHÁT TRIỂN CÂU HỎI TRONG ĐỀ MINH HỌA THI TỐT NGHIỆP THPT ...
 
220711130088 Sumi Basak Virtual University EPC 3.pptx
220711130088 Sumi Basak Virtual University EPC 3.pptx220711130088 Sumi Basak Virtual University EPC 3.pptx
220711130088 Sumi Basak Virtual University EPC 3.pptx
 
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx78 Microsoft-Publisher - Sirin Sultana Bora.pptx
78 Microsoft-Publisher - Sirin Sultana Bora.pptx
 
Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10
 
How to Download & Install Module From the Odoo App Store in Odoo 17
How to Download & Install Module From the Odoo App Store in Odoo 17How to Download & Install Module From the Odoo App Store in Odoo 17
How to Download & Install Module From the Odoo App Store in Odoo 17
 

Connecting Publications & Data: Raising visibility of local data collections through linking with international publication databases

  • 1. Abstract: Connecting locally hosted data repositories to internationally hosted related articles has never been easier. With APIs and other web services becoming standardized at the same time that new linking standards, such as Datacite DOIs, are being adopted, new ways to distribute and mashup content are now possible. This presentation will explore emerging trends in linking scholarly literature to data. Both entity linking and data linking will be discussed. Examples will be presented demonstrating how these technologies are being employed by publishers and A&I vendors in cooperation with local data repositories. __________________________________________ Before I get started, I would like to take a minute to set some expectations for this talk. The examples used will primarily be about hard sciences, my challenge to you is to figure out how to apply these technologies and methods to the digital humanities. 1
  • 2. This is a theoretical framework for looking at the different ways that publications can be connected to data. This is also the agenda for the talk. I will first speak about the top left quadrant and then work my way to the bottom right. This means starting from the easiest to apply to the humanities and working through to the hardest. 2
  • 3. This quadrant is primarily about publications to supplemental data. 3
  • 4. Supplemental data submitted as a file with an article is the traditional way. It has its place, but that is not what I am talking about today. 4
  • 5. Instead, new tools now enable display and direct manipulation of data in new and interesting ways. This example is an application that displays KML files on a Google Map: http://www.applications.sciverse.com/action/appDetail/298231?zone=main&pageOrigin=appGallery &activity=display 5
  • 6. Next on the agenda is automating the connection between publications and whole supplementary or related datasets. 6
  • 7. One example of this is the PANGAEA app which searches PANGAEA apis by article DOI and retrieves the coordinates of where supplementary data was collected and then charts these on a Google map displayed directly on the ScienceDirect article page. 7
  • 8. This also works on Scopus record pages (so for lot’s of publishers and journals). From deciding to put it on Scopus as well it took less than 24 hours for the PANGAEA developer to implement. This was enabled by the SciVerse Applications platform. 8
  • 9. Users can link through to the main record for the dataset on PANGAEA. One thing I would like to mention here is that there is also a DOI for the dataset. This was done through DataCite. 9
  • 10. So what is DataCite and why is it important? It is also very important for creating links to data in repositories. 10
  • 11. Takeaway points: International DOI Foundation enables CrossRef to give out DOIs. DataCite roughly equivalent to CrossRef. Learn more at the DataCite website. A central institution in Serbia might want to become a Member Institute. 11
  • 12. So those were examples of linking to whole datasets and displaying them in new and interesting ways. Next to discuss is linking to entities. 12
  • 13. Traditional linking involves an author marking up an entity such as a protein so that it can be easily linked to additional information about that entity in a different database. While this is useful, it is not what I wish to share with you today. Why make a user follow a link when… 13
  • 14. You can now embed a 3D interactive model of the protein directly in context in the article. In this example the PDB Protein Viewer is embedded directly in the article. 14
  • 15. In this example an author adds key structures to the article and they are then embedded using Reaxys information and software. 15
  • 16. 16
  • 17. The last examples still required an Author to manually mark up entities. Through text analysis and mining, this is no longer always necessary. 17
  • 18. In this example, our partner NextBio automatically recognizes entities in the text of the article. Easily extendable to new / other entities Works retrospectively on older content Does create recall / precision errors 18
  • 19. Not only can it display them in the sidebar, but the application framework enables adding links to the entities in the text on the fly. 19
  • 20. A reader can then click those links for additional information form multiple databases. 20
  • 21. 1. Colours & tags genes, proteins, molecule names 2. Clicking shows a summary of features for the term (ie: sequence or 2D structure) 3. User can click on links in the pop-up leading out to more information 21
  • 22. 22
  • 23. * To summarize, we started with very traditional linking of datasets where an author submits the dataset with the article. One example of how this can be improved was the Interactive map viewer that displays supplementary KML files rather than simple attaching the files to the article. * Next we discussed automated linking to datasets. This included the example of searching PANGAEA APIs for related datasets and then displaying the locations the data was collected. This will be driven by new standards such as DataCite. * Third, authors manually mark up entities that can be linked to in other databases. Now it is possible to embed content from other databases using APIs. * Last, is totally automated entity recognition using text analysis and mining, Again, information from third party databases can be embedded directly in the article itself. * While I haven’t spoken too much about the technologies enabling these new ways of linking articles to data, one example is the SciVerse Application Framework, which now enables all of the examples discussed today. http://www.applications.sciverse.com/action/userhome 23
  • 24. I would like to close with the same questions I opened with. Thank you. 24

Editor's Notes

  1. Title: Connecting Publications & Data: Raising visibility of local data collections through linking with international publication databases   Abstract: Connecting locally hosted data repositories to internationally hosted related articles has never been easier. With APIs and other web services becoming standardized at the same time that new linking standards, such as Datacite DOIs, are being adopted, new ways to distribute and mashup content are now possible. This presentation will explore emerging trends in linking scholarly literature to data. Both entity linking and data linking will be discussed. Examples will be presented demonstrating how these technologies are being employed by publishers and A&I vendors in cooperation with local data repositories. __________________________________________ Before I get started, I would like to take a minute to set some expectations for this talk. The examples used will primarily be about hard sciences, my challenge to you is to figure out how to apply these technologies and methods to the digital humanities.
  2. This is a theoretical framework for looking at the different ways that publications can be connected to data. This is also the agenda for the talk. I will first speak about the top left quadrant and then work my way to the bottom right. This means starting from the easiest to apply to the humanities and working through to the hardest.
  3. This quadrant is primarily about publications to supplemental data.
  4. Supplemental data submitted as a file with an article is the traditional way. It has its place, but that is not what I am talking about today.
  5. Instead, new tools now enable display and direct manipulation of data in new and interesting ways. This example is an application that displays KML files on a Google Map: http://www.applications.sciverse.com/action/appDetail/298231?zone=main&pageOrigin=appGallery&activity=display
  6. Next on the agenda is automating the connection between publications and whole supplementary or related datasets.
  7. One example of this is the PANGAEA app which searches PANGAEA apis by article DOI and retrieves the coordinates of where supplementary data was collected and then charts these on a Google map displayed directly on the ScienceDirect article page.
  8. This also works on Scopus record pages (so for lot’s of publishers and journals). From deciding to put it on Scopus as well it took less than 24 hours for the PANGAEA developer to implement. This was enabled by the SciVerse Applications platform.
  9. Users can link through to the main record for the dataset on PANGAEA. One thing I would like to mention here is that there is also a DOI for the dataset. This was done through DataCite.
  10. So what is DataCite and why is it important? It is also very important for creating links to data in repositories.
  11. Takeaway points: International DOI Foundation enables CrossRef to give out DOIs. DataCite roughly equivalent to CrossRef. Learn more at the DataCite website. A central institution in Serbia might want to become a Member Institute.
  12. So those were examples of linking to whole datasets and displaying them in new and interesting ways. Next to discuss is linking to entities.
  13. Traditional linking involves an author marking up an entity such as a protein so that it can be easily linked to additional information about that entity in a different database. While this is useful, it is not what I wish to share with you today. Why make a user follow a link when…
  14. You can now embed a 3D interactive model of the protein directly in context in the article. In this example the PDB Protein Viewer is embedded directly in the article.
  15. In this example an author adds key structures to the article and they are then embedded using Reaxys information and software.
  16. The last examples still required an Author to manually mark up entities. Through text analysis and mining, this is no longer always necessary.
  17. In this example, our partner NextBio automatically recognizes entities in the text of the article. Easily extendable to new / other entities Works retrospectively on older content Does create recall / precision errors
  18. Not only can it display them in the sidebar, but the application framework enables adding links to the entities in the text on the fly.
  19. A reader can then click those links for additional information form multiple databases.
  20. Colours & tags genes, proteins, molecule names Clicking shows a summary of features for the term (ie: sequence or 2D structure) User can click on links in the pop-up leading out to more information
  21. Colours & tags genes, proteins, molecule names Clicking shows a summary of features for the term (ie: sequence or 2D structure) User can click on links in the pop-up leading out to more information
  22. To summarize, we started with very traditional linking of datasets where an author submits the dataset with the article. One example of how this can be improved was the Interactive map viewer that displays supplementary KML files rather than simple attaching the files to the article. Next we discussed automated linking to datasets. This included the example of searching PANGAEA APIs for related datasets and then displaying the locations the data was collected. This will be driven by new standards such as DataCite. Third, authors manually mark up entities that can be linked to in other databases. Now it is possible to embed content from other databases using APIs. Last, is totally automated entity recognition using text analysis and mining, Again, information from third party databases can be embedded directly in the article itself. While I haven’t spoken too much about the technologies enabling these new ways of linking articles to data, one example is the SciVerse Application Framework, which now enables all of the examples discussed today. http://www.applications.sciverse.com/action/userhome
  23. I would like to close with the same questions I opened with. Thank you.