SlideShare a Scribd company logo
1 of 29
Download to read offline
Scholars@Cornell:	A	Journey	from	
Rest	in	Peace	to	Data	in	Use
June	7,	2018
Muhammad Javed
Email: mj495@cornell.edu
Twitter: @mjaved495
Data
Tech Lead (Scholars@Cornell)
Data
Collection
Data
Cleaning
Data
Transformation
Data
Storage
01
02
03
04
- Person and Position Data (Human Resource)
- Publications data (Citation Data Sources)
- Grants Data (Office of Sponsored Programs)
- Incomplete Data, Incorrect Data, Duplicate Data
- CSV to RDF Conversion
- XML to RDF Conversion
- Loading RDF Data to VIVO Triplestore
Data
Access
Data
Analysis
Data
Visualization
Data
Prediction
VIVO Data Life Cycle
Scope of the Presentation
@Cornell(2009 - 2016)
Data Reuse
Data Analysis
Overview
Achievements &
Challenges
AGENDA
Data Reuse
Data Analysis
Overview
Achievements &
Challenges
AGENDA
Advance the visibility and accessibility of
Cornell scholarship and creative expression
(May 2016)
&
Explore the scholarship of Cornell from the
perspective of what the scholarly record itself
can tell us.
(August 2017)
‘Us’ who?
Explore the scholarship of Cornell from the perspective of what the
scholarly record itself can tell us.
Knowledge Delivery
Lorem ipsum Lorem ipsum Lorem ipsum Lorem ipsum
Faculty Admin
Librarian
Student
Faculty is interested in global
visibility of their research and
an authoritative source of data
about their scholarship and
how it relates to other
scholars.
Faculty
Academic deans and
department chairs need
macro views of research
output, global impact and
for faculty reporting.
Admin
Librarians need data to
support strategies to address
issues such as prioritization
and selection of library
resources and collections.
Librarians Students
Students need means
to discover faculty,
domain experts and
their research
contributions.
who are the stakeholders
Explore the scholarship of Cornell from the perspective of what the scholarly
record itself can tell us.
About whom?
FEED
MACHINE
publications
Database
Graph
Scholars@Cor1ell - “System of Systems”
g;ants
&
cont;acts
people
& positions academic
units
Manual
Data Analysis Module
SPARQL JAVA
JSON, RDF
Push
t;iples in the g;aph
Push
JSON files
QuerE
Database
Process
QuerE ResultSets
Process
outHut files
Scholars@Cornell
Data Reuse
Data Analysis
Overview
Achievements &
Challenges
AGENDA
Image taken from http://www.servervitalsigns.com/liberate-domino-data/
Linked Open Data
Yes,
in RDF ForLat
Can I reuse it?
YesNo, May be
RDFOWL
Ontology EngineersSoftware Developers
JSJSON
Can I access it ?
[LOD]VIVO records data as
Image taken from http://www.servervitalsigns.com/liberate-domino-data/
Data
Yes,
in RDF ForLat
Can I reuse it?
YesNo, May be
RDFOWL
Ontology EngineersSoftware Developers
JSJSON
Can I access it ?
We need Linked OpenReusable
[LO D]R
RDFOWL
Ontology Engineers Software Developers
JSJSON
1) Data Access
2) Data Reuse
Unlock your Data
3) Infographics Reuse
https://cul-it.github.io/vivo-data-distribution-api/
RDF to JSON: Data Distribution API forVIVO
Data Reuse
Data Analysis
Overview
Achievements &
Challenges
AGENDA
The Data Maturity Model
http://www.b-eye-network.com/view/15105
Stage One: No Usable Data
Stage Two: Big Data
Stage Three: The Right Data
Stage Four: Predictions
Stage Five: Strategy
With little or no useful data.
Can’t run metrics.
Doesn’t fully understand data.
No information-backed insights.
Have access to large data.
Steady flow of data from multiple sources.
But few tools to turn data into information.
Spent more time looking than analyzing.
Have access to high quality data.
Apply context and relevance to data models.
Explain data in meaningful ways.
Cornell units accept responsibility
for being Content Creators.
Can conduct historical & predictive analysis.
What is likely to happen tomorrow and beyond.
Can predict customer behavior and market
demand.
Entire business model is built around
its analytical models.
Predictive analysis is integrated into
core business processes.
Say No to long
VIVO Lists
From Counting
to Connecting
Courtesy: Pedro Parraguez (DTU)
KNOWKNOWLEDGE
Linked Data
Data is merely a record & VIVO is just a (linked) data warehouse
unless we pull the knowledge out of it.
From Data to Knowledge
How do we SHARE the KNOWLEDGE in
Scholars@Cornell ?
Access and Reuse
- List of Publications for a Faculty.
https://scholars.cornell.edu/api/dataRequest/listPublications?person=http://scholars.cornell.edu/individual/mjh78
- List of Grants
- of a Person
- of an Academic Unit
- List of Current Faculty
- of an Academic Unit
- List of Academic / Industry-level CoAuthorships *
- of an Academic Unit
- of an Person
Future Plans
- and more …
- in JSON Format
- in RDF Format
via Data Distribution API
Knowledge Share via Infographics
Global Collaborations
Grants
Keyword Cloud
Co-authorships
Inter-departmental
co-authorships
Research Interests
(in a department)
Journals/Proceedings
in Scholars@Cornell
Image taken from: https://marketingland.com/breadcrumb-links-good-user-experience-yes-97848
User Experience
DEMO
The Data Maturity Model
http://www.b-eye-network.com/view/15105
Stage One: No Usable Data
Stage Two: Big Data
Stage Three: The Right Data
Stage Four: Predictions
Stage Five: Strategy
With little or no useful data.
Can’t run metrics.
Doesn’t fully understand data.
No information-backed insights.
Have access to large data.
Steady flow of data from multiple sources.
But few tools to turn data into information.
Spent more time looking than analyzing.
Have access to high quality data.
Apply context and relevance to data models.
Explain data in meaningful ways.
Cornell Units accept responsibility
for being Content Creators.
Can conduct historical & predictive analysis.
What is likely to happen tomorrow and beyond.
Can predict customer behavior and market
demand.
Entire business model is built around
its analytical models.
Predictive analysis is integrated into
core business processes.
VIVO Cornell
Scholars@Cornell
Data Reuse
Data Analysis
Overview
Achievements &
Challenges
AGENDA
July 2017 June 2018
1.College of Engineering
2.Johnson Graduate School of Management
3.BoyceThompson Institute
1.College of Engineering
2.Johnson Graduate School of Management
3.BoyceThompson Institute
4.College of Agriculture and Life Sciences
5.Law School (Pilot)
6.College of Veterinary Sciences (Pilot)
PARTNERS
DATA SOURCES 1.Upstream sources of Symplectic Elements
1.Upstream sources of Symplectic Elements
2.Activity Insight (Digital Measures)
3.Institutional Repository (Digital Commons)
DATATYPES 1.Journal Articles 1.Journal Articles
2.Conference Papers
DATA ACCESS 1.Data Download
i. RDF
1.Data Download
i. RDF
ii. JSON (using Date Distr. API)
iii.CSV (VIZ Data)
iv.SVG (VIZ)
2.VIZ Embed
Achievements
Cornell - International Academic Partners
OPERA - Open Research Analytics
Denmark’s Electronic Research Library Project
Challenges
The	infrastructure	for	publica=on	data	(and	visualiza=ons)	
reuse	is	in	place.	Next	step	is	to	engage	academic	units	to	
demonstrate	benefits	of	data	reuse	and	actually	make	it	
work.
DATA ACCESSIBILITY AND DATA REUSE
Past, Present and Future…
Cornell Units
“VIVO 2 ORCID” Data Push
THE NEXT
BIGTHING
Jim	Blake
- Envisioning	an	emerging	
collabora=ve	data	exchange/
share	hub:
Scholars@Cornell
an	institutional	investment	!
MANAGEMENT
TEAMWORK
• Faculty,		
• Department	Chair,	College	Deans,	
• Admin	Staff	(responsible	for	faculty	reporting	and	faculty	data	mgmt./websites)		
• Librarians,		
• Students,		
• Provost	OfFice	
• OfFice	of	Research
Sandy	Payette
Muhammad	Javed
Mary	Beth	Martini-Lyons
George	Kozak
Meet	the	Team
Jim	Blake
Tim	Worrall
Joe	McEnerney
Travis	Beach
Lylla	Younes
Amit	Mizrahi
Scholars@Cornell
@ScholarsCornell
Dean	B.	Krafft
Acknowledgments
A number of 24Slides free templates were used to prepare the slides
Parraguez, P. and Maier, A.M. (2016), “Using network science to support design research: From counting to connecting”, in Cash, P., Stankovic, T. and Storga,
M. (Eds.),Experimental Design Research: Approaches, perspectives, applications, Springer, Cham, pp. 153–172.
Counting to Connecting Images: Wehrli, U., Born, G. and Spehr, D. (2013), The art of clean up: life made neat and tidy, Chronicle Books, San Francisco.
Few other images were taken from Google-Images.

More Related Content

What's hot

Self adaptive based natural language interface for disambiguation of
Self adaptive based natural language interface for disambiguation ofSelf adaptive based natural language interface for disambiguation of
Self adaptive based natural language interface for disambiguation ofNurfadhlina Mohd Sharef
 
Predicting Online News Popularity
Predicting Online News Popularity Predicting Online News Popularity
Predicting Online News Popularity Ke Feng
 
Dotnet ranking on data manifold with sink points
Dotnet  ranking on data manifold with sink pointsDotnet  ranking on data manifold with sink points
Dotnet ranking on data manifold with sink pointsEcway Technologies
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?Elena Simperl
 
Paper id 26201475
Paper id 26201475Paper id 26201475
Paper id 26201475IJRAT
 
Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?Elena Simperl
 
Data Quality and the FAIR principles
Data Quality and the FAIR principlesData Quality and the FAIR principles
Data Quality and the FAIR principlesAmrapali Zaveri, PhD
 
Workshop on Data Quality Management in Wikidata
Workshop on Data Quality Management in WikidataWorkshop on Data Quality Management in Wikidata
Workshop on Data Quality Management in WikidataAmrapali Zaveri, PhD
 
Uphrading the Scholarly Infrastructure
Uphrading the Scholarly InfrastructureUphrading the Scholarly Infrastructure
Uphrading the Scholarly InfrastructureBjörn Brembs
 
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEWUSING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEWNellore Harilakshmi
 
ACS 248th Paper 104 ChemData Project
ACS 248th Paper 104 ChemData ProjectACS 248th Paper 104 ChemData Project
ACS 248th Paper 104 ChemData ProjectStuart Chalk
 
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...NASIG
 
Recommendations for selection process automation in systematic reviews
Recommendations for selection process automation in systematic reviewsRecommendations for selection process automation in systematic reviews
Recommendations for selection process automation in systematic reviewsFaisal Razzak
 

What's hot (18)

Hahn "Wikidata as a hub to library linked data re-use"
Hahn "Wikidata as a hub to library linked data re-use"Hahn "Wikidata as a hub to library linked data re-use"
Hahn "Wikidata as a hub to library linked data re-use"
 
Self adaptive based natural language interface for disambiguation of
Self adaptive based natural language interface for disambiguation ofSelf adaptive based natural language interface for disambiguation of
Self adaptive based natural language interface for disambiguation of
 
Predicting Online News Popularity
Predicting Online News Popularity Predicting Online News Popularity
Predicting Online News Popularity
 
Webometrics report
Webometrics reportWebometrics report
Webometrics report
 
Altmetrics
Altmetrics Altmetrics
Altmetrics
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
Dotnet ranking on data manifold with sink points
Dotnet  ranking on data manifold with sink pointsDotnet  ranking on data manifold with sink points
Dotnet ranking on data manifold with sink points
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?
 
Paper id 26201475
Paper id 26201475Paper id 26201475
Paper id 26201475
 
Data science
Data science Data science
Data science
 
Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?
 
Data Quality and the FAIR principles
Data Quality and the FAIR principlesData Quality and the FAIR principles
Data Quality and the FAIR principles
 
Workshop on Data Quality Management in Wikidata
Workshop on Data Quality Management in WikidataWorkshop on Data Quality Management in Wikidata
Workshop on Data Quality Management in Wikidata
 
Uphrading the Scholarly Infrastructure
Uphrading the Scholarly InfrastructureUphrading the Scholarly Infrastructure
Uphrading the Scholarly Infrastructure
 
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEWUSING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
 
ACS 248th Paper 104 ChemData Project
ACS 248th Paper 104 ChemData ProjectACS 248th Paper 104 ChemData Project
ACS 248th Paper 104 ChemData Project
 
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
 
Recommendations for selection process automation in systematic reviews
Recommendations for selection process automation in systematic reviewsRecommendations for selection process automation in systematic reviews
Recommendations for selection process automation in systematic reviews
 

Similar to Scholars@Cornell: From Data in Peace to Data in Use. (VIVO'18)

Singapore Management UniversityInstitutional Knowledge at Si.docx
Singapore Management UniversityInstitutional Knowledge at Si.docxSingapore Management UniversityInstitutional Knowledge at Si.docx
Singapore Management UniversityInstitutional Knowledge at Si.docxjennifer822
 
Singapore Management UniversityInstitutional Knowledge at Si.docx
Singapore Management UniversityInstitutional Knowledge at Si.docxSingapore Management UniversityInstitutional Knowledge at Si.docx
Singapore Management UniversityInstitutional Knowledge at Si.docxedgar6wallace88877
 
Research Metadata Mechanics - Simon Porter
Research Metadata Mechanics - Simon PorterResearch Metadata Mechanics - Simon Porter
Research Metadata Mechanics - Simon PorterCASRAI
 
Metadata 2020 Vivo Conference 2018
Metadata 2020 Vivo Conference 2018 Metadata 2020 Vivo Conference 2018
Metadata 2020 Vivo Conference 2018 Clare Dean
 
Adopting a situated learning framework for (big) data projects
Adopting a situated learning framework for (big) data projectsAdopting a situated learning framework for (big) data projects
Adopting a situated learning framework for (big) data projectsCranfield University
 
Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseRinke Hoekstra
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data ManagementCarole Goble
 
Knowledge graphs dedicated to the memory of amrapali zaveri 3388748
Knowledge graphs dedicated to the memory of amrapali zaveri 3388748Knowledge graphs dedicated to the memory of amrapali zaveri 3388748
Knowledge graphs dedicated to the memory of amrapali zaveri 3388748Jyotindra Zaveri
 
Get 'em in, Get 'em out: Finding a Road from Turnaway Data to Repurposed Space
Get 'em in, Get 'em out: Finding a Road from Turnaway Data to Repurposed SpaceGet 'em in, Get 'em out: Finding a Road from Turnaway Data to Repurposed Space
Get 'em in, Get 'em out: Finding a Road from Turnaway Data to Repurposed SpaceNikki DeMoville
 
Next generation data services at the Marriott Library
Next generation data services at the Marriott LibraryNext generation data services at the Marriott Library
Next generation data services at the Marriott LibraryRebekah Cummings
 
Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)Bradley Allen
 
Data Science for Every Student at RPI
Data Science for Every Student at RPIData Science for Every Student at RPI
Data Science for Every Student at RPISteven Miller
 
Ross Wilkinson - Data Publication: Australian and Global Policy Developments
Ross Wilkinson - Data Publication: Australian and Global Policy DevelopmentsRoss Wilkinson - Data Publication: Australian and Global Policy Developments
Ross Wilkinson - Data Publication: Australian and Global Policy DevelopmentsWiley
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale Bernadette Hyland-Wood
 
Introduction to Data Analytics and data analytics life cycle
Introduction to Data Analytics and data analytics life cycleIntroduction to Data Analytics and data analytics life cycle
Introduction to Data Analytics and data analytics life cycleDr. Radhey Shyam
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4
 
A Framework Concept for Profiling Researchers on Twitter using the Web of Data
A Framework Concept for Profiling Researchers on Twitter using the Web of DataA Framework Concept for Profiling Researchers on Twitter using the Web of Data
A Framework Concept for Profiling Researchers on Twitter using the Web of DataLaurens De Vocht
 
A Survey on Big Data Analytics: Challenges
A Survey on Big Data Analytics: ChallengesA Survey on Big Data Analytics: Challenges
A Survey on Big Data Analytics: ChallengesDr. Amarjeet Singh
 

Similar to Scholars@Cornell: From Data in Peace to Data in Use. (VIVO'18) (20)

Singapore Management UniversityInstitutional Knowledge at Si.docx
Singapore Management UniversityInstitutional Knowledge at Si.docxSingapore Management UniversityInstitutional Knowledge at Si.docx
Singapore Management UniversityInstitutional Knowledge at Si.docx
 
Singapore Management UniversityInstitutional Knowledge at Si.docx
Singapore Management UniversityInstitutional Knowledge at Si.docxSingapore Management UniversityInstitutional Knowledge at Si.docx
Singapore Management UniversityInstitutional Knowledge at Si.docx
 
Research Metadata Mechanics - Simon Porter
Research Metadata Mechanics - Simon PorterResearch Metadata Mechanics - Simon Porter
Research Metadata Mechanics - Simon Porter
 
McGeary Data Curation Network: Developing and Scaling
McGeary Data Curation Network: Developing and ScalingMcGeary Data Curation Network: Developing and Scaling
McGeary Data Curation Network: Developing and Scaling
 
Metadata 2020 Vivo Conference 2018
Metadata 2020 Vivo Conference 2018 Metadata 2020 Vivo Conference 2018
Metadata 2020 Vivo Conference 2018
 
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at ScaleFull Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
 
Adopting a situated learning framework for (big) data projects
Adopting a situated learning framework for (big) data projectsAdopting a situated learning framework for (big) data projects
Adopting a situated learning framework for (big) data projects
 
Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS case
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
Knowledge graphs dedicated to the memory of amrapali zaveri 3388748
Knowledge graphs dedicated to the memory of amrapali zaveri 3388748Knowledge graphs dedicated to the memory of amrapali zaveri 3388748
Knowledge graphs dedicated to the memory of amrapali zaveri 3388748
 
Get 'em in, Get 'em out: Finding a Road from Turnaway Data to Repurposed Space
Get 'em in, Get 'em out: Finding a Road from Turnaway Data to Repurposed SpaceGet 'em in, Get 'em out: Finding a Road from Turnaway Data to Repurposed Space
Get 'em in, Get 'em out: Finding a Road from Turnaway Data to Repurposed Space
 
Next generation data services at the Marriott Library
Next generation data services at the Marriott LibraryNext generation data services at the Marriott Library
Next generation data services at the Marriott Library
 
Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)
 
Data Science for Every Student at RPI
Data Science for Every Student at RPIData Science for Every Student at RPI
Data Science for Every Student at RPI
 
Ross Wilkinson - Data Publication: Australian and Global Policy Developments
Ross Wilkinson - Data Publication: Australian and Global Policy DevelopmentsRoss Wilkinson - Data Publication: Australian and Global Policy Developments
Ross Wilkinson - Data Publication: Australian and Global Policy Developments
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
 
Introduction to Data Analytics and data analytics life cycle
Introduction to Data Analytics and data analytics life cycleIntroduction to Data Analytics and data analytics life cycle
Introduction to Data Analytics and data analytics life cycle
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the table
 
A Framework Concept for Profiling Researchers on Twitter using the Web of Data
A Framework Concept for Profiling Researchers on Twitter using the Web of DataA Framework Concept for Profiling Researchers on Twitter using the Web of Data
A Framework Concept for Profiling Researchers on Twitter using the Web of Data
 
A Survey on Big Data Analytics: Challenges
A Survey on Big Data Analytics: ChallengesA Survey on Big Data Analytics: Challenges
A Survey on Big Data Analytics: Challenges
 

More from Muhammad Javed

Open Harvester - Search publications for a researcher from CrossRef, PubMed a...
Open Harvester - Search publications for a researcher from CrossRef, PubMed a...Open Harvester - Search publications for a researcher from CrossRef, PubMed a...
Open Harvester - Search publications for a researcher from CrossRef, PubMed a...Muhammad Javed
 
Extending Local Data: "Where to start from"
Extending Local Data: "Where to start from"Extending Local Data: "Where to start from"
Extending Local Data: "Where to start from"Muhammad Javed
 
VIZ-VIVO: Towards Visualizations-driven Linked Data Navigation
VIZ-VIVO: Towards Visualizations-driven Linked Data NavigationVIZ-VIVO: Towards Visualizations-driven Linked Data Navigation
VIZ-VIVO: Towards Visualizations-driven Linked Data NavigationMuhammad Javed
 
Scholars@Cornell: Visualizing the Scholarship Data
Scholars@Cornell: Visualizing the Scholarship DataScholars@Cornell: Visualizing the Scholarship Data
Scholars@Cornell: Visualizing the Scholarship DataMuhammad Javed
 
Scholars@Cornell: An Envision - My unfulfilled Dream.
Scholars@Cornell: An Envision  - My unfulfilled Dream.Scholars@Cornell: An Envision  - My unfulfilled Dream.
Scholars@Cornell: An Envision - My unfulfilled Dream.Muhammad Javed
 
VIVO for visualization and analysis
VIVO for visualization and analysisVIVO for visualization and analysis
VIVO for visualization and analysisMuhammad Javed
 
VIVO: A Community-driven Research Information Management System: Challenges a...
VIVO: A Community-driven Research Information Management System: Challenges a...VIVO: A Community-driven Research Information Management System: Challenges a...
VIVO: A Community-driven Research Information Management System: Challenges a...Muhammad Javed
 
Scholars@Cornell: Visualizing the Scholarship data
Scholars@Cornell: Visualizing the Scholarship dataScholars@Cornell: Visualizing the Scholarship data
Scholars@Cornell: Visualizing the Scholarship dataMuhammad Javed
 
Scholars@Cornell: Visualizing the scholarly record
Scholars@Cornell: Visualizing the scholarly recordScholars@Cornell: Visualizing the scholarly record
Scholars@Cornell: Visualizing the scholarly recordMuhammad Javed
 

More from Muhammad Javed (9)

Open Harvester - Search publications for a researcher from CrossRef, PubMed a...
Open Harvester - Search publications for a researcher from CrossRef, PubMed a...Open Harvester - Search publications for a researcher from CrossRef, PubMed a...
Open Harvester - Search publications for a researcher from CrossRef, PubMed a...
 
Extending Local Data: "Where to start from"
Extending Local Data: "Where to start from"Extending Local Data: "Where to start from"
Extending Local Data: "Where to start from"
 
VIZ-VIVO: Towards Visualizations-driven Linked Data Navigation
VIZ-VIVO: Towards Visualizations-driven Linked Data NavigationVIZ-VIVO: Towards Visualizations-driven Linked Data Navigation
VIZ-VIVO: Towards Visualizations-driven Linked Data Navigation
 
Scholars@Cornell: Visualizing the Scholarship Data
Scholars@Cornell: Visualizing the Scholarship DataScholars@Cornell: Visualizing the Scholarship Data
Scholars@Cornell: Visualizing the Scholarship Data
 
Scholars@Cornell: An Envision - My unfulfilled Dream.
Scholars@Cornell: An Envision  - My unfulfilled Dream.Scholars@Cornell: An Envision  - My unfulfilled Dream.
Scholars@Cornell: An Envision - My unfulfilled Dream.
 
VIVO for visualization and analysis
VIVO for visualization and analysisVIVO for visualization and analysis
VIVO for visualization and analysis
 
VIVO: A Community-driven Research Information Management System: Challenges a...
VIVO: A Community-driven Research Information Management System: Challenges a...VIVO: A Community-driven Research Information Management System: Challenges a...
VIVO: A Community-driven Research Information Management System: Challenges a...
 
Scholars@Cornell: Visualizing the Scholarship data
Scholars@Cornell: Visualizing the Scholarship dataScholars@Cornell: Visualizing the Scholarship data
Scholars@Cornell: Visualizing the Scholarship data
 
Scholars@Cornell: Visualizing the scholarly record
Scholars@Cornell: Visualizing the scholarly recordScholars@Cornell: Visualizing the scholarly record
Scholars@Cornell: Visualizing the scholarly record
 

Recently uploaded

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknowmakika9823
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxTanveerAhmed817946
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 

Recently uploaded (20)

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 

Scholars@Cornell: From Data in Peace to Data in Use. (VIVO'18)

  • 2. Data Collection Data Cleaning Data Transformation Data Storage 01 02 03 04 - Person and Position Data (Human Resource) - Publications data (Citation Data Sources) - Grants Data (Office of Sponsored Programs) - Incomplete Data, Incorrect Data, Duplicate Data - CSV to RDF Conversion - XML to RDF Conversion - Loading RDF Data to VIVO Triplestore Data Access Data Analysis Data Visualization Data Prediction VIVO Data Life Cycle Scope of the Presentation @Cornell(2009 - 2016)
  • 5. Advance the visibility and accessibility of Cornell scholarship and creative expression (May 2016) & Explore the scholarship of Cornell from the perspective of what the scholarly record itself can tell us. (August 2017)
  • 6. ‘Us’ who? Explore the scholarship of Cornell from the perspective of what the scholarly record itself can tell us. Knowledge Delivery Lorem ipsum Lorem ipsum Lorem ipsum Lorem ipsum Faculty Admin Librarian Student Faculty is interested in global visibility of their research and an authoritative source of data about their scholarship and how it relates to other scholars. Faculty Academic deans and department chairs need macro views of research output, global impact and for faculty reporting. Admin Librarians need data to support strategies to address issues such as prioritization and selection of library resources and collections. Librarians Students Students need means to discover faculty, domain experts and their research contributions. who are the stakeholders
  • 7. Explore the scholarship of Cornell from the perspective of what the scholarly record itself can tell us. About whom?
  • 8. FEED MACHINE publications Database Graph Scholars@Cor1ell - “System of Systems” g;ants & cont;acts people & positions academic units Manual Data Analysis Module SPARQL JAVA JSON, RDF Push t;iples in the g;aph Push JSON files QuerE Database Process QuerE ResultSets Process outHut files Scholars@Cornell
  • 10. Image taken from http://www.servervitalsigns.com/liberate-domino-data/ Linked Open Data Yes, in RDF ForLat Can I reuse it? YesNo, May be RDFOWL Ontology EngineersSoftware Developers JSJSON Can I access it ? [LOD]VIVO records data as
  • 11. Image taken from http://www.servervitalsigns.com/liberate-domino-data/ Data Yes, in RDF ForLat Can I reuse it? YesNo, May be RDFOWL Ontology EngineersSoftware Developers JSJSON Can I access it ? We need Linked OpenReusable [LO D]R
  • 12. RDFOWL Ontology Engineers Software Developers JSJSON 1) Data Access 2) Data Reuse Unlock your Data 3) Infographics Reuse https://cul-it.github.io/vivo-data-distribution-api/ RDF to JSON: Data Distribution API forVIVO
  • 14. The Data Maturity Model http://www.b-eye-network.com/view/15105 Stage One: No Usable Data Stage Two: Big Data Stage Three: The Right Data Stage Four: Predictions Stage Five: Strategy With little or no useful data. Can’t run metrics. Doesn’t fully understand data. No information-backed insights. Have access to large data. Steady flow of data from multiple sources. But few tools to turn data into information. Spent more time looking than analyzing. Have access to high quality data. Apply context and relevance to data models. Explain data in meaningful ways. Cornell units accept responsibility for being Content Creators. Can conduct historical & predictive analysis. What is likely to happen tomorrow and beyond. Can predict customer behavior and market demand. Entire business model is built around its analytical models. Predictive analysis is integrated into core business processes.
  • 15. Say No to long VIVO Lists
  • 16. From Counting to Connecting Courtesy: Pedro Parraguez (DTU)
  • 17. KNOWKNOWLEDGE Linked Data Data is merely a record & VIVO is just a (linked) data warehouse unless we pull the knowledge out of it. From Data to Knowledge
  • 18. How do we SHARE the KNOWLEDGE in Scholars@Cornell ?
  • 19. Access and Reuse - List of Publications for a Faculty. https://scholars.cornell.edu/api/dataRequest/listPublications?person=http://scholars.cornell.edu/individual/mjh78 - List of Grants - of a Person - of an Academic Unit - List of Current Faculty - of an Academic Unit - List of Academic / Industry-level CoAuthorships * - of an Academic Unit - of an Person Future Plans - and more … - in JSON Format - in RDF Format via Data Distribution API
  • 20. Knowledge Share via Infographics Global Collaborations Grants Keyword Cloud Co-authorships Inter-departmental co-authorships Research Interests (in a department) Journals/Proceedings
  • 21. in Scholars@Cornell Image taken from: https://marketingland.com/breadcrumb-links-good-user-experience-yes-97848 User Experience DEMO
  • 22. The Data Maturity Model http://www.b-eye-network.com/view/15105 Stage One: No Usable Data Stage Two: Big Data Stage Three: The Right Data Stage Four: Predictions Stage Five: Strategy With little or no useful data. Can’t run metrics. Doesn’t fully understand data. No information-backed insights. Have access to large data. Steady flow of data from multiple sources. But few tools to turn data into information. Spent more time looking than analyzing. Have access to high quality data. Apply context and relevance to data models. Explain data in meaningful ways. Cornell Units accept responsibility for being Content Creators. Can conduct historical & predictive analysis. What is likely to happen tomorrow and beyond. Can predict customer behavior and market demand. Entire business model is built around its analytical models. Predictive analysis is integrated into core business processes. VIVO Cornell Scholars@Cornell
  • 24. July 2017 June 2018 1.College of Engineering 2.Johnson Graduate School of Management 3.BoyceThompson Institute 1.College of Engineering 2.Johnson Graduate School of Management 3.BoyceThompson Institute 4.College of Agriculture and Life Sciences 5.Law School (Pilot) 6.College of Veterinary Sciences (Pilot) PARTNERS DATA SOURCES 1.Upstream sources of Symplectic Elements 1.Upstream sources of Symplectic Elements 2.Activity Insight (Digital Measures) 3.Institutional Repository (Digital Commons) DATATYPES 1.Journal Articles 1.Journal Articles 2.Conference Papers DATA ACCESS 1.Data Download i. RDF 1.Data Download i. RDF ii. JSON (using Date Distr. API) iii.CSV (VIZ Data) iv.SVG (VIZ) 2.VIZ Embed Achievements Cornell - International Academic Partners OPERA - Open Research Analytics Denmark’s Electronic Research Library Project
  • 26. “VIVO 2 ORCID” Data Push THE NEXT BIGTHING Jim Blake
  • 27. - Envisioning an emerging collabora=ve data exchange/ share hub: Scholars@Cornell an institutional investment ! MANAGEMENT TEAMWORK • Faculty, • Department Chair, College Deans, • Admin Staff (responsible for faculty reporting and faculty data mgmt./websites) • Librarians, • Students, • Provost OfFice • OfFice of Research
  • 29. Acknowledgments A number of 24Slides free templates were used to prepare the slides Parraguez, P. and Maier, A.M. (2016), “Using network science to support design research: From counting to connecting”, in Cash, P., Stankovic, T. and Storga, M. (Eds.),Experimental Design Research: Approaches, perspectives, applications, Springer, Cham, pp. 153–172. Counting to Connecting Images: Wehrli, U., Born, G. and Spehr, D. (2013), The art of clean up: life made neat and tidy, Chronicle Books, San Francisco. Few other images were taken from Google-Images.