SlideShare a Scribd company logo
1 of 64
NISO/DCMI Webinar:
Semantic Mashups Across
Large, Heterogeneous Institutions:
Experiences from the VIVO Service
May 22, 2013
Speaker:
John Fereira,
Senior Programmer/Analyst and
Technology Strategist at Cornell University
http://www.niso.org/news/events/2013/dcmi/vivo
Semantic mashups across
large, heterogeneous
institutions: experiences
from the VIVO service
John Fereira
Cornell University
Overview
• What is VIVO?
• History of VIVO
• High level Overview
• Ingesting Data into VIVO
• Exposing Data in Vivo
What is VIVO?
• VIVO is not an acronym
• A semantic web application that enables the discovery of
research and scholarship across disciplines in an
institution.
• VIVO enables collaboration and understanding across an
institution and among institutions – and not just for
scientists.
• A powerful search/browse functionality for locating people
and information within or across institutions.
What is VIVO?
• An ontology editor. Vivo includes a “vivo” ontology
with can be modified and extended
• An instance editor. Instances of classes such as a
Person, Organization, Event, etc. can be created,
modified, and deleted
• Content can also be brought into VIVO in automated
ways from local systems of record, such as HR,
grants, course, and faculty activity databases, or
from database providers such as publication
aggregators and funding agencies.
What is VIVO?
• VIVO is a content disseminator
• Views of People, Organizations, etc. can be highly
customized
• VIVO provides visualizations such as topic maps, co-
authorship networks
• Open data means other applications can use it
A brief History of VIVO
• 2003 – Vivo created for local use at Cornell University
for life sciences collaboration
• 2007 - Reimplemented using RDF, OWL, Jena and
SPARQL
• 2007 – Implemented at Cornell and University of
Florida as “production” systems
A brief History of VIVO
• 2009 - seven institutions received $12.2 million in
funding from the National Center for Research
Resources of the NIH to enable a national network of
scientists
• 2010 – Version 1.0 released as open source
• 2013 – Now at version 1.5.1
• 2013 – Transitioning from funded project to a
sustainable community open source project
A high level Overview
• Core ideas
• Searching/browsing
• Self editing
Core ideas
• Research and researchers should be discoverable
independently of administrative hierarchies
• Relationships are as interesting as the facts
• It’s the network, not just the nodes
• Static data models are too confining
• Granular data management allows multiple views and
re-purposing
• Discovery is improved by linking pages to surrounding
context
VIVO and Linked Open Data
• VIVO enables authoritative data about researchers to become
part of the Linked Open Data (LOD) cloud
Tim Berners-Lee, http://www.w3.org/2009/Talks/0204-ted-tbl
Linked Data principles
Tim Berners-Lee:
▫ Use URIs as names for things
▫ Use HTTP URIs so that people can look up those names
▫ When someone looks up a URI, provide useful
information, using the standards (RDF, SPARQL)
▫ Include links to other URIs so that people can discover
more things
http://linkeddata.org
VIVO in the LOD cloud
Searching and Browsing
• Triple store indexed into a SOLR instance
• Searches are against SOLR
• Instance data comes from triplestore
• An example…
Food security
Self Editing
• Users can edit their own profile
• System can delegate editing to “proxy” editors
• Some data can be locked
• An example
Editable and non-editable fields
Most text fields support “rich text”
External Concepts for “terms”
Data Ingest (harvesting)
VIVO harvests much of its data automatically from
verified sources
•Reduces the need for manual input of data
•Provides an integrated and flexible source of publicly
visible data at an institutional level
Data, data, data
Individuals may also edit and customize their profiles to
suit their professional needs
External data
sources
Internal data
sources
Ingesting data with the Vivo Harvester
• A pipeline of tools
• Tools are written java, using Jena APIs
• Can fetch data from a variety of data formats
• Data can be sanitized and disambiguated
• Data is ingested directly to the triple store…does not
require VIVO web app to be running
Harvesting Pipeline
• Fetcher/Parser
• Translate: maps rdf to “vivo” RDF
• Transfer to local triple store (Jena TDB)
• Disambiguate using Scoring/Matching
• Changenamespace (mint unique URIs)
• Diff with previous model to create subtractions
• Transfer to VIVO triple store
Fetching and Parsing
• Fetches data from a URL, Database, local file
• Many different types of fetchers
▫ CSV fetcher
▫ JDBC fetcher
▫ SimpleXMLFetcher
▫ JSONFetcher
• Output is intermediate RDF Format, one file per
record
• “Fake” namespace used
<?xml version="1.0"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:node-person="http://vivo.example.com/harvest/aims_users/fields/person/"
xml:base="http://vivo.example.com/harvest/aims_users/person">
<rdf:Description rdf:ID="node_-_0">
<rdf:type rdf:resource="http://vivo.example.com/harvest/aims_users/types#person"/>
<node-person:Picture>http://aims.fao.org/sites/default/files/profiles/profile_image_108074.jpg</node-person:Picture>
<node-person:Website>http://www.valeriapesce.name</node-person:Website>
<node-person:Nid>108074</node-person:Nid>
<node-person:Profile>In the last six years at the Global Forum on Agricultural research (GFAR) I have worked extensively on
metad
ata standards and protocols for managing and exchanging information between systems, in strict collaboration with the OEKCS
group in
FAO.</node-person:Profile>
<node-person:Organization>Food and Agriculture Organization of the United Nations (FAO)</node-person:Organization>
<node-person:Expertise>Information management tools, information systems, information architectures</node-person:Expertise>
<node-person:LastName>Pesce</node-person:LastName>
<node-person:Country>Italy</node-person:Country>
<node-person:Email>valeria.pesce@fao.org</node-person:Email>
<node-person:geolocation>http://aims.fao.org/aos/geopolitical.owl#Italy</node-person:geolocation>
<node-person:Profile_URL>http://aims.fao.org/node/108074</node-person:Profile_URL>
<node-person:Username>valeria.pesce</node-person:Username>
<node-person:FirstName>Valeria</node-person:FirstName>
<node-person:Role>Information Management Specialist</node-person:Role>
<node-person:Interests>agINFRA, AgriDrupal, AgriFeeds, AgriVIVO, authority control, automatic indexing, CIARD Content
Management
Task Force, CIARD RING, cloud services, CMS - Content Management Systems, data exchange, Drupal, IAALD - International
Association of
Agricultural Information Specialists, information management, institutional repository software, interoperability, Linked Open Data
- LOD, RDF - Resource Description Framework, Semantic Web</node-person:Interests>
</rdf:Description>
</rdf:RDF>
Translate
• Map “fake” namespace to VIVO classes and
properties
• Uses XSLT transform
• Unique ID for each record
• node-person:Organization becomes
foaf:Organization
• Relationships created
Translated RDF
<rdf:Description rdf:about="http://vivo.example.com/harvest/aims_users/person/uid-108074">
<rdf:type rdf:resource="http://xmlns.com/foaf/0.1/Person"/>
<rdfs:label>Pesce, Valeria</rdfs:label>
<core:currentMemberOf rdf:resource="http://vivo.example.com/harvest/aims_users/org/aims"/>
<foaf:firstName>Valeria</foaf:firstName>
<foaf:lastName>Pesce</foaf:lastName>
<core:primaryEmail>valeria.pesce@fao.org</core:primaryEmail>
<core:positionInOrganization
rdf:resource="http://vivo.example.com/harvest/aims_users/org/Food%20and%20Agriculture%20Organization%20of%20the%20
United%20Nations%20(FAO)"/>
</rdf:Description>
<rdf:Description
rdf:about="http://vivo.example.com/harvest/aims_users/org/Food%20and%20Agriculture%20Organization%20of%20the%20Uni
ted%20Nations%20(FAO)">
<rdf:type rdf:resource="http://xmlns.com/foaf/0.1/Organization"/>
<rdfs:label>Food and Agriculture Organization of the United Nations (FAO)</rdfs:label>
<core:organizationForPosition
rdf:resource="http://vivo.example.com/harvest/aims_users/position/positionFor108074inFood%20and%20Agriculture%20Organ
ization%20of%20the%20United%20Nations%20(FAO)"/>
<core:hasGeographicLocation rdf:resource="http://aims.fao.org/aos/geopolitical.owl#Italy"/>
</rdf:Description>
Transfer
• Load RDF into TDB triplestore
• Duplicate URIs are not loaded
• Further operations are made in the triple store
Scoring/Match
• Disambiguates People, Organizations, etc. based
upon property values
• Supports Equality, NameCompare,
NormalizedLevenshteinDifference, Soundex
algorithms
• Each property is weighted
▫ firstName: 0.5
▫ lastName: 0.5
▫ Email: 1.0
• MatchThreshHold: 1.0
Matching
• Determines what should be done with a record
which matches another record based upon it’s
“score”
▫ Replace old record
▫ Merge records
▫ Ignore record
ChangeNameSpace
• Match old namespace pattern in configuration file
http://vivo.example.com/harvest/aims_users/person/
• Specify namespace in VIVO
http://agrivivodev.mannlib.cornell.edu/vivo/individual/
• Mint a new URI in the vivo namespace
http://agrivivodev.mannlib.cornell.edu/vivo/individual/n123456
Diff of previous harvest
• Compare TDB model with previous harvest
• Generate vivo-additions.rdf
• Generate vivo-substractions.rdf
Final Transfer
• Load vivo-subtractions.rdf file into SDB
• Load vivo-additions.rdf file into SDB
Data Ingest alternatives
• Karma: an information integration tool which
provides a GUI for modeling data into an ontology
• Google Refine: Good for one time ingests and has a
VIVO RDF plugin
• VIVO admin tools can load RDF
Exposing Data in VIVO
• Vivo web pages
• View data as RDF
• Query a Sparql Endpoint and transform results
• Drupal front end
Default VIVO theme
Cornell VIVO
Griffiths University
Melbourne Find an Expert
Visualization
• Completed Work
▫ Co-Author visualization
▫ Sparklines
▫ VIVO world activity map
VIVO 1.0 source code was publicly released on April 14, 2010
87 downloads by June 11, 2010. 917 downloads on July 16, 2o10.
The more institutions adopt VIVO, the more high quality data will be available to understand, navigate,
manage, utilize, and communicate progress in science and technology.
06/2010
View RDF from profile page
Requesting RDF using an Accept Header
• curl -H "Accept: application/rdf+xml" -X GET
http://vivo.ufl.edu/display/n25562
Retrieving data with SPARQL
• Fuseki sparql endpoint installed (not included)
• Callable with a SPARQL Client
• Semantic Services
▫ Manages custom sparql queries
▫ Exposes URL for external sites
▫ Can ask for output as html, xml, json
Semantic Services application
Hector Abruna in VIVO
Hector Abruna on Chemistry Site
Viewing VIVO data with Drupal
• Import data with Feeds module and Linked Data
Importer
• Examples
Cals Impact Statements
Agrivivo Home Page
Agrivivo map page
AgriVivo
VivoSearch: search across multiple
vivo sites
Vivo SearchLight bookmarklet
Vivo Searchlight
Some Links
• Vivoweb
▫ http://vivoweb.org
• Vivoweb on Sourceforge
▫ http://www.sourceforge.net/projects/vivo
• VivoSearch
▫ http://vivosearch.org
• Vivo Wiki on Duraspace
▫ https://wiki.duraspace.org/display/VIVO
• Mailing Lists
▫ http://sourceforge.net/p/vivo/sfx-list/
Thank you
NISO/DCMI Webinar
Semantic Mashups Across Large, Heterogeneous
Institutions: Experiences from the VIVO Service
NISO/DCMI Webinar • May 22, 2013
Questions?
All questions will be posted with presenter answers on
the NISO website following the webinar:
http://www.niso.org/news/events/2013/dcmi/vivo
Thank you for joining us today.
Please take a moment to fill out the brief online survey.
We look forward to hearing from you!
THANK YOU

More Related Content

What's hot

Metadata Training for Staff and Librarians for the New Data Environment
Metadata Training for Staff and Librarians for the New Data EnvironmentMetadata Training for Staff and Librarians for the New Data Environment
Metadata Training for Staff and Librarians for the New Data EnvironmentDiane Hillmann
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosEUCLID project
 
It19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveIt19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveJanifer Gatenby
 
Contributing to the Smart City Through Linked Library Data
Contributing to the Smart City Through Linked Library DataContributing to the Smart City Through Linked Library Data
Contributing to the Smart City Through Linked Library DataMarcia Zeng
 
Better Search With Structured Knowledge
Better Search With Structured KnowledgeBetter Search With Structured Knowledge
Better Search With Structured KnowledgeMichel Dumontier
 
Lecture linked data cloud & sparql
Lecture linked data cloud & sparqlLecture linked data cloud & sparql
Lecture linked data cloud & sparqlDhavalkumar Thakker
 
Corrib.org - OpenSource and Research
Corrib.org - OpenSource and ResearchCorrib.org - OpenSource and Research
Corrib.org - OpenSource and Researchadameq
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked DataEUCLID project
 
Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)Hong (Jenny) Jing
 
4.2.15 Slides, “Hydra: many heads, many connections. Enriching Fedora Reposit...
4.2.15 Slides, “Hydra: many heads, many connections. Enriching Fedora Reposit...4.2.15 Slides, “Hydra: many heads, many connections. Enriching Fedora Reposit...
4.2.15 Slides, “Hydra: many heads, many connections. Enriching Fedora Reposit...DuraSpace
 
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...eswcsummerschool
 
Best Practices for Descriptive Metadata for Web Archiving
Best Practices for Descriptive Metadata for Web ArchivingBest Practices for Descriptive Metadata for Web Archiving
Best Practices for Descriptive Metadata for Web ArchivingOCLC
 
Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Janifer Gatenby
 
Islandora and Linked Open Data
Islandora and Linked Open Data Islandora and Linked Open Data
Islandora and Linked Open Data eohallor
 

What's hot (20)

Metadata Training for Staff and Librarians for the New Data Environment
Metadata Training for Staff and Librarians for the New Data EnvironmentMetadata Training for Staff and Librarians for the New Data Environment
Metadata Training for Staff and Librarians for the New Data Environment
 
Thompson 6-jun15-final
Thompson 6-jun15-finalThompson 6-jun15-final
Thompson 6-jun15-final
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
It19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveIt19 20140721 linked data personal perspective
It19 20140721 linked data personal perspective
 
Metadata Workshop
Metadata WorkshopMetadata Workshop
Metadata Workshop
 
Contributing to the Smart City Through Linked Library Data
Contributing to the Smart City Through Linked Library DataContributing to the Smart City Through Linked Library Data
Contributing to the Smart City Through Linked Library Data
 
Library Linked Data and the Future of Bibliographic Control
Library Linked Data and the Future of Bibliographic ControlLibrary Linked Data and the Future of Bibliographic Control
Library Linked Data and the Future of Bibliographic Control
 
Better Search With Structured Knowledge
Better Search With Structured KnowledgeBetter Search With Structured Knowledge
Better Search With Structured Knowledge
 
Lecture linked data cloud & sparql
Lecture linked data cloud & sparqlLecture linked data cloud & sparql
Lecture linked data cloud & sparql
 
Corrib.org - OpenSource and Research
Corrib.org - OpenSource and ResearchCorrib.org - OpenSource and Research
Corrib.org - OpenSource and Research
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
NISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to RealityNISO Webinar: Library Linked Data: From Vision to Reality
NISO Webinar: Library Linked Data: From Vision to Reality
 
Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)
 
4.2.15 Slides, “Hydra: many heads, many connections. Enriching Fedora Reposit...
4.2.15 Slides, “Hydra: many heads, many connections. Enriching Fedora Reposit...4.2.15 Slides, “Hydra: many heads, many connections. Enriching Fedora Reposit...
4.2.15 Slides, “Hydra: many heads, many connections. Enriching Fedora Reposit...
 
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
 
Best Practices for Descriptive Metadata for Web Archiving
Best Practices for Descriptive Metadata for Web ArchivingBest Practices for Descriptive Metadata for Web Archiving
Best Practices for Descriptive Metadata for Web Archiving
 
Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19
 
Snac webinar v3
Snac webinar v3Snac webinar v3
Snac webinar v3
 
Islandora and Linked Open Data
Islandora and Linked Open Data Islandora and Linked Open Data
Islandora and Linked Open Data
 
NISO Webinar: Back From the Endangered List: Using Authority Data to Enhance ...
NISO Webinar: Back From the Endangered List: Using Authority Data to Enhance ...NISO Webinar: Back From the Endangered List: Using Authority Data to Enhance ...
NISO Webinar: Back From the Endangered List: Using Authority Data to Enhance ...
 

Similar to NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Institutions: Experiences from the VIVO Service

FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...Carole Goble
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Anja Jentzsch
 
New member
New member New member
New member Crossref
 
Integrating with others: Stable VIVO URIs for local authority records; linkin...
Integrating with others: Stable VIVO URIs for local authority records; linkin...Integrating with others: Stable VIVO URIs for local authority records; linkin...
Integrating with others: Stable VIVO URIs for local authority records; linkin...Violeta Ilik
 
Breaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social SemanticsBreaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social SemanticsJohn Breslin
 
VIVO at the University of Idaho
VIVO at the University of IdahoVIVO at the University of Idaho
VIVO at the University of Idahoanniegaines
 
Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Richard Urban
 
Metadata for researchers
Metadata for researchers Metadata for researchers
Metadata for researchers Getaneh Alemu
 
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAFWho's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAFSimeon Warner
 
Linked Open Data in Romania
Linked Open Data in RomaniaLinked Open Data in Romania
Linked Open Data in RomaniaVlad Posea
 
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...DuraSpace
 
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021Crossref
 
#ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love #ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love Kristi Holmes
 
New member webinar 052418
New member webinar 052418New member webinar 052418
New member webinar 052418Crossref
 
AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...
AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...
AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...Valeria Pesce
 

Similar to NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Institutions: Experiences from the VIVO Service (20)

Alamw15 VIVO
Alamw15 VIVOAlamw15 VIVO
Alamw15 VIVO
 
Kristi Holmes. A bird’s-eye view of scholarship at the individual, institutio...
Kristi Holmes. A bird’s-eye view of scholarship at the individual, institutio...Kristi Holmes. A bird’s-eye view of scholarship at the individual, institutio...
Kristi Holmes. A bird’s-eye view of scholarship at the individual, institutio...
 
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)
 
New member
New member New member
New member
 
Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
 
Integrating with others: Stable VIVO URIs for local authority records; linkin...
Integrating with others: Stable VIVO URIs for local authority records; linkin...Integrating with others: Stable VIVO URIs for local authority records; linkin...
Integrating with others: Stable VIVO URIs for local authority records; linkin...
 
Options for online profiles
Options for online profilesOptions for online profiles
Options for online profiles
 
Breaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social SemanticsBreaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social Semantics
 
VIVO at the University of Idaho
VIVO at the University of IdahoVIVO at the University of Idaho
VIVO at the University of Idaho
 
Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1
 
Metadata for researchers
Metadata for researchers Metadata for researchers
Metadata for researchers
 
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAFWho's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
 
Linked Open Data in Romania
Linked Open Data in RomaniaLinked Open Data in Romania
Linked Open Data in Romania
 
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
 
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
 
#ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love #ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love
 
New member webinar 052418
New member webinar 052418New member webinar 052418
New member webinar 052418
 
AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...
AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...
AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...
 

More from National Information Standards Organization (NISO)

More from National Information Standards Organization (NISO) (20)

Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
 
Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"
 
Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"
 
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
 
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
 
Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"
 
Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"
 
Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"
 
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
 
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
 

Recently uploaded

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupJonathanParaisoCruz
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
MICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxMICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxabhijeetpadhi001
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 

Recently uploaded (20)

POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized Group
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
MICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxMICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 

NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Institutions: Experiences from the VIVO Service

  • 1. NISO/DCMI Webinar: Semantic Mashups Across Large, Heterogeneous Institutions: Experiences from the VIVO Service May 22, 2013 Speaker: John Fereira, Senior Programmer/Analyst and Technology Strategist at Cornell University http://www.niso.org/news/events/2013/dcmi/vivo
  • 2. Semantic mashups across large, heterogeneous institutions: experiences from the VIVO service John Fereira Cornell University
  • 3. Overview • What is VIVO? • History of VIVO • High level Overview • Ingesting Data into VIVO • Exposing Data in Vivo
  • 4. What is VIVO? • VIVO is not an acronym • A semantic web application that enables the discovery of research and scholarship across disciplines in an institution. • VIVO enables collaboration and understanding across an institution and among institutions – and not just for scientists. • A powerful search/browse functionality for locating people and information within or across institutions.
  • 5. What is VIVO? • An ontology editor. Vivo includes a “vivo” ontology with can be modified and extended • An instance editor. Instances of classes such as a Person, Organization, Event, etc. can be created, modified, and deleted • Content can also be brought into VIVO in automated ways from local systems of record, such as HR, grants, course, and faculty activity databases, or from database providers such as publication aggregators and funding agencies.
  • 6. What is VIVO? • VIVO is a content disseminator • Views of People, Organizations, etc. can be highly customized • VIVO provides visualizations such as topic maps, co- authorship networks • Open data means other applications can use it
  • 7. A brief History of VIVO • 2003 – Vivo created for local use at Cornell University for life sciences collaboration • 2007 - Reimplemented using RDF, OWL, Jena and SPARQL • 2007 – Implemented at Cornell and University of Florida as “production” systems
  • 8. A brief History of VIVO • 2009 - seven institutions received $12.2 million in funding from the National Center for Research Resources of the NIH to enable a national network of scientists • 2010 – Version 1.0 released as open source • 2013 – Now at version 1.5.1 • 2013 – Transitioning from funded project to a sustainable community open source project
  • 9. A high level Overview • Core ideas • Searching/browsing • Self editing
  • 10. Core ideas • Research and researchers should be discoverable independently of administrative hierarchies • Relationships are as interesting as the facts • It’s the network, not just the nodes • Static data models are too confining • Granular data management allows multiple views and re-purposing • Discovery is improved by linking pages to surrounding context
  • 11. VIVO and Linked Open Data • VIVO enables authoritative data about researchers to become part of the Linked Open Data (LOD) cloud Tim Berners-Lee, http://www.w3.org/2009/Talks/0204-ted-tbl
  • 12. Linked Data principles Tim Berners-Lee: ▫ Use URIs as names for things ▫ Use HTTP URIs so that people can look up those names ▫ When someone looks up a URI, provide useful information, using the standards (RDF, SPARQL) ▫ Include links to other URIs so that people can discover more things http://linkeddata.org
  • 13. VIVO in the LOD cloud
  • 14. Searching and Browsing • Triple store indexed into a SOLR instance • Searches are against SOLR • Instance data comes from triplestore • An example…
  • 16.
  • 17.
  • 18.
  • 19.
  • 20. Self Editing • Users can edit their own profile • System can delegate editing to “proxy” editors • Some data can be locked • An example
  • 22. Most text fields support “rich text”
  • 23. External Concepts for “terms”
  • 25. VIVO harvests much of its data automatically from verified sources •Reduces the need for manual input of data •Provides an integrated and flexible source of publicly visible data at an institutional level Data, data, data Individuals may also edit and customize their profiles to suit their professional needs External data sources Internal data sources
  • 26. Ingesting data with the Vivo Harvester • A pipeline of tools • Tools are written java, using Jena APIs • Can fetch data from a variety of data formats • Data can be sanitized and disambiguated • Data is ingested directly to the triple store…does not require VIVO web app to be running
  • 27. Harvesting Pipeline • Fetcher/Parser • Translate: maps rdf to “vivo” RDF • Transfer to local triple store (Jena TDB) • Disambiguate using Scoring/Matching • Changenamespace (mint unique URIs) • Diff with previous model to create subtractions • Transfer to VIVO triple store
  • 28. Fetching and Parsing • Fetches data from a URL, Database, local file • Many different types of fetchers ▫ CSV fetcher ▫ JDBC fetcher ▫ SimpleXMLFetcher ▫ JSONFetcher • Output is intermediate RDF Format, one file per record • “Fake” namespace used
  • 29. <?xml version="1.0"?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:node-person="http://vivo.example.com/harvest/aims_users/fields/person/" xml:base="http://vivo.example.com/harvest/aims_users/person"> <rdf:Description rdf:ID="node_-_0"> <rdf:type rdf:resource="http://vivo.example.com/harvest/aims_users/types#person"/> <node-person:Picture>http://aims.fao.org/sites/default/files/profiles/profile_image_108074.jpg</node-person:Picture> <node-person:Website>http://www.valeriapesce.name</node-person:Website> <node-person:Nid>108074</node-person:Nid> <node-person:Profile>In the last six years at the Global Forum on Agricultural research (GFAR) I have worked extensively on metad ata standards and protocols for managing and exchanging information between systems, in strict collaboration with the OEKCS group in FAO.</node-person:Profile> <node-person:Organization>Food and Agriculture Organization of the United Nations (FAO)</node-person:Organization> <node-person:Expertise>Information management tools, information systems, information architectures</node-person:Expertise> <node-person:LastName>Pesce</node-person:LastName> <node-person:Country>Italy</node-person:Country> <node-person:Email>valeria.pesce@fao.org</node-person:Email> <node-person:geolocation>http://aims.fao.org/aos/geopolitical.owl#Italy</node-person:geolocation> <node-person:Profile_URL>http://aims.fao.org/node/108074</node-person:Profile_URL> <node-person:Username>valeria.pesce</node-person:Username> <node-person:FirstName>Valeria</node-person:FirstName> <node-person:Role>Information Management Specialist</node-person:Role> <node-person:Interests>agINFRA, AgriDrupal, AgriFeeds, AgriVIVO, authority control, automatic indexing, CIARD Content Management Task Force, CIARD RING, cloud services, CMS - Content Management Systems, data exchange, Drupal, IAALD - International Association of Agricultural Information Specialists, information management, institutional repository software, interoperability, Linked Open Data - LOD, RDF - Resource Description Framework, Semantic Web</node-person:Interests> </rdf:Description> </rdf:RDF>
  • 30. Translate • Map “fake” namespace to VIVO classes and properties • Uses XSLT transform • Unique ID for each record • node-person:Organization becomes foaf:Organization • Relationships created
  • 31. Translated RDF <rdf:Description rdf:about="http://vivo.example.com/harvest/aims_users/person/uid-108074"> <rdf:type rdf:resource="http://xmlns.com/foaf/0.1/Person"/> <rdfs:label>Pesce, Valeria</rdfs:label> <core:currentMemberOf rdf:resource="http://vivo.example.com/harvest/aims_users/org/aims"/> <foaf:firstName>Valeria</foaf:firstName> <foaf:lastName>Pesce</foaf:lastName> <core:primaryEmail>valeria.pesce@fao.org</core:primaryEmail> <core:positionInOrganization rdf:resource="http://vivo.example.com/harvest/aims_users/org/Food%20and%20Agriculture%20Organization%20of%20the%20 United%20Nations%20(FAO)"/> </rdf:Description> <rdf:Description rdf:about="http://vivo.example.com/harvest/aims_users/org/Food%20and%20Agriculture%20Organization%20of%20the%20Uni ted%20Nations%20(FAO)"> <rdf:type rdf:resource="http://xmlns.com/foaf/0.1/Organization"/> <rdfs:label>Food and Agriculture Organization of the United Nations (FAO)</rdfs:label> <core:organizationForPosition rdf:resource="http://vivo.example.com/harvest/aims_users/position/positionFor108074inFood%20and%20Agriculture%20Organ ization%20of%20the%20United%20Nations%20(FAO)"/> <core:hasGeographicLocation rdf:resource="http://aims.fao.org/aos/geopolitical.owl#Italy"/> </rdf:Description>
  • 32. Transfer • Load RDF into TDB triplestore • Duplicate URIs are not loaded • Further operations are made in the triple store
  • 33. Scoring/Match • Disambiguates People, Organizations, etc. based upon property values • Supports Equality, NameCompare, NormalizedLevenshteinDifference, Soundex algorithms • Each property is weighted ▫ firstName: 0.5 ▫ lastName: 0.5 ▫ Email: 1.0 • MatchThreshHold: 1.0
  • 34. Matching • Determines what should be done with a record which matches another record based upon it’s “score” ▫ Replace old record ▫ Merge records ▫ Ignore record
  • 35. ChangeNameSpace • Match old namespace pattern in configuration file http://vivo.example.com/harvest/aims_users/person/ • Specify namespace in VIVO http://agrivivodev.mannlib.cornell.edu/vivo/individual/ • Mint a new URI in the vivo namespace http://agrivivodev.mannlib.cornell.edu/vivo/individual/n123456
  • 36. Diff of previous harvest • Compare TDB model with previous harvest • Generate vivo-additions.rdf • Generate vivo-substractions.rdf
  • 37. Final Transfer • Load vivo-subtractions.rdf file into SDB • Load vivo-additions.rdf file into SDB
  • 38. Data Ingest alternatives • Karma: an information integration tool which provides a GUI for modeling data into an ontology • Google Refine: Good for one time ingests and has a VIVO RDF plugin • VIVO admin tools can load RDF
  • 39. Exposing Data in VIVO • Vivo web pages • View data as RDF • Query a Sparql Endpoint and transform results • Drupal front end
  • 44. Visualization • Completed Work ▫ Co-Author visualization ▫ Sparklines ▫ VIVO world activity map
  • 45.
  • 46. VIVO 1.0 source code was publicly released on April 14, 2010 87 downloads by June 11, 2010. 917 downloads on July 16, 2o10. The more institutions adopt VIVO, the more high quality data will be available to understand, navigate, manage, utilize, and communicate progress in science and technology. 06/2010
  • 47. View RDF from profile page
  • 48. Requesting RDF using an Accept Header • curl -H "Accept: application/rdf+xml" -X GET http://vivo.ufl.edu/display/n25562
  • 49. Retrieving data with SPARQL • Fuseki sparql endpoint installed (not included) • Callable with a SPARQL Client • Semantic Services ▫ Manages custom sparql queries ▫ Exposes URL for external sites ▫ Can ask for output as html, xml, json
  • 52. Hector Abruna on Chemistry Site
  • 53. Viewing VIVO data with Drupal • Import data with Feeds module and Linked Data Importer • Examples
  • 58. VivoSearch: search across multiple vivo sites
  • 61. Some Links • Vivoweb ▫ http://vivoweb.org • Vivoweb on Sourceforge ▫ http://www.sourceforge.net/projects/vivo • VivoSearch ▫ http://vivosearch.org • Vivo Wiki on Duraspace ▫ https://wiki.duraspace.org/display/VIVO • Mailing Lists ▫ http://sourceforge.net/p/vivo/sfx-list/
  • 63. NISO/DCMI Webinar Semantic Mashups Across Large, Heterogeneous Institutions: Experiences from the VIVO Service NISO/DCMI Webinar • May 22, 2013 Questions? All questions will be posted with presenter answers on the NISO website following the webinar: http://www.niso.org/news/events/2013/dcmi/vivo
  • 64. Thank you for joining us today. Please take a moment to fill out the brief online survey. We look forward to hearing from you! THANK YOU

Editor's Notes

  1. Authoritative data, diverse formats, filter out private informationTalk about verified dataTalking points: Much of the data in VIVO profiles is ingested from authoritative sources so it is accurate and current, reducing the need for manual input.Private or sensitive information is never imported into VIVO. Only public information will be stored and displayed. Data is housed and maintained at the local institutions. There it can be updated on a regular basis. There are three ways to get data: internal, external, individuals. Internal is authoritative!The rich information in VIVO profiles can be repurposed and shared with other institutional web pages and consumers, reducing cost and increasing efficiencies across the institution.
  2. Co-author visAn at-a-glance view of an individual&apos;s collaboration space. Who do they collaborate with most often? Do they always work with the same people, or do they work with multiple separate communities?Links increase in size and color with more frequent collaboration. Co-authors are clustered into communities. Users can explore the social network by traveling to co-authors pages.
  3. Since VIVO stores profile information drawn from a variety of sources in a single, flexible format, it can be easily “re-skinned” or “re-purposed” to present specialized views into the institution.
  4. Since VIVO stores profile information drawn from a variety of sources in a single, flexible format, it can be easily “re-skinned” or “re-purposed” to present specialized views into the institution.