BioPortal is a repository of biomedical ontologies—the largest such repository, with more than 300 ontologies to date. This set includes ontologies that were developed in OWL, OBO and other languages, as well as a large number of medical terminologies that the US National Li- brary of Medicine distributes in its own proprietary format. We have published the RDF based serializations of all these ontologies and their meta- data at sparql.bioontology.org.
Interlinking educational data to Web of Data (Thesis presentation)Enayat Rajabi
This is a thesis presentation about interlinking educational data to Web of Data. I explain how I used the Linked Data approach to expose and interlink educational data to the Linked Open Data cloud
Jean-Claude Bradley presents on "Peer Review and Science2.0: blogs, wikis and social networking sites" as a guest lecturer for the “Peer Review Culture in Scholarly Publication and Grantmaking” course at Drexel University. The main thrust of the presentation is that peer review alone is not capable of coping with the increasing flood of scientific information being generated and shared. Arguments are made to show that providing sufficient proof for scientific findings does scale and weakens the tragedy of the trusted source cascade.
Using Architectures for Semantic Interoperability to Create Journal Clubs for...James Powell
In certain types of _slow burn_ emergencies, careful accumulation and evaluation of information can offer a crucial advantage. The SARS outbreak in the first decade of the 21st century was such an event, and ad hoc
journal clubs played a critical role in assisting scientific and technical responders in identifying and developing various strategies for halting what could have become a dangerous pandemic. This paper describes a process for leveraging emerging semantic web and digital library architectures and standards to (1) create a focused collection of bibliographic metadata, (2) extract semantic information, (3) convert it to the Resource Description Framework /Extensible Markup Language (RDF/XML),
and (4) integrate it so that scientific and technical responders can share and explore critical information in the collections.
systems.
Talk about Exploring the Semantic Web, and particularly Linked Data, and the Rhizomer approach. Presented August 14th 2012 at the SRI AIC Seminar Series, Menlo Park, CA
Connecting life sciences data at the European Bioinformatics InstituteConnected Data World
Tony Burdett's slides from his talk at Connected Data London. Tony is a Senior Software Engineer at The European Bioinformatics Institute. He presented the complexity of data at the EMBL-EBI and what is their solution to make sense of all this data.
Project Website: http://www.researchobject.org/
researchobjects.org is a community project that has developed an approach to describe and package up all resources used as part of an investigation as Research Objects (RO’s).
RO’s - provide two main features; a manifest - a consistent way to provide a well-typed, structured description of the resources used in an investigation; and a ‘bundle’ - a mechanism for packaging up manifests with resources as a single, publishable unit.
RO’s therefore carry the research context of an experiment - data, software, standard operating procedures (SOPs), models etc - and gather together the components of an experiment so that they are findable, accessible, interoperable and reproducible (FAIR). RO’s combine software and data into an aggregative data structure consisting of well described reconstructable parts.
RO’s have the potential to address a number of challenges pertinent to open research including: a) supporting interoperability between infrastructures by using ROs as a primary mechanism for exchange and publication b) supporting the evolution of research objects as a living collection, enabling provenance tracking c) providing the ability to pivot research object components (data, software, models) that are not restricted to the traditional publication.
Here we present work towards the development and adoption of ROs:
(i) A series of specifications and conventions, using community standards, for the RO manifest and RO bundles.
(ii) Implementations of Java, Python and Ruby APIs and tooling against those specifications;
(iii) Examples of representations of the RO models in various languages (e.g. JSON-LD, RDF, HTML).
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
More Related Content
Similar to Using SPARQL to Query BioPortal Ontologies and Metadata
Interlinking educational data to Web of Data (Thesis presentation)Enayat Rajabi
This is a thesis presentation about interlinking educational data to Web of Data. I explain how I used the Linked Data approach to expose and interlink educational data to the Linked Open Data cloud
Jean-Claude Bradley presents on "Peer Review and Science2.0: blogs, wikis and social networking sites" as a guest lecturer for the “Peer Review Culture in Scholarly Publication and Grantmaking” course at Drexel University. The main thrust of the presentation is that peer review alone is not capable of coping with the increasing flood of scientific information being generated and shared. Arguments are made to show that providing sufficient proof for scientific findings does scale and weakens the tragedy of the trusted source cascade.
Using Architectures for Semantic Interoperability to Create Journal Clubs for...James Powell
In certain types of _slow burn_ emergencies, careful accumulation and evaluation of information can offer a crucial advantage. The SARS outbreak in the first decade of the 21st century was such an event, and ad hoc
journal clubs played a critical role in assisting scientific and technical responders in identifying and developing various strategies for halting what could have become a dangerous pandemic. This paper describes a process for leveraging emerging semantic web and digital library architectures and standards to (1) create a focused collection of bibliographic metadata, (2) extract semantic information, (3) convert it to the Resource Description Framework /Extensible Markup Language (RDF/XML),
and (4) integrate it so that scientific and technical responders can share and explore critical information in the collections.
systems.
Talk about Exploring the Semantic Web, and particularly Linked Data, and the Rhizomer approach. Presented August 14th 2012 at the SRI AIC Seminar Series, Menlo Park, CA
Connecting life sciences data at the European Bioinformatics InstituteConnected Data World
Tony Burdett's slides from his talk at Connected Data London. Tony is a Senior Software Engineer at The European Bioinformatics Institute. He presented the complexity of data at the EMBL-EBI and what is their solution to make sense of all this data.
Project Website: http://www.researchobject.org/
researchobjects.org is a community project that has developed an approach to describe and package up all resources used as part of an investigation as Research Objects (RO’s).
RO’s - provide two main features; a manifest - a consistent way to provide a well-typed, structured description of the resources used in an investigation; and a ‘bundle’ - a mechanism for packaging up manifests with resources as a single, publishable unit.
RO’s therefore carry the research context of an experiment - data, software, standard operating procedures (SOPs), models etc - and gather together the components of an experiment so that they are findable, accessible, interoperable and reproducible (FAIR). RO’s combine software and data into an aggregative data structure consisting of well described reconstructable parts.
RO’s have the potential to address a number of challenges pertinent to open research including: a) supporting interoperability between infrastructures by using ROs as a primary mechanism for exchange and publication b) supporting the evolution of research objects as a living collection, enabling provenance tracking c) providing the ability to pivot research object components (data, software, models) that are not restricted to the traditional publication.
Here we present work towards the development and adoption of ROs:
(i) A series of specifications and conventions, using community standards, for the RO manifest and RO bundles.
(ii) Implementations of Java, Python and Ruby APIs and tooling against those specifications;
(iii) Examples of representations of the RO models in various languages (e.g. JSON-LD, RDF, HTML).
Similar to Using SPARQL to Query BioPortal Ontologies and Metadata (20)
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
Using SPARQL to Query BioPortal Ontologies and Metadata
1. Using SPARQL to Query BioPortal
Ontologies and Metadata
Manuel Salvadores, Matthew Horridge, Paul R. Alexander,
Ray W. Fergerson, Mark A. Musen, and Natasha F. Noy
Stanford Center for Biomedical Informatics Research (BMIR)
Stanford University
ISWC 2012
sparql.bioontology.org Boston, US
1
Tuesday, November 13, 12
3. The main entry point to BioPortal data are
the REST APIs.
3
Tuesday, November 13, 12
4. The main entry point to BioPortal data are
the REST APIs.
Via REST services we cannot offer answers
to queries that require fine access to the
data.
3
Tuesday, November 13, 12
5. The main entry point to BioPortal data are
the REST APIs.
Via REST services we cannot offer answers
to queries that require fine access to the
data.
SPARQL finer data access
3
Tuesday, November 13, 12
6. The main entry point to BioPortal data are
the REST APIs.
Via REST services we cannot offer answers
to queries that require fine access to the
data.
SPARQL finer data access
Challenges, opportunities and lessons
learnt with sparql.bioontology.org
3
Tuesday, November 13, 12
8. Our SPARQL endpoint is different from others
because our data are primarily ontologies
themselves and not data about individuals.
Still lessons learnt apply to other domains, we have
to deal with:
performance
scalability
heterogeneity
query articulation
5
Tuesday, November 13, 12
10. Challenges, opportunities and lessons learn with
sparql.bioontology.org
1. Retrieval of common attributes and how simple
reasoning can help.
6
Tuesday, November 13, 12
11. Challenges, opportunities and lessons learn with
sparql.bioontology.org
1. Retrieval of common attributes and how simple
reasoning can help.
2. Complexity of query articulation when targeting
OWL complex constructions.
6
Tuesday, November 13, 12
12. Challenges, opportunities and lessons learn with
sparql.bioontology.org
1. Retrieval of common attributes and how simple
reasoning can help.
2. Complexity of query articulation when targeting
OWL complex constructions.
3. Best practices in using an open shared endpoint:
6
Tuesday, November 13, 12
13. Challenges, opportunities and lessons learn with
sparql.bioontology.org
1. Retrieval of common attributes and how simple
reasoning can help.
2. Complexity of query articulation when targeting
OWL complex constructions.
3. Best practices in using an open shared endpoint:
I. Selective queries work better.
6
Tuesday, November 13, 12
14. Challenges, opportunities and lessons learn with
sparql.bioontology.org
1. Retrieval of common attributes and how simple
reasoning can help.
2. Complexity of query articulation when targeting
OWL complex constructions.
3. Best practices in using an open shared endpoint:
I. Selective queries work better.
II. Careful with large result sets. Paginate.
6
Tuesday, November 13, 12
15. Challenges, opportunities and lessons learn with
sparql.bioontology.org
1. Retrieval of common attributes and how simple
reasoning can help.
2. Complexity of query articulation when targeting
OWL complex constructions.
3. Best practices in using an open shared endpoint:
I. Selective queries work better.
II. Careful with large result sets. Paginate.
III. How the client reads matter.
6
Tuesday, November 13, 12
16. BioPortal Data
• Ontology Content
• OBO Format
• Rich Release Format (RRF)
• OWL
• Ontology Metadata
• Mapping Data
7
Tuesday, November 13, 12
17. BioPortal Data
• Ontology Content
• OBO Format
• Rich Release Format (RRF) RDF
• OWL
• Ontology Metadata
• Mapping Data
7
Tuesday, November 13, 12
18. BioPortal Data
• Ontology Content
• OBO Format
• Rich Release Format (RRF) RDF
• OWL
• Ontology Metadata
• Mapping Data
Triple Store
7
Tuesday, November 13, 12
19. BioPortal Data
• Ontology Content
• OBO Format
• Rich Release Format (RRF) RDF
• OWL
• Ontology Metadata SPARQL
• Mapping Data
Triple Store
7
Tuesday, November 13, 12
20. RDF - Ontology Metadata
omv:Ontology
meta:VirtualOntology version
name
date
ontology meta:hasVersion ontology
format
/1353 /46896
meta:hasVersion
(....)
ontology
meta:hasVersion /46116 meta:hasDataGraph
ontology <http://bioportal.bioontology.org/ontologies/SNOMED>
/42122
8
Tuesday, November 13, 12
21. RDF - Mappings
<http://purl.bioontology.org/mapping/2767e8e0-001b-012e-749f-005056bd0010>
maps:has_process_info <.../procinfo/2008-04-23-38138> ;
maps:comment "Manual mappings between Mouse anatomy and NCIT." ;
maps:relation skos:closeMatch ;
maps:target <http://purl.org/obo/owl/MA#MA_0001096> ;
maps:source <http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#Olfactory_Nerve> ;
maps:source_ontology_id <http://bioportal.bioontology.org/ontologies/1032> ;
maps:target_ontology_id <http://bioportal.bioontology.org/ontologies/1000> ;
a maps:One_To_One_Mapping . Mapping
<http://purl.bioontology.org/mapping/nonloom/procinfo/2008-04-23-38138>
maps:date "2008-04-23T19:21:45Z"^^xsd:dateTime ;
maps:mapping_source "Organization" ;
maps:mapping_source_contact_info "http://www.nlm.nih.gov" ;
maps:mapping_source_name "NLM" ;
maps:mapping_source_site <http://www.nlm.nih.gov> ;
maps:mapping_type "Manual" ;
maps:submitted_by 38138 .
Noy, N.F., Griffith, N., Musen, M.A.: Collecting community-based mappings in an ontology
repository. In: International Semantic Web Conference. pp. 371–386 (2008)
9
Tuesday, November 13, 12
22. Challenges, opportunities and lessons learn with
sparql.bioontology.org
1. Retrieval of common attributes and how simple
reasoning can help.
2. Complexity of query articulation when targeting
OWL complex constructions.
3. Best practices in using an open shared endpoint:
I. Selective queries work better.
II. Careful with large result sets. Paginate.
III. How the client reads matter.
10
Tuesday, November 13, 12
23. Common Attributes in BP Ontologies
taxonomies
preferred labels
synonyms
definitions
11
Tuesday, November 13, 12
24. Common Attributes in BP Ontologies
taxonomies rdfs:subClassOf
preferred labels
synonyms
definitions
11
Tuesday, November 13, 12
25. Common Attributes in BP Ontologies
taxonomies rdfs:subClassOf
preferred labels
synonyms rdfs:subPropertyOf
definitions
11
Tuesday, November 13, 12
26. Common Attributes in BP Ontologies
taxonomies rdfs:subClassOf
preferred labels
synonyms rdfs:subPropertyOf
definitions
12
Tuesday, November 13, 12
27. BP Taxonomies
Almost every ontology in BioPortal uses
rdfs:subClassOf to record class hierarchies.
We offer rdfs:subClassOf reasoning to collect
hierarchy closures.
13
Tuesday, November 13, 12
28. BP Taxonomies
Almost every ontology in BioPortal uses
rdfs:subClassOf to record class hierarchies.
We offer rdfs:subClassOf reasoning to collect
hierarchy closures.
backward-chain
off by default
13
Tuesday, November 13, 12
29. BP Taxonomies
2 use cases and their challenges
hierarchies with mappings
partial traversal
14
Tuesday, November 13, 12
30. hierarchies and mappings
With mappings one can continue browsing a taxonomy
beyond the boundaries of a certain ontology.
"malignant hyperthermia"
Human Disease Ontology
15
Tuesday, November 13, 12
31. hierarchies and mappings
With mappings one can continue browsing a taxonomy
beyond the boundaries of a certain ontology.
15
Tuesday, November 13, 12
32. hierarchies and mappings
With mappings one can continue browsing a taxonomy
beyond the boundaries of a certain ontology.
15
Tuesday, November 13, 12
33. hierarchies and mappings
With mappings one can continue browsing a taxonomy
beyond the boundaries of a certain ontology.
15
Tuesday, November 13, 12
34. partial traversal
Some applications need to traverse the
hierarchy for a fixed number of steps.
16
Tuesday, November 13, 12
35. partial traversal
Some applications need to traverse the
hierarchy for a fixed number of steps.
16
Tuesday, November 13, 12
36. Common Attributes in BP Ontologies
taxonomies rdfs:subClassOf
preferred labels
synonyms rdfs:subPropertyOf
definitions
17
Tuesday, November 13, 12
37. Common Attributes in BP Ontologies
taxonomies rdfs:subClassOf
preferred labels
synonyms rdfs:subPropertyOf
definitions
18
Tuesday, November 13, 12
38. BP preferred labels,
synonyms and definitions
34 ontologies record preferred labels,
synonyms and definitions using their
own predicates.
When ontology authors upload ontologies into BioPortal they have
to choose what are the predicates that represent these attributes.
19
Tuesday, November 13, 12
39. rdfs:label
SKOS
skos:prefLabel skos:altLabel skos:definition
pref. label alt. label definition user
predicates predicates predicates defined
We provide uniform access to these
proper ties by linking these different
properties to the standard SKOS properties
using rdfs:subPropertyOf.
Tuesday, November 13, 12
40. By including the
rdfs:subPropertyOf links in
“globals” we do not need to
know what property is used
in NIF-RTH to retrieve
preferred labels.
21
Tuesday, November 13, 12
41. By including the
rdfs:subPropertyOf links in
“globals” we do not need to
know what property is used
in NIF-RTH to retrieve
preferred labels.
21
Tuesday, November 13, 12
42. Challenges, opportunities and lessons learn with
sparql.bioontology.org
1. Retrieval of common attributes and how simple
reasoning can help.
2. Complexity of query articulation when targeting
OWL complex constructions.
3. Best practices in using an open shared endpoint:
I. Selective queries work better.
II. Careful with large result sets. Paginate.
III. How the client reads matter.
22
Tuesday, November 13, 12
43. Challenges, opportunities and lessons learn with
sparql.bioontology.org
1. Retrieval of common attributes and how simple
reasoning can help.
2. Complexity of query articulation when targeting
OWL complex constructions.
3. Best practices in using an open shared endpoint:
I. Selective queries work better.
II. Careful with large result sets. Paginate.
III. How the client reads matter.
22
Tuesday, November 13, 12
44. Challenges, opportunities and lessons learn with
sparql.bioontology.org
1. Retrieval of common attributes and how simple
reasoning can help.
2. Complexity of query articulation when targeting
OWL complex constructions.
3. Best practices in using an open shared endpoint:
I. Selective queries work better.
II. Careful with large result sets. Paginate.
III. How the client reads matter.
22
Tuesday, November 13, 12
46. obo:VO_0000001
a owl:Class ;
rdfs:label "vaccine" ;
rdfs:seeAlso "MeSH: D014612" ;
obo:IAO_0000115 "A vaccine is a processed (...) " ;
obo:IAO_0000116 "Many vaccines are developed (...) " ;
obo:IAO_0000117 "YH, BP, BS, MC, LC, XZ, RS" ;
rdfs:subClassOf obo:OBI_0000047 ;
owl:equivalentClass [
a owl:Class ;
owl:intersectionOf (obo:OBI_0000047
[ Example of a
a owl:Restriction ;
owl:onProperty obo:BFO_0000085 ; relatively complex
owl:someValuesFrom [
a owl:Class ; OWL construction
owl:intersectionOf (obo:VO_0000278
[
a owl:Restriction ;
from the Vaccine
owl:onProperty obo:BFO_0000054 ;
owl:someValuesFrom obo:VO_0000494
Ontology
]
)
]
]
[
a owl:Restriction ;
owl:onProperty obo:OBI_0000312 ;
owl:someValuesFrom obo:VO_0000590
]
)
].
24
Tuesday, November 13, 12
47. Challenges, opportunities and lessons learn with
sparql.bioontology.org
1. Retrieval of common attributes and how simple
reasoning can help.
2. Complexity of query articulation when targeting
OWL complex constructions.
3. Best practices in using an open shared endpoint:
I. Selective queries work better.
II. Careful with large result sets. Paginate.
III. How the client reads matter.
25
Tuesday, November 13, 12
48. Challenges, opportunities and lessons learn with
sparql.bioontology.org
1. Retrieval of common attributes and how simple
reasoning can help.
2. Complexity of query articulation when targeting
OWL complex constructions.
3. Best practices in using an open shared endpoint:
I. Selective queries work better.
II. Careful with large result sets. Paginate.
III. How the client reads matter.
25
Tuesday, November 13, 12
49. Challenges, opportunities and lessons learn with
sparql.bioontology.org
1. Retrieval of common attributes and how simple
reasoning can help.
2. Complexity of query articulation when targeting
OWL complex constructions.
3. Best practices in using an open shared endpoint:
I. Selective queries work better.
II. Careful with large result sets. Paginate.
III. How the client reads matter.
25
Tuesday, November 13, 12
50. Best practices in using a
shared SPARQL endpoint
selective queries work better
control size of result sets - pagination
how clients read matters
26
Tuesday, November 13, 12
54. control size of result sets - pagination
28
Tuesday, November 13, 12
55. control size of result sets - pagination
while len(results) == LIMIT
OFFSET += LIMIT
28
Tuesday, November 13, 12
56. Use libraries that parse the
result set on demand
how clients read matters
60
retrieval of all preferred labels from
NCBI Taxonomy (500K solutions)
45
130.0
30
97.5
65.0
15
32.5
0
0 JSON+Python JSON+CJSON XML+Jena ARQ XML+Sesame
XML JSON
output size in MB parsing time in seconds
29
Tuesday, November 13, 12
57. Undefined 1 (2009) 1–5 1
IOS Press
Using SPARQL to Query BioPortal
Ontologies and Metadata
BioPortal as a Dataset of Linked Biomedical
Ontologies and Terminologies in RDF.
Manuel Salvadores, Matthew Horridge, Paul R. Alexander,
Ray W. Fergerson, Mark A. Musen, and Natalya F. Noy Manuel Salvadores, a,⇤ Paul R. Alexander, a Mark A. Musen a and Natalya F. Noy a
a
Stanford Center for Biomedical Informatics Research
Stanford Center for Biomedical Informatics Research Stanford University, US
Stanford University, US E-mail: {manuelso, palexander, musen, noy}@stanford.edu,
{manuelso,matthew.horridge,palexander,ray.fergerson,musen,noy}@stanford.
edu
Abstract. BioPortal is a repository of biomedical ontologies—the largest such repository, with more than 300 ontologies to
Abstract. BioPortal is a repository of biomedical ontologies—the largest date. This set includes ontologies that were developed in OWL, OBO and other formats, as well as a large number of medical
such repository, with more than 300 ontologies to date. This set includes terminologies that the US National Library of Medicine distributes in its own proprietary format. We have published the RDF
ontologies that were developed in OWL, OBO and other languages, as version of all these ontologies at http://sparql.bioontology.org. This dataset contains 190M triples, representing
well as a large number of medical terminologies that the US National Li- both metadata and content for the 300 ontologies. We use the metadata that the ontology authors provide and simple RDFS
brary of Medicine distributes in its own proprietary format. We have pub- reasoning in order to provide dataset users with uniform access to key properties of the ontologies, such as lexical properties for
lished the RDF based serializations of all these ontologies and their meta- the class names and provenance data. The dataset also contains 9.8M cross-ontology mappings of different types, generated both
data at sparql.bioontology.org. This dataset contains 203M triples, manually and automatically, which come with their own metadata.
representing both content and metadata for the 300+ ontologies; and 9M Keywords: biomedical ontologies, BioPortal, RDF, linked data
mappings between terms. This endpoint can be queried with SPARQL
which opens new usage scenarios for the biomedical domain. This paper
presents lessons learned from having redesigned several applications that
today use this SPARQL endpoint to consume ontological data. 1. Introduction numerous requests from users for the SPARQL end-
In our laboratory, we have developed BioPortal, a point, which would enable them to query and analyze
community-based ontology repository for biomedical the data in much more precise and application-specific
Keywords: Ontologies, SPARQL, RDF, Biomedical, Linked Data
ontologies [20,1]. Users can publish their ontologies ways than our set of REST APIs allowed.
to BioPortal, submit new versions, browse the ontolo- This paper describes the Linked Data aspects of the
1 SPARQL In Use In BioPortal: gies, and access the ontologies and their components BioPortal’s ecosystem and the structure of our linked
through a set of REST services, SPARQL and de- datasets in RDF. In addition, we describe the process
Overview of Opportunities and Challenges that we used to transform different ontology formats
referenceable URIs.
Over the past four years, as BioPortal grew in popu- into RDF and the mappings between ontologies. We
Ontology repositories act as a gateway for users who need to find ontologies for larity, research institutions and corporations have used describe several issues with using the shared SPARQL
their applications. Ontology developers submit their ontologies to these reposi- our REST APIs extensively. The use of the REST ser- endpoint elsewhere [10]. This discussion includes the
tories in order to promote their vocabularies and to encourage inter-operation. vices has experienced outstanding growth in 2011. The details on retrieving common attributes from multi-
In biomedicine, cultural heritage, and other domains, many of the ontologies and average number of hits per month grew from 3M hits ple ontologies, articulating complex queries, and the
vocabularies are extremely large, with tens of thousands of classes. in 2010 to 122M hits in 2011.Our users have incorpo- lessons that we have learned on the best practices of
In our laboratory, we have developed BioPortal, a community-based ontology rated these services in applications that perform drug using a shared SPARQL endpoint.
repository for biomedical ontologies [11]. Users can publish their ontologies to surveillance, gene annotation, enrichment and clas- 2. Biomedical Ontologies in BioPortal
BioPortal, submit new versions, browse the ontologies, and access the ontologies sification of scientific literature, and other tasks. In
December 2011, we released a public SPARQL end- Researchers and practitioners in the Semantic Web
and their components through a set of REST services. BioPortal provides search normally deal with two types of data: (1) ontologies,
across all ontologies in its collection, a repository of automatically and manually point, http://sparql.bioontology.org, to
provide direct access to our datasets in RDF. We had vocabularies or TBoxes; and (2) instance data or sim-
generated mappings between classes in di↵erent ontologies, ontology reviews, ply data. It is important to clarify that BioPortal’s con-
new term requests, and discussions generated by the ontology users in the com- tent is almost exclusively ontologies and related arti-
munity. BioPortal contains metadata about each ontology and its versions as * Corresponding author. E-mail: manuelso@stanford.edu. facts. By contrast, most other datasets of the Linked
well as mappings between terms in di↵erent ontologies.
0000-0000/09/$00.00 c 2009 – IOS Press and the authors. All rights reserved
30
Tuesday, November 13, 12
58. Conclusions
• Our use of SPARQL is different from many other use cases
because our data are primarily ontologies themselves and
not data about individuals.
• SPARQL and a small amount of reasoning can be
particularly powerful in providing easy access to common
attributes.
• Exposing OWL through a SPARQL endpoint poses a
number of challenges.
• There are challenges in running a shared open SPARQL
endpoint. We can overcome these challenges if we
encourage developers to conform to a set of simple best
practices.
31
Tuesday, November 13, 12
59. Thank you
Questions
32
Tuesday, November 13, 12