Converting GHO to RDF
Upcoming SlideShare
Loading in...5

Converting GHO to RDF






Total Views
Slideshare-icon Views on SlideShare
Embed Views



1 Embed 1 1


Upload Details

Uploaded via as Adobe PDF

Usage Rights

CC Attribution-ShareAlike LicenseCC Attribution-ShareAlike License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
Post Comment
Edit your comment

    Converting GHO to RDF Converting GHO to RDF Presentation Transcript

    • Converting WHO’s GlobalHealth Observatory Data to RDF Amrapali Zaveri, PhD student August  27,  2012 1
    • Outline• Background• What is the RDF Data Cube Vocabulary?• Semi-automated approach• OntoWikis CSVImport plug-in• RDFized GHO data• Limitations and Future Work 2
    • Background• Biomedical statistical data • Published as Excel sheets• Advantage • Readable by humans• Disadvantages • Cannot be queried efficiently • Difficult to integrate with other data (in different formats)• Our approach • Converting data into a single data model - RDF • Using the RDF Data Cube Vocabulary* • designed particularly to represent multidimensional statistical data using RDF. * 3
    • What is the RDF Data CubeVocabulary? 4
    • What is the RDF Data CubeVocabulary? • Dimensions • Attributes • Measures • Observations 5
    • Semi-automated approach• Transforming CSV to RDF in a fully automated way is not feasible. • Dimensions may often be encoded in heading or label of a sheet• Our semi-automatic approach: • As a plug-in in OntoWiki# • a semantic collaboration platform developed by the AKSW research group. • A CSV file is converted into RDF using the RDF Data Cube Vocabulary: Stats2RDF # Sören Auer, Sebastian Tramp (geb. Dietzold), Jens Lehmann, and Thomas Riechert: OntoWiki: A Tool for Social Semantic Collaboration In: Proceedings of the Workshop on Social and Collaborative Construction of Structured Knowledge CKC 2007 at the 16th International World Wide Web Conference WWW2007 Banff, Canada, May 8, 2007 6
    • 1. Create Knowledge Base 7
    • 2. Import a CSV file 8
    • 3. Define dimensions 9
    • 4. Define data range 10
    • 5. Save template, extract triples 11
    • 6. Re-use template for similar files 12
    • 7. View resources 13
    • RDFized GHO datagho:Country rdfs:subClassOf qb:DimensionProperty; rdf:type rdfs:Class; rdfs:label "Country" .gho:Disease rdfs:subClassOf qb:DimensionProperty; rdf:type rdfs:Class; rdfs:label "Disease" .gho: Afghanistan rdf:type ex:Country; rdfs:label "Afghanistan" .gho:Tuberculosis rdf:type ex:Disease; rdfs:label "Tuberculosis" .gho:c1-r6 rdf:type qb:Observation; rdf:value "127"^^xsd:integer; qb:dimension gho:Afghanistan; qb:dimension gho:Tuberculosis . 14
    • RDFized GHO data• Available at• 50 datasets• ~ 8 million triples• Paper published at SWJ Call for Dataset descriptions: 15
    • Limitations and Future Work • Conversion • Coherence • Temporal Comparability • Exploring GHO 16
    • Thank You! Questions? 17