World bank 2011-05
Upcoming SlideShare
Loading in...5
×
 

Like this? Share it with your network

Share

World bank 2011-05

on

  • 894 views

Presentation at Worldbank

Presentation at Worldbank

Statistics

Views

Total Views
894
Views on SlideShare
894
Embed Views
0

Actions

Likes
0
Downloads
12
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • the chart on the homepage representing the distribution of services across "service types" (http://ring.ciard.net) (implemented with support from John Fereira); the geographic map on the homepage representing the geographic distribution of services;
  • a first attempt to provide some aggregated data on the number of contents / resources potentially reachable through the services registered in the RING: http://ring.ciard.net/totals
  • Whatdoesthismean in practice? I will show thiswithanexamplefrom the BBC. The biggestconsumers (and producers) of LOD are as I know the BBC and the New York times (Butnowalso the US government)
  • During the Web 1.0 phase, Webpageswerecomposedbyhumans. Todaymostwebpages are drivenbydatabasesthat can bedynamicallyqueried. Theycontainthrough RSS feedsalso data fromotherwebsitesThis BBC webpageis a big jumpfurther. I hasnotbeencomposedbyhumans and itisnotfromone database generated. Itisgeneratedfromdifferentdatasourcesthatwerepresentaslinked open data, linkedonlythrough common URIs
  • Ifresources are marked up withsemanticallydefined and machinereadableconcepts, they can belinked and mashed up preciselyaswehaveseen in the examplefrom the BBC.In thisexamplewe start withan AGRIS record on Hazardouswaste, whichisindexedwith AGROVOC. Alreadynowwe can easily link to material indexedwithEurovoc, hereanexamplefromEuroLex. If the UNBIS thesaurus wouldberestructuredto a conceptscheme and publishedas LOD, related UN documentscouldbeattachedautomaticallyby the machine.
  • Ifresources are marked up withsemanticallydefined and machinereadableconcepts, they can belinked and mashed up preciselyaswehaveseen in the examplefrom the BBC.In thisexamplewe start withan AGRIS record on Hazardouswaste, whichisindexedwith AGROVOC. Alreadynowwe can easily link to material indexedwithEurovoc, hereanexamplefromEuroLex. If the UNBIS thesaurus wouldberestructuredto a conceptscheme and publishedas LOD, related UN documentscouldbeattachedautomaticallyby the machine.
  • Ifresources are marked up withsemanticallydefined and machinereadableconcepts, they can belinked and mashed up preciselyaswehaveseen in the examplefrom the BBC.In thisexamplewe start withan AGRIS record on Hazardouswaste, whichisindexedwith AGROVOC. Alreadynowwe can easily link to material indexedwithEurovoc, hereanexamplefromEuroLex. If the UNBIS thesaurus wouldberestructuredto a conceptscheme and publishedas LOD, related UN documentscouldbeattachedautomaticallyby the machine.
  • Ifresources are marked up withsemanticallydefined and machinereadableconcepts, they can belinked and mashed up preciselyaswehaveseen in the examplefrom the BBC.In thisexamplewe start withan AGRIS record on Hazardouswaste, whichisindexedwith AGROVOC. Alreadynowwe can easily link to material indexedwithEurovoc, hereanexamplefromEuroLex. If the UNBIS thesaurus wouldberestructuredto a conceptscheme and publishedas LOD, related UN documentscouldbeattachedautomaticallyby the machine.
  • How does this work: A resource is connected with each concept URI in the web. The concepts between three vocabularies are having same literal which is connected with owl:sameAS/exactMatch relationship. As we are speakingaboutthesauri and notontologieswekept the relation tobechosenpurposelyvague. The conceptscouldbematchedwithowl:sameAS or the termscouldbematcheswith SKOS:exactMatch. A lotofdiscussion on thisisongoing
  • The mainintegrationworksthroughcommonsemanticsCore ofagINFRAtechnologyisaLODstoreofsharedencodedknowledgeorganizationsystemsan automaticmarkupto link structuredandunstructureddatasourcesthroughthissharedKnowledgeOrganizationsystemsSharing withinthe R.I.N.G.Partner registertheirservices, notechnicallimitationLOD – Wrapper for all participatingInstitutionsFor all registered services a „triplificationwrapper“ will besetupThe triplifierworkswith „agConceptsandagIdentities“ tocreatelinkeddataSteadilygrowing LOD ecosystemThe agINFRA LOD ecosystemoffers Webservices forthewww
  • http://193.190.8.15/agri3/user: agri@oceanpassword: agrihttp://aims.fao.org/lode/bd/subjecthttp://202.73.13.50:58301/AgroTagger/
  • Note: we identified outlinks to RAMAEU and GEMET, and they have taken them as inlinks to their own thesaurus.
  • - All links are checked by a domain expert.
  • Oneof the groundbreakingenterprises in this area isThomsonReuters “Open Calais”. Thisis a webservicethatprovidessemanticmark up foranyunstructured text thatyoufeedintotheir service The service is free ofCharge. Why? I will show youlater.
  • My team in collaborationwith the IndianInstituteofTechnology in Kanpur isdeveloping a similar service foroursubject area.
  • Wehavehere a text from 1964 without a bibliographic record at handabout a plantprotectionissue
  • Open Calais isverygood in thoseareas, in whichtheyhavetheirownelaboratedconceptschemeagainstwhich the texts are analyzed: “Places”, “Persons”, “Business Processes” , “IndustryTerms”, butitisweak in the specifictopicanalysis, whattheycall “social tags”
  • AgroTaggerstilllacksmanyof the sophisticated featuresof “Open Calais” ,butismuch, muchbetter in the subjectanalysisof the text

World bank 2011-05 Presentation Transcript

  • 1. The CIARD (Coherence in Information for Agricultural Research for development) initiative and a global infrastructure for linked open data
    Dr. Johannes Keizer
    Office ofKnowledge Exchange, Research and Extension
    Food andAgricultureOrganizationofthe UN
    Talk atWorldbank, 2011, May 17
  • 2. We will promote research for food and agriculture, including research to adapt to, and mitigate climate change, and access to research results and technologies at national, regional and international levels.
    We will reinvigorate national research systems and will share information and best practices.
    We will improve access to knowledge.
    worldfoodsummit 2009
  • 3. FAO has been engaged for decades in making agricultural development information more easily accessible and sharable among it's stakeholders.  These efforts reach back to the early 70s when FAO set up the AGRIS program.  Since the advent of the Internet the AIMS team at FAO HQ is working to make distributed data and information repositories interoperable. This work has been backed up on the institutional level by the CIARD  (Coherence in Information for Agricultural Research for Development) initiative, in which FAO, GFAR, the CGIAR and many national partners collaborate. Technically FAO has underpinned this with the further development of the Agricultural Thesaurus AGROVOC and with initiatives on shared metadata sets (AGRIS AP) and ontologies. The paradigm and technology of linked open data, proposed by Tim Berners Lee some years ago, now provides a practical possibility to apply standard vocabularies and semantics to link distributed data that is published in a non proprietary format. The presentation will show the CIARD RING,  ("routemap to information nodes and gateways"), demonstrate the AGROVOC LOD, will talk about the use of LOD in federating document repositories and will outline an Infrastructure for Information interoperability in Agricultural research and innovation
  • 4. http://www.ciard.net
  • 5. The Community
    and growing…..
    Founding Partners
  • 6. The Vision and Manifesto
    “To make public domain agricultural research information and knowledge truly accessible to all”
    • All organizations that create and possess public agricultural research information disseminate and share it more widely
    • 7. CIARD partners will (a) coordinate their efforts, (b) promote common formats, (c) adopt open systems
    • 8. Create a global network of public collections of information
  • Coherence in Information
    forAgriculturalResearchforDevelopment
    A new global movement to provide a platform for coherence
    between information-related initiatives
    to make public domain agricultural research information and knowledge
    truly accessible to all
    2009
    2007
    2008
    2005
    2010
    2012
    2011
    1st IISAST Consultation
    TASK FORCES
    CIARD Initiative launched
    (15 founding partners)
    Regional Consultations
    70 countries 150 info prof.
    2nd IISAST Consultation
    GCARD 2012
    e-Consultation & Beijing Consultation
    + Regional Workshops
    CIARD endorsed (GCARD and FARA)
    +112 partners and growing…
  • 9. Contribution and Participation in Science
    Territory size shows proportion of scientific papers published in 2001 by authors living there.
    Copyright SASI Group (University of Sheffield) and Mark Newman (University of Michigan)
  • 10. RING - Charts and numbers
  • 11. RING – Numbers
    http://ring.ciard.net/totals
    Number of documents potentially reachable through the services registered in the RING.
    Types of service considered: document repositories and bibliographic databases.
  • 12. Information InfrastructureforAgriculturalResearch and Innovation
  • 13. DistributedRepositories
    • stats
    • 14. gene banks
    • 15. gis data
    • 16. blogs,
    • 17. journals
    • 18. open archives
    • 19. raw data
    • 20. technologies
    • 21. learningobjects
    • 22. ………..
  • Problem 1: makingservices
    ?
    ?
    ?
  • 23. Problem 2: gettingknowledge
    ?
    ?
    ?
  • 24.
  • 25. Example: BBC Wildlife Finder
  • 26. Humboldt Squid page, pulled together from a diversity of Linked Data sources
    BBC TV Documentary
    BBC News item
    Wikipedia
    Animal Diversity Web:Nocturnal way of life
  • 27.
    • http://www.w3.org/2007/Talks/0221-Bangalore-IH/
    RDF as a common format for merging data
  • 28. The role of vocabularies in linking data sets
  • 29.
  • 30. http://aims.fao.org/aos/agrovoc/c_7825
  • 31. http://eurovoc.europa.eu/218754
    http://aims.fao.org/aos/agrovoc/c_7825
  • 32. http://eurovoc.europa.eu/218754
    http://aims.fao.org/aos/agrovoc/c_7825
  • 33. http://eurovoc.europa.eu/218754
    http://agclass.nal.usda.gov/nalt/2011.xml#1780
    http://aims.fao.org/aos/agrovoc/c_7825
  • 34. Linking data through common URIs
    TOXIC SUBSTANCES
    http://www.agnic.org/search/CAT85822953
    UNBIS
    AGROVOC
    NALT
    http://aims.fao.org/aos/agrovoc/c_7825
    http://agclass.nal.usda.gov/nalt/2011.xml#1780
    http://eurovoc.europa.eu/218754
    Eurovoc
    http://agris.fao.org/agris-search/search/display.do?f=1996/TR/TR96001.xml;TR9600026
    http://unbisnet.un.org:8080/ipac20/ipac.jsp?session=128F308557F34.283092&profile=bib&uri=full=3100001~!685149~!1&ri=1&aspect=subtab124&menu=search&source=~!horizon
    http://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2010:202:0011:0015:EN:PDF
    http://aims.fao.org/aos/agrovoc/c_12332 owl:sameAshttp://eurovoc.europa.eu/219871
    skos: exact match UNBIS: Toxic Substances
  • 35. If all institutions, which publish about toxic wastes would:
    • - Index their publications with URIs from AGROVOC,GEMET, NALT, LCSH or EUROVOC
    • 36. (many do – low hanging fruit!)
    • 37. - Publish their metadata as LOD
    • 38. (quite easy to do, bibData map well to RDF
    Then
    Everyone who knows to write SparqlQeries could get all these publications with one shot for a new website on toxic wastes
  • 39. Vocabularies and LOD
    Simply publishing your data as RDF does not link them to other data sets 
    Creating this links by humans is interesting in detail, but unrealistic as mass processing
    Linking 2 standard vocabularies can link 200 datasets which use these standard vocabularies
  • 40. RING
    routemapto information nodes and gateways
    VocBench
    concepts and entitiesreferencetriples
    Cloud
    storagefor RDF data triples
    Tools
    LOD
    enabled software
    LOD Generator
    triplifier,
    concept and entityidentifier
    Data Services
    Webservices + APIsto triple stores
    agINFRA - the elements
  • 41. http://aims.fao.org
  • 42. ….views into the construction site
    VocBench
    AGROVOC LOD on VocBench 1.1
    LOD Generator
    Do you know openCalais?
    AgroTagger Testing Site
    LODE-BD
    The RING: http://ring.ciard.net
    Tools
    AgriDrupal
    AgriOceanDspace : http://193.190.8.15/agri3/
  • 43. AGROVOC
  • 44. AGROVOC
    A multilingual agricultural vocabulary organized as concept scheme in 20 languages
    Covers agriculture, forestry, fisheries and related themes (food security, land use, environment, etc.)
    Organized in sub-vocabularies, e.g. chemicals, fisheries terms, scientific/common names of organisms
    Maintained by a global community (e.g. librarians, terminologists, information managers) using VocBench
  • 45. AGROVOC - Statistics
  • 46. Top concepts
  • 47. Relationships (examples)
  • 48.
  • 49.
  • 50.
  • 51.
  • 52. Thesauri into the AGROVOC LOD Cloud
    • 18000 outlinks
    • 53. 2000 inlinks
    EUROVOC
    NALT
    AGROVOC
    RAMEAU
    GEMET
    STW
    LCSH
  • 54. AGROVOC Links after 3 weeks LOD
    Outlinks:
    GEMET-AGROVOC 1,198
    RAMEAU-AGROVOC  :700
    Total Outlinks: 1898
    Inlinks:
    AGROVOC-EUROVOC:1,297
    AGROVOC-GEMET:1,198
    AGROVOC-LCSH :1,093
    AGROVOC-NAL: 13,390
    AGROVOC-STW:1136
    AGROVOC-RAMEAU:700
    Total Inlinks:18,814
  • 55. Europe:(It is better to use this example during the presentation)http://aims.fao.org/aos/agrovoc/c_2724From the Top concept:Ref:  http://aims.fao.org/aos/agrovoc/c_7644Vocbench (Production)Ref:   http://agrovoc.mimos.my/vocbenchv1.1i/VocBench(Sandbox)Ref:http://agrovoc.mimos.my/vocbenchv1.1i/
  • 56. The VocBench
  • 57. The VocBench
    VocBench
    concepts and entitiestriples
  • 58. VocBench Features
    • Domain independent
    • 59. Structure independent (i.e. thesauri, Glossaries, etc)
    • 60. Supports RDF (SKOS, SKOS-XL), OWL
    • 61. Supports collaborative editing
    • 62. Supports editorial workflow, with user roles
    • 63. Simple and advanced search
    • 64. Supports data export: SKOS, Relational format (MySQL)
  • 65. AgroTagger
    And
    OpenCalais
  • 66.
  • 67.
    • Does Concept identification in unstructured texts
    • 68. Uses Agrovoc as a controlled vocabulary
    • 69. Prototype under testing with excellent results (entire repository of ICARDA indexed)
    • 70. Will produce in future Structured RDF files that can be used to link data like “open Calais”
    AgroTagger
  • 71.
  • 72.
  • 73.
  • 74. Thank You!
    http://www.ciard.net
    http://ring.ciard.net
    http://aims.fao.org
    http://agris.fao.org