Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Enrich, Link, Search

630 views

Published on

The lean approach for advanced search applications over linked data. Presentation of Spinque at the Semantics conference 2015, http://semantics.cc/

Published in: Data & Analytics
  • Be the first to comment

Enrich, Link, Search

  1. 1. ENRICH > LINK > SEARCH The lean approach for advanced search applications over linked data Michiel Hildebrand Semantics Conference Vienna 2015
  2. 2. 2
  3. 3. Do you see value in open data? 3
  4. 4. Do you think that open data could improve the access to your own data? 4
  5. 5. Have you integrated open data with your own data? 5
  6. 6. Have you created an application on top of your integrated data? 6
  7. 7. The billion $ Open Data example 7
  8. 8. Cultural Heritage: advanced access through (Open) Data multi-lingual location-based recommendation personalization advanced ranking analytics http://www.getty.edu/research/tools/vocabularies/aat/ 8
  9. 9. multi-lingual location-based recommendation personalization advanced ranking analytics Cultural Heritage: advanced access through (Open) Data http://www.vistory.nl/ 9
  10. 10. Cultural Heritage: advanced access through (Open) Data multi-lingual location-based recommendation personalization advanced ranking analytics query logs content-based 10
  11. 11. Cultural Heritage: advanced access through (Open) Data multi-lingual location-based recommendation personalization advanced ranking analytics http://manovich.net/ 11
  12. 12. Historic newsreels and photographs 12
  13. 13. Demo: Linked Open Images 13 http://link.spinque.com/openbeelden
  14. 14. Can we build this in a day? 14
  15. 15. Factory metaphor PUSH: make to stock PULL: make to order Output and efficiency oriented exact needs of user secondary User needs oriented production costly 15
  16. 16. How can we reduce the time and cost? Data factory PUSH: make to stock PULL: make to order 16 How good is the data for your application?
  17. 17. The lean approach 17 Your data Integrate Access Deploy API Enrich
  18. 18. Open Data Node platform http://opendatanode.org/ Methodology for publishing Open Data http://www.comsode.eu/index.php/deliverables/ Moving from one-off to sustainable data publishing 18 http://unifiedviews.eu/
  19. 19. Key requirements for integration step Sustainable Quality control 19 Your data Integrate Access Deploy API Enrich
  20. 20. Integrating historic newsreels with photographs GTAA thesaurus (SKOS)NIOD subject terms (SKOS) 20
  21. 21. preferred label antisemitisme spionage amnestie ... preferred label antisemitisme spionage amnestie ... NIOD subject terms GTAA thesaurus preferred label = preferred label 21
  22. 22. prefered label alternative label politieagenten agenten militaire parades parades optochten parades prefered label agenten parades NIOD subject terms GTAA thesaurus Introduces ambiguity preferred label = alternative label 22
  23. 23. prefered label dodenherdenking hamsteren NIOD subject terms GTAA thesaurus Introduces errors prefered label dodenherdenkingen hamsters singular label = plural label (stemming) 23
  24. 24. prefered label dieren graven NIOD subject terms GTAA thesaurus filter sources prefered label concept scheme dieren subject terms dieren geographical names graven subject terms grave geographical names subject ≠ location (noise) 24
  25. 25. Other alignment techniques fuzzy string matching join matches on multiple attributes similarity in the hierarchy (skos:broader) select best candidate (most generic/specific term) .... 25
  26. 26. Demo Spinque LINK 26 http://cultuurlink.beeldengeluid.nl/app/#/tutorial/tutorial_niod_start
  27. 27. Key requirements integration step checked Quality control • Model link strategy out of (simple) building blocks • Iterative process (trial and error) • Exploration of the source data • Direct access to the results • Evaluate the subsets Sustainable • Export links and link strategy • Provenance of the process is explicit in the strategy • Rerun after update of datasets 27
  28. 28. Dutch National Strategy Digital Heritage 28
  29. 29. CultuurLINK a free service for the cultural heritage domain 29 http://cultuurlink.beeldengeluid.nl/
  30. 30. Rijksmuseum Amsterdam integrated multilingual vocabularies http://www.rijksmuseum.nl/nl/collectie/BK-NM-1010 http://www.getty.edu/research/tools/vocabularies/aat/ 30
  31. 31. Key requirements for access step 31 Your data Integrate Access Deploy API Enrich Model complex access (search) Combine graph queries and ranking
  32. 32. Already three types of search in a simple app 32 keyword search location-based search recommendation
  33. 33. multilingual location-based recommendation personalization ranking analytics Probabilistic Graph Database Building blocks (SPINQL) Search by Strategy Advanced search applications with Spinque 33
  34. 34. Demo Spinque Search 34
  35. 35. Key requirements access step checked Model complex search problems • Search strategy out of (simple) building blocks • No programming required Combine graph queries and ranking • Integrated triple store and search index • Probabilistic graph database • Building blocks for graph queries • Building blocks for search and ranking 35
  36. 36. Your data Enrich Link strategy API DeploySearch strategy 36 The lean approach
  37. 37. Breakout What kind of functionality would you like to provide to your users? 1. What kind of data do you want to make accessible in a richer way? 2. What additional (open) data can you use for this enriched access? 3. What type of (search) functionality is required? 37
  38. 38. Other applications: Restaurant inspections 38
  39. 39. Other applications: Community platform 39

×