Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

On Data quality

708 views

Published on

Talk at the 1st DBpedia Community Meeting

Published in: Technology, Business
  • Be the first to comment

On Data quality

  1. 1. ON DATA QUALITY Marco Fossati fossati@spaziodati.eu AMSTERDAM 30TH JANUARY 2014
  2. 2. MAPPING PARADIGM Problem Solution A. TEMPLATE-DEPENDENT ERROR-PRONE HETEROGENEOUS USAGE MACHINE LEARNINGBASED METHODS TYPE INFERENCE CONFIDENCE SCORE MAPPING ASSISTANT ! B. FULLY MANUAL COSTLY 2
  3. 3. ONTOLOGY Problem Solution COMMUNITY-BASED SHALLOW SEMANTICS LACK OF COVERAGE UNBALANCED • TOO GENERIC • REDUNDANT A. CONSISTENCY CHECK CLASS USAGE B. DATA-DRIVEN SCHEMA WIKIPEDIA CATEGORIES 3
  4. 4. LINKING MULTIMEDIA DATA SOURCES PHOTO AUDIO VIDEO 4
  5. 5. PHOTO ! FLICKR WRAPPER MAINTENANCE? UPDATES? 5
  6. 6. AUDIO BANDCAMP GROOVESHARK RDIO SOUNDCLOUD 6
  7. 7. VIDEO IMDB ROTTEN TOMATOES VIMEO YOUTUBE 7
  8. 8. THANKS FOR YOUR ATTENTION! Marco Fossati fossati@spaziodati.eu

×