Your SlideShare is downloading. ×
0
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Peak   cloud based data - linked data
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Peak cloud based data - linked data

622

Published on

A description of using linked data to create clear and unambiguous systems across the Internet or within your enterprise.

A description of using linked data to create clear and unambiguous systems across the Internet or within your enterprise.

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
622
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
8
Comments
0
Likes
1
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  1. Dealing with the “new” data in the “Cloud” – Linked DataLondon - New York - Dubai - Mumbai 2011
  2. Table of Contents Definitions 3 History 5 The Modigliani Test 11 Link Data 13 Raw Data 23 Resource Description Framework 30 Linked Data Principles 42 Publishing Linked Data 57 Faceted Browsers 65 On-the-fly Mashups 67 SPARQL 73 What is a Linked Data Application 77 Characteristics of a Linked Data Application 78 Contact Us 81
  3. DefinitionsRDF: The RDF data model is similar to classic conceptualmodelling approaches such as Entity-Relationship or Classdiagrams, as it is based upon the idea of making statements aboutresources (in particular Web resources) in the form of subject-predicate-object expressions. These expressions are known astriples in RDF terminology. The subject denotes the resource, andthe predicate denotes traits or aspects of the resource andexpresses a relationship between the subject and the object. Forexample, one way to represent the notion "The sky has the colourblue" in RDF is as the triple: a subject denoting "the sky", apredicate denoting "has the colour", and an object denoting "blue".RDF is an abstract model with several serialization formats (i.e.,file formats), and so the particular way in which a resource ortriple is encoded varies from format to format.
  4. DefinitionsSPARQL: (SPARQL Protocol and RDF Query Language,pronounced "sparkle") is an RDF query languageLinked Data: Linked Data describes a method of publishingstructured data, so that it can be interlinked and become moreuseful. It builds upon standard Web technologies, such as HTTPand URIs - but rather than using them to serve web pages forhuman readers, it extends them to share information in a way thatcan be read automatically by computers. This enables data fromdifferent sources to be connected and queried.
  5. History Linked Data Design Issues by Tim Berners-Lee July 2006 Linked Open Data Project WWW2007 First LOD Cloud May 2007 BBC publishes Linked Data 2008 NY Times announcement SemTech2009 - ISWC09 Data.gov.uk publishes Linked Data 2010
  6. May 2007
  7. Mar 2008
  8. Sept 2008
  9. Mar 2009
  10. July 2009
  11. The Modigliani Test Show me all the locations of all the original paintingsof Modigliani Daniel Koller (@dakoller) showed that you can findthis with a SPARQL query on DBpedia
  12. So what is Linked Data?
  13. Do you SEARCH or do you FIND?
  14. Search for Football Players who went to the University ofTexas at Austin, played for the Dallas Cowboys as Cornerback
  15. Why can’t we just FIND it…
  16. Using the Current Web =internet + links + docsis terribly inefficient
  17. So what is the problem? We aren’t always interested in documents • We are interested in THINGS • These THINGS might be in documents We can read a HTML document rendered in a browser and findwhat we are searching for • This is hard for computers. It’s typically based on guesswork from some primitive NLP engine, or simple keyword search
  18. What do we need to do?Make it easy for computers/software to find THINGS
  19. How can we do that? • Besides publishing documents on the web - which computers can’t understand easily • Let’s publish something that computers can understand
  20. RAW DATA!But don’t we already publish raw data in RDBMS, XML, CSV, etc?
  21. Yes!But it’s not in a consistent format, and very difficult to integrate (or “link”).
  22. For example, how do I know that theWael Elrifai in Facebook is the same as Wael Elrifai in Twitter
  23. Don’t we already have a standard way of publishing on the web?
  24. We have a standardized way ofpublishing documents on the web, right? HTML
  25. Then why can’t we have a standard way of publishing data on the Web?
  26. In fact, we do have one.
  27. Resource Description Framework (RDF) A data model •A way to model data •i.e. Relational databases use relational data model RDF is a triple data model Labeled Graph Subject, Predicate, Object <Wael> <was born in> <Beirut> <Beirut> <is part of> <the Lebanon> <Wael> <likes> <the Semantic Web>
  28. RDF can be serialized in different ways RDF/XML RDFa (RDF in HTML) N3 Turtle JSON
  29. So does that mean that I have to publish my data in RDF now?
  30. You don’t have to… but it sure would be nice.
  31. Document on the Web
  32. Databases back up documents THINGS have PROPERTIES: A Book as a Title, an author, …Isbn Title Author PublisherID ReleasedData978-0-596- Programming Toby Segaran 1 July 200915381-6 the Semantic Web… … … … … PublisherID PublisherNa This is a THING: me A book title “Programming the Semantic Web” by Toby Segaran, 1 O’Reilly … Media … …
  33. Lets represent the data in RDFIsbn Title Author PublisherID ReleasedData978-0- Programming Toby 1 July 2009596- the Semantic Segaran15381- Web6 Programming thePublisherID PublisherName title Semantic Web1 O’Reilly Media author book Toby Segaran isbn 978-0-596-15381-6 publisher Publisher O’Reilly name
  34. Remember that we are on the webEverything on the web is identified by a URL
  35. And now let’s link the data to other data Programming the Semantic Web title http://…/isbn Toby author Segaran 978 978-0-596-15381-6 isbn publisher http://…/pu O’Reilly blisher1 name
  36. And now consider the data from Revyu.com hasReview http:// http://…/ …/revie isbn978 w1 descriptionreviewer Awesom e Book name http://… Wael /reviewer Elrifai
  37. Let’s start to link data http:// hasReview http://…/ …/revie isbn978 w1 Programming description title the SemantichasReviewer sameAs Web Awesom http:// author Toby e Book …/isbn9 Segaran 78 http:// …/revie name wer isbn 978-0-596-15381-6 Wael publisher Elrifai http://…/ name publisher1 O’Reilly
  38. Data on the Web that is in RDF and is linked to other RDF data is LINKED DATA
  39. Linked Data Principles 1. Use URIs as names for things 2. Use HTTP URIs so that people can look up (dereference) those names. 3. When someone looks up a URI, provide useful information. 4. Include links to other URIs so that they can discover more things.
  40. Linked Data makes the web appear a single global database!The same can be done inside your company!
  41. What if you wanted to know your company’s EBITDA for Catalonia in 2010? You could have a EDW pre-aggregate anddistribute the data, an analyst calculate it on the spot, or…
  42. Linked data in your internal semanticweb could relate all transactions to alinked financial formulae!You ask the question, tell your systemwhere to look (as part of the question,this can be prebuilt) and voilà!
  43. I can query a database with SQL. Isthere a way to query Linked Data with a query language?
  44. Yes! There is actually a standardize language for that
  45. FIND all the reviews on the book“Programming the Semantic Web” by people who live in London
  46. hasReview http://…/ http://…/ Programming isbn978 the Semantic review1 Web description titlehasReviewer sameAs Toby Awesom http:// Segaran author e Book …/isbn9 78 http://… 978-0-596-15381-6 /reviewer name isbn sameAs Wael publisher http://… Elrifai name O’Reilly /publishe r1 http://waelw orldwide.com livesIn http://dbpedia.org/London name Wael Elrifai
  47. This looks cool, but let’s be realistic. What is the incentive to publish Linked Data?
  48. What was your incentive to publishan HTML (Intranet) page in 1990?
  49. 1) Share data in documents2) Because you neighbor was doing it
  50. So why should we publish Linked Data in 2011?
  51. 1) Share data as data2) Because you neighbor is doing it
  52. You’ll be among good company…
  53. Linked Data Publishers UK Government US Government BBC Open Calais – Thomson Reuters Freebase NY Times Best Buy CNET Dbpedia
  54. How can I publish Linked Data?
  55. Publishing Linked Data • Legacy Data in Relational Databases • D2R Server • Virtuoso • Triplify • Ultrawrap • CMS • Drupal 7 • Native RDF Stores • Databases for RDF (Triple Stores) • AllegroGraph, Jena, Sesame, Virtuoso • Talis Platform (Linked Data in the Cloud) • In HTML with RDFa
  56. Consuming Linked Data by Humans
  57. HTML Browsers RDF can be serialized in RDFa Have you heard of •Yahoo’s Search Monkey •Google Rich Snippets? They are consuming RDFa But WHY?
  58. Because there is life beyond ten blue links
  59. Google and Yahoo are starting to crawl RDFa! The Semantic Web is a reality!
  60. The Reality •Yahoo is crawling data that is in RDFa and Microformats under a specific vocabularies • FOAF • GoodRelations • Google is crawling RDFa and Microformaats that use the Google vocabulary
  61. Linked Data BrowsersTabulator •http://www.w3.org/2005/ajar/tabOpenLink •http://ode.openlinksw.com/Zitgist Dataviewr •http://dataviewer.zitgist.com/Marbles •http://www5.wiwiss.fu-berlin.de/marbles/Explorator •http://www.tecweb.inf.puc-rio.br/explorator
  62. Faceted Browsers
  63. http://dbpedia.neofonie.de
  64. http://dev.semsol.com/2010/semtech/
  65. On-the-fly Mashups
  66. http://sig.ma
  67. What’s next?
  68. Time to create new and innovativeways to interact with Linked Data
  69. This may be one of the Killer Apps that we have all beenwaiting for http://en.wikipedia.org/wiki/File:Mosaic_browser_plaque_ncsa.jpg
  70. Where can I find SPARQL Endpoints?Dbpedia:http://dbpedia.org/sparqlMusicbrainz: http://dbtune.org/musicbrainz/sparqlU.S. Census:http://www.rdfabout.com/sparqlSemantic Crunchbase: http://cb.semsol.org/sparqlhttp://esw.w3.org/topic/SparqlEndpoints
  71. • Querying a single dataset is quite boring compared to:• Issuing SPARQL queries over multiple datasets• How can you do this? 1. Issue follow-up queries to different endpoints 2. Querying a central collection of datasets 3. Build store with copies of relevant datasets 4. Use query federation system
  72. Follow-up Queries• Idea: issue follow-up queries over otherdatasets based on results from previousqueries• Substituting placeholders in query templates
  73. Getting Started• Finding URIs• Finding Additional Data• Finding SPARQL Endpoints
  74. What is a Linked Data applicationSoftware system that makes use of data on theweb from multiple datasets AND that benefitsfrom links between the datasets
  75. Characteristics of Linked Data Applications• Consume data that is published on the web following the Linked Data principles• Discover further information by following the links between different data sources• Combine the consumed linked data with data from sources (not necessarily Linked Data)• Expose the combined data back to the web following the Linked Data principles• Offer value to end-users
  76. Examples • http://data-gov.tw.rpi.edu/wiki • http://dbrec.net/ • http://fanhu.bz/ • http://data.nytimes.com/schools/schools.html • http://sig.ma • http://visinav.deri.org/semtech2010/
  77. Hot Research Topics • Interlinking Algorithms • Provenance and Trust • Dataset Dynamics • UI • Distributed Query
  78. ContactPEAK Consulting United States United Arab EmiratesHeadquarters 11 Penn Plaza, 5th floor Unit P12 Rimal, The90 Long Acre, Covent Garden New York, NY 1000 WalkLondon WC2E 9RZ United States PO Box 487 177 DubaiUnited Kingdom United Arab Emirates Tel: +1 (212) 946 4824Tel: +44 (0)207 849 3422 Fax: +1 (212) 946 2801 Tel: +44 (0)207 849Fax: +44 (0)207 990 9478 3422 Fax: +44 (0)207 990 9478 http://www.peakconsulting.eu info@peakconsulting.eu

×