Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Querying the Web Out of the Slipstream :: September 27, 2007
Querying the Web <ul><ul><li>“ Information wants to be free” </li></ul></ul><ul><ul><ul><li>Stewart Brand, Whole Earth Cat...
General Medical Council
General Medical Council
General Medical Council
Freebase
Freebase
Freebase
Freebase
Freebase <ul><li>Metaweb Query Language </li></ul><ul><li>Request: </li></ul><ul><ul><li>{  &quot;type&quot; : &quot;/medi...
<ul><ul><li>GMC </li></ul></ul><ul><li>Authoritative </li></ul><ul><li>Website based search </li></ul><ul><li>Static </li>...
REST <ul><ul><li>REpresentational State Transfer </li></ul></ul><ul><ul><li>Less rigourous equivalent of SOAP </li></ul></...
Amazon S3 <ul><ul><li>S3 :: Simple Storage Service </li></ul></ul><ul><ul><li>Online storage space </li></ul></ul><ul><ul>...
Microformats
Microformats <ul><li>Without Microformats: </li></ul><ul><ul><li><div class=‘opaque’> </li></ul></ul><ul><ul><li>Out of th...
Microformats
Astoria
Astoria in action <ul><li>Request: </li></ul><ul><ul><li>http://astoria.sandbox.live.com/northwind/northwind.rse/Categorie...
Astoria in action <ul><li>Request: </li></ul><ul><ul><li>http://astoria.sandbox.live.com/northwind/northwind.rse/Customers...
Astoria in action <ul><li>Request: </li></ul><ul><ul><li>/Customers[FRANK] </li></ul></ul><ul><li>Response: </li></ul>
Astoria in action <ul><li>Request: </li></ul><ul><ul><li>/Customers[FRANK]/Orders </li></ul></ul><ul><li>Response: </li></ul>
Astoria in action <ul><li>A variety of response formats: </li></ul><ul><ul><li>POX </li></ul></ul><ul><ul><li>Web3S (Web, ...
Astoria is still evolving <ul><li>Ongoing discussion about the format of requests: </li></ul><ul><ul><li>/Customers!’FRANK...
<ul><ul><li>Where is all this information going to come from? </li></ul></ul>
Crowdsourcing <ul><ul><li>Jeff Howe, Wired Magazine, June 2006 </li></ul></ul><ul><ul><li>Delegating an activity to a larg...
Crowdsourcing
Crowdsourcing
Google Maps
Google Maps
Crowdsourcing <ul><ul><li>Jeff Howe, June 2006, Wired Magazine </li></ul></ul><ul><ul><li>Delegating an activity to a larg...
The Power of Information Review <ul><li>Commissioned by the Cabinet Office, published in June 2007, to review and advise o...
AND - Automotive Navigation Data <ul><li>  </li></ul><ul><li>  Press release: </li></ul><ul><ul><li>July 4, 2007 </li></ul...
More ways of querying the web <ul><li>Google Search </li></ul><ul><li>Google Events </li></ul><ul><li>Google Base </li></u...
<ul><ul><li>The Internet is the Database </li></ul></ul>
<ul><ul><li>Thank you </li></ul></ul><ul><ul><li>Questions? </li></ul></ul>
Upcoming SlideShare
Loading in …5
×

George James :: Querying The Web

1,430 views

Published on

Published in: Business, Technology
  • Be the first to comment

  • Be the first to like this

George James :: Querying The Web

  1. Querying the Web Out of the Slipstream :: September 27, 2007
  2. Querying the Web <ul><ul><li>“ Information wants to be free” </li></ul></ul><ul><ul><ul><li>Stewart Brand, Whole Earth Catalogue </li></ul></ul></ul><ul><ul><ul><li>May 1985 </li></ul></ul></ul><ul><ul><li>“ If the new computer set up allowed folks inside to be more creative and independent, why not open it up to outsiders, too?” </li></ul></ul><ul><ul><ul><li>Jeff Bezos, Amazon </li></ul></ul></ul><ul><ul><ul><li>March 2002 </li></ul></ul></ul><ul><ul><li>“ Data is the Next Intel Inside” </li></ul></ul><ul><ul><ul><li>Tim O’Reilly </li></ul></ul></ul><ul><ul><ul><li>September 2005 </li></ul></ul></ul><ul><ul><li>Open Source has commoditized software </li></ul></ul><ul><ul><li>Creative Commons will commoditize information </li></ul></ul><ul><ul><li>Which leaves servers, services and service… </li></ul></ul>
  3. General Medical Council
  4. General Medical Council
  5. General Medical Council
  6. Freebase
  7. Freebase
  8. Freebase
  9. Freebase
  10. Freebase <ul><li>Metaweb Query Language </li></ul><ul><li>Request: </li></ul><ul><ul><li>{ &quot;type&quot; : &quot;/medicine/physician&quot;, </li></ul></ul><ul><ul><li>&quot;name&quot; : “Michael Maher“ } </li></ul></ul><ul><li>Response: </li></ul><ul><ul><li>{ &quot;code&quot;: &quot;/api/status/ok&quot;, &quot;result&quot;: { &quot;type&quot;: &quot;/medicine/physician&quot;, &quot;name&quot;: “Michael Maher&quot;, “gender”: “Male”, </li></ul></ul><ul><ul><li>“ education”: “Leeds University”} </li></ul></ul><ul><ul><li>} </li></ul></ul><ul><li>JSON </li></ul>
  11. <ul><ul><li>GMC </li></ul></ul><ul><li>Authoritative </li></ul><ul><li>Website based search </li></ul><ul><li>Static </li></ul><ul><li>Restrictive license </li></ul><ul><li>Even if you pay for the data you still cannot use it, legally. </li></ul><ul><li>Periodic updates </li></ul>GMC vs Freebase <ul><ul><li>Freebase </li></ul></ul><ul><li>User sourced content </li></ul><ul><li>API </li></ul><ul><li>Extensible, dynamic </li></ul><ul><li>Creative Commons / PD </li></ul><ul><li>Automatic right to use </li></ul><ul><li>Stepwise refinement </li></ul>
  12. REST <ul><ul><li>REpresentational State Transfer </li></ul></ul><ul><ul><li>Less rigourous equivalent of SOAP </li></ul></ul><ul><ul><li>Data are considered to be resources </li></ul></ul><ul><ul><li>Every resource has a unique address </li></ul></ul><ul><ul><li>Layered over http: </li></ul></ul><ul><ul><ul><li>Client/Server separation </li></ul></ul></ul><ul><ul><ul><li>Stateless </li></ul></ul></ul><ul><ul><ul><li>Cacheable </li></ul></ul></ul><ul><ul><li>Request: </li></ul></ul><ul><ul><ul><li>GET http://rest.georgejames.com/product/Serenji/ </li></ul></ul></ul><ul><ul><li>Response: </li></ul></ul><ul><ul><ul><li>Name=Serenji </li></ul></ul></ul><ul><ul><ul><li>Price=195.00 </li></ul></ul></ul><ul><ul><ul><li>OrderCode=H1001 </li></ul></ul></ul>
  13. Amazon S3 <ul><ul><li>S3 :: Simple Storage Service </li></ul></ul><ul><ul><li>Online storage space </li></ul></ul><ul><ul><li>$0.15 per Gbyte per month for storage </li></ul></ul><ul><ul><li>~ $0.20 per Gbyte data transfer </li></ul></ul><ul><ul><li>Storage request: </li></ul></ul><ul><ul><ul><li>PUT http://s3.amazonaws.com/[bucket-name]/[key-name] </li></ul></ul></ul><ul><ul><li>Retrieval request: </li></ul></ul><ul><ul><ul><li>GET http://s3.amazonaws.com/[bucket-name]/[key-name] </li></ul></ul></ul><ul><ul><li>EC2 :: Elastic Compute Clouds </li></ul></ul>
  14. Microformats
  15. Microformats <ul><li>Without Microformats: </li></ul><ul><ul><li><div class=‘opaque’> </li></ul></ul><ul><ul><li>Out of the Slipstream is a one-day conference on Thursday 27 September 2007 at Brooklands Museum, Surrey, UK. </li></ul></ul><ul><ul><li></div> </li></ul></ul><ul><li>With Microformats: </li></ul><ul><ul><li><div class=‘opaque vevent’> </li></ul></ul><ul><ul><li><span class='summary'> Out of the Slipstream </span> is a one-day conference on </li></ul></ul><ul><ul><li><abbr class=&quot;dtstart&quot; title=&quot;20070927&quot;> </li></ul></ul><ul><ul><li>Thursday 27 September 2007 </li></ul></ul><ul><ul><li></abbr> </li></ul></ul><ul><ul><li>at Brooklands Museum, Surrey, UK. </li></ul></ul><ul><ul><li></div> </li></ul></ul>
  16. Microformats
  17. Astoria
  18. Astoria in action <ul><li>Request: </li></ul><ul><ul><li>http://astoria.sandbox.live.com/northwind/northwind.rse/Categories </li></ul></ul><ul><li>Response: </li></ul>
  19. Astoria in action <ul><li>Request: </li></ul><ul><ul><li>http://astoria.sandbox.live.com/northwind/northwind.rse/Customers </li></ul></ul><ul><li>Response: </li></ul>
  20. Astoria in action <ul><li>Request: </li></ul><ul><ul><li>/Customers[FRANK] </li></ul></ul><ul><li>Response: </li></ul>
  21. Astoria in action <ul><li>Request: </li></ul><ul><ul><li>/Customers[FRANK]/Orders </li></ul></ul><ul><li>Response: </li></ul>
  22. Astoria in action <ul><li>A variety of response formats: </li></ul><ul><ul><li>POX </li></ul></ul><ul><ul><li>Web3S (Web, Structured, Schema’d and Searchable) </li></ul></ul><ul><ul><li>ATOM </li></ul></ul><ul><ul><li>JSON </li></ul></ul><ul><li>JSON request: </li></ul><ul><ul><li>/Customers[FRANK]?$format=json </li></ul></ul><ul><li>Response: </li></ul>
  23. Astoria is still evolving <ul><li>Ongoing discussion about the format of requests: </li></ul><ul><ul><li>/Customers!’FRANK’ </li></ul></ul><ul><ul><li>/Customers!’FRANK’/Orders!10267 </li></ul></ul><ul><ul><li>/Customers!CustomerID=‘FRANK’ </li></ul></ul><ul><ul><li>/Customers(‘FRANK’) </li></ul></ul><ul><ul><li>/Customers(‘FRANK’)/Orders(10267) </li></ul></ul><ul><li>Qualifiers control the response format: </li></ul><ul><ul><li>/Customers(‘FRANK’)/CustomerName </li></ul></ul><ul><ul><li>/Customers(‘FRANK’)/CustomerName/$value </li></ul></ul><ul><ul><li>/Customers(‘FRANK’)/$format=json </li></ul></ul><ul><ul><li>/Customers/$skip=30&$take=10 </li></ul></ul><ul><li>Currently being Microsoftened… </li></ul>
  24. <ul><ul><li>Where is all this information going to come from? </li></ul></ul>
  25. Crowdsourcing <ul><ul><li>Jeff Howe, Wired Magazine, June 2006 </li></ul></ul><ul><ul><li>Delegating an activity to a large number of unidentified individuals </li></ul></ul><ul><ul><li>Small finite tasks </li></ul></ul><ul><ul><li>Quantity more important than quality </li></ul></ul><ul><ul><li>The sum is greater than the parts </li></ul></ul><ul><ul><li>Examples: </li></ul></ul><ul><ul><ul><li>Wikipedia </li></ul></ul></ul>
  26. Crowdsourcing
  27. Crowdsourcing
  28. Google Maps
  29. Google Maps
  30. Crowdsourcing <ul><ul><li>Jeff Howe, June 2006, Wired Magazine </li></ul></ul><ul><ul><li>Delegating an activity to a large number of unidentified individuals </li></ul></ul><ul><ul><li>Small finite tasks </li></ul></ul><ul><ul><li>Quantity more important than quality </li></ul></ul><ul><ul><li>The sum is greater than the parts </li></ul></ul><ul><ul><li>Examples: </li></ul></ul><ul><ul><ul><li>Wikipedia </li></ul></ul></ul><ul><ul><ul><li>Galaxy Zoo </li></ul></ul></ul><ul><ul><ul><li>Amazon Mechanical Turk </li></ul></ul></ul><ul><ul><ul><li>Google route planner </li></ul></ul></ul><ul><ul><li>Consequences: </li></ul></ul><ul><ul><ul><li>Drives down the cost of data </li></ul></ul></ul><ul><ul><ul><li>Ownership may not be the traditional incubents </li></ul></ul></ul><ul><ul><ul><li>Client / user needs to discriminate </li></ul></ul></ul>
  31. The Power of Information Review <ul><li>Commissioned by the Cabinet Office, published in June 2007, to review and advise on the use of public sector information. </li></ul><ul><li>Recommendation 9: </li></ul><ul><ul><li>By Budget 2008, government should commission </li></ul></ul><ul><ul><li>and publish an independent review of the </li></ul></ul><ul><ul><li>costs and benefits of the current trading </li></ul></ul><ul><ul><li>fund charging model for the re-use </li></ul></ul><ul><ul><li>of public sector information, including </li></ul></ul><ul><ul><li>the role of the five largest trading </li></ul></ul><ul><ul><li>funds, the balance of direct versus </li></ul></ul><ul><ul><li>downstream economic revenue, </li></ul></ul><ul><ul><li>and the impact on the quality of </li></ul></ul><ul><ul><li>public sector information. </li></ul></ul><ul><li>US: Public Domain </li></ul><ul><li>UK: Crown Copyright </li></ul>
  32. AND - Automotive Navigation Data <ul><li> </li></ul><ul><li> Press release: </li></ul><ul><ul><li>July 4, 2007 </li></ul></ul><ul><ul><li>Rotterdam - AND Automotive Navigation Data has </li></ul></ul><ul><ul><li>agreed ... to donate digital maps of the Netherlands, </li></ul></ul><ul><ul><li>China and India to the community. </li></ul></ul>
  33. More ways of querying the web <ul><li>Google Search </li></ul><ul><li>Google Events </li></ul><ul><li>Google Base </li></ul><ul><li>Yahoo! Pipes </li></ul><ul><li>RSS – Really Simple Syndication </li></ul><ul><li>KML </li></ul><ul><li>BBC Backstage </li></ul>
  34. <ul><ul><li>The Internet is the Database </li></ul></ul>
  35. <ul><ul><li>Thank you </li></ul></ul><ul><ul><li>Questions? </li></ul></ul>

×