Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Seek and you will find?


Published on

Yes... but probably not what you were expecting! Presentation given to NetIKX at the British Dental Association, 18th March 2015

Published in: Internet
  • Be the first to comment

Seek and you will find?

  1. 1. Seek and you will find? Yes ... but probably not what you were expecting NetIKX, 18th March 2015, #netikx72 British Dental Association, 64 Wimpole Street, London W1G 8YS Karen Blakeman RBA Information Services Presentation available for a short time at: This work is licensed under a Creative Commons Attribution-Share-Alike License.
  2. 2. EU - so called “right to be forgotten” ruling 2 Mario Costeja Gonzalez Edition of Monday, January 19, 1998, page 23 - Newspaper - 98/01/19/pagina-23/33842001/pdf.html EU Court of Justice ruled that Google is a “data controller” under Data Protection legislation and must remove, if requested, links to information that is “inadequate, irrelevant .... or excessive” from search results on a person’s name. Applies to search engines in the EU, Norway, Lichtenstein, Iceland and Switzerland 17/03/2015
  3. 3. 3 Scale of EU 'right to be forgotten' rules revealed by Google forgotten-rules-revealed-Google-says-forced-delete-260-000-links-legislation- criticised-protecting-terrorists-criminals.html 17/03/2015
  4. 4. 4 Spanish Newspapers Suddenly Regret Forcing Google Out Of Spain - minds-and-are-now-begging-google-news-to-stay-2014-12 How Google News Lives On In Spain Despite Being Closed 17/03/2015
  5. 5. Oh joy - NOT! 5 More UK information vanishes into GOV.UK vanishes-into-gov-uk/ 17/03/2015
  6. 6. Where’s the information gone to? List of departments, agencies and public bodies at “Home pages” on GOV.UK Data and information may still be on the old websites Data may have been moved to Information may have been sent to Or information may have been “lost” Don’t rely on just GOV.UK search – use Google/Bing site: command combined with filetype: if appropriate 617/03/2015
  7. 7. 717/03/2015
  8. 8. "Yes Minister" The Skeleton in the Cupboard (TV Episode 1982) - Quotes - IMDb James Hacker: [reads memo] This file contains the complete set of papers, except for a number of secret documents, a few others which are part of still active files, some correspondence lost in the floods of 1967... James Hacker: Was 1967 a particularly bad winter? Sir Humphrey Appleby: No, a marvellous winter. We lost no end of embarrassing files. James Hacker: [reads] Some records which went astray in the move to London and others when the War Office was incorporated in the Ministry of Defence, and the normal withdrawal of papers whose publication could give grounds for an action for libel or breach of confidence or cause embarrassment to friendly governments. James Hacker: That's pretty comprehensive. How many does that normally leave for them to look at? James Hacker: How many does it actually leave? About a hundred?... Fifty?... Ten?... Five?... Four?... Three?... Two?... One?... *Zero?* Sir Humphrey Appleby: Yes, Minister. [Add “transfer to GOV.UK” to the list of excuses] 17/03/2015
  9. 9. 917/03/2015
  10. 10. Wayback Machine 1017/03/2015
  11. 11. Remember - Google knows best Google very kindly.... 1. Goes to great lengths to personalise your results according to your search history, contacts, location, device, phase of the moon, the train, bus or tram you take to work and anything else it can think of 2. Rewrites your search for you by leaving out some of your terms and looking for weird and wonderful alternatives 3. Doesn’t bother you with everything that might be relevant 4. Changes its algorithms on a regular basis to keep you on your toes 5. Constantly conducts experiments on you to ensure that you don’t feel forgotten 1117/03/2015
  12. 12. Google no longer looks at keywords in isolation Tries to make “sense” of your search and put it into context, natural language queries, uses what others have searched and clicked on Constantly changing – all bets are off when it comes to predicting what your results will look like How you ask your question is taken into account, device you are using is taken into account Providing Quick Answers and “facts”, extracts from websites giving you the “answer” 1217/03/2015
  13. 13. 1317/03/2015
  14. 14. 1417/03/2015
  15. 15. 17/03/2015 15
  16. 16. 1617/03/2015
  17. 17. 1717/03/2015
  18. 18. 1817/03/2015
  19. 19. 19 What could possibly go wrong? 17/03/2015
  20. 20. 2017/03/2015
  21. 21. 21 And then on another day... 17/03/2015
  22. 22. knowledge-graph-gets-confused.html 2217/03/2015
  23. 23. One of many wrong Quick Answers submitted to me by a delegate at a recent conference 23 Many thanks to Philip Stirups for the example. About 24 hours after taking this screen shot Google corrected the error. 17/03/2015
  24. 24. 24 Google "Henry VIII wives": Jane Seymour reveals search engine's blind spots jane_seymour_reveals_search_engine_s_blind_spots.html Image courtesy of Will Oremus 17/03/2015
  25. 25. 2517/03/2015
  26. 26. Waitrose Caversham opening times New Year’s Day 26 Google used the standard opening times in its answer, not the seasonal opening times 17/03/2015
  27. 27. 27 knowledge-graph-203293 But Google’s choice of “basic factual data” may be wrong! 17/03/2015
  28. 28. 28 Google wants to rank websites based on facts not links - 28 February 2015 - New Scientist wants-to-rank-websites-based-on-facts-not-links.html 17/03/2015
  29. 29. Or maybe not.... 29 Google: We Are Not Using Facts For Search Engine Ranking Now 19979.html 17/03/2015
  30. 30. Artificial intelligence 30 Artificial Intelligence machine plays video games like a pro - CBBC Newsround Google buys UK artificial intelligence startup Deepmind for £400m acquires-uk-artificial-intelligence-startup-deepmind Google buys two more UK artificial intelligence startups intelligence-startups-machine-learning-dark-blue-labs-vision-factory 17/03/2015
  31. 31. 31 But official data is OK isn’t it? 17/03/2015
  32. 32. Google Public Data Explorer Minimum Wage 32 Some countries are missing e.g. Germany 17/03/2015
  33. 33. Eurostat - Minimum Wage 3317/03/2015
  34. 34. 3417/03/2015
  35. 35. - uses Land Registry data 35 Land Registry data often goes missing. I know that 10 months ago the sold price for number 90 was listed as £185,000 and for 2012. 17/03/2015
  36. 36. 36 Data doesn’t show up via the Land Registry Open Data interface either. 17/03/2015
  37. 37. Missing data 37 Error report filed with the Land Registry - still waiting for a response Why might a property/price paid not appear in the data? Seems not that uncommon according to discussion boards – usually data entry error (but the above example was in the open data sets until a few months ago) Absence of price – gift of property or purchase of a share Impractical to calculate price e.g. bulk purchase of properties Commercial transactions index-and-price-paid-data Raw data files downloaded and searched and data for number 90 is missing 17/03/2015
  38. 38. Title document 38 Data IS present in the title document. 17/03/2015
  39. 39. 39 Free Companies House data to boost UK economy - Press releases - GOV.UK uk-economy 17/03/2015
  40. 40. Companies House free data Bulk data – all or nothing Large daily files available as zipped files No support provided – you’re on your own! 4017/03/2015
  41. 41. Companies House free data 41 Each file within the zipped file is a separate document. Note the uninformative file names! 17/03/2015
  42. 42. 42 Variable Pitch Uses public electricity micro-generation data 17/03/2015
  43. 43. 43 Variable Pitch Virginia Station is the hydroelectric installation at Windsor Castle – no data! 17/03/2015
  44. 44. FoI request generation data for Virginia Station 44 e-news/royal-family-granted-new-right-of- secrecy-2179148.html 17/03/2015
  45. 45. And finally..... Per capita consumption of cheese (US) correlates with Number of people who died by becoming tangled in their bedsheets 4517/03/2015