Seek and you will find?
Yes ... but probably not what you were expecting
NetIKX, 18th March 2015, #netikx72
British Dental Association, 64 Wimpole Street, London W1G 8YS
RBA Information Services
Presentation available for a short time at: http://www.rba.co.uk/as/
This work is licensed under a Creative Commons Attribution-Share-Alike License.
EU - so called “right to be forgotten” ruling
Mario Costeja Gonzalez
Edition of Monday, January 19, 1998, page 23 -
Newspaper - Lavanguardia.es
EU Court of Justice ruled that Google is a “data
controller” under Data Protection legislation
and must remove, if requested, links to
information that is “inadequate, irrelevant .... or
excessive” from search results on a person’s
Applies to search engines in the EU, Norway,
Lichtenstein, Iceland and Switzerland
Scale of EU 'right to be forgotten' rules revealed by Google
Spanish Newspapers Suddenly Regret Forcing Google Out Of Spain -
How Google News Lives On In Spain Despite Being Closed
Oh joy - NOT!
More UK information vanishes into GOV.UK
Where’s the information gone to?
List of departments, agencies and public bodies at
“Home pages” on GOV.UK
Data and information may still be on the old websites
Data may have been moved to http://data.gov.uk/
Information may have been sent to
Or information may have been “lost”
Don’t rely on just GOV.UK search – use Google/Bing site:
command combined with filetype: if appropriate
"Yes Minister" The Skeleton in the Cupboard (TV Episode 1982) -
Quotes - IMDb http://www.imdb.com/title/tt0751825/quotes
James Hacker: [reads memo] This file contains the complete set of papers,
except for a number of secret documents, a few others which are part of still
active files, some correspondence lost in the floods of 1967...
James Hacker: Was 1967 a particularly bad winter?
Sir Humphrey Appleby: No, a marvellous winter. We lost no end of embarrassing
James Hacker: [reads] Some records which went astray in the move to London
and others when the War Office was incorporated in the Ministry of Defence,
and the normal withdrawal of papers whose publication could give grounds for
an action for libel or breach of confidence or cause embarrassment to friendly
James Hacker: That's pretty comprehensive. How many does that normally leave
for them to look at?
James Hacker: How many does it actually leave? About a hundred?... Fifty?...
Ten?... Five?... Four?... Three?... Two?... One?... *Zero?*
Sir Humphrey Appleby: Yes, Minister.
[Add “transfer to GOV.UK” to the list of excuses]
Remember - Google knows best
Google very kindly....
1. Goes to great lengths to personalise your results according
to your search history, contacts, location, device, phase of
the moon, the train, bus or tram you take to work and
anything else it can think of
2. Rewrites your search for you by leaving out some of your
terms and looking for weird and wonderful alternatives
3. Doesn’t bother you with everything that might be relevant
4. Changes its algorithms on a regular basis to keep you on
5. Constantly conducts experiments on you to ensure that you
don’t feel forgotten
Google no longer looks at keywords in isolation
Tries to make “sense” of your search and put it into context,
natural language queries, uses what others have searched
and clicked on
Constantly changing – all bets are off when it comes to
predicting what your results will look like
How you ask your question is taken into account, device you
are using is taken into account
Providing Quick Answers and “facts”, extracts from websites
giving you the “answer”
One of many wrong Quick Answers submitted to me by a
delegate at a recent conference
Many thanks to Philip Stirups for the example. About 24 hours after taking this
screen shot Google corrected the error.
Google "Henry VIII wives": Jane Seymour reveals search engine's blind spots
Image courtesy of Will Oremus
Waitrose Caversham opening times New Year’s Day
Google used the standard opening
times in its answer, not the
seasonal opening times
But Google’s choice of “basic factual data” may be wrong!
Google wants to rank websites based on facts not links - 28 February 2015 -
New Scientist http://www.newscientist.com/article/mg22530102.600-google-
Or maybe not....
Google: We Are Not Using Facts For Search Engine Ranking Now
Artificial Intelligence machine plays video games like a pro - CBBC
Google buys UK artificial intelligence startup Deepmind for £400m
Google buys two more UK artificial intelligence startups
But official data is OK isn’t it?
Google Public Data Explorer Minimum Wage
Some countries are missing e.g.
http://www.rightmove.co.uk/ - uses Land Registry data
Land Registry data
missing. I know
that 10 months
ago the sold price
for number 90 was
listed as £185,000
and for 2012.
Data doesn’t show
up via the Land
Registry Open Data
Error report filed with the Land Registry - still waiting for a response
Why might a property/price paid not appear in the data?
Seems not that uncommon according to discussion boards – usually data entry error
(but the above example was in the open data sets until a few months ago)
Absence of price – gift of property or purchase of a share
Impractical to calculate price e.g. bulk purchase of properties
Raw data files downloaded and searched
and data for number 90 is missing
Data IS present in
the title document.
Free Companies House data to boost UK economy - Press releases - GOV.UK
Companies House free data
Bulk data – all or nothing
Large daily files available as zipped files
No support provided – you’re on your own!
Companies House free data
Each file within the zipped file is a separate document. Note the uninformative
Variable Pitch http://www.variablepitch.co.uk/stations/1310/
Uses public electricity
Variable Pitch http://www.variablepitch.co.uk/stations/2580/
Virginia Station is the
Windsor Castle – no
FoI request generation data for Virginia Station
Per capita consumption of cheese (US) correlates with Number of
people who died by becoming tangled in their bedsheets