The four Es: Doing more with metadata

4,594 views
3,915 views

Published on

For CAARA Residential School, 10 November 2010

0 Comments
5 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
4,594
On SlideShare
0
From Embeds
0
Number of Embeds
83
Actions
Shares
0
Downloads
23
Comments
0
Likes
5
Embeds 0
No embeds

No notes for slide

The four Es: Doing more with metadata

  1. The four Es Doing more with metadata Tim Sherratt (@wragge)
  2. Archives know the value of metadata.
  3. A metadata fetish?
  4. Metadata is just data about data.
  5. We value it according to our needs.
  6. Once we get past the fetishistic allure, we can see...
  7. Metadata is everywhere.
  8. The four Es
  9. The four Es • Extraction • Enhancement • Extension • Experimentation
  10. Extraction
  11. Extraction Liberate the metadata trapped within existing processes and systems.
  12. Extraction • Where is it? • What is it? • How do I get it out?
  13. Extraction • Inside • Outside • Neither in nor out Where is it?
  14. Extraction – where is it? • Records • Descriptive systems • Research • Websites • Usage statistics Inside
  15. Extraction – where is it? • Research • Publications • Social media Outside
  16. Extraction – where is it? • Cloud services (eg Flickr) Neither in nor out
  17. Extraction • People • Places • Subjects • Dates • Structure What is it?
  18. Extraction • Text mining • Natural language processing • Web services • Crowdsourcing How do I get it out?
  19. Extraction – examples Old Weather Where? ● Ships’ logs
  20. Extraction – examples Old Weather What? ● Ship movements ● Weather observations
  21. Extraction – examples Old Weather How? ● Crowdsourcing
  22. Extraction – examples Mapping our Anzacs Where? Corrigan James : SERN 5308 : POB Aberfeldie VIC : POE Melbourne VIC : NOK S Corrigan Maggie ● Collection database
  23. Extraction – examples Mapping our Anzacs What? Corrigan James : SERN 5308 : POB Aberfeldie VIC : POE Melbourne VIC : NOK S Corrigan Maggie ● People
  24. Extraction – examples Mapping our Anzacs What? Corrigan James : SERN 5308 : POB Aberfeldie VIC : POE Melbourne VIC : NOK S Corrigan Maggie ● Places
  25. Extraction – examples Mapping our Anzacs What? Corrigan James : SERN 5308 : POB Aberfeldie VIC : POE Melbourne VIC : NOK S Corrigan Maggie ● Relationships
  26. Extraction – examples Mapping our Anzacs What? Corrigan James : SERN 5308 : POB Aberfeldie VIC : POE Melbourne VIC : NOK S Corrigan Maggie ● Other
  27. Extraction – examples Mapping our Anzacs How? ● Text mining Corrigan James : SERN 5308 : POB Aberfeldie VIC : POE Melbourne VIC : NOK S Corrigan Maggie
  28. Extraction – examples Reference blog Where? ● Reference inquiries http://itech.dickinson.edu/archives/
  29. Extraction – examples Reference blog What? ● People ● Places ● Subjects ● Access points! http://itech.dickinson.edu/archives/
  30. Extraction – examples Reference blog How? ● Workflow app ● Blog/Drupal http://itech.dickinson.edu/archives/
  31. Extraction – examples Factsheet explorer Where? ● Website
  32. Extraction – examples Factsheet explorer What? ● Subjects ● Collection references http://discontents.com.au/shed/fs/fs_explorer.php
  33. Extraction – examples Factsheet explorer What? ● Subjects ● Collection references http://discontents.com.au/shed/fs/fs_explorer.php
  34. Extraction – examples Factsheet explorer How? ● Screen scraping ● ‘See also’ links http://discontents.com.au/shed/fs/fs_explorer.php
  35. Extraction – examples JSTOR Where? ● Footnotes
  36. Extraction – examples JSTOR What? ● Collection references
  37. Extraction – examples JSTOR What? ● People
  38. Extraction – examples JSTOR What? ● Dates
  39. Extraction – examples JSTOR What? ● Detailed description!
  40. Extraction – examples JSTOR How? ● Screen scraping ● XML from http://dfr.jstor.org/
  41. Extraction – examples Flickr context harvester Where? ● Flickr http://userscripts.org/scripts/show/56135
  42. Extraction – examples Flickr context harvester What? ● Comments ● Tags ● Links http://userscripts.org/scripts/show/56135
  43. Extraction – examples Flickr context harvester How? ● Flickr API ● Javascript or...? ● ‘See also’ links? http://userscripts.org/scripts/show/56135
  44. Extraction – examples Zotero Where? ● Research databases ● Zotero groups
  45. Extraction – examples Zotero What? ● Notes ● Tags ● Collections ● Gems and strays ● Annotations
  46. Extraction – examples Zotero How? ● Zotero everywhere ● Web API ● Integrate into apps
  47. Enhancement
  48. Enhancement Add structure, meaning, value or context.
  49. Enhancement Not just what you do, but also what you don’t do.
  50. Enhancement Following a name ● Entity extraction (eg Open Calais, AlchemyAPI) ‘I say emphatically that the climate has changed’, Henry Hodgson told the Argus in 1928. The experience of seventy-eight years brooked no denial, summers were milder, and thunderstorms were fewer. ‘It is no use telling me that weather bureau statistics do not bear this out’, he added defiantly. ‘You can do anything with statistics, but no statistics will convince me that the climate has not changed radically.’ Henry Hodgsonperson But then what?
  51. Enhancement Following a name ● Use once and throw away? http://mysite.com/search?q=Henry+Hodgson
  52. Enhancement Following a name ● Store as a subject? Subjects: thunderstorms weather memory Henry Hodgson
  53. Enhancement Following a name ● Store as a person? Subjects: thunderstorms weather memory People: Henry Hodgson
  54. Enhancement Following a name ● Add some structure? <people> <person> <firstname>Henry</firstname> <surname>Hodgson</surname> </person> </people>
  55. Enhancement Following a name ● What about the text? ‘I say emphatically that the climate has changed’, <span typeof=”foaf:person”>Henry Hodgson</span> told the Argus in 1928. The experience of seventy- eight years brooked no denial, summers were milder, and thunderstorms were fewer. ‘It is no use telling me that weather bureau statistics do not bear this out’, he added defiantly. ‘You can do anything with statistics, but no statistics will convince me that the climate has not changed radically.’
  56. Enhancement Following a name ● Disambiguation? People: Henry Hodgson (1889-1956) Henry H Hodgson (1902-1974)
  57. Enhancement Following a name ● Name authorities? <people> <person> <firstname>Henry</firstname> <surname>Hodgson</surname> <id>http://nla.gov.au/nla.party-590379</id> </person> </people>
  58. Enhancement The way you store and structure your metadata will affect possibilities for reuse.
  59. Enhancement Geocoding ● Putting places on a map Canberra, ACT, Australia -35.28346 / 149.12807
  60. Enhancement Geocoding services ● Google maps ● Yahoo Placemaker (includes entity extraction) ● GeoNames ● Geoscience Australia (under construction) ● and more...
  61. Enhancement NMA collection map http://labs.nma.gov.au/collection/map/ ● Two days work ● Used GeoNames ● 57% success (2142 places) ● Scotland is not a country
  62. Enhancement NLA photos map http://www.paulhagon.com/playground/nla/geo/ ● 35,000+ images located ● Used Yahoo Placemaker ● 80% success ● See Paul Hagon’s blog
  63. Enhancement Topic modelling ● Understanding what it all means ‘I say emphatically that the climate has changed’, Henry Hodgson told the Argus in 1928. The experience of seventy-eight years brooked no denial, summers were milder, and thunderstorms were fewer. ‘It is no use telling me that weather bureau statistics do not bear this out’, he added defiantly. ‘You can do anything with statistics, but no statistics will convince me that the climate has not changed radically.’ Weather forecasting
  64. Enhancement Topic modelling ● Web services (AlchemyAPI) ● MALLET (trainable)
  65. Enhancement Crowdsourcing ● Harnessing the wisdom of the crowd ● Seeking specialised knowledge ● Gathering additional context
  66. Enhancement Mapping our Anzacs ● Scrapbook ● Adding context to records ● More structure?
  67. Enhancement Archives Outside ● Gathering information ● Blog / Twitter / Flickr
  68. Extension
  69. Extension Push your metadata beyond its boundaries.
  70. Extension New contexts ● Visualisation ● Mashups
  71. Extension Visible Archive ● Seeing everything http://visiblearchive.blogspot.com/
  72. Extension History Wall ● Endless ● Ephemeral ● Serendipitous http://visiblearchive.blogspot.com/ http://labs.nma.gov.au/wall/
  73. Extension Making connections ● Record linkage ● Authority records
  74. Extension People Australia ● Disambiguation ● Aggregating identities ● Assigning identifiers http://nla.gov.au/nla.party-479364 me
  75. Extension People Australia ● Contribute! ● Use identifiers! ● See the wiki
  76. Extension Identity browser ● Bookmarklet enhanced ● Enriched with RDFa ● Machine tags http://wraggelabs.com/identities/
  77. Extension FMTC ● Crowdsource connections ● Semantic linkages ● Harvest metadata back http://wraggelabs.com/fmtc/
  78. Extension Setting it free ● Open data ● APIs ● Linked Data
  79. Extension Linked Open Data ● Become part of the semantic web ● Expose your metadata to the world ● Get started with good URLs and RDFa
  80. Extension Linked Open Data
  81. Experimentation
  82. Experimentation Build spaces to play, learn, create and fail.
  83. Experimentation Share ideas, examples, recipes, tools and code.
  84. Experimentation TNA Labs http://labs.nationalarchives.gov.uk/wordpress/
  85. Experimentation DigitalNZ http://www.digitalnz.org/
  86. Experimentation NMA Labs http://labs.nma.gov.au/
  87. Experimentation Don’t wait for permission. now
  88. Experimentation Do it. now
  89. Experimentation It’s easier than you think. now
  90. Homework ● Make good urls ● Use identifiers ● Fix citation standards ● Expose structures (RDFa) ● Use NLA party ids now
  91. Where to find me: @wragge words – discontents.com.au experiments – wraggelabs.com work – labs.nma.gov.au now

×