Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Museum Data Exchange:  Making hay with harvestable data G ünter Waibel Program Officer OCLC Research 12 November 2009 MCN ...
Museum Data Exchange <ul><li>Harvard University Art Museums </li></ul><ul><li>Metropolitan Museum of Art </li></ul><ul><li...
Open Archives Initiative (OAI) Protocol for Metadata Harvesting Museums OCLC Research TMS COBOAT OAICatMuseum Research Agg...
Extracting data and harvesting Data extraction 6 out of 9 museums used COBOAT 3 used alternative means: XSLT, Custom, Vend...
Research Aggregation 9 Museums 887,572 Records
Analyzing the Research Aggregation <ul><li>My data </li></ul><ul><li>Are CDWA Lite required fields present? </li></ul><ul>...
Available at: http://hangingtogether.org/wp-content/uploads/2009/04/methodology_03-30-2009.pdf
Use of CDWA Lite 131 CDWA Lite units of information Museums 54% not used 8% used by all
Conformance to CDWA Lite Required/Highly recommended elements Records 90% consistency for 7 of 17 elements
Impact of COBOAT mapping <ul><li>based on museum mapping documents </li></ul><ul><li>uses 32 units of information of the 1...
Controlled Vocabularies match rates < 42%
Data Values correlated to records Top 100 <objectWorkType> All the Institution’s 100K+ Records AAT Match 27 No AAT  Match ...
Data Values correlated to records Top 100 <nameCreator> All the Institution’s 200K+ records ULAN Match 49 No ULAN  Match 5...
What now? <ul><li>“We must create a single information system which embraces all museum holdings in the United States” </l...
publishing your collections
reaching educators
reaching higher education
sharing infrastructure
projecting authority
national collaboration
Thank you! <ul><li>Thanks also to my colleagues and collaborators Bruce Washburn and Ralph LeVan. </li></ul><ul><li>Museum...
Upcoming SlideShare
Loading in …5
×

Museum Data Exchange

1,969 views

Published on

A presentation focusing on the data analysis OCLC Research performed on 900K museum records, plus next steps for the nine project museums who now have the capacity to share standards-based records.

Published in: Education, Technology
  • Be the first to comment

Museum Data Exchange

  1. 1. Museum Data Exchange: Making hay with harvestable data G ünter Waibel Program Officer OCLC Research 12 November 2009 MCN Portland
  2. 2. Museum Data Exchange <ul><li>Harvard University Art Museums </li></ul><ul><li>Metropolitan Museum of Art </li></ul><ul><li>National Gallery of Art </li></ul><ul><li>Princeton University Art Museum </li></ul><ul><li>Yale University Art Gallery </li></ul><ul><li>Cleveland Museum of Art </li></ul><ul><li>Victoria & Albert Museum (UK) </li></ul><ul><li>National Gallery of Canada (CA) </li></ul><ul><li>Minneapolis Institute of Art </li></ul><ul><li>funded by Andrew W. Mellon Foundation </li></ul>Create Tools Extract CDWA Lite XML Publish via OAI-PMH Harvest Data Test tools Create Research Aggregation Analyze Data Standards compliance? Interoperability? What now?
  3. 3. Open Archives Initiative (OAI) Protocol for Metadata Harvesting Museums OCLC Research TMS COBOAT OAICatMuseum Research Aggregation Museums Basic Interface
  4. 4. Extracting data and harvesting Data extraction 6 out of 9 museums used COBOAT 3 used alternative means: XSLT, Custom, Vendor support (MIMSY XG) Data harvest 6 out of 9 museums used OAI-PMH add’l 2 wanted to use OAI-PMH 3 used alternative means: SQL dumps, FTP Data validation 2 out of 9 harvests completely validated (Mostly) minor errors for the rest
  5. 5. Research Aggregation 9 Museums 887,572 Records
  6. 6. Analyzing the Research Aggregation <ul><li>My data </li></ul><ul><li>Are CDWA Lite required fields present? </li></ul><ul><li>What is the state of CCO compliance of the data? </li></ul><ul><li>Who uses which controlled vocabularies, and does their use demonstrably aid retrieval? </li></ul><ul><li>Suggest strategies to work around the inevitable inconsistencies in institutional data </li></ul><ul><li>Aggregate data </li></ul><ul><li>Which CDWA Lite fields are used by all institutions? </li></ul><ul><li>Do queries across the aggregate return meaningful results? </li></ul><ul><li>How is my cataloging different from other institution’s cataloging? </li></ul><ul><li>Suggest strategies to work around the inevitable inconsistencies in aggregate data </li></ul>
  7. 7. Available at: http://hangingtogether.org/wp-content/uploads/2009/04/methodology_03-30-2009.pdf
  8. 8. Use of CDWA Lite 131 CDWA Lite units of information Museums 54% not used 8% used by all
  9. 9. Conformance to CDWA Lite Required/Highly recommended elements Records 90% consistency for 7 of 17 elements
  10. 10. Impact of COBOAT mapping <ul><li>based on museum mapping documents </li></ul><ul><li>uses 32 units of information of the 131 CDWA Lite defined </li></ul><ul><li>all required/recommended elements and attributes are in the default mapping, except for <subjectTerm> </li></ul><ul><li>4 out of 6 COBOAT contributors used, in various combinations, 11 additional elements and attributes </li></ul>
  11. 11. Controlled Vocabularies match rates < 42%
  12. 12. Data Values correlated to records Top 100 <objectWorkType> All the Institution’s 100K+ Records AAT Match 27 No AAT Match 73 73% 99%
  13. 13. Data Values correlated to records Top 100 <nameCreator> All the Institution’s 200K+ records ULAN Match 49 No ULAN Match 51 25% 50%
  14. 14. What now? <ul><li>“We must create a single information system which embraces all museum holdings in the United States” </li></ul><ul><li>- MCN/IBM 1969 </li></ul>“ Universal access to cultural heritage will likely soon become a reality, but museums may be losing their role as key players.” - Nicholas Crofts 2008
  15. 15. publishing your collections
  16. 16. reaching educators
  17. 17. reaching higher education
  18. 18. sharing infrastructure
  19. 19. projecting authority
  20. 20. national collaboration
  21. 21. Thank you! <ul><li>Thanks also to my colleagues and collaborators Bruce Washburn and Ralph LeVan. </li></ul><ul><li>Museum Data Exchange </li></ul><ul><li>http://www.oclc.org/research/activities/museumdata/default.htm </li></ul><ul><li>- report forthcoming - </li></ul><ul><li>OAICatMuseum 1.0 </li></ul><ul><li>http://www.oclc.org/research/activities/oaicatmuseum/default.htm </li></ul><ul><li>COBOAT </li></ul><ul><li>http://www.oclc.org/research/activities/coboat/default.htm </li></ul><ul><li>G ünter Waibel </li></ul><ul><li>waibelg @oclc.org </li></ul><ul><li>twitter: guWa </li></ul><ul><li>blog: hangingtogether.org </li></ul>

×