Processing and Delivering Email Archives in Special Collections Using MUSE Personal Digital Archiving 2012 Peter Chan, Digital Archivist
RESHAPING THE REPOSITORY: THECHALLENGE OF EMAIL ARCHIVING
Challenge• Copyrights / Privacy – No web delivery? Reading room only?• Sensitive information – SS#, student records, etc.• Description – No. of emails, recipients, folders, contents?• Delivering – Same as original or more?
Emails in Our Collections• Robert Creeley – 50,000+• Richard Fikes – 100,000• Stephen H. Schneider• John McCarthy• Mandelbrot• Arlene Blum• Harrison• ………………………………….
Access for Emails converted to XML• Build search index• Build access control• Merge with existing search and discovery portal• Big development effort.
Groups Over Time
Topic of the Month
Processing Emails - MUSE• Edit the pre-built lexicon – Screen for sensitive information (SS#, student records, etc.) and mark for restriction. – Group by known projects, conference, etc.• Standard MUSE functions – Group, topics, attachment wall, communication chart, browsing lens.
Delivering Emails - MUSE• Web – Summary information – Sentiment, group, topics, communication links – Browsing lens (preview only)• Reading room – Individual emails – Attachments – Browsing lens (link to emails)
Use MUSE - Gap• More sophisticated search – Pattern search (SS#, credit card#, etc.) – Full text, thesaurus, misspell, etc.• Original views (folders)• Delivery mode for metadata (package for download or server at Stanford).• Multiple lexicons• Foreign languages (long term)
Future• More people to collaborate in the development.• More people to use the software.• Funding • Contact me (firstname.lastname@example.org) if you are interested in any of the above.