1. Diaries on the Web: A Practical Guide Joanne Riley Associate University Librarian Joseph P. Healey Library University of Massachusetts Boston [email_address] Mass. History Conference, June 7, 2010
2.
3. Scanned Page Images Elizabeth Cowperthwaite 1857 - 1858 Source: Special Collections, University of Pennsylvania Library
22. 8. Paste the transcribed text and add page image links into a blog post “ Pocket Diary Data Entry Template” file on Docs.Google.com Wordpress blog – Add New Post screen
23. 8. Paste the transcribed text and image links into a blog post – a shortcut! “ Pocket Diary Data Entry Template” file on Docs.Google.com Wordpress blog – Add New Post screen
Image only – just a picture of a page. Which can be useful for research, etc. But cannot search for particular words or phrases – simply a digital safe and widely accessible way of paging through the manuscript.
Not ideal to have only transcribed text, with no page image, but sometime necessary. Sometimes the originals have disappeared and you’re left with someone’s typescript transcription, or it may not be possible to digitize the images, or not yet. Harder too to put into a structured file like a spreadsheet (which I’m going to be recommending for many of these diaries) when you have layout like “written in opposite direction)
They photographed the transcription and page image side-by-side. Also full text of the transcripton for searching
No pages images, but has enormous depth and richness - BUT requires knowledge of relational databases, PHP, mySQL, a lot of attention to programming, harvesting from wikipedia and google maps – a tour-de-force, a real labor of love by a non-professional. This is also true of the Martha Ballard diary presentationon dohistory.org ================================ The site is built around Movable Type with some additional custom code. There are five Movable Type weblogs: The Diary All the entries of the diary are entries in a weblog with a little tweaking to work better with the diary's old dates. The "Also on this day" items are stored separately in a MySQL database and displayed with PHP . The Encyclopedia Every topic in the Encyclopedia is a separate entry in a weblog, organised into a large number of categories. Some entries include content from Wikipedia which is fetched and cached locally using PHP. Some entries include Google Maps which are displayed using Mapstraction . The latitude, longitude and zoom levels are stored in Movable Type using Custom Fields . The larger maps are also from Google, with additional PdMarker code, and the pages are generated using PHP to extract the location data from the Movable Type database. In-Depth Articles , Site News and The story so far These are all straightforward Movable Type weblogs. The Recent Activity page is an aggregation of the latest activity from all the Movable Type weblogs.
Short entries – 140 characters – daybook, pocket diaries.
Pros win. Of the cons, the date problem is the worst. But even so, a blog is an excellent way to PRESENT (not to store, or preserve!) an historic diary.
Criteria for this suggested process:
Good prospect is legally publishable and engaging – not necessarily wordy, but with evocative text. For me, my interest and experience is in short-entry diaries – daybooks, pocket diaries that can be treated as snippets, little windows into the past. Known diarist, unpublished, unregistered manuscript: Life of the author + 70 years Anonymous diarist, unpublished, unregistered manuscript: Creation date + 120 years FAIR USE Fair Use checklist http://www.copyright.com/Services/copyrightoncampus/basics/fairuse_list.html Link to Four Factors for Determining Fair Use is on the top of that page.
Can also make thumbnails, and name accordingly. Naming conventions should be logical, extensible and relate all versions of one image to each other. Master Files High resolution scans allow for multiple uses (print, zoom, etc.) Large file size Often stored on CDs, DVDs, external drives, etc. Maintain over time: refresh/migrate Working Files - For printing or detailed viewing on the web 300 DPI
Most people will choose to upload to the global network called the World Wide Web. You can upload to your own org’s server, or to a collaborating university or org, or to a public site. Remember this is NOT your repository for this stuff, just a storage place for the project.
Most people will choose to upload to the global network called the World Wide Web. You can upload to your own org’s server, or to a collaborating university or org, or to a public site. Remember this is NOT your repository for this stuff, just a storage place for the project.
Here’s where your Google account comes in handy
Master data file – like the master image file – if you do it right, you can use that file to repurposes and move to other formats. Storing diary data in a structured master file is: Flexible Re-purposable Stable Safe Alas, OCR won’t help with manuscripts Unless it’s an old typescript transcription Abbyy Finereader excellent OCR software
You can add the diary entries in any order you want because you can sort them by date! This is helpful for having workgroups collaborating on transcribing a diary
Categories can have unique names. Tags need to be known names. Categories don’t help search engines find information. Tags help search engines and tag directories catalog your site. Categories help visitors find related information on your site. Tags help visitors find related information on your site and on other sites.
As of June 2010 – may change!
There are many ways to approach this challenge of sharing short entry diaries online – I’ll be very interested to hear of your experiences and thoughts on all this. I guess I just want to leave you with the encouragement to put your data into a structured master format to save yourselves trouble and complications in the long run. Once you do that, many variations on format and style in presentation and functionality are possible. These are all different presentations of the very same diary data, entered once into a database.