or
“Diamonds in the rough”



Richard M. Davis, ULCC
 Maureen Pennock, BL
or
“Set a blog to catch a blog”



 Richard M. Davis, ULCC
  Maureen Pennock, BL
or
“Open-source Blog Archive Management Application”



             Richard M. Davis, ULCC
              Maureen Pennock,...
Blogs
•   Simple diary/journal/log format

•   Readers can (usually) comment

•   Quick and easy to set up

•   Available ...
Web 2.0 recap
Web 1.0                      Web 2.0
Britannica Online            Wikipedia

personal websites            bl...
Value of blogs

•   Primary source, first-hand, personal account
    (diary, log, journal)

•   Research, discussion and ou...
Blogs are evolving and being used for many valuable
activities (here we highlight scholarship). Some bloggers
spend hours ...
New genres of publications are becoming increasingly
important to participants. For example, blogs are cited as a
good win...
In my opinion, if Peter Suber’s focus were the peer–
reviewed article rather than blogging, we would be waiting
longer for...
Blogs, scholarly or otherwise
“But the web is being archived isn’t it?”
JISC: Preservation of Web
       Resources (PoWR)
•   Web preservation awareness and activities in HE
    institutions - v...
Institutional web archiving:
             a wish-list
•   Application:
    •   Quick and easy to install and maintain
    ...
Institutional blog archives

•   Part of the institutional record

•   Cumulative, automated

•   Selection - explicitly s...
“Use the feeds...”




http
    ://w
        ww. CC:B
           flick      Y
                r.co -NC-S
                  ...
Blogs as data
Blogs as data
I rarely see any blog’s design, since I read through
NetNewsWire, so I’m inclined to think blogs represent
an area where t...
ArchivePress: aims

•   JISC-funded, 6 months (June-November 2009)

•   Enable easy creation of institutional (or thematic...
ArchivePress: approach

•   Demonstrator/testbed using existing WordPress
    features and 3rd party plugins

•   Test on ...
ArchivePress: challenges

•   Non-invasive

•   Boot-strapping

•   Comments

•   Embedded content

•   Versions

•   Meta...
AP1 Demonstrator
ArchivePress: possibilities

•   Thematic collections (e.g. local history projects)

•   Other newsfeed-oriented content (...
References
•   Marshall, C. (2008). From writing and analysis to the repository: taking the scholars' perspective on
    s...
http://archivepress.ulcc.ac.uk/
Upcoming SlideShare
Loading in …5
×

ArchivePress Presentation (BL 21/7/2009)

894 views

Published on

Presentation on ArchivePress project for JISC/DPC/UKWAC workshop: "Missing Links: The Enduring Web" at British Library, 21st July 2009

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
894
On SlideShare
0
From Embeds
0
Number of Embeds
30
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

ArchivePress Presentation (BL 21/7/2009)

  1. 1. or “Diamonds in the rough” Richard M. Davis, ULCC Maureen Pennock, BL
  2. 2. or “Set a blog to catch a blog” Richard M. Davis, ULCC Maureen Pennock, BL
  3. 3. or “Open-source Blog Archive Management Application” Richard M. Davis, ULCC Maureen Pennock, BL
  4. 4. Blogs • Simple diary/journal/log format • Readers can (usually) comment • Quick and easy to set up • Available from many public hosts as free or freemium services (Blogger, WordPress, LiveJournal, Typepad...) • Personal and institutional hosted installations • Different platforms, all support newsfeeds (Atom, RSS)
  5. 5. Web 2.0 recap Web 1.0 Web 2.0 Britannica Online Wikipedia personal websites blogging screen scraping web services publishing participation content management systems wikis directories (taxonomy) tagging ("folksonomy") Tim O'Reilly (2006)
  6. 6. Value of blogs • Primary source, first-hand, personal account (diary, log, journal) • Research, discussion and outputs (notebook, lab- book) • E-learning, reflection and discussion (notes, essays, exercise books) • Project activities - centralised, distributed (reports)
  7. 7. Blogs are evolving and being used for many valuable activities (here we highlight scholarship). Some bloggers spend hours or more on a post. Bill Hooker has an incredible set of statistics about the cost of Open Access and Toll Access publications, page charges, etc. Normally that would get published in a journal no-one reads [...] So I tend to work out my half-baked ideas in public. Some people do their early science in the Open. Some are activists. Some review the current landscape, etc. Peter Murray Rust (2009)
  8. 8. New genres of publications are becoming increasingly important to participants. For example, blogs are cited as a good window into what expert practitioners are doing. This material is not duplicated in traditional sources, yet it is important to consult: “This guy has a fantastic blog. He's actually a software architect at Microsoft… and he writes about a lot of issues in data centers...” Catherine Marshall (2008)
  9. 9. In my opinion, if Peter Suber’s focus were the peer– reviewed article rather than blogging, we would be waiting longer for less knowledge about open access. It is hard to keep up with Open Access News; however, without this blog, it would be even harder ... Heather Morrison (2007)
  10. 10. Blogs, scholarly or otherwise
  11. 11. “But the web is being archived isn’t it?”
  12. 12. JISC: Preservation of Web Resources (PoWR) • Web preservation awareness and activities in HE institutions - variable • Institutional web archiving - perceived barriers: • Cost of implementation • Complexity of available tools • Blog archiving • Backup and migration • Individual’s responsibility http://jiscpowr.jiscinvolve.org/
  13. 13. Institutional web archiving: a wish-list • Application: • Quick and easy to install and maintain • Free, open-source, open-standards • Widely used, supported, documented, understood • Reliable • Content: • Easy to ingest, manage, access, preserve
  14. 14. Institutional blog archives • Part of the institutional record • Cumulative, automated • Selection - explicitly seeded with blogs of note • Support authenticity and fixity through automation • Persistent - within institutional domain • Citable - through persistent URI for posts • Agnostic about location or platform used by original blog
  15. 15. “Use the feeds...” http ://w ww. CC:B flick Y r.co -NC-S m/c A hris top/ 134 550 576 /
  16. 16. Blogs as data
  17. 17. Blogs as data
  18. 18. I rarely see any blog’s design, since I read through NetNewsWire, so I’m inclined to think blogs represent an area where the content is primary and design secondary. Chris Rusbridge (2009)
  19. 19. ArchivePress: aims • JISC-funded, 6 months (June-November 2009) • Enable easy creation of institutional (or thematic) blog post archives by using a WordPress database and installation to • monitor and gather feed content automatically • store gathered posts and comments • provide access (search, author, date, keyword, etc.) • be the focus of ongoing preservation activity
  20. 20. ArchivePress: approach • Demonstrator/testbed using existing WordPress features and 3rd party plugins • Test on existing corpora of institutional blogs (DCC, Lincoln University, UKOLN): different platforms, complexity and issues • Analyse results (metadata, content, sig. props, usability) • Develop methodology and new plugins as necessary
  21. 21. ArchivePress: challenges • Non-invasive • Boot-strapping • Comments • Embedded content • Versions • Metadata • Scalability
  22. 22. AP1 Demonstrator
  23. 23. ArchivePress: possibilities • Thematic collections (e.g. local history projects) • Other newsfeed-oriented content (e.g. Twitter) • Islands of linked blog archives • Incorporation of semantic metadata, linking, microformats • Text-bases for text-mining • E-anthologies of blogs, posts, discussions • Integration with finding aids and discovery tools
  24. 24. References • Marshall, C. (2008). From writing and analysis to the repository: taking the scholars' perspective on scholarly archiving. In: Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries, June 16-20, 2008, Pittsburgh PA, USA [doi>10.1145/1378889.1378930]. • Morrison, H. (2007). Rethinking collections — Libraries and librarians in an open age: A theoretical view. First Monday, Volume 12 Number 10 - 1 October 2007. Available from: http:// firstmonday.org/htbin/cgiwrap/bin/ojs/index.php/fm/article/viewArticle/1965/1841 [Accessed 20th July 2009]. • Murray Rust, P. (2009) Effective digital preservation is (almost) impossible; so Disseminate instead. Peter MR’s Blog, Unilvever Cambridge Centre For Molecular Informatics, University of Cambridge. Available from: http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=2159 [Accessed 20th July 2009]. • O'Reilly, T. (2005). What Is Web 2.0? Available from: http://www.oreillynet.com/pub/a/oreilly/ tim/news/2005/09/30/what-is-web-20.html [Accessed: December 18th, 2006]. • Rusbridge, C. (2009) Comment on Preservation for scholarly blogs. Gavin Baker: A Journal of Insignificant Inquiry. Available from: http://www.gavinbaker.com/2009/03/30/preservation- for-scholarly-blogs/ [Accessed 20th July 2009].
  25. 25. http://archivepress.ulcc.ac.uk/

×