Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Web Preservation in a Web 2.0 Environment


Published on

Slides for a talk on "Web Preservation in a Web 2.0 Environment" given by Brian Kelly, UKOLN at a Web site preservation workshop at the UKOLN IWMW 2008 event.

Published in: Education, Business

Web Preservation in a Web 2.0 Environment

  1. Web Preservation in a Web 2.0 Environment Brian Kelly UKOLN University of Bath Bath, UK Email [email_address] UKOLN is supported by: About This Talk Will use of Web 2.0 services lead to new preservation concerns? And how should we respond to these new challenges? This work is licensed under a Attribution-NonCommercial-ShareAlike 2.5 licence (but note caveat) Resources bookmarked using ‘ iwmw2008 ' tag
  2. Is Web 2.0 Different? <ul><li>How does Web site preservation differ for Web 2.0: </li></ul><ul><ul><li>Use of 3 rd party services </li></ul></ul><ul><ul><li>Emphasis on collaboration and communication, rather than access to resources </li></ul></ul><ul><ul><li>More complex IPR issues </li></ul></ul><ul><ul><li>Richer diversity of services </li></ul></ul><ul><li>Let’s look at: </li></ul><ul><ul><li>Case study 1 - wikis </li></ul></ul><ul><ul><li>Case study 2 – blogs </li></ul></ul><ul><ul><li>Case study 3 – reusing data </li></ul></ul><ul><ul><li>Case study 4 – comms tools (disposable data) </li></ul></ul><ul><ul><li>Case study 5 – recording events </li></ul></ul>Web 2.0
  3. Group Exercises <ul><li>Speed exercises: </li></ul><ul><ul><li>You will be given an example to consider. Give an initial response, in a single sentence (or word) in less than a minute! </li></ul></ul><ul><li>Small group exercise: </li></ul><ul><ul><li>In small groups chose an example of interest. </li></ul></ul><ul><ul><li>Give a more considered response to the preservation challenges </li></ul></ul>Web 2.0
  4. Case Study 1: A Public Wiki <ul><li>WetPaint wiki used to support various workshops </li></ul><ul><li>Approaches taken: </li></ul><ul><ul><li>Open access to all prior to & during event (to minimise barriers to creating content) </li></ul></ul><ul><ul><li>Access restricted to WetPaint users after event </li></ul></ul><ul><ul><li>Access later restricted to event organisers </li></ul></ul>Web 2.0 Many aspects of Web site curation are to do with implementing such best practices, rather than implementing technical solutions See JISC PoWR blog post
  5. Case Study 1: A Public Wiki <ul><li>WetPaint provides an option for backing up data. </li></ul><ul><li>A zipped file of the pages can be saved for storing on a locally managed service. </li></ul>Web 2.0 There are limitations in this particular service (poor quality HTML, internal links don’t work, …) But this does illustrate an approach which can be taken.
  6. Case Study 2: Blog Migration <ul><li>How might you migrate the contents of a blog (e.g. you’re leaving college)? </li></ul><ul><li>This question was raised by Casey Leaver, shortly before leaving Warwick University </li></ul>Web 2.0
  7. Case Study 2: Blog Migration <ul><li>She migrated her blog from blogs at Warwick Univ to Wordpress </li></ul>Web 2.0 Note, though, that not all data was transferred (e.g. title, but not contents) so there’s a need to check transfer mechanisms
  8. Case Study 2: Blog Migration <ul><li>A backup of UK Web Focus blog is available on Vox: </li></ul><ul><ul><li>Manual migration of new posts every few weeks </li></ul></ul><ul><ul><li>Only migrates text </li></ul></ul><ul><ul><li>Doesn’t migrate images, embedded videos, internal links, comments, … </li></ul></ul>Web 2.0 Migration of blogs, wikis, etc. is not currently an easy task  But advice is available
  9. Case Study 3: Reusing Data <ul><li>Blog post in Facebook. Possible concerns: </li></ul><ul><ul><li>It’s not sustainable </li></ul></ul><ul><ul><li>You’ve given ownership to Facebook </li></ul></ul>Web 2.0 <ul><li>Response: </li></ul><ul><ul><li>The post is managed in WordPress; Fb displays copy (to new audience) </li></ul></ul><ul><ul><li>Fb don’t claim ownership – they claim rights to make money (e.g. through ads) </li></ul></ul>
  10. Case Study 4: Disposable Data <ul><li>Twitter – example of a micro-blogging application </li></ul><ul><li>Facebook status messages is another related example </li></ul>Web 2.0 <ul><li>Issues: </li></ul><ul><ul><li>Will Twitter be sustain-able over a long period? </li></ul></ul><ul><ul><li>What will happen to the data? </li></ul></ul><ul><ul><li>What about the IPR for ‘tweets’? </li></ul></ul><ul><ul><li>What about institutional uses? </li></ul></ul>
  11. Case Study 4: Disposable Data <ul><li>Many twitterers regard their tweets as disposal </li></ul><ul><li>I tend to use Twitter as a ‘virtual water cooler’ – sharing gossip, jokes and occasional work-related information with (mainly) people I know </li></ul>Web 2.0 You could make use of clients which manage your tweets (e.g. treat like email) But you should develop your policies first, prior to exploring technologies
  12. Case Study 4: Disposable Data <ul><li>Skype (or your preferred VoIP application) are growing in popularity </li></ul>Web 2.0 <ul><li>Issues: </li></ul><ul><ul><li>Is the digital data (the call) preserved? </li></ul></ul><ul><ul><li>What about the video and the IM chats? </li></ul></ul><ul><li>Possible Responses: </li></ul><ul><ul><li>Am I bovvered? </li></ul></ul><ul><ul><li>I didn’t bother with analogue phones, why should I worry now? </li></ul></ul>
  13. Case Study 5: Digitized Talks <ul><li>Seminar on Open Science given at UKOLN in Feb 2008. </li></ul><ul><li>Video clip of opening 10 mins taken & uploaded to YouTube </li></ul><ul><li>Issues: </li></ul><ul><ul><li>Privacy </li></ul></ul><ul><ul><li>Quality </li></ul></ul><ul><ul><li>Benefits </li></ul></ul><ul><ul><li>Long term access </li></ul></ul>Benefits identified – now how do we seek to deploy recordings of seminars, conferences, etc. on a more systematic basis? This is work in progress – but see IWMW 2007 videos
  14. Case Study 6: Slideshare <ul><li>What happens to your slides if Slideshare disappears? </li></ul><ul><li>My approach: </li></ul><ul><ul><li>Master copy held on managed environment </li></ul></ul><ul><ul><li>Info on master on title slide and metadata </li></ul></ul><ul><ul><li>CC licence & download available – many copies </li></ul></ul>
  15. Case Study 7: Social Networks <ul><li>University of Wales, Newport and University of Bradford have set up Ning networks for supporting their students: </li></ul><ul><ul><li>Bradford: Closed (Bradford email address needed to access) </li></ul></ul><ul><ul><li>Newport: Open Intended for students about to arrive at institution </li></ul></ul>What does preservation mean in this context? Answers to this question will be left as an exercise for the participants 
  16. Role Of The Internet Archive <ul><li>Can we leave everything to the Internet Archive? </li></ul><ul><ul><li>Has role to play in Web 1.0 </li></ul></ul><ul><ul><li>Seems to archive some public blogs </li></ul></ul><ul><ul><li>May not access images or other embedded content </li></ul></ul><ul><ul><li>Still has limitations (cf. UCE/BCU) </li></ul></ul><ul><li>Can’t (currently) access Facebook pages, for example </li></ul>Web 2.0
  17. Role Of The Internet Archive <ul><li>The Open University has a presence in Facebook. </li></ul><ul><li>In June 2008: </li></ul><ul><ul><li>7,551 fans </li></ul></ul><ul><ul><li>983 wall posts </li></ul></ul><ul><ul><li>82 discussion topics </li></ul></ul><ul><li>Is anyone: </li></ul><ul><ul><li>Recording the history? </li></ul></ul><ul><ul><li>Curating the data </li></ul></ul><ul><ul><li>Managing possible risks? </li></ul></ul>Web 2.0
  18. The Research Challenges <ul><li>Some thoughts: </li></ul><ul><ul><li>Preservation of Web sites in known to be difficult </li></ul></ul><ul><ul><li>Additional difficulties in a Web 2.0 world </li></ul></ul><ul><ul><li>Complexities include technical challenges and business issues </li></ul></ul><ul><li>However: </li></ul><ul><ul><li>Is avoiding Web 2.0 a realistic answer? </li></ul></ul><ul><ul><li>There may be some simple processes which may help </li></ul></ul>Web 2.0
  19. Accessibility & Preservation <ul><li>The parallels: </li></ul><ul><ul><li>We can’t release xxx: it breaks accessibility guidelines; we’ll be sued </li></ul></ul><ul><ul><li>The xxx service provides benefits to many – we’ll see what reasonable adjustments are needed to enhance access </li></ul></ul><ul><li>Approaches needed: </li></ul><ul><ul><li>Clarification of the purpose of the service </li></ul></ul><ul><ul><li>Risk assessment of loss of service / record of service / record of look-and-feel </li></ul></ul><ul><ul><li>Documented policy </li></ul></ul>
  20. Questions