HARD CONTENT, FAB FRONT-END
Archiving websites of the Dutch Public Broadcasters
23-5-2014
Lotte Belice Baltussen | R&D, Ne...
Nederlands Instituut voor
Beeld en Geluid
Sound and Vision
• 70% of Dutch AV heritage
• > 850,000 hours
• 2M photos
•20,00...
“The Archive as a Laboratory”
Web archiving since 2008 (LiWA, several pilots) with various objectives
NTR PILOT
(2013-2014)
23-5-2014
WHY:
• Saving websites selected to be taken offline
• Getting insights in user requirement...
WEBSITES
23-5-2014
CRAWLING ISSUES
ACCESS ISSUES
USER REQUIREMENTS, PT. 1
Phase 1: Focus group
USER REQUIREMENTS SUMMARY
• Communication and information
e.g. “As a user, I can suggest a website that should be archived...
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
USER REQUIREMENTS, PT. 2
Phase 2: Usability tests
think-aloud, 60-90 minutes
x 2:
• 37, PostDoc web archive research proje...
LESSONS-LEARNED
UI/UX
+ Clean, visual look
- More functionality explanations
COMMUNICATION
+ FAQ contains good info about
...
FUTURE WORK WEB ARCHIVES:
CONTEXT COLLECTIONS
“Public broadcaster web archives will help you learn where you come from”
--...
Thanks!
@lottebelice | lbbaltussen@beeldengeluid.nl
@benglabs
Hard Content, Fab Front-end @ IIPC 2014
Hard Content, Fab Front-end @ IIPC 2014
Hard Content, Fab Front-end @ IIPC 2014
Hard Content, Fab Front-end @ IIPC 2014
Upcoming SlideShare
Loading in …5
×

Hard Content, Fab Front-end @ IIPC 2014

408 views
325 views

Published on

Presentation at the International Internet Preservation Consortium conference 2014 in Paris on the web archiving project the Netherlands Institute for Sound and Vision did together with Dutch public broadcaster NTR.

Published in: Technology, Design
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
408
On SlideShare
0
From Embeds
0
Number of Embeds
8
Actions
Shares
0
Downloads
2
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Hard Content, Fab Front-end @ IIPC 2014

  1. 1. HARD CONTENT, FAB FRONT-END Archiving websites of the Dutch Public Broadcasters 23-5-2014 Lotte Belice Baltussen | R&D, Netherlands Institute for Sound and Vision IIPC | 21 May 2014 | BnF, Paris
  2. 2. Nederlands Instituut voor Beeld en Geluid Sound and Vision • 70% of Dutch AV heritage • > 850,000 hours • 2M photos •20,000 objects • Large paper archives
  3. 3. “The Archive as a Laboratory” Web archiving since 2008 (LiWA, several pilots) with various objectives
  4. 4. NTR PILOT (2013-2014) 23-5-2014 WHY: • Saving websites selected to be taken offline • Getting insights in user requirements • Create great front and back-end • Provide public access • Shape future plans
  5. 5. WEBSITES 23-5-2014
  6. 6. CRAWLING ISSUES
  7. 7. ACCESS ISSUES
  8. 8. USER REQUIREMENTS, PT. 1 Phase 1: Focus group
  9. 9. USER REQUIREMENTS SUMMARY • Communication and information e.g. “As a user, I can suggest a website that should be archived” • Metadata e.g. “As a user, I can see the crawl date for each archived URL” • Searching e.g. “As a user, I can search full-text through a single archived website” • Visualisation e.g. “As a user, I can see side-by-side comparisons of the same URL that was archived at different moments in time”
  10. 10. FRONT-END AND BACK-END DEVELOPMENT
  11. 11. FRONT-END AND BACK-END DEVELOPMENT
  12. 12. FRONT-END AND BACK-END DEVELOPMENT
  13. 13. FRONT-END AND BACK-END DEVELOPMENT
  14. 14. FRONT-END AND BACK-END DEVELOPMENT
  15. 15. USER REQUIREMENTS, PT. 2 Phase 2: Usability tests think-aloud, 60-90 minutes x 2: • 37, PostDoc web archive research project • 58, Multimedia editor at a Dutch public broadcaster x 3: • 44, Crawl engineer • 50, Manager digital projects at a Dutch public broadcaster • 58, Freelance (archive) researcher & journalist
  16. 16. LESSONS-LEARNED UI/UX + Clean, visual look - More functionality explanations COMMUNICATION + FAQ contains good info about web archiving - Info about status + plans / More info about scope and size of web archive METADATA + Overview of outgoing links - TMI / Creation + last change of website SEARCHING + Fast! + Thumbnail previews - Search by URL - More filtering options - Relevance ranking VISUALISATION / More stats, e.g., % text - Highlight differences crawls USERS & USAGE + Current groups representative - No av-streaming big loss for all / Add more fine-grained subgroups
  17. 17. FUTURE WORK WEB ARCHIVES: CONTEXT COLLECTIONS “Public broadcaster web archives will help you learn where you come from” -- Usability test participant • We need to be more dynamic than the websites we archive • We can and must achieve public access • We are moving from pilot to standard practice • Connect crawls to catalogue • Increase public broadcaster cooperation
  18. 18. Thanks! @lottebelice | lbbaltussen@beeldengeluid.nl @benglabs

×