Web-Harvesting: concepts, issues, and prospects

  • 740 views
Uploaded on

Paper presented at PAARL seminar (Villa Escudero, San Pablo City, 27 October 2004) by Vivian del Castillo-Sy

Paper presented at PAARL seminar (Villa Escudero, San Pablo City, 27 October 2004) by Vivian del Castillo-Sy

More in: Technology , Design
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
740
On Slideshare
0
From Embeds
0
Number of Embeds
0

Actions

Shares
Downloads
0
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Web-Harvesting
  • 2. Web-Harvesting
    • Concept
    • Issues
    • Prospects
  • 3. Web-Harvesting
    • Concept
    • Issues
    • Prospects
  • 4. Concept Web resource Web resource Web resource Web resource Web resource Web resource Web resource Web resource Web resource
  • 5. Reference to Web Resource
    • Harnad, Stevan (2004). “The Self-Archiving Initiative.” http://www.ecs.soton.ac.uk/ ~harnad/Tp/Nature4.htm . Accessed last 15 September 2004.
  • 6. The Web
  • 7. The Web
  • 8. The Web Harvester
  • 9. 3 Major Activities
    • Imaging
    • Digitization
    • Storage
    • Retrieval
    • Web Archiving
    • Migration
    • Storage
    • Retrieval
    • Storage
    • Migration
    • Retrieval
  • 10. Storage
  • 11. Migration
  • 12. Retrieval
  • 13. Developments in Web Archiving
    • Internet Archive
    • NEDLIB
    • Nordic Web Archive
    • Amiga Realm Internet Archive
    • WebArchivist.org
    • September 11 Web Archive
    • Eprints.org
  • 14. Web-Harvesting Concept
    • WWW - publishing venue
    • Web resources – non-permanent
    • Web harvester - to store, migrate, retrieve web resources
  • 15. Web-Harvesting
    • Concept
    • Issues
    • Prospects
  • 16. Web-Harvesting
    • Concept
    • Issues
    • Prospects
  • 17. Issues – Storage
    • Legal justification
    • Non-permanency of materials
      • Daily changes
      • Checksum
    • No consistency in citations
    • Refinement of criteria
  • 18. Issues – Storage
    • Several systems providing information
      • Continued development
      • Inaccessibility of data in databases
    • Several information formats
    • Overload of information
    • Sufficient storage space
  • 19. Issues – Migration
    • Developments in information formats
    • Developments in hardware, operating systems, and software
  • 20. Issues – Retrieval
    • Need for registries
    • Completeness of metadata
    • Commercial vendor or not?
    • Legal or illegal?
  • 21. Web-Harvesting
    • Concept
    • Issues
    • Prospects
  • 22. Web-Harvesting
    • Concept
    • Issues
    • Prospects
  • 23. Prospects
    • Future harvesters will be more powerful
      • Overflow, duplication
    • Current options:
      • Self-archiving by universities
      • Self-archiving by authors
      • Burning of cited web pages
      • Printing of cited web pages
  • 24. Have a nice day!