Greenstone Digital Library

3,893 views
3,728 views

Published on

Greenstone Digital Library

Published in: Education
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
3,893
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
181
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Greenstone Digital Library

  1. 1. Greenstone Digital Library Software: An Overview Imran Mansuri Project Assistant (Library Science) INFLIBNET Centre 7 March 2011 Prepared by Imran Mansuri 1
  2. 2. Agenda• Introduction : Digital Library Software (DL)• Greenstone Digital Library Software (GSDL)• Introduction• History• Versions• Features• Unique Features• Technology used• Example Sites• Example Collections 7 March 2011 Prepared by Imran Mansuri 2
  3. 3. Digital Library Software• The term “Digital Library” refers to a library in which collections are stored in digital formats (as opposed to print, microform, or other media) and accessible by computers• The digital content may be stored locally or accessed remotely via computer networks• Access the books, images are in digital format• Using Net access to information from anywhere 7 March 2011 Prepared by Imran Mansuri 3
  4. 4. Digital Libraries : Features Dynamic Electronic Information Systems Increase Portability Efficiency of Access Flexibility Availability 7 March 2011 Prepared by Imran Mansuri 4
  5. 5. Digital Library Software Dspace Fedora Eprints Resource Space Greenstone 7 March 2011 Prepared by Imran Mansuri 5
  6. 6. Greenstone Digital Library Software• The Greenstone Digital Library Software (GSDL) provides a way of building and distributing digital library collections, opening up new possibilities for organizing information and making it available over the Internet or on CD-ROM• Developed by the New Zealand Digital Library Project (www.nzdl.org) at the University of Waikato• Distributed in co-operation with UNESCO and Humanities Library Project, Romania 7 March 2011 Prepared by Imran Mansuri 6
  7. 7. GSDL : Some Facts• Current version: 2.82 and 3.03 Available from http://www.greenstone.org• Software suite for building, maintaining, and distributing digital library collections• Comprehensive, open-source• Distribution and promotion partners:o UNESCOo Human Info NGO, Belgium 7 March 2011 Prepared by Imran Mansuri 7
  8. 8. GSDL : History 1995 - Digital library of Computer Science Technical Reports. Its established by New Zealand Digital Library 1997 - Decision to use the GPL (General Public License ); name : Greenstone adopted ; Work with Human Info NGO to produce humanitarian CD-ROMs 1998 Apr - First CD-ROM collection released: Humanity Development Library 1998 Aug - Greenstone.org website established 1999 BBC - Collection established 7 March 2011 Prepared by Imran Mansuri 8
  9. 9.  2000 Apr - Greenstone mailing list started Aug - Formally established cooperative effort with UNESCO and Human Info NGO Nov - Distribute software on SourceForge 2002 Apr - Development of Greenstone3 Mar - Official opening of the Niupepa collection, development of the Greenstone Librarian Interface Jun - First UNESCO Greenstone CD-ROM 7 March 2011 Prepared by Imran Mansuri 9
  10. 10. 2003 - A Java development that became known as the Greenstone Librarian Interface2005 Nov - Initial release of Greenstone32006 Apr - Greenstone Support Group for South Asia launched 7 March 2011 Prepared by Imran Mansuri 10
  11. 11. GSDL : Version 2000 Feb - gsdl 2.12 Apr - gsdl 2.21 Dec - gsdl 2.30 2001 Feb – gsdl 2.31 2002 Jan – gsdl 2.38 2003 Jun - gsdl 2.40 2004 Feb – gsdl 2.50 2005 Apr – gsdl 2.60 and in November - gsdl 3.00 2006 Mar – gsdl 2.70 2007 Apr – gsdl 2.80 2008 gsdl 3.03 Current release gsdl 2.82 7 March 2011 Prepared by Imran Mansuri 11
  12. 12. GSDL : Features Multi S/W Platform Multi Lingual Support Structured Metadata in XML using DC Metadata Extraction Plug-ins for Documents Full-text mirroring Text Level Penetration Concurrent & Dynamic Content Development Uniform Presentation 7 March 2011 Prepared by Imran Mansuri 12
  13. 13. Collection Building• Web and command line mode• Input collections:• GSDL server (files)• Remote (FTP - files, HTTP – website pages)• Collection input: batch mode, NOT interactive• Document formats: HTML, PDF, Text, Word• (Doc, RTF), PS, e-mail, bibliographic 7 March 2011 Prepared by Imran Mansuri 13
  14. 14. • Support for full text tagging for hierarchical document browsing• Automatic text extraction and indexing ‘Plugins’ for different document formats (HTMLPlug, PDFPlug, etc.) May fail for some documents! XML representation – conversion to HTML for Display Native document format – storage and display (via browser plugins, helper applications)• Data compression support 7 March 2011 Prepared by Imran Mansuri 14
  15. 15. • MetadataAutomatic extraction of simple metadata (e.g. Title, date)Explicit metadata via ‘Classifiers’ Hierarchical (e.g. Subject) List (e.g. Organization, Author)Used for browsing and field-based searchingMulti-language support via Unicode 7 March 2011 Prepared by Imran Mansuri 15
  16. 16. Collection Browse and Search• Full text search• Metadata (field) search and browse• Boolean• Ranked• Multi-language support for browse/ search interface• Search history, search term• highlighting… 7 March 2011 Prepared by Imran Mansuri 16
  17. 17. Collection Presentation• Search results formatting Format strings in the configuration file• Home page customization Using macros 7 March 2011 Prepared by Imran Mansuri 17
  18. 18. GSDL : Features Easy Installation Easy Maintenance Hierarchy Structure Interface Customization – Front Page Design, Header for the Digital Library, Collection Icon, Cover Images Collection Configuration (Collect.cfg) File Scalability, Flexibility 7 March 2011 Prepared by Imran Mansuri 18
  19. 19. Collection Distribution• Web• CD-ROM Publish created collections to the CD-ROM Windows only Two possibilities:o Install GSDL software to HDD and access content on CDo Run GSDL search engine out of the CD! 7 March 2011 Prepared by Imran Mansuri 19
  20. 20. GSDL : Unique Features Incremental Collection Building Content Development in 3 different ways Good Documentation and Active Mailing List Variety of Plug-ins for different document Types Publishing on CD-ROMs Data Compression 7 March 2011 Prepared by Imran Mansuri 20
  21. 21. GSDL : Technology Used• Technology used in the current version– Java 1.6 (Higher)– Image Magic– Application Server : Apache 2.2– GSDL_Linux 2.82 and Win 7 March 2011 Prepared by Imran Mansuri 21
  22. 22. GSDL : Example SitesIndia: Archives of Indian Labour 7 March 2011 Prepared by Imran Mansuri 22
  23. 23. United States: New York Botanical Garden 7 March 2011 Prepared by Imran Mansuri 23
  24. 24. International: Global Library Services Network 7 March 2011 Prepared by Imran Mansuri 24
  25. 25. 7 March 2011 Prepared by Imran Mansuri 25
  26. 26. 7 March 2011 Prepared by Imran Mansuri 26
  27. 27. 7 March 2011 Prepared by Imran Mansuri 27
  28. 28. Some ObservationsStrengths: Configurability: content extraction for indexing, presentation layout, metadata for browsing and field- based searching (little difficult though!) Extensibility: Plugins for content extraction, Unicode for multilanguage support, source code availability Fulltext search on variety of document formats XML, Unicode, Dublin Core support Data compression CD-ROM publishing 7 March 2011 Prepared by Imran Mansuri 28
  29. 29. Limitations: Interactive content updating and management not possible No duplicate identification Metadata handling appears to be little complex Linux version seems to be more robust than Windows Hangs while processing some documents during collection building – no way to gracefully handle this 7 March 2011 Prepared by Imran Mansuri 29
  30. 30. Current Status Strong development work – CS department at University of Waikato, NZ Z39.50 experimental interface now available Promoted by UNESCO Beginning to be used worldwide Can be expected to reach CDS/ISIS like popularity (particularly in developing countries) 7 March 2011 Prepared by Imran Mansuri 30
  31. 31. Documentation and Help• Available at: http://www.greenstone.org – Software – Demo collections – FAQ – Tutorial materials• Documentation: Installer’s Guide, User’s Guide, Developer’sGuide, and other reading materials 7 March 2011 Prepared by Imran Mansuri 31
  32. 32. • Mailing lists: – Greenstone Users List – Greenstone Developers List• Greenstone Documentation Wiki http://wiki.greenstone.org/wiki/index.php/Gr eenstoneWiki 7 March 2011 Prepared by Imran Mansuri 32
  33. 33. 7 March 2011 Prepared by Imran Mansuri 33

×