The BHL InfrastructureBHL Africa WorkshopPretoria National Botanical GardenPretoria, South AfricaJune 13th, 2012
A brief history…
New Partners and Geographies
Different models around the World
Global Replication & ServingReplicated Data Center Portal Application
Sharing
SharingBHL shares data through:APIsData ExportOpenURLOAI-PMH
SharingScan Requests –Strategy for handling and managing scan requestsDeduplication –Avoiding duplication of scanning.Feed...
Repatriation of Information
Tools• Scanlist• Monograph Deduper• Portal• Macaw• Gemini & Feedback form
`
Macawhttp://macawup01.up.ac.za
Viewing Activity
Viewing Activity
Loading Activity
Uploading images via browser
Uploading images via browser
Uploading images via browser
Reviewing Metadata
Reviewing Metadata
Uploading to the Archive• Need to get set up with an account atIA first• Account at IA needs access to thebiodiversity col...
Thank youWilliam UlateGlobal BHL CoordinatorBHL US/UK Technical DirectorSr. Project Manager, Missouri Botanical Gardenwill...
US/UKAustraliaBrazilChinaEgyptEuropeAfrica?
BHL-Europe• Different technologies• Mirroring data from US/UK• New content: BHL-Europe developedtheir own portal, which ag...
Vision and mission European biodiversity knowledge freely availableglobally to everyone.BHL-Europe mission statement Mob...
BHL-Europe organisationBHL-Europe server
Best Practice Guide
Architecture simplified
http://www.biodiversityexhibition.com/
Europeana
OCR - IMPACT Standard tools not suitable for heterogeneous content Page type separation helpful to improve performance(S...
What is BHL?Access to literature is particularly important totaxonomic researchers Source: Biodiversity Heritage Library f...
BHL-China• BHL-China staff visited SIL and MOBOT onearly November last year to discuss projectstatus and future collaborat...
BHL-Australia• Museum Victoria in Melbourne & otherLibraries later• Improved User Interface.• Visited MOBOT in November la...
BHL-Australia• Joined development of Macaw with BHLto accommodate needs of institutions.• Accessing content from Internet ...
BibliothecaAlexandrina• The Digital Assets Repository (DAR) is a systemdeveloped at the Bibliotheca Alexandrina, the Libra...
BHL-SciELO Brazil• Has been contributing content to BHLvia the citation & article repositoryCitebank• Has now installed tw...
BHL-SciELO Brazil• A network of ten libraries with some major content onbiodiversity will have 5 scanners moving to anothe...
BHL-SciELO Brazil• Initial plan: a technician at SciELO would receiveimages and a librarian would work out thebibliographi...
BHL-SciELO Brazil1. Biodiversity Journals Collection. Starting with SciELOBrazil and all methodology and program developme...
OutreachLife and Literature, Nov. 2011
Multilingual Outreach
Multilingual OutreachFor BHL-Europe Coordinator interview:• In English:– For those wondering whats Global BHL project abou...
The BHL Infrastructure
Upcoming SlideShare
Loading in...5
×

The BHL Infrastructure

140
-1

Published on

Presentation on the BHL Infrastructure, presented by William Ulate at the BHL-Africa launch and workshop, April 16, 2013. Pretoria, South Africa.

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
140
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
1
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • Demo Account        macaw.joelrichard.com        User: demo Password: demohttp://macawup01.up.ac.zai macaw.mobot.org
  • Viewing Activity        Dashboard with some basic statistics (01-dashboard.png)        List of items that are in progress, filterable (02-list-of-items.png)
  • Viewing Activity        Dashboard with some basic statistics (01-dashboard.png)        List of items that are in progress, filterable (02-list-of-items.png)
  • Loading Items        Creating by hand (03-add-item.png)        Uploading via CSV file        Creating automatically through a custom import        All items still need MARC XML data
  • Uploading images via the browser (04-upload-pages.png)        Importing images to Macaw (05-import-pages.png)Importing of JPG and PDF files is supportedReviewing Metadata        Thumbnail view of the pages (06-enter-metadata.png)        Large version of the image has a magnifier        List view to see more metadata at once (07-metadata.png)        Standard Metadata is suitable for BHL use        No additional metadata modules are neededUploading to the Archive        Need to get set up with an account at IA first        Account at IA needs access to the biodiversity collection        Uploading of completed items is done via scheduled job or the command lineDemo Account        macaw.joelrichard.com        User: demo Password: demo
  • Uploading images via the browser (04-upload-pages.png)        Importing images to Macaw (05-import-pages.png)Importing of JPG and PDF files is supportedReviewing Metadata        Thumbnail view of the pages (06-enter-metadata.png)        Large version of the image has a magnifier        List view to see more metadata at once (07-metadata.png)        Standard Metadata is suitable for BHL use        No additional metadata modules are neededUploading to the Archive        Need to get set up with an account at IA first        Account at IA needs access to the biodiversity collection        Uploading of completed items is done via scheduled job or the command lineDemo Account        macaw.joelrichard.com        User: demo Password: demo
  • Uploading images via the browser (04-upload-pages.png)        Importing images to Macaw (05-import-pages.png)Importing of JPG and PDF files is supportedReviewing Metadata        Thumbnail view of the pages (06-enter-metadata.png)        Large version of the image has a magnifier        List view to see more metadata at once (07-metadata.png)        Standard Metadata is suitable for BHL use        No additional metadata modules are neededUploading to the Archive        Need to get set up with an account at IA first        Account at IA needs access to the biodiversity collection        Uploading of completed items is done via scheduled job or the command lineDemo Account        macaw.joelrichard.com        User: demo Password: demo
  • Reviewing Metadata        Thumbnail view of the pages (06-enter-metadata.png)        Large version of the image has a magnifier        List view to see more metadata at once (07-metadata.png)        Standard Metadata is suitable for BHL use        No additional metadata modules are neededUploading to the Archive        Need to get set up with an account at IA first        Account at IA needs access to the biodiversity collection        Uploading of completed items is done via scheduled job or the command lineDemo Account        macaw.joelrichard.com        User: demo Password: demo
  • Reviewing Metadata        Thumbnail view of the pages (06-enter-metadata.png)        Large version of the image has a magnifier        List view to see more metadata at once (07-metadata.png)        Standard Metadata is suitable for BHL use        No additional metadata modules are neededUploading to the Archive        Need to get set up with an account at IA first        Account at IA needs access to the biodiversity collection        Uploading of completed items is done via scheduled job or the command lineDemo Account        macaw.joelrichard.com        User: demo Password: demo
  • Uploading to the Archive        Need to get set up with an account at IA first        Account at IA needs access to the biodiversity collection        Uploading of completed items is done via scheduled job or the command lineDemo Account        macaw.joelrichard.com        User: demo Password: demo
  • Our Brazilian colleagues explained how a network of ten libraries with some major content on biodiversity will have 5 scanners moving to another location after they have scanned their collections. Some of them, would even get 2 scanners, because of their collection sizes. One of these scanners will be held at the SciELO facilities to test and document the experience on the digitization process that is developed and improve the digitization process. There will be 5 people, one working in with each scanner hired by the BHL-SciELO project and they will work 8 hours per day, with a technician at SciELO to receive the images and a librarian to work out the bibliographic metadata and pagination. Jose Eugenio Grillo will be supervising the whole operation. The librarians within the participant Units will be supporting the process.
  • Our Brazilian colleagues explained how a network of ten libraries with some major content on biodiversity will have 5 scanners moving to another location after they have scanned their collections. Some of them, would even get 2 scanners, because of their collection sizes. One of these scanners will be held at the SciELO facilities to test and document the experience on the digitization process that is developed and improve the digitization process. There will be 5 people, one working in with each scanner hired by the BHL-SciELO project and they will work 8 hours per day, with a technician at SciELO to receive the images and a librarian to work out the bibliographic metadata and pagination. Jose Eugenio Grillo will be supervising the whole operation. The librarians within the participant Units will be supporting the process.
  • Our Brazilian colleagues explained how a network of ten libraries with some major content on biodiversity will have 5 scanners moving to another location after they have scanned their collections. Some of them, would even get 2 scanners, because of their collection sizes. One of these scanners will be held at the SciELO facilities to test and document the experience on the digitization process that is developed and improve the digitization process. There will be 5 people, one working in with each scanner hired by the BHL-SciELO project and they will work 8 hours per day, with a technician at SciELO to receive the images and a librarian to work out the bibliographic metadata and pagination. Jose Eugenio Grillo will be supervising the whole operation. The librarians within the participant Units will be supporting the process.
  • The BHL Infrastructure

    1. 1. The BHL InfrastructureBHL Africa WorkshopPretoria National Botanical GardenPretoria, South AfricaJune 13th, 2012
    2. 2. A brief history…
    3. 3. New Partners and Geographies
    4. 4. Different models around the World
    5. 5. Global Replication & ServingReplicated Data Center Portal Application
    6. 6. Sharing
    7. 7. SharingBHL shares data through:APIsData ExportOpenURLOAI-PMH
    8. 8. SharingScan Requests –Strategy for handling and managing scan requestsDeduplication –Avoiding duplication of scanning.Feedback –How to coordinate feedback (issues) between subregions?
    9. 9. Repatriation of Information
    10. 10. Tools• Scanlist• Monograph Deduper• Portal• Macaw• Gemini & Feedback form
    11. 11. `
    12. 12. Macawhttp://macawup01.up.ac.za
    13. 13. Viewing Activity
    14. 14. Viewing Activity
    15. 15. Loading Activity
    16. 16. Uploading images via browser
    17. 17. Uploading images via browser
    18. 18. Uploading images via browser
    19. 19. Reviewing Metadata
    20. 20. Reviewing Metadata
    21. 21. Uploading to the Archive• Need to get set up with an account atIA first• Account at IA needs access to thebiodiversity collection• Uploading of completed items is done viascheduled job or the command line
    22. 22. Thank youWilliam UlateGlobal BHL CoordinatorBHL US/UK Technical DirectorSr. Project Manager, Missouri Botanical Gardenwilliam.ulate@mobot.orgSkype: william_ulate_rCredits:Martin Kalfatovic, Chris Freeland, BHL-Europe,BHL-Australia and so many other BHL Colleagueswhose valuable contributions make BHL what it is!
    23. 23. US/UKAustraliaBrazilChinaEgyptEuropeAfrica?
    24. 24. BHL-Europe• Different technologies• Mirroring data from US/UK• New content: BHL-Europe developedtheir own portal, which aggregatescontent from multiple European librarymembers
    25. 25. Vision and mission European biodiversity knowledge freely availableglobally to everyone.BHL-Europe mission statement Mobilising and preserving digital Europeanbiodiversity heritage literature and facilitating the openaccess to this literature through a multilingual community portal, the Global Reference Index to Biodiversity, the Biodiversity Library Exhibition, Europeana.BHL-Europe vision statement
    26. 26. BHL-Europe organisationBHL-Europe server
    27. 27. Best Practice Guide
    28. 28. Architecture simplified
    29. 29. http://www.biodiversityexhibition.com/
    30. 30. Europeana
    31. 31. OCR - IMPACT Standard tools not suitable for heterogeneous content Page type separation helpful to improve performance(SCAPE) High quality scanning operations and good QA isimportant Language information important to be in the metadata Font type information important to be in the metadata Tesseract 3.0x competitive OCR tool Crowdsourcing options still to be investigated
    32. 32. What is BHL?Access to literature is particularly important totaxonomic researchers Source: Biodiversity Heritage Library for Europe, http://www.youtube.com/watch?v=bJUMH9z91UQ
    33. 33. BHL-China• BHL-China staff visited SIL and MOBOT onearly November last year to discuss projectstatus and future collaborations anddevelopment• Continued digitization of Chinese materialsand has now 900 booksavailable in InternetArchive - 5,000 pending• Copied content sentfrom Cluster in WoodsHole.
    34. 34. BHL-Australia• Museum Victoria in Melbourne & otherLibraries later• Improved User Interface.• Visited MOBOT in November last year• Share Metadata DB between systems
    35. 35. BHL-Australia• Joined development of Macaw with BHLto accommodate needs of institutions.• Accessing content from Internet Archive• Started to digitize and upload contentto Internet Archive.• Joint portal development with US• Will keep using a team of volunteers to dothe scan.
    36. 36. BibliothecaAlexandrina• The Digital Assets Repository (DAR) is a systemdeveloped at the Bibliotheca Alexandrina, the Libraryof Alexandria, to create and maintain the Librarysdigital collections.• Bibliotheca Alexandrina has copied content from disksand downloaded the rest (serving 15,938 today). Willdownload the rest from Internet Archive.• Currently determining what books they cancontribute to BHL through Internet Archive.• A team has started to setup a BHL portal (Arabic).• Promoted synchronization and tools in Global BHLTechnical Meeting last week in Berlin.• http://edition.cnn.com/video/#/video/bestoftv/2012/06/06/inside-middle-east-alexandria-library.cnn
    37. 37. BHL-SciELO Brazil• Has been contributing content to BHLvia the citation & article repositoryCitebank• Has now installed two digitization outof 5 stations• Visited US to familiarize with tools anddeterminebestpractices.
    38. 38. BHL-SciELO Brazil• A network of ten libraries with some major content onbiodiversity will have 5 scanners moving to anotherlocation after they have scanned their collections.• Some of them, would even get 2 scanners, because oftheir collection sizes. One held at SciELO facilities.• 5 people, one with each scanner hired byBHL-SciELO project will work 8 hours per day.
    39. 39. BHL-SciELO Brazil• Initial plan: a technician at SciELO would receiveimages and a librarian would work out thebibliographic metadata and pagination.• There will be a supervisor the whole operation. Thelibrarians within the participant Units will besupporting the process.• Equipment owned by FAP UNIFESP from theUniversity of Sao Paulo.• FAPESP, the Foundation for Research Support in SãoPaulo provides some of the funding to FAP UNIFESP atthe Federal University of São Paulo, to execute the BHL-SciELO project.
    40. 40. BHL-SciELO Brazil1. Biodiversity Journals Collection. Starting with SciELOBrazil and all methodology and program development is doneconsidering the applicability and migration to other countries.2. Article Repository development. Initially populated with thecontents developed by the BIOTA project, another projectfinanced by FAPESP.3. Biodiversity Thesaurus development. There are alreadyseveral thesaurus available that will support the BiodiversityThesaurus development. One of the subproducts related tothis point is the creation of a list of species, starting withsnakes.4. Identification and markup of relevant scientific terms within thecontent.
    41. 41. OutreachLife and Literature, Nov. 2011
    42. 42. Multilingual Outreach
    43. 43. Multilingual OutreachFor BHL-Europe Coordinator interview:• In English:– For those wondering whats Global BHL project about, check thisinterview to our colleague, Dr. Henning Scholz, BHL-EuropeCoordinator! http://www.bhl-europe.eu/webfm_send/113• In Spanish:– Para quienes se preguntan ¿de qué se trata el proyecto BHL Global?¡Vean esta entrevista a nuestro colega, el Dr. Henning Scholz,Coordinador de BHL-Europa! http://www.bhl-europe.eu/webfm_send/113• In French:– Pour ceux qui se demandent ce que cest le projet "BHL Globale",consultez linterview de notre collègue, le Doc. Henning Scholz,coordinateur de BHL-Europe! http://www.bhl-europe.eu/webfm_send/113• In Portuguese:– Pra quem esta se perguntando o que é o projeto BHL Global? Olhemesta entrevista ao nosso colega, o Dr. Henning Scholz, CoordinadorBHL Europa http://www.bhl-europe.eu/webfm_send/113
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×