Your SlideShare is downloading. ×
EURISCO demo installations of IPT, at GBIF EU Nodes meeting in Alicante (11 March 2010)
Upcoming SlideShare
Loading in...5

Thanks for flagging this SlideShare!

Oops! An error has occurred.

Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

EURISCO demo installations of IPT, at GBIF EU Nodes meeting in Alicante (11 March 2010)


Published on

Regional GBIF NODES meeting of Europe in March 2010. Presentation of current activities from the NordGen NODE. Implementations of the GBIF IPT toolkit for genebanks in Europe. Upgrade for selected …

Regional GBIF NODES meeting of Europe in March 2010. Presentation of current activities from the NordGen NODE. Implementations of the GBIF IPT toolkit for genebanks in Europe. Upgrade for selected genebanks from the BioCASE publishing toolkit to the IPT. First step of a scheduled larger implementation planned to start in 2011 as part of the EuroGeneBank application pending EU funding decision. NordGen IPT EURISCO

Published in: Technology
  • Be the first to comment

  • Be the first to like this

No Downloads
Total Views
On Slideshare
From Embeds
Number of Embeds
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

No notes for slide
  • IMAGE: [Creative Commons License:]
  • Contract overview
  • Darwin core
  • Technology overview – IPT role
  • Java: DarwinCore Archive (DwC-A): (DBMS):
  • TAPIR (2004 ->) (International Crop Information System): http://www.icis.cgiar.orgBioMOBY (2001), (2002, not active) http://digir.sourceforge.netBioCASE (2003, PyWrapper v2) http://www.biocase.orgEURISCO (2003, tab delimited text) http://eurisco.ecpgr.orgPyWrapper 3 (2006, not active) http://trac.pywrapper.orgTapirLink (2007) Provider Toolkit (2009)
  • Transcript

    • 1. European GBIF Nodes Meeting 2010, March 10th-12th Alicante, Spain
      Dag Endresen, Nordiv Genetic Resources Center, NordGen
      GBIF IPT installations for EURISCO
      GBIF Tools and Darwin Core extension for germplasm
      Cartoon by Sasha Kopf (Creative Commons)
    • 2. Topics for this session
      • GBIF IPT installation for EURISCO
      • 3. Overview of the project
      • 4. Darwin Core extension for germplasm
      • 5. GBIF informatics tools
      • 6. Integrated Publishing Toolkit (IPT)
      • 7. IPT installations for EURISCO
      • 8. Possible PGR networkmodel
    • Darwin Core extension for Germplasm, (presentedat TDWG 2009)
      Opened up for use of new GBIF technology in gene banking world
      Proposal to implement GBIF technology as a test in the European gene banking community
    • 9. From the contract between NordGen and GBIF:
      “... a feasibility study aimed at demonstrating the practical implementation of the GBIF decentralised architecture strategy and in particular in the context of the EURISCO Network.”
      “... focused on the adoption of the IPT by selected gene banks in Europe, the publishing of richer content using the Darwin Core germplasm extension and the indexing of these published resources by the EURISCO platform.”
      “... implemented in the context of EURISCO and therefore in close collaboration with the EURISCO Coordinator.”
    • 10. GBIF Informatics Suite
      • GBIF tools to empower decentralized thematic or regional networks
      • 11. Darwin Core extension for germplasm makes these tools usable for crop gene banks.
    • Darwin Core
      The purpose of DwC terms is to facilitate data sharing
      • a well-defined standard core vocabulary
      • 12. a flexible framework to maximize re-usability
      The Darwin Core can be extended by adding new terms to share additional information.
      TDWG standard 2009
      “The Darwin Core is primarily based on taxa, their occurrence in nature as documented by observations, specimens, and samples, and related information.”
    • 13. DwC star schema model
    • 14. DwC extension for Germplasm
      DwC Germplasm : DRAFT 0.1 : August 26, 2009
      • “MCPD in Darwin Core”
      • 15. Maintained by gene banks worldwide
      • 16. Additional terms to describe germplasm samples
      • 17. Includes the new terms for crop trait experiments developed as part of the European EPGRIS3 project
      • 18. Includes a few additional terms for new international crop treaty regulations
    • 19. DwC Germplasm (1)
    • 20. DwC Germplasm (2)
    • 21. DwC Germplasm (3)
    • 22. DwC Germplasm (4)
    • 23. DwC Germplasm (5)
    • 24. DwC Germplasm (6)
      Perhaps add new terms to facilitate the reporting of germplasm distribution for the ITPGRFA (International Treaty for Genetic Resources for Food and Agriculture)
      The Millennium Seed Bank (Kew) has contributed feedback to the DwC-G modeling and proposed to include a number of seed management descriptors.
      • Seed processing terms
      • 25. Seed cleaning
      • 26. Seed germination testing
      Suggested by ENSCONET - threat status for populations in situ
    • 27. Mapping of DwC-G terms to the MCPD descriptors
    • 28. Mapping of DwC-G terms to the MCPD descriptors (continued)
    • 29. MCPD -> ABCD 2.06 (2004)
      National Inventory Code
      Institute Code
      Accession Number
      Collecting Number
      Collecting Institute Code
      Species Authority
      „Subtaxa“ Authority
      Common Crop Name
      Accession Name
      Acquisition Date
      Country of Origin
      Location of Collection Site
      Latitude of CS
      Longitude of CS
      Elevation of CS
      Collecting Date of Sample
      Breeding Institute Code
      Biological Status of Accession
      Ancestral Data
      Collecting/Acquisition Source
      Donor Institute Code
      Donor Accession Number
      Other Identification (Number) associated with the accession
      Location of Safety Duplicates
      Type of Germplasm Storage
      Decoded Collecting Institute
      Decoded Breeding Institute
      Decoded Donor Institute
      Decoded Safety Duplication Location
      Accession URL
      Helmut Knüpffer
      IPK Gatersleben
      Descriptors marked red did not match the earlier versions of ABCD
       ABCD was extended by a PGR section [W. Berendsohn, H. Knüpffer]
      Walter Berendsohn
    • 30. Home:
      Primary developers: Markus Döring, Tim Robertson, John Wieczorek
      Source code: Java
      Released: 2009
      DEMO at
      Genebank Example at
    • 31. Integrated Publishing Toolkit (IPT)
      A tool in support of data publishers.
      A simple and straightforward mechanism to share primary biodiversity data following the Darwin Core standard.
      Open source, Java based web application.
      Provides a local tool for data quality assessment.
    • 32. GBIF Integrated Publishing Toolkit (IPT)
      • Java 1.5 or higher is required
      • 33. Apache Tomcat is recommended (1 GB RAM+)
      • 34. GBIF IPT is provided as a WAR archive (for easy deployment)
      • 35. GeoServer is included for web mapping (OGC Compliant, WFS, WMS, etc)
      • 36. H2 Embedded Java Database (with JDBC interface and web console)
      • 37. Hibernate (object relational mapping)
    • IPT Interfaces
      • REST XML
      • 38. TAPIR
      • 39. DwC Archive
      • 40. OGC (WFS, WMS, Web Mapping)
      • 41. EML (EcologicalMarkup Language)
    • Darwin Core Archive (DwC-A)
      • DwC-A publish dwc records including extensions
      • 42. Simple text based format
      • 43. Zipped single file archive
    • 44. IPT service from NordGen at
      • TAPIR (2004 ->)
      • DiGIR(PHP, 2001-2006)
      • 45. TapirLink(PHP, 2007 ->)
      • BioCASE(Python, 2001-2008)
      • 46. PyWrapper3 (2006-2008)
      • EURISCO (tab-delimited, 2003)
      • ICIS (Java, 1996 ->)
      • BioMOBY(Perl, 2001 ->)
      • Embeds its own database
      • 47. Multilingual
      • 48. Has a user management feature based on roles, which allows for multiple data managers to share a common instance
      • 49. Manages multiple data sources
      • 50. Several upload options: relational database management systems or data files
      • 51. Public web interface allows for data browsing and full text search
      • 52. Customised detail pages
    • GBIF IPT
      GBIF IPT implements the Darwin Core Standard; and provides an interface to easily build extensions to the core Darwin Core terms.
      The draft germplasm extension is one example of how-to extend the Darwin Core terms for the GBIF IPT.
    • 53. The IPT user interface includes the germplasm extension
    • 54. XML interface includes thegermplasm extension
    • 55. The Harvesting and Indexing Toolkit (HIT)
      Addresses the need of Nodes managers, to aggregate indexes of published primary biodiversity data.
      Aims to ease the complexity of heterogeneous networks of data publishers, by shielding the end-user from the complexities of the different protocols.
    • 56. Biodiversity Resources Discovery System (GBRDS)
      A Yellow Page reference of Biodiversity resources.
      The IPT and HIT instances installed in the course of this project will be registered in the GBRDS.
      Any biodiversity organisation should be able to register their resources and services into the GBRDS and contribute to the discovery services.
    • 57. Objectives of the European genebank project
      • Evaluate the GBIF decentralized architecture
      • 58. Upgrade of the Integrated Publishing Toolkit (IPT) with the genebank extension and develop associated documentation.
      • 59. Install and test the IPT installation in various genebanks in Europe that, as far as possible, are also EURISCO/ECPGR partners.
      • 60. Test the registration of IPT installation through the GBIF Global Biodiversity Resources Discovery System (GBRDS).
      • 61. Test the Harvesting and Indexing Toolkit (HIT) installation for the EURISCO platform.
      • 62. Install an IPT instance on the EURISCO platform and synchronize with GBIF central Index.
      • 63. Project runs until 20 December 2010.
    • IPT deployment in Europe
      • NordGen in Sweden covering 5 countries (Denmark, Sweden, Finland, Norway and Iceland)
      • 64. EURISCO / Bioversity-HQ (Italy)
      • 65. Bioversity-Montpellier (France)
      • 66. IPK Gatersleben (Germany)
      • 67. WUR CGN (The Netherlands)
      • 68. CRI (Czech Republic)
      • 69. VIR (Russia)
      • 70. Balkan countries (Albania, Bosnia, Croatia, Macedonia, Serbia, Romania)
      • 71. Baltic countries (Estonia, Latvia, Lithuania)
    • Possible PGR Network model
      • The gene bank dataset is shared from the holding gene bank.
      • 72. The National Inventory (NI) endorse all national gene banks (and eventually individual accessions) for EURISCO.
      • 73. ECPGR Crop databases can access passport data from EURISCO and additional crop specific data from the genebank IPT interface.
      • 74. Standard data sharing tools ensure that the genebank dataset is available to other relevant decentralized thematic, regional or global networks.
    • Potential of GBIF technology
      Using GBIF technology (and contributing to its development), the PGR community can easily establish specific PGR networks without duplicating GBIF's work.
      The compatibility of data standards between PGR and biodiversity collections made it possible to integrate the worldwide germplasm collections into the biodiversity community (TDWG, GBIF).
    • 75. Special thanks to:
      • GBIF, Global Biodiversity Information Facility
      • 76. TDWG, Biodiversity Information Standards
      • 77. BioCASE, The Biological Collection Access Service for Europe.
      • Bioversity International
      Things can happen in a band, or any type of collaboration, that would not otherwise happen. (Jim Coleman, Musician)