Danis and Parsons, presentation given at the World Conference on Marine Biodiversity, Aberdeen, September 2011.
ANSTRACT: In this paper, we present SCAR’s Marine Biodiversity Information Network (SCAR-MarBIN, www.scarmarbin.be), introduce the new Antarctic Biodiversity Information Facility (ANTABIF, www.biodiversity.aq) and argue that it has become vital and practicable to support an international mechanism for the exchange of scientific data. This approach allows to integrate large data volumes, and helps modern biologists to face a “data deluge” using new techniques and technologies currently developed in the field of biodiversity informatics. Biodiversity is an example of data-intensive science, and certainly requires an interdisciplinary, scalable approach to address complex systemic problems such as environmental change and its impact on marine ecosystems. This paper discusses the experience of data scientists seeking to collect, curate, and provide data during the timeframe of the International Polar Year. The data content of the SCAR-MarBIN and ANTABIF holdings has been explored, and recent published analyses are used to illustrate concrete examples. We find that while technology is a critical factor to address this dimension, the greater challenges are more socio-cultural than technical. We describe a vision of discoverable, open, linked, useful, and safe data and suggest the need for a rapid socio-technical evolution in the overall science data ecosystem.
Biodiversity Information Networks: Dataflows for interdisciplinary sciences
1. Biodiversity Information
Networks
Dataflows for interdisciplinary sciences
Bruno Danis & Mark Parsons
Royal Belgian Institute of Natural Sciences
National Snow & Ice Data Center
www.scarmarbin.be
www.biodiversity.aq
2. Layout
• Background
• Our vision
• Achievements and products
• Uptake of GBIF tools
• The Future: Technological and Cultural challenges
4. Antarctic Treaty
(our inspiration)
« In order to promote international cooperation in scientific
investigation in Antarctica, as provided for in Article III (1c) of the
Treaty, the Contracting Parties agree that, to the greatest extent
feasible and practicable: […] Scientific observations and results
from Antarctica shall be exchanged and made freely available. »
5. Special things about Antarc8ca
• Antarc8c Treaty system (no government)
• The Scien8fic Community on Antarc8c Research
• Very ac8ve community
• Logis8cs
• Biodiversity
– Highly adapted
– Mostly marine
– Probably extremely sensi8ve
5
6. SCAR‐MarBIN & AntaBIF
• www.scarmarbin.be: marine biodiversity informa8on network
• www.biodiversity.aq: biodiversity informa8on facility
• Core funding: BELSPO.be (un8l June 2012)
• Interna8onal Polar Year 2007/08
• Census of Antarc8c Marine Life
• Ocean Biogeographic Informa8on System
• Global Biodiversity Informa8on Facility
7. General philosophy
• Build an electronic ecosystem
• Offer free and open access to data and technology
• Expose all the (biodiversity) data and metadata, in mul8ple contexts
• Remain community‐driven, and collabora8ve
• Adopt strong standardiza8on
• Work for science, conserva8on, management
7
11. Webportals
• Up since Oct 2005
• taxonomy, occurrence,
metadata
• vizualisation
• open access
• 909,915 visitors
• 8,093,774 hits
• >50,000,000 dld records
• ANTABIF proto up
11
12. Data products
• Antarc8c Field Guides
• Biogeography Atlas of the Southern
Ocean
• Community projects
• Incen8ves for publishing/reviewing
data
12
13. Antarc8c Field Guides
afg.biodiversity.aq
• Iden8fica8on aid
• Best available pictures
• Descrip8ons
• Publica8on/sharing plahorm for customized FGs
• Many contributors
• Dynamically built from various sources
13
20. Biodiversity Atlas of the SO
atlas.biodiversity.aq
• Paper and digital versions
• Publish thema8c maps
• Publish predic8ve maps
• Build an interac8ve plahorm
• Dynamic genera8on of the content
• Capacity building
• Fill in gaps
• 60 contributors from 36 ins8tu8ons in 16 countries
20
25. (near) Future
• Beker documenta8on
• Many data portals
• (much) Enhanced data flow
• The community feels responsibility
• Conserva8on/Management are factual
• Op8miza8on of Biodiversity research efforts/resources
25
26. Future Future
• Expor8ng the model to other disciplines sounds like a plan
– SCADM: Standing Commikee on Antarc8c Data Management
– SOOS: Southern Ocean Observing System
• SCAR/IASC: Going bipolar
• (much) Enhanced interoperability
• Integra8ve, connected science
26
27. The Polar Informa8on Commons
www.polarcommons.org
• Drama8c environmental changes
• Emergency solu8on
• Setup of a commons
– IT cloud
– Set of norms
• All polar data (IPY)
• Simple procedure:
– badge dataset(s) (describe, choose a CC Licence)
– throw it in the cloud
– done
27
So starting with “Open”. I like this definition from wikipedia\n
OK. Linked. Again Wikipedia provides a good definition. The point is that data are more relevant and useful if they are associated and explicitly linked with other data, especially when they are linked in a way that computers can readily interpret.\n\n
Useful. I don’t define useful, because it depends on your perspective.\n\n
So if we think of interoperability of the actual data not just the metadata, there are two general approaches: A geospatial or map-based approach and a semantic approach that relies on common, well-defined concepts and relationships. Semantic work is really still in its infancy. There has been some work in specific domains and IPY has an agreement on basic terms, but major challenges remain. There has been progress in the geospatial realm.\n\n