• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Chavan Finland 13082009
 

Chavan Finland 13082009

on

  • 889 views

Public Interest Lecture given at Helsinki on 13th August 2009.

Public Interest Lecture given at Helsinki on 13th August 2009.

Statistics

Views

Total Views
889
Views on SlideShare
887
Embed Views
2

Actions

Likes
0
Downloads
16
Comments
0

2 Embeds 2

https://www.mturk.com 1
http://www.slideshare.net 1

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • If one analyze the trends of investment in biodiversity informatics activities in the North, major funding has been attracted by large, projects rather than small scale data management. In fact, R&D activities in the area of biodiversity informatics themselves are focused to address the needs of large data publishers. Small data publishers who has ability to contribute much required droplets to oceans of biodiversity data are the neglected mass!
  • Long ta80% of science funding is disbursed for small projects. NSF 2007 awards - 9347 ($2,137,636,716) BIG – 1869 ($1,199,088,125), SMALL – 7478 ($938,548,595). Computational and standards thinking needs to free the long tail or liberate the dark data.
  • Add a NODES component to this SLIDE as GBIF historical focus has collections as a core

Chavan Finland 13082009 Chavan Finland 13082009 Presentation Transcript

  • GLOBALBIODIVERSITY
    Helsinki, 13 August 2009
    INFORMATIONFACILITY
    Discovery and Mobilisation of Primary Biodiversity Data: Challenges and Potentials
    Vishwas Chavan
    GBIF Secretariat
    WWW.GBIF.ORG
    Building the Biodiversity Informatics Commons
  • Primary Biodiversity Data: Definition
    Primary Biodiversity Data are the digital text or multimedia data records that detail the instance of an organism – the ‘what, where, when, how and by whom’ of the organism’s occurrence and recording
  • Conservation and sustainable use of biodiversity
    Policy development and decision making (at local, national, regional, and global levels)
    Biodiversity Data
    Monitoring of status and trends of biodiversity
    (sound science)
    Significance of Biodiversity Data
  • Uses of primary biodiversity data
    What can you do with georeferenced biodiversity data?
    • Predict effects of climate change
    • Analyse and predict spread of pests and diseases of humans, crops, livestock, wildlife, etc.
    • Predict best places to set up new protected areas
    • Analyse invasive species and predict invasion pathways
    • Provide policymaker-relevant data of all kinds
    • Be a resource for biodiversity science communities
  • Key Millennium Assessment Solutions
    GBIF is a multi-country response to the need to mobilise information in order to facilitate improved decision-making...
    ”data without borders”
  • It is all about……
    Data
    Data
    Data
    Data
    Data
    Data
    Content
    Data
    Data
    Data
    Data
    Data
    Content
    Content
    Content
    Content
    Data
    Data
    Content
    Data
    Data
    Data
    Content
    Data
    Content
    Content
    Data
    Content
    Content
    Data
    Data
    Content
    Content
    Data
    Data
    Data
    Data
    Content
    Data
    Data
    Content
    Data
    Content
    Data
    Data
    Data
    Data
    Data
    Content
    Data
    Data
    Data
  • What is needed?
    • Data Digitisation, Management and Archiving
    • Data exchange / sharing
    • Digital Data Publishing
    • Free and Open Access
    • Data without barriers
  • Biodiversity and information about it are unevenly distributed…..
    biodiversity hotspot
    holder of large amounts of biodiversity data
  • Few more facts…..
    Investment in biodiversity information management is towards large projects
    Research in biodiversity informatics is focused towards large data publishers
    Small Data Publishers – A neglected mass!
  • Small are BIG!
    • Long tail or Dark Data is economically and ecologically very critical
    • Most of existing and future data would be hold by Small Data Publishers
    • 80% of current investment is towards Small Data Publishers
    • Total Awards: 9347
    • Big Awards: 1869
    • SMALL Awards: 7478
    • Source: Curating the Dark Data in the Long tail of science by P. Bryan Heidorn
  • Characteristics of Small Data Publishers
    • Heterogeneous
    • Distributed and isolated
    • Manually generated
    • Individual creation
    • Not maintained for reuse by others
    • Obscured or protected
    • Uneven distribution as well unequal access
    • It is highly “Unorganised” data sector.......
  • GBIF is a Govt-sponsored science research infrastructure initiative...
    • responsive to government needs/demands in biodiversity management – govt-initiated, and govt funded;
    • developing an advanced informatics infrastructure for the global scientific (+ national research institutions) community for accessing and sharing data and thereby making better use of the huge investments in data collection, storage, management and updating of biodiversity information in countries.
    • Only started in 2002....
  • GBIF’s growing global network
    Currently,
    50 countries,
    40 international organisations
  • GBIF’s multilateral mandate
    Catalyse building a global informatics research infrastructure by:
    - promoting global participation, working through and linking up a global network of partners
    - Enabling publishing of biodiversity data
    - promoting development of data exchange standards
    - building an informatics architecture
    - capacity building
    - catalysing development of analytical tools
    data provider / aggregator
  • The GBIF Data Portal
    • Seamless integration of data from many sources
    • Searches
    • Taxonomic
    • Geographic, by
    • country,
    • region, or
    • bounding-box
    • By dataset
    • or a combination of these
    • Taxonomic browse navigation using choice of classification
    • Web services
    http://data.gbif.org/
  • Georef’d data via GBIF portal
    (>181m records mapped to a 1 X 1 degree grid)
  • In summary…GBIF’s Informatics
    Improved access
    to Names, Metadata
    and Primary Biodiversity
    Data
    Distributed GBIF
    informatics architecture
    Faster and easier
    publishing of data
  • At the core, a Discovery System
    DiscoverySystem
    ServicePublishers
    Others…
    Registering
    Discovering
    DataPublishers
    Consumers
    Searching
    Retrieving
  • That links to resources…
    Who?
    Institutions, Collections …
    What?
    Data, Services, GUID/LSID…
    Where?
    Location, Access points…
    When?
    Temporal Scope…
    How
    Formats, protocols, qualities
    …./
    A distributed service
    …………..
    which resolves to information resources
  • GBRDS: empowering discovery
  • Key Components: the IPT
    Registration (GBRDS) +
    Publishing of Names, Metadata,
    Primary biodiversity data etc…
    IPT
    Data Publisher
    The Integrated Publishing Toolkit isa state-of-the-art tool to simplify the
    mobilization of biodiversity information resources such as Names, Metadata andprimary biodiversity data
  • Simple process!
    The Integrated Publishing Toolkit (IPT) is designed to simplify the mapping, indexing and harvesting of Names, Metadata and Primary Biodiversity Data!
  • Building a comprehensive global architecture for nomenclature
  • Examples of resources provided by GBIF
    all are free
  • Data mobilised by the USA, including other countries of origin
  • GBIF Data portal and data access/sharing/ repatriation
  • Users of GBIF data
    Scientists, experts
    Government officials at all levels
    Education at all levels
    NGOs and the general public
    These needs are highly varied, but can be met by open access to the same datasets
    The same data can be analysed differently for different uses
  • Integrating Biodiversity Data…
    World Database on Protected Areas
    Primary Biodiversity Data
    >60,000 protected areas
    >181 M specimenand observation data
    Species geo-referenced data.
    taxonomic resolution (intelligence)
    Spain
    Madagascar
    Visualisation on WDPA site
    Each Protected Area polygon processed….
    Data processing
    from the two databases
  • Moving towards…
    global integration
    ?
    Migratory Spp.
    ThreatenedSpp.; Red List Spp.
    Invasives, crop wild relatives,
    medicinals, etc.
  • Mapping Disease Vectors
    • Aedes albopictus “Asian Tiger Mosquito”
    • Invader - fastest spreading mosquito in the world
    • Aggressive daytime biter and pest
    • Known to transmit Dengue, La Crosse, St. Louis, Eastern Equine, Ross River, Rift Valley, and West Nile Viruses
  • Aedes albopictus – native range
    Modeled native range in Asia from specimens & observations (ENM)
  • Aedes albopictus:potential USA invasion
    Projected Asian niche onto present USA to create invasion risk-map.
  • Aedesalbopictus: actual USA invasion (by county)
  • Aedes albopictus: world risk-map (present)
  • Using GBIF data in CC models
    Sterculiaceae (Meliaceae, Dipterocapaceae)
  • Summary of CC impacts
    • All families and genera suffer habitat loss;
    • Some do gain habitat potential, but this requires migration and suitable habitat (for forests) at destination;
    • Important implications for REDD and other CC adaptation/mitigation programmes
  • Why should I publish data?
    What is there for me?
    Chavan, June 2009
  • Why should I publish data?
    Recognition
    • Opportunities
    What is there for me?
    • Investment
  • Elements of
    Data Publishing Framework
  • Data Publishing Framework
    • Bring in cultural change towards ‘free and open access' to biodiversity data
    • Addresses social, technical, and policy concerns
    • Answer ‘What is there for me?’ needs of ALL
  • Data Publication together with scholarly publication: ZooKeys experience
    Occurrence Data
    KML file
  • Data Publishing = Scholarly Publishing ?
  • Conservation and sustainable use of biodiversity
    Policy development and decision making (at local, national, regional, and global levels)
    Primary Biodiversity Data
    Monitoring of status and trends of biodiversity
    (sound science)
  • www.gbif.org
    Thank you!
    Vishwas Chavan: vchavan@gbif.org
  • How to contact GBIF:
    Web site:www.gbif.org
    Data portal:data.gbif.org
    GBIF Secretariat
    Universitetsparken 152100 CopenhagenDenmark
    E-mail:info@gbif.org
    Phone: +45 3532 1470
    Fax: +45 3532 1480
    GBIF Secretariat building, supported by a grant from the Aage V. Jensens Fonde