SlideShare a Scribd company logo
1 of 20
Vince Smith
Delivering biodiversity
knowledge in the
information age
Hellenic Botanical Society
Thessaloniki, Greece, 3-6 Oct. 2013
Overview
1. Background – biodiversity data diversity
• An introduction to me (lice to data infrastructures)
• The problem (integrating biodiversity research)
2. Example tools to manage biodiversity data
• Scratchpads (a platform to manage data)
• Biodiversity Data Journal (incentives to work digitally)
• eMonocot (aggregating data across communities)
3. Big community challenges – three examples
• Social issues (openness)
• Data issues (mobilizing existing data)
• Synthetic issues (modeling data)
4. Next steps
• Toward an integrated view for H2020 (strategy)
1. Background
Lice to data infrastructures (1997-2004)
Systematics (circa 1998)
- No high level keys
- Poor high level taxonomy
- Just one phylogeny
- Few living experts!
Circa 5,000 spp.
Mammals & birds
12,000 associations
15,000 potential hosts
http://darwin.zoology.gla.ac.uk/~rpage/LouseBase/2/
LouseBASE
Specimens Images
(SID)
http://darwin.zoology.gla.ac.uk/~SID/
Literature
PHPBib
http://myphpbib.sourceforge.net/
Lab Notebook
http://www2.flmnh.ufl.edu/pdb/
Host-Parasite Checklists
http://www2.flmnh.ufl.edu/adb/
Glasgow version at:
Lousy data infrastructures (circa 2004)
The problem – integrating biodiversity research (2004>)
How to we join up these activities? How do we use this as a tool?
Species conservation & protected areas
Impacts of human development
Biodiversity & human health
Impacts of climate change
Food, farming & biofuels
Invasive alien species
What infrastructures do we need?
(technologies, tools, standards…)
What processes do we need?
(Modelling, workflows…)
What data do we need?
(Genes, localities…)
2. Biodiversity data tools
- Scratchpads
- Biodiversity Data Journal
- eMonocot
Scratchpads – a space for your data
• Hosted websites for
biodiversity data
• Virtual research
environments
• Completely open access
& open source
• Modular & flexible
• Running since 2007
• Making taxonomy
digital, open & linked
http://scratchpads.eu
Scratchpads– a space for your data
Taxa Projects Regions Societies
544Scratchpad Communities
by 6,644active registered users
covering 91,631taxa
in 535,317 pages.
81 paper citations in 2012
In total more than
1,300,000 visitors
http://scratchpads.eu
Biodiversity Data Journal – incentivising data publishing
• New, Open Access data journal
• Linked to Scratchpads via Publication
Module
• Supports the life cycle of a manuscript
• Writing, submission, review, publication
& dissemination, all in one place
• Structured, reusable, standardised data
• Launched in Sept 2013 with 24 articles
http://biodiversitydatajournal.com
Biodiversity Data Journal – easy manuscript assembly
Structured data
Review, Publish
, cite &
disseminate
EOL
Dryad
GBIF
Wiki Species-Id
PubMed
Plazi
Select, describ
e & annotate
data
Publication module
http://biodiversitydatajournal.com
eMonocot – aggregating data across communities
• Online resource for monocot
plants
• Collaboration between
Kew, Oxford University and
NHM
• Data to be open and usable
by other scientists
http://e-monocot.org
eMonocot – aggregating data across communities
• Linking monocot
communities
• Identification, checklist
& taxonomic data for:
- 275,000 taxa
- 8,300 images
- 15 identification keys
- 3 phylogenies
• A sustainable digital
portal
• A source of data for
analysis
http://e-monocot.org
3. Example challenges
- Social issues (openness)
- Data issues (mobalising existing data)
- Synthetic issues (modelling)
Social challenges: openness
E. Archambault et. al., Proportion of Open Access Peer-Reviewed Papers at the
European and World Levels--2004-2011, June 2013, Science-Metrix Inc.
“One-half of all papers are now freely available
within a year or two of publication”
“A piece of data or content is open if anyone is free to use, reuse, and redistribute it -
subject, at most, to the requirement to attribute and/or share-alike.” http://opendefinition.org/
Many kinds of openness:
• Open Access
• Open Data
• Open Science
• Open Source
• Sharing data is a foundation
for our activities
• Normal practice in some
communities (molecular)
• Mandated by some funders
& governments
Need to continue to incentivise openness
Data challenges: mobilising existing data
Collections
• 1.5-3B specimens in collections worldwide
• Fragments efforts / need coordination
Biodiversity literature
• >300M pages, BHL scanned 41M to date
• Copyright post-1923 & article metadata
Informatics challenges
• Automation & annotation
• Storage & persistence
• Business models to sustain activity
Collections, literature & metadata
How can we quickly, efficiently and cost
effectively mobilise biological data at scale?
Bibliography of Life
(RefFinder & RefBank)
BHL
literature
NHM
Digitisation
Synthetic challenges: Modeling the biosphere
Conceptually has many potential uses
• Identifying trends
• Explaining patterns
• Making predictions
• Real time alerts
- when data contradicts current knowledge
• The ultimate policy tool
Major informatics challenges
• Technical very difficult (many years off)
• Needs effective prototypes & platforms
• Some first steps e.g. Local Ecological Footprint Tool
Nature 2013, doi:10.1038/493295a
Reasoning across large, linked biodiversity datasets
A clear, singular, long-term vision, which
biodiversity data can contribute too
4. Next steps
- Further reading
- H2020 Opportunities
A strategic view: community informatics challenges
GBIF GBIC Report
(Sept. 2013)
Biodiv. Inf. Challenges
(April, 2013)
Grand Challenges for Biodiversity Informatics
(integrating activities for H2020)
QUESTIONS

More Related Content

What's hot

Preserving Our Digital Heritage: Community Action via UK LOCKSS
Preserving Our Digital Heritage: Community Action via UK LOCKSSPreserving Our Digital Heritage: Community Action via UK LOCKSS
Preserving Our Digital Heritage: Community Action via UK LOCKSSEDINA, University of Edinburgh
 
The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...
The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...
The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...LEARN Project
 
Building research data management services at the University of Edinburgh: a ...
Building research data management services at the University of Edinburgh: a ...Building research data management services at the University of Edinburgh: a ...
Building research data management services at the University of Edinburgh: a ...Robin Rice
 
Use it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresUse it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresVince Smith
 
Dr Natalie Harrower - DRI and Open Data
Dr Natalie Harrower - DRI and Open DataDr Natalie Harrower - DRI and Open Data
Dr Natalie Harrower - DRI and Open Datadri_ireland
 
Scratchpad 2014-introduction
Scratchpad 2014-introductionScratchpad 2014-introduction
Scratchpad 2014-introductionVince Smith
 
What’s Different about the Digital: Community Action via UK LOCKSS Alliance
What’s Different about the Digital: Community Action via UK LOCKSS AllianceWhat’s Different about the Digital: Community Action via UK LOCKSS Alliance
What’s Different about the Digital: Community Action via UK LOCKSS AllianceEDINA, University of Edinburgh
 
Dev8ed2012 lightening talk
Dev8ed2012 lightening talkDev8ed2012 lightening talk
Dev8ed2012 lightening talkSuzanneHardy
 
Fostering Open Science to Research Using a Taxonomy and an eLearning Portal
Fostering Open Science to Research Using a Taxonomy and an eLearning PortalFostering Open Science to Research Using a Taxonomy and an eLearning Portal
Fostering Open Science to Research Using a Taxonomy and an eLearning PortalNancy Pontika
 
Charting the Future - Ms Heather Jenks, ANU
Charting the Future - Ms Heather Jenks, ANUCharting the Future - Ms Heather Jenks, ANU
Charting the Future - Ms Heather Jenks, ANUARDC
 
Data management: The new frontier for libraries
Data management: The new frontier for librariesData management: The new frontier for libraries
Data management: The new frontier for librariesLEARN Project
 
Big Data is today: key issues for big data - Dr Ben Evans
Big Data is today: key issues for big data - Dr Ben EvansBig Data is today: key issues for big data - Dr Ben Evans
Big Data is today: key issues for big data - Dr Ben EvansARDC
 
Authority files - Jisc Digital Festival 2014
Authority files - Jisc Digital Festival 2014Authority files - Jisc Digital Festival 2014
Authority files - Jisc Digital Festival 2014Jisc
 

What's hot (20)

Preserving Our Digital Heritage: Community Action via UK LOCKSS
Preserving Our Digital Heritage: Community Action via UK LOCKSSPreserving Our Digital Heritage: Community Action via UK LOCKSS
Preserving Our Digital Heritage: Community Action via UK LOCKSS
 
The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...
The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...
The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...
 
Building research data management services at the University of Edinburgh: a ...
Building research data management services at the University of Edinburgh: a ...Building research data management services at the University of Edinburgh: a ...
Building research data management services at the University of Edinburgh: a ...
 
Who is looking after your e-journals?
Who is looking after your e-journals?Who is looking after your e-journals?
Who is looking after your e-journals?
 
Use it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructuresUse it or lose it: a hybrid model for sustaining e-infrastructures
Use it or lose it: a hybrid model for sustaining e-infrastructures
 
Dr Natalie Harrower - DRI and Open Data
Dr Natalie Harrower - DRI and Open DataDr Natalie Harrower - DRI and Open Data
Dr Natalie Harrower - DRI and Open Data
 
Scratchpad 2014-introduction
Scratchpad 2014-introductionScratchpad 2014-introduction
Scratchpad 2014-introduction
 
What’s Different about the Digital: Community Action via UK LOCKSS Alliance
What’s Different about the Digital: Community Action via UK LOCKSS AllianceWhat’s Different about the Digital: Community Action via UK LOCKSS Alliance
What’s Different about the Digital: Community Action via UK LOCKSS Alliance
 
Limitreal
LimitrealLimitreal
Limitreal
 
Dev8ed2012 lightening talk
Dev8ed2012 lightening talkDev8ed2012 lightening talk
Dev8ed2012 lightening talk
 
RDM through a UK lens - New Roles for Librarians?
RDM through a UK lens - New Roles for Librarians? RDM through a UK lens - New Roles for Librarians?
RDM through a UK lens - New Roles for Librarians?
 
SCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison serviceSCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison service
 
Fostering Open Science to Research Using a Taxonomy and an eLearning Portal
Fostering Open Science to Research Using a Taxonomy and an eLearning PortalFostering Open Science to Research Using a Taxonomy and an eLearning Portal
Fostering Open Science to Research Using a Taxonomy and an eLearning Portal
 
COBWEB Project: Citizens Observatories Side Event
COBWEB Project: Citizens Observatories Side EventCOBWEB Project: Citizens Observatories Side Event
COBWEB Project: Citizens Observatories Side Event
 
Charting the Future - Ms Heather Jenks, ANU
Charting the Future - Ms Heather Jenks, ANUCharting the Future - Ms Heather Jenks, ANU
Charting the Future - Ms Heather Jenks, ANU
 
Data management: The new frontier for libraries
Data management: The new frontier for librariesData management: The new frontier for libraries
Data management: The new frontier for libraries
 
UKLA Update On Activities
UKLA Update On ActivitiesUKLA Update On Activities
UKLA Update On Activities
 
Ukla uksg 2013_final
Ukla uksg 2013_finalUkla uksg 2013_final
Ukla uksg 2013_final
 
Big Data is today: key issues for big data - Dr Ben Evans
Big Data is today: key issues for big data - Dr Ben EvansBig Data is today: key issues for big data - Dr Ben Evans
Big Data is today: key issues for big data - Dr Ben Evans
 
Authority files - Jisc Digital Festival 2014
Authority files - Jisc Digital Festival 2014Authority files - Jisc Digital Festival 2014
Authority files - Jisc Digital Festival 2014
 

Similar to Delivering biodiversity knowledge in the information age

Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince Smith
 
The biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveThe biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveVince Smith
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeVince Smith
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessAbby Clobridge
 
Scratchpads introductory presentation 45mins
Scratchpads introductory presentation   45minsScratchpads introductory presentation   45mins
Scratchpads introductory presentation 45minsDimitrios Koureas
 
Building data infrastructures for science
Building data infrastructures for scienceBuilding data infrastructures for science
Building data infrastructures for scienceVince Smith
 
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...LIBER Europe
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyICZN
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
eROSA Policy WS2: Second Stakeholder Workshop
eROSA Policy WS2: Second Stakeholder WorkshopeROSA Policy WS2: Second Stakeholder Workshop
eROSA Policy WS2: Second Stakeholder Workshope-ROSA
 
Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?Hilmar Lapp
 
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.Carole Goble
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...BigData_Europe
 
The role of libraries and information professionals during the Big Data Era/ ...
The role of libraries and information professionals during the Big Data Era/ ...The role of libraries and information professionals during the Big Data Era/ ...
The role of libraries and information professionals during the Big Data Era/ ...African Open Science Platform
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2Alex Hardisty
 
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Trish Rose-Sandler
 
Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Vince Smith
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliLEARN Project
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeEdward Baker
 

Similar to Delivering biodiversity knowledge in the information age (20)

Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notext
 
The biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveThe biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspective
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics Landscape
 
Getting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open AccessGetting Started with Institutional Repositories and Open Access
Getting Started with Institutional Repositories and Open Access
 
Scratchpads introductory presentation 45mins
Scratchpads introductory presentation   45minsScratchpads introductory presentation   45mins
Scratchpads introductory presentation 45mins
 
Building data infrastructures for science
Building data infrastructures for scienceBuilding data infrastructures for science
Building data infrastructures for science
 
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
eROSA Policy WS2: Second Stakeholder Workshop
eROSA Policy WS2: Second Stakeholder WorkshopeROSA Policy WS2: Second Stakeholder Workshop
eROSA Policy WS2: Second Stakeholder Workshop
 
Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?
 
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
 
The role of libraries and information professionals during the Big Data Era/ ...
The role of libraries and information professionals during the Big Data Era/ ...The role of libraries and information professionals during the Big Data Era/ ...
The role of libraries and information professionals during the Big Data Era/ ...
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2
 
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
 
Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
 
Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...Virtual Research Environments supporting biodiversity research: Needs & prior...
Virtual Research Environments supporting biodiversity research: Needs & prior...
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina Leonelli
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 

More from Vince Smith

DiSSCo institutional benefits
DiSSCo institutional benefitsDiSSCo institutional benefits
DiSSCo institutional benefitsVince Smith
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeVince Smith
 
Moving beyond the box: automating the digitisation of insect collections
Moving beyond the box: automating the digitisation of insect collectionsMoving beyond the box: automating the digitisation of insect collections
Moving beyond the box: automating the digitisation of insect collectionsVince Smith
 
FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...Vince Smith
 
No specimen left behind: Collections digitisation at the NHM, London*
No specimen left behind:  Collections digitisation at the NHM, London*No specimen left behind:  Collections digitisation at the NHM, London*
No specimen left behind: Collections digitisation at the NHM, London*Vince Smith
 
SYNTHESYS 3 Overview
SYNTHESYS 3 OverviewSYNTHESYS 3 Overview
SYNTHESYS 3 OverviewVince Smith
 
Consolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsConsolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsVince Smith
 
Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Vince Smith
 
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Vince Smith
 
Scratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataScratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataVince Smith
 
Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Vince Smith
 
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Vince Smith
 
Don't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easyDon't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easyVince Smith
 
Don’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyDon’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyVince Smith
 
Digitised collections: Toward a digital strategy for for the NHM, London
Digitised collections: Toward a digital strategy for for the NHM, LondonDigitised collections: Toward a digital strategy for for the NHM, London
Digitised collections: Toward a digital strategy for for the NHM, LondonVince Smith
 
2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smithVince Smith
 
Sharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT waySharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT wayVince Smith
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
 
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Vince Smith
 
Scratchpad 2, Virtual Research Environment: Project Update
 Scratchpad 2, Virtual Research Environment: Project Update Scratchpad 2, Virtual Research Environment: Project Update
Scratchpad 2, Virtual Research Environment: Project UpdateVince Smith
 

More from Vince Smith (20)

DiSSCo institutional benefits
DiSSCo institutional benefitsDiSSCo institutional benefits
DiSSCo institutional benefits
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 
Moving beyond the box: automating the digitisation of insect collections
Moving beyond the box: automating the digitisation of insect collectionsMoving beyond the box: automating the digitisation of insect collections
Moving beyond the box: automating the digitisation of insect collections
 
FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...FP7 Funded RI Project experiences: some overly honest tips from a project coo...
FP7 Funded RI Project experiences: some overly honest tips from a project coo...
 
No specimen left behind: Collections digitisation at the NHM, London*
No specimen left behind:  Collections digitisation at the NHM, London*No specimen left behind:  Collections digitisation at the NHM, London*
No specimen left behind: Collections digitisation at the NHM, London*
 
SYNTHESYS 3 Overview
SYNTHESYS 3 OverviewSYNTHESYS 3 Overview
SYNTHESYS 3 Overview
 
Consolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review PresentationsConsolidated ViBRANT Project Final Review Presentations
Consolidated ViBRANT Project Final Review Presentations
 
Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...Assisted restructure of web content for paper-based presentation: a look at w...
Assisted restructure of web content for paper-based presentation: a look at w...
 
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...
 
Scratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataScratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity data
 
Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...Next generation sequencing requires next generation publishing: the Biodivers...
Next generation sequencing requires next generation publishing: the Biodivers...
 
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...
 
Don't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easyDon't make me think: biodiversity data publishing made easy
Don't make me think: biodiversity data publishing made easy
 
Don’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easyDon’t make me think: biodiversity data publishing made easy
Don’t make me think: biodiversity data publishing made easy
 
Digitised collections: Toward a digital strategy for for the NHM, London
Digitised collections: Toward a digital strategy for for the NHM, LondonDigitised collections: Toward a digital strategy for for the NHM, London
Digitised collections: Toward a digital strategy for for the NHM, London
 
2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith2013 02 data portal science group update -v smith
2013 02 data portal science group update -v smith
 
Sharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT waySharing, linking and publishing biodiversity data the ViBRANT way
Sharing, linking and publishing biodiversity data the ViBRANT way
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
 
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
Making your data work for you: Scratchpads, publishing & the Biodiversity Dat...
 
Scratchpad 2, Virtual Research Environment: Project Update
 Scratchpad 2, Virtual Research Environment: Project Update Scratchpad 2, Virtual Research Environment: Project Update
Scratchpad 2, Virtual Research Environment: Project Update
 

Recently uploaded

_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 

Recently uploaded (20)

_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 

Delivering biodiversity knowledge in the information age

  • 1. Vince Smith Delivering biodiversity knowledge in the information age Hellenic Botanical Society Thessaloniki, Greece, 3-6 Oct. 2013
  • 2. Overview 1. Background – biodiversity data diversity • An introduction to me (lice to data infrastructures) • The problem (integrating biodiversity research) 2. Example tools to manage biodiversity data • Scratchpads (a platform to manage data) • Biodiversity Data Journal (incentives to work digitally) • eMonocot (aggregating data across communities) 3. Big community challenges – three examples • Social issues (openness) • Data issues (mobilizing existing data) • Synthetic issues (modeling data) 4. Next steps • Toward an integrated view for H2020 (strategy)
  • 4. Lice to data infrastructures (1997-2004) Systematics (circa 1998) - No high level keys - Poor high level taxonomy - Just one phylogeny - Few living experts! Circa 5,000 spp. Mammals & birds 12,000 associations 15,000 potential hosts
  • 6. The problem – integrating biodiversity research (2004>) How to we join up these activities? How do we use this as a tool? Species conservation & protected areas Impacts of human development Biodiversity & human health Impacts of climate change Food, farming & biofuels Invasive alien species What infrastructures do we need? (technologies, tools, standards…) What processes do we need? (Modelling, workflows…) What data do we need? (Genes, localities…)
  • 7. 2. Biodiversity data tools - Scratchpads - Biodiversity Data Journal - eMonocot
  • 8. Scratchpads – a space for your data • Hosted websites for biodiversity data • Virtual research environments • Completely open access & open source • Modular & flexible • Running since 2007 • Making taxonomy digital, open & linked http://scratchpads.eu
  • 9. Scratchpads– a space for your data Taxa Projects Regions Societies 544Scratchpad Communities by 6,644active registered users covering 91,631taxa in 535,317 pages. 81 paper citations in 2012 In total more than 1,300,000 visitors http://scratchpads.eu
  • 10. Biodiversity Data Journal – incentivising data publishing • New, Open Access data journal • Linked to Scratchpads via Publication Module • Supports the life cycle of a manuscript • Writing, submission, review, publication & dissemination, all in one place • Structured, reusable, standardised data • Launched in Sept 2013 with 24 articles http://biodiversitydatajournal.com
  • 11. Biodiversity Data Journal – easy manuscript assembly Structured data Review, Publish , cite & disseminate EOL Dryad GBIF Wiki Species-Id PubMed Plazi Select, describ e & annotate data Publication module http://biodiversitydatajournal.com
  • 12. eMonocot – aggregating data across communities • Online resource for monocot plants • Collaboration between Kew, Oxford University and NHM • Data to be open and usable by other scientists http://e-monocot.org
  • 13. eMonocot – aggregating data across communities • Linking monocot communities • Identification, checklist & taxonomic data for: - 275,000 taxa - 8,300 images - 15 identification keys - 3 phylogenies • A sustainable digital portal • A source of data for analysis http://e-monocot.org
  • 14. 3. Example challenges - Social issues (openness) - Data issues (mobalising existing data) - Synthetic issues (modelling)
  • 15. Social challenges: openness E. Archambault et. al., Proportion of Open Access Peer-Reviewed Papers at the European and World Levels--2004-2011, June 2013, Science-Metrix Inc. “One-half of all papers are now freely available within a year or two of publication” “A piece of data or content is open if anyone is free to use, reuse, and redistribute it - subject, at most, to the requirement to attribute and/or share-alike.” http://opendefinition.org/ Many kinds of openness: • Open Access • Open Data • Open Science • Open Source • Sharing data is a foundation for our activities • Normal practice in some communities (molecular) • Mandated by some funders & governments Need to continue to incentivise openness
  • 16. Data challenges: mobilising existing data Collections • 1.5-3B specimens in collections worldwide • Fragments efforts / need coordination Biodiversity literature • >300M pages, BHL scanned 41M to date • Copyright post-1923 & article metadata Informatics challenges • Automation & annotation • Storage & persistence • Business models to sustain activity Collections, literature & metadata How can we quickly, efficiently and cost effectively mobilise biological data at scale? Bibliography of Life (RefFinder & RefBank) BHL literature NHM Digitisation
  • 17. Synthetic challenges: Modeling the biosphere Conceptually has many potential uses • Identifying trends • Explaining patterns • Making predictions • Real time alerts - when data contradicts current knowledge • The ultimate policy tool Major informatics challenges • Technical very difficult (many years off) • Needs effective prototypes & platforms • Some first steps e.g. Local Ecological Footprint Tool Nature 2013, doi:10.1038/493295a Reasoning across large, linked biodiversity datasets A clear, singular, long-term vision, which biodiversity data can contribute too
  • 18. 4. Next steps - Further reading - H2020 Opportunities
  • 19. A strategic view: community informatics challenges GBIF GBIC Report (Sept. 2013) Biodiv. Inf. Challenges (April, 2013) Grand Challenges for Biodiversity Informatics (integrating activities for H2020)