SlideShare a Scribd company logo
1 of 19
Big Data Initiatives for
Agroecosystems
Cynthia Parr
Knowledge Services Division
National Agricultural Library
Ecological Society of America, 2015
Outline
• Data management at the
National Agricultural Library
• Four examples
1. Insects 5K – i5K Workspace
2. Life Cycle Assessment
3. Long-Term Agroecosystem
Research
4. Ag Data Commons
• General principles
8.1 million items,
Agricola, PubAg
3
http://blog.thingarage.com/
raw data
citable
publication
4
raw data
collection
cleaning, enrichment, analysis
registration, preservation
temporary
data
referable
data
citable
data
citable
publication
Modified from Peter Wittenberg, Research Data Alliance
https://rd-alliance.org/group/data-fabric-ig.html
i5k.nal.usda.gov
5
Genome project hosting at the
i5k Workspace
• 27 pilot genomes hosted; 45 total
– Storage and dissemination of a
genome assembly and anything
mapped to it.
– BLAST, JBrowse Genome Browser
• Manual Curation: Web Apollo
• Post-curation maintenance
– Quality Control
– Official Gene Set generation
• Research plan
• Generate material
• Sequencing
• Assembly
• Automated
annotation
• Manual Curation
• Official gene set
generation
• Genome project
maintenance
• Biological
insights/Publicatio
n
GenomeProjectTrajectory
Life Cycle Assessment Commons
7
www.lcacommons.gov
Unformatted,
non-standard
LCA Commons Concept
LCA Community
Open LCA Framework
Common computing environment, application,
data standards, and development
NAL
LCADC
NREL
USLCI
XYZ
LCI DB
ABC
LCI DB
Distributed computing
environment & application
Common data standards
Distributed computing
environment
DEF
LCI
DB
Common application
& data standards
Interoperability Tools
Ag Data
Commons
Catalog and
Repository
Long Term Agro-ecosystem Research
(LTAR)
LTAR Data
Common Observatory
– Meteorology
– Hydrology
– Eddy flux CO2
– Non-CO2 gasses
– Soil
– Biological
10
Common Experiment
Approach
– Business as usual
– Aspirational
Will include data about
– Management practices
– Results
LTAR Data Loss
N=194 of ~500 citations in 2011 LTAR site proposals
Bad links
to data
No data
available
80% of papers provide
no way to obtain data
Data are
accessible
Refers to
general data
source
LTAR information management
• Support for download of files, web services
• Metadata in FGDC CSDGM, ISO 19115, EML,
Project Open Data
• Catalog of instrument specs using SensorML 2
• Data dictionaries in ISO 19110
• Weather data to be converted to other formats
• Field names could be converted to match different
conventions (AgMIP, etc.)
Ag Data Commons
13
data.nal.usda.gov
Enhanced
DKAN
Distributed
repositories
Search &
Knowledge
Discovery
Thesaurus &
Indexing
Ag Data
Commons
Repository
Organization
& Curation
Grant
management
systems
INGESTION DISSEMINATION
PubAg
Dataset
Submission
Analytics
& Tools
Data.gov
Forest Service
NCBI
Ag Data
Commons
Catalog
Color Legend:
Building
Adapt/Re-use
Existing
LCA Commons
Guiding principle 1:
a distributed network ….
Geospatial
Catalog
Geospatial
Repository
STEWARDS
Ag Data
Commons
(catalog)
Ag Data
Commons
(repository)
USDA
Enterprise
Inventory
National
Weather
Service
Data.gov
Ecosystems
.data.gov
of Networks…
Public access to open, machine
readable data enables larger
scale, integrative and innovative
data science
The long tail
Guiding principle 2:
big data AND long tail
Guiding principle 3:
curation adds value
• Data dictionaries
• Standards & templates
• Linkages
• Semantics
• Preservation
Thanks!
National Agricultural Library
Knowledge Services Division: Susan McCarthy
LTAR Jeffrey Campbell, Charles Lockwood
i5K Monica Poelchau, Chris Childers
LCA Commons Peter Arbuckle, Ezra Kahn
Ag Data Commons Ursula Pieper, Jocelyn McNamara,
Qing Qu, Erin Antognoli, Melissa
Lowrey, Jaylen Nathwani, NuCivic
… and collaborators and testers

More Related Content

What's hot

Introduction to Big data
Introduction to Big dataIntroduction to Big data
Introduction to Big datacthanopoulos
 
Dataverse: Helping Researchers Publish Their Data Through Automation
Dataverse: Helping Researchers Publish Their Data Through Automation�Dataverse: Helping Researchers Publish Their Data Through Automation�
Dataverse: Helping Researchers Publish Their Data Through AutomationEleni Castro, MLIS
 
Tripal within the Arabidopsis Information Portal - PAG XXIII
Tripal within the Arabidopsis Information Portal - PAG XXIIITripal within the Arabidopsis Information Portal - PAG XXIII
Tripal within the Arabidopsis Information Portal - PAG XXIIIVivek Krishnakumar
 
From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...Catherine Canevet
 
ICIC 2013 New Product Introductions InfoChem
ICIC 2013 New Product Introductions InfoChemICIC 2013 New Product Introductions InfoChem
ICIC 2013 New Product Introductions InfoChemDr. Haxel Consult
 
Information and Data Management CoP: Metadata Working Group
Information and Data Management CoP: Metadata Working Group Information and Data Management CoP: Metadata Working Group
Information and Data Management CoP: Metadata Working Group ILRI
 
DataStarR: A Data Sharing and Publication Infrastructure to Support Research
DataStarR: A Data Sharing and Publication Infrastructure to Support ResearchDataStarR: A Data Sharing and Publication Infrastructure to Support Research
DataStarR: A Data Sharing and Publication Infrastructure to Support ResearchIAALD Community
 
L clarke faang_dcc_isag_2017_compress
L clarke faang_dcc_isag_2017_compressL clarke faang_dcc_isag_2017_compress
L clarke faang_dcc_isag_2017_compressLaura Clarke
 
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...Araport
 
Acquisition, Storage and Management of Research Data in Chemical Sciences: De...
Acquisition, Storage and Management of Research Data in Chemical Sciences: De...Acquisition, Storage and Management of Research Data in Chemical Sciences: De...
Acquisition, Storage and Management of Research Data in Chemical Sciences: De...LIBER Europe
 
A guided tour of Araport
A guided tour of AraportA guided tour of Araport
A guided tour of AraportAraport
 
Web-based Tools for Integrative Analysis of Pancreatic Cancer Data
Web-based Tools for Integrative Analysis of Pancreatic Cancer DataWeb-based Tools for Integrative Analysis of Pancreatic Cancer Data
Web-based Tools for Integrative Analysis of Pancreatic Cancer DataDerek Wright
 
Efforts for Research Data Management in Japanese university / institution lib...
Efforts for Research Data Management in Japanese university / institution lib...Efforts for Research Data Management in Japanese university / institution lib...
Efforts for Research Data Management in Japanese university / institution lib...Yasuyuki Minamiyama
 
The GoGeo Vision for Repositories (Pecha Kucha) - Tony Mathys
The GoGeo Vision for Repositories (Pecha Kucha) - Tony MathysThe GoGeo Vision for Repositories (Pecha Kucha) - Tony Mathys
The GoGeo Vision for Repositories (Pecha Kucha) - Tony MathysRepository Fringe
 
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...Open Science Fair
 
Providing Research Graph data in JSON-LD using Schema.org
Providing Research Graph data in JSON-LD using Schema.orgProviding Research Graph data in JSON-LD using Schema.org
Providing Research Graph data in JSON-LD using Schema.orgJingbo Wang
 
Biothings APIs: high-performance bioentity-centric web services
Biothings APIs: high-performance bioentity-centric web servicesBiothings APIs: high-performance bioentity-centric web services
Biothings APIs: high-performance bioentity-centric web servicesChunlei Wu
 
Bioschemas findability and interoperability
Bioschemas findability and interoperabilityBioschemas findability and interoperability
Bioschemas findability and interoperabilityBioschemas
 
Demonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations SystemsDemonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations SystemsGESIS
 

What's hot (20)

Introduction to Big data
Introduction to Big dataIntroduction to Big data
Introduction to Big data
 
Dataverse: Helping Researchers Publish Their Data Through Automation
Dataverse: Helping Researchers Publish Their Data Through Automation�Dataverse: Helping Researchers Publish Their Data Through Automation�
Dataverse: Helping Researchers Publish Their Data Through Automation
 
Tripal within the Arabidopsis Information Portal - PAG XXIII
Tripal within the Arabidopsis Information Portal - PAG XXIIITripal within the Arabidopsis Information Portal - PAG XXIII
Tripal within the Arabidopsis Information Portal - PAG XXIII
 
From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...From data to knowledge – the Ondex System for integrating Life Sciences data ...
From data to knowledge – the Ondex System for integrating Life Sciences data ...
 
Marrying ACDLabs technologies to eScience Projects at the Royal Society of C...
Marrying ACDLabs technologies to eScience Projects at the  Royal Society of C...Marrying ACDLabs technologies to eScience Projects at the  Royal Society of C...
Marrying ACDLabs technologies to eScience Projects at the Royal Society of C...
 
ICIC 2013 New Product Introductions InfoChem
ICIC 2013 New Product Introductions InfoChemICIC 2013 New Product Introductions InfoChem
ICIC 2013 New Product Introductions InfoChem
 
Information and Data Management CoP: Metadata Working Group
Information and Data Management CoP: Metadata Working Group Information and Data Management CoP: Metadata Working Group
Information and Data Management CoP: Metadata Working Group
 
DataStarR: A Data Sharing and Publication Infrastructure to Support Research
DataStarR: A Data Sharing and Publication Infrastructure to Support ResearchDataStarR: A Data Sharing and Publication Infrastructure to Support Research
DataStarR: A Data Sharing and Publication Infrastructure to Support Research
 
L clarke faang_dcc_isag_2017_compress
L clarke faang_dcc_isag_2017_compressL clarke faang_dcc_isag_2017_compress
L clarke faang_dcc_isag_2017_compress
 
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...
 
Acquisition, Storage and Management of Research Data in Chemical Sciences: De...
Acquisition, Storage and Management of Research Data in Chemical Sciences: De...Acquisition, Storage and Management of Research Data in Chemical Sciences: De...
Acquisition, Storage and Management of Research Data in Chemical Sciences: De...
 
A guided tour of Araport
A guided tour of AraportA guided tour of Araport
A guided tour of Araport
 
Web-based Tools for Integrative Analysis of Pancreatic Cancer Data
Web-based Tools for Integrative Analysis of Pancreatic Cancer DataWeb-based Tools for Integrative Analysis of Pancreatic Cancer Data
Web-based Tools for Integrative Analysis of Pancreatic Cancer Data
 
Efforts for Research Data Management in Japanese university / institution lib...
Efforts for Research Data Management in Japanese university / institution lib...Efforts for Research Data Management in Japanese university / institution lib...
Efforts for Research Data Management in Japanese university / institution lib...
 
The GoGeo Vision for Repositories (Pecha Kucha) - Tony Mathys
The GoGeo Vision for Repositories (Pecha Kucha) - Tony MathysThe GoGeo Vision for Repositories (Pecha Kucha) - Tony Mathys
The GoGeo Vision for Repositories (Pecha Kucha) - Tony Mathys
 
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
 
Providing Research Graph data in JSON-LD using Schema.org
Providing Research Graph data in JSON-LD using Schema.orgProviding Research Graph data in JSON-LD using Schema.org
Providing Research Graph data in JSON-LD using Schema.org
 
Biothings APIs: high-performance bioentity-centric web services
Biothings APIs: high-performance bioentity-centric web servicesBiothings APIs: high-performance bioentity-centric web services
Biothings APIs: high-performance bioentity-centric web services
 
Bioschemas findability and interoperability
Bioschemas findability and interoperabilityBioschemas findability and interoperability
Bioschemas findability and interoperability
 
Demonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations SystemsDemonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations Systems
 

Similar to Big Data Initiatives for Agroecosystems

re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data RepositoriesHeinz Pampel
 
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...Cyndy Parr
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptxvijayapraba1
 
Global RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm DataGlobal RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm DataVassilis Protonotarios
 
i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017Monica Poelchau
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
DAS game: how a programmer thinks
DAS game: how a programmer thinksDAS game: how a programmer thinks
DAS game: how a programmer thinksRafael C. Jimenez
 
Emerging domain agnostic functionalities on the handle-centered networks
Emerging domain agnostic functionalities on the handle-centered networksEmerging domain agnostic functionalities on the handle-centered networks
Emerging domain agnostic functionalities on the handle-centered networksNational Institute of Informatics
 
From Data to Data: One version of a History of Scholarly Communication
From Data to Data: One version of a History of Scholarly CommunicationFrom Data to Data: One version of a History of Scholarly Communication
From Data to Data: One version of a History of Scholarly CommunicationAndrew Treloar
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisCatherine Canevet
 
Introduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopIntroduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopAaike De Wever
 
Agro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystemAgro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystemNikos Manouselis
 
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Spark Summit
 

Similar to Big Data Initiatives for Agroecosystems (20)

Pieper NISO Virtual Conf Feb17
Pieper NISO Virtual Conf Feb17Pieper NISO Virtual Conf Feb17
Pieper NISO Virtual Conf Feb17
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositories
 
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx
 
Data integration
Data integrationData integration
Data integration
 
Global RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm DataGlobal RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm Data
 
i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017i5k Workspace Workshop - AGS2017
i5k Workspace Workshop - AGS2017
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
iMicrobe_ASLO_2015
iMicrobe_ASLO_2015iMicrobe_ASLO_2015
iMicrobe_ASLO_2015
 
DAS game: how a programmer thinks
DAS game: how a programmer thinksDAS game: how a programmer thinks
DAS game: how a programmer thinks
 
Introduction of Linked Data for Science
Introduction of Linked Data for ScienceIntroduction of Linked Data for Science
Introduction of Linked Data for Science
 
DataNet Federation Consortium Preservation Policy Toolkit. Reagan Moore, Arco...
DataNet Federation Consortium Preservation Policy Toolkit. Reagan Moore, Arco...DataNet Federation Consortium Preservation Policy Toolkit. Reagan Moore, Arco...
DataNet Federation Consortium Preservation Policy Toolkit. Reagan Moore, Arco...
 
Emerging domain agnostic functionalities on the handle-centered networks
Emerging domain agnostic functionalities on the handle-centered networksEmerging domain agnostic functionalities on the handle-centered networks
Emerging domain agnostic functionalities on the handle-centered networks
 
From Data to Data: One version of a History of Scholarly Communication
From Data to Data: One version of a History of Scholarly CommunicationFrom Data to Data: One version of a History of Scholarly Communication
From Data to Data: One version of a History of Scholarly Communication
 
The CIARD RINGValeri
The CIARD RINGValeriThe CIARD RINGValeri
The CIARD RINGValeri
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysis
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
 
Introduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopIntroduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshop
 
Agro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystemAgro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystem
 
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
 

More from Cyndy Parr

Open data and the ag data commons
Open data and the ag data commonsOpen data and the ag data commons
Open data and the ag data commonsCyndy Parr
 
Biodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscapeBiodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscapeCyndy Parr
 
Public access to research results at USDA
Public access to research results at USDAPublic access to research results at USDA
Public access to research results at USDACyndy Parr
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataCyndy Parr
 
Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.Cyndy Parr
 
TDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's WelcomeTDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's WelcomeCyndy Parr
 
Behavior ontology workshop princeton
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princetonCyndy Parr
 
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK Cyndy Parr
 
Frontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of LifeFrontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of Life Cyndy Parr
 
Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Cyndy Parr
 
Using and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute dataUsing and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute dataCyndy Parr
 
How the Encyclopedia of Life is wrangling organismal attribute data
How the Encyclopedia of Life is wrangling organismal attribute dataHow the Encyclopedia of Life is wrangling organismal attribute data
How the Encyclopedia of Life is wrangling organismal attribute dataCyndy Parr
 
The Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of LifeThe Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of LifeCyndy Parr
 
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Cyndy Parr
 
Encyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesCyndy Parr
 
Species pages and portals
Species pages and portals Species pages and portals
Species pages and portals Cyndy Parr
 
Building EOL species pages
Building EOL species pagesBuilding EOL species pages
Building EOL species pagesCyndy Parr
 
Leveraging an international infrastructure: Case studies from the Encyclopeda...
Leveraging an international infrastructure: Case studies from the Encyclopeda...Leveraging an international infrastructure: Case studies from the Encyclopeda...
Leveraging an international infrastructure: Case studies from the Encyclopeda...Cyndy Parr
 
Introduction to EOL.org for scientists
Introduction to EOL.org for scientistsIntroduction to EOL.org for scientists
Introduction to EOL.org for scientistsCyndy Parr
 
EOL and Science: Yes we can!
EOL and Science: Yes we can!EOL and Science: Yes we can!
EOL and Science: Yes we can!Cyndy Parr
 

More from Cyndy Parr (20)

Open data and the ag data commons
Open data and the ag data commonsOpen data and the ag data commons
Open data and the ag data commons
 
Biodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscapeBiodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscape
 
Public access to research results at USDA
Public access to research results at USDAPublic access to research results at USDA
Public access to research results at USDA
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and data
 
Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.
 
TDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's WelcomeTDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's Welcome
 
Behavior ontology workshop princeton
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princeton
 
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
 
Frontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of LifeFrontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of Life
 
Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...
 
Using and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute dataUsing and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute data
 
How the Encyclopedia of Life is wrangling organismal attribute data
How the Encyclopedia of Life is wrangling organismal attribute dataHow the Encyclopedia of Life is wrangling organismal attribute data
How the Encyclopedia of Life is wrangling organismal attribute data
 
The Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of LifeThe Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of Life
 
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
 
Encyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypes
 
Species pages and portals
Species pages and portals Species pages and portals
Species pages and portals
 
Building EOL species pages
Building EOL species pagesBuilding EOL species pages
Building EOL species pages
 
Leveraging an international infrastructure: Case studies from the Encyclopeda...
Leveraging an international infrastructure: Case studies from the Encyclopeda...Leveraging an international infrastructure: Case studies from the Encyclopeda...
Leveraging an international infrastructure: Case studies from the Encyclopeda...
 
Introduction to EOL.org for scientists
Introduction to EOL.org for scientistsIntroduction to EOL.org for scientists
Introduction to EOL.org for scientists
 
EOL and Science: Yes we can!
EOL and Science: Yes we can!EOL and Science: Yes we can!
EOL and Science: Yes we can!
 

Recently uploaded

Artificial Intelligence in Philippine Local Governance: Challenges and Opport...
Artificial Intelligence in Philippine Local Governance: Challenges and Opport...Artificial Intelligence in Philippine Local Governance: Challenges and Opport...
Artificial Intelligence in Philippine Local Governance: Challenges and Opport...CedZabala
 
PPT Item # 4 - 231 Encino Ave (Significance Only)
PPT Item # 4 - 231 Encino Ave (Significance Only)PPT Item # 4 - 231 Encino Ave (Significance Only)
PPT Item # 4 - 231 Encino Ave (Significance Only)ahcitycouncil
 
Top Rated Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated  Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...Top Rated  Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...Call Girls in Nagpur High Profile
 
EDUROOT SME_ Performance upto March-2024.pptx
EDUROOT SME_ Performance upto March-2024.pptxEDUROOT SME_ Performance upto March-2024.pptx
EDUROOT SME_ Performance upto March-2024.pptxaaryamanorathofficia
 
(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Service
(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Service(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Service
(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
(TARA) Call Girls Sanghavi ( 7001035870 ) HI-Fi Pune Escorts Service
(TARA) Call Girls Sanghavi ( 7001035870 ) HI-Fi Pune Escorts Service(TARA) Call Girls Sanghavi ( 7001035870 ) HI-Fi Pune Escorts Service
(TARA) Call Girls Sanghavi ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Service
(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Service(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Service
(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
“Exploring the world: One page turn at a time.” World Book and Copyright Day ...
“Exploring the world: One page turn at a time.” World Book and Copyright Day ...“Exploring the world: One page turn at a time.” World Book and Copyright Day ...
“Exploring the world: One page turn at a time.” World Book and Copyright Day ...Christina Parmionova
 
Climate change and safety and health at work
Climate change and safety and health at workClimate change and safety and health at work
Climate change and safety and health at workChristina Parmionova
 
(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Service
(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Service(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Service
(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130 Available With Room
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130  Available With RoomVIP Kolkata Call Girl Jatin Das Park 👉 8250192130  Available With Room
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130 Available With Roomishabajaj13
 
How the Congressional Budget Office Assists Lawmakers
How the Congressional Budget Office Assists LawmakersHow the Congressional Budget Office Assists Lawmakers
How the Congressional Budget Office Assists LawmakersCongressional Budget Office
 
DNV publication: China Energy Transition Outlook 2024
DNV publication: China Energy Transition Outlook 2024DNV publication: China Energy Transition Outlook 2024
DNV publication: China Energy Transition Outlook 2024Energy for One World
 
VIP Call Girl mohali 7001035870 Enjoy Call Girls With Our Escorts
VIP Call Girl mohali 7001035870 Enjoy Call Girls With Our EscortsVIP Call Girl mohali 7001035870 Enjoy Call Girls With Our Escorts
VIP Call Girl mohali 7001035870 Enjoy Call Girls With Our Escortssonatiwari757
 
(VASUDHA) Call Girls Balaji Nagar ( 7001035870 ) HI-Fi Pune Escorts Service
(VASUDHA) Call Girls Balaji Nagar ( 7001035870 ) HI-Fi Pune Escorts Service(VASUDHA) Call Girls Balaji Nagar ( 7001035870 ) HI-Fi Pune Escorts Service
(VASUDHA) Call Girls Balaji Nagar ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 

Recently uploaded (20)

Artificial Intelligence in Philippine Local Governance: Challenges and Opport...
Artificial Intelligence in Philippine Local Governance: Challenges and Opport...Artificial Intelligence in Philippine Local Governance: Challenges and Opport...
Artificial Intelligence in Philippine Local Governance: Challenges and Opport...
 
PPT Item # 4 - 231 Encino Ave (Significance Only)
PPT Item # 4 - 231 Encino Ave (Significance Only)PPT Item # 4 - 231 Encino Ave (Significance Only)
PPT Item # 4 - 231 Encino Ave (Significance Only)
 
Top Rated Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated  Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...Top Rated  Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated Pune Call Girls Hadapsar ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
 
EDUROOT SME_ Performance upto March-2024.pptx
EDUROOT SME_ Performance upto March-2024.pptxEDUROOT SME_ Performance upto March-2024.pptx
EDUROOT SME_ Performance upto March-2024.pptx
 
How to Save a Place: 12 Tips To Research & Know the Threat
How to Save a Place: 12 Tips To Research & Know the ThreatHow to Save a Place: 12 Tips To Research & Know the Threat
How to Save a Place: 12 Tips To Research & Know the Threat
 
(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Service
(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Service(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Service
(TARA) Call Girls Chakan ( 7001035870 ) HI-Fi Pune Escorts Service
 
(TARA) Call Girls Sanghavi ( 7001035870 ) HI-Fi Pune Escorts Service
(TARA) Call Girls Sanghavi ( 7001035870 ) HI-Fi Pune Escorts Service(TARA) Call Girls Sanghavi ( 7001035870 ) HI-Fi Pune Escorts Service
(TARA) Call Girls Sanghavi ( 7001035870 ) HI-Fi Pune Escorts Service
 
(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Service
(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Service(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Service
(PRIYA) Call Girls Rajgurunagar ( 7001035870 ) HI-Fi Pune Escorts Service
 
“Exploring the world: One page turn at a time.” World Book and Copyright Day ...
“Exploring the world: One page turn at a time.” World Book and Copyright Day ...“Exploring the world: One page turn at a time.” World Book and Copyright Day ...
“Exploring the world: One page turn at a time.” World Book and Copyright Day ...
 
Call Girls Service Connaught Place @9999965857 Delhi 🫦 No Advance VVIP 🍎 SER...
Call Girls Service Connaught Place @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SER...Call Girls Service Connaught Place @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SER...
Call Girls Service Connaught Place @9999965857 Delhi 🫦 No Advance VVIP 🍎 SER...
 
Rohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 37 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
Climate change and safety and health at work
Climate change and safety and health at workClimate change and safety and health at work
Climate change and safety and health at work
 
The Federal Budget and Health Care Policy
The Federal Budget and Health Care PolicyThe Federal Budget and Health Care Policy
The Federal Budget and Health Care Policy
 
(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Service
(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Service(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Service
(SUHANI) Call Girls Pimple Saudagar ( 7001035870 ) HI-Fi Pune Escorts Service
 
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130 Available With Room
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130  Available With RoomVIP Kolkata Call Girl Jatin Das Park 👉 8250192130  Available With Room
VIP Kolkata Call Girl Jatin Das Park 👉 8250192130 Available With Room
 
How the Congressional Budget Office Assists Lawmakers
How the Congressional Budget Office Assists LawmakersHow the Congressional Budget Office Assists Lawmakers
How the Congressional Budget Office Assists Lawmakers
 
DNV publication: China Energy Transition Outlook 2024
DNV publication: China Energy Transition Outlook 2024DNV publication: China Energy Transition Outlook 2024
DNV publication: China Energy Transition Outlook 2024
 
Call Girls In Rohini ꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
Call Girls In  Rohini ꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCeCall Girls In  Rohini ꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
Call Girls In Rohini ꧁❤ 🔝 9953056974🔝❤꧂ Escort ServiCe
 
VIP Call Girl mohali 7001035870 Enjoy Call Girls With Our Escorts
VIP Call Girl mohali 7001035870 Enjoy Call Girls With Our EscortsVIP Call Girl mohali 7001035870 Enjoy Call Girls With Our Escorts
VIP Call Girl mohali 7001035870 Enjoy Call Girls With Our Escorts
 
(VASUDHA) Call Girls Balaji Nagar ( 7001035870 ) HI-Fi Pune Escorts Service
(VASUDHA) Call Girls Balaji Nagar ( 7001035870 ) HI-Fi Pune Escorts Service(VASUDHA) Call Girls Balaji Nagar ( 7001035870 ) HI-Fi Pune Escorts Service
(VASUDHA) Call Girls Balaji Nagar ( 7001035870 ) HI-Fi Pune Escorts Service
 

Big Data Initiatives for Agroecosystems

  • 1. Big Data Initiatives for Agroecosystems Cynthia Parr Knowledge Services Division National Agricultural Library Ecological Society of America, 2015
  • 2. Outline • Data management at the National Agricultural Library • Four examples 1. Insects 5K – i5K Workspace 2. Life Cycle Assessment 3. Long-Term Agroecosystem Research 4. Ag Data Commons • General principles 8.1 million items, Agricola, PubAg
  • 4. 4 raw data collection cleaning, enrichment, analysis registration, preservation temporary data referable data citable data citable publication Modified from Peter Wittenberg, Research Data Alliance https://rd-alliance.org/group/data-fabric-ig.html
  • 6. Genome project hosting at the i5k Workspace • 27 pilot genomes hosted; 45 total – Storage and dissemination of a genome assembly and anything mapped to it. – BLAST, JBrowse Genome Browser • Manual Curation: Web Apollo • Post-curation maintenance – Quality Control – Official Gene Set generation • Research plan • Generate material • Sequencing • Assembly • Automated annotation • Manual Curation • Official gene set generation • Genome project maintenance • Biological insights/Publicatio n GenomeProjectTrajectory
  • 7. Life Cycle Assessment Commons 7 www.lcacommons.gov
  • 8. Unformatted, non-standard LCA Commons Concept LCA Community Open LCA Framework Common computing environment, application, data standards, and development NAL LCADC NREL USLCI XYZ LCI DB ABC LCI DB Distributed computing environment & application Common data standards Distributed computing environment DEF LCI DB Common application & data standards Interoperability Tools Ag Data Commons Catalog and Repository
  • 9. Long Term Agro-ecosystem Research (LTAR)
  • 10. LTAR Data Common Observatory – Meteorology – Hydrology – Eddy flux CO2 – Non-CO2 gasses – Soil – Biological 10 Common Experiment Approach – Business as usual – Aspirational Will include data about – Management practices – Results
  • 11. LTAR Data Loss N=194 of ~500 citations in 2011 LTAR site proposals Bad links to data No data available 80% of papers provide no way to obtain data Data are accessible Refers to general data source
  • 12. LTAR information management • Support for download of files, web services • Metadata in FGDC CSDGM, ISO 19115, EML, Project Open Data • Catalog of instrument specs using SensorML 2 • Data dictionaries in ISO 19110 • Weather data to be converted to other formats • Field names could be converted to match different conventions (AgMIP, etc.)
  • 15. Distributed repositories Search & Knowledge Discovery Thesaurus & Indexing Ag Data Commons Repository Organization & Curation Grant management systems INGESTION DISSEMINATION PubAg Dataset Submission Analytics & Tools Data.gov Forest Service NCBI Ag Data Commons Catalog Color Legend: Building Adapt/Re-use Existing LCA Commons
  • 16. Guiding principle 1: a distributed network …. Geospatial Catalog Geospatial Repository STEWARDS Ag Data Commons (catalog) Ag Data Commons (repository) USDA Enterprise Inventory National Weather Service Data.gov Ecosystems .data.gov of Networks…
  • 17. Public access to open, machine readable data enables larger scale, integrative and innovative data science The long tail Guiding principle 2: big data AND long tail
  • 18. Guiding principle 3: curation adds value • Data dictionaries • Standards & templates • Linkages • Semantics • Preservation
  • 19. Thanks! National Agricultural Library Knowledge Services Division: Susan McCarthy LTAR Jeffrey Campbell, Charles Lockwood i5K Monica Poelchau, Chris Childers LCA Commons Peter Arbuckle, Ezra Kahn Ag Data Commons Ursula Pieper, Jocelyn McNamara, Qing Qu, Erin Antognoli, Melissa Lowrey, Jaylen Nathwani, NuCivic … and collaborators and testers

Editor's Notes

  1. The National Ag Library is providing tools to assist with both the top part of this diagram as well as the bottom part
  2. i5K (Insect 5000 genomes) Sequence and annotate the 5000 genomes of arthropod species known to be important to worldwide agriculture, food safety, energy production, and medicine Most of them are not well-funded model organism communities, so rather than build a website for each of these 5000 organisms my colleagues Monica Poelchau and Chris chlders have built a general workspace where communities tools and data can be hosted and shared. Here’s a list of some of the organisms in the i5K workspace already
  3. We take over all of the infrastructural challenges, but a community coordinator to identify curation priorities and organize curators is still a necessity Collaborations between NAL, Baylor College of medicine, Lawrence Berkeley, National Taiwan University
  4. My colleagues Peter Arbuckle and Ezra Kahn have been working with their colleagues on the LCA commons Life Cycle Assessment is a set of methodologies for doing complex accounting of end-to-end inputs and outputs into any kind of production or manufacturing process. For agriculture this means tracking things like energy pesticide and water inputs and carbon emissions and yield outputs and seeking ways to make the processes more sustainable. The LCA commons will provide three things 1) Life Cycle Inventory Database for agriculture 2) ADC collection for related tools and unformatted data 3) Federal network of databases and resources – still under development Life Cycle Assessment (LCA) Commons open access to LCA datasets and tools for researchers studying sustainable methods in crop and livestock production 1 and 2 have recently been release together in new web site and 3 is still under development. Bringing partners together and discussing terms and business model.
  5. Vision for LCA commons is to be a distributed network of databases, applications, and tools that support LCA with interoperable data to the extent possible. At this point this model represents what we envision for the US Federal LCA Commons
  6. Jeff Campbell at the Library is working with Mark Walbridge and his the team who are building the LTAR network. LTAR is a set of 18 sites designed to help determine who managed systems behave within their ecosystems under regional and continental conditions, to be able to better predict what the impacts might be on agriculture and the environment as the world changes, for example under conditions of climate change. You can’t read these but some of these site have been intentionally chosen to overlap with existing LTER and NEON sites. They all have long legacies of agroecology research so there will be both deep background data and expertise for future research.
  7. Some Observatory may be real time open, generic, the contexts needed to interpret all kinds of experimental data Common experiment across all sites research approach
  8. 11
  9. Ag Data Commons general catalog and repository for agricultural data which can promote effective discovery of and add value to often widely distributed and seemingly disparate datasets
  10. Dark Blue: develop as part of AgDatacCommons Light blue:Enhance existing systems. Gray: Already exist