SlideShare a Scribd company logo
1 of 33
Download to read offline
Reusable Software and Open Data To Optimize Agriculture
David LeBauer
AGU 2015 Fall Meetings
@dlebauer
Overview
Ideas:
Software: Modular, Reusable, and Useable
Data: Harmonization, Distribution
Workflows: Reproducible, Automated
Science: Cumulative and Synthetic
Examples:
PEcAn ProjectBETYdb TERRA Ref
Agriculture: Model and Application
Food, fuel, and other ecosystem services (e.g. C, N, H2O)
Basic science: genes to organism to ecosystem
Engineering applications: computing, data collection, prediction
Enzyme Ecosystem Continent
Overview
Ideas:
Software: Modular, Reusable, and Useable
Data: Harmonization, Distribution
Workflows: Reproducible, Automated
Science: Cumulative and Synthetic
Examples:
PEcAn ProjectBETYdb TERRA Ref
betydb.org github.com/pecanproject/bety @BETYdatabase
BETYdb
Database for meta-analysis (BETYdb)
Model-data synthesis, provenance (PEcAn)
Link Genomics to Phenomics (TERRA Ref)
Primary Research
Measurement PublicationAnalysis
LeBauer 2008, 2010
Spreadsheet
Publication Extraction
Analysis…
Publication Extraction
Meta Analysis
LeBauer and Treseder 2008
BETYdb: Data entry Workflow
BETYdb.org →docs →data entry workflow LeBauer et al, in prep
Technicians Enter and CheckScientist Identifies Data
Open Data Entry and QAQC
Shared Spreadsheets Issue Tracker
Data Access
/search?search=Salix+vcmax
RWeb Application + API
BETYdb.org →docs →Data Access
PEcAn
Overview
Ideas:
Software: Modular, Reusable, and Useable
Data: Harmonization, Distribution
Workflows: Reproducible, Automated
Science: Cumulative and Synthetic
Examples:
PEcAn ProjectBETYdb TERRA Ref
pecanproject.org github.com/pecanproject/pecan @PEcAnproject
Ecosystem Modeling c. 2009
PEcAn
LeBauer et al, 2013
Ecological Model-data Synthesis
PEcAn:
complex models in complex workflows
Modeling Information Systems
Dietze 2016 Princeton University Press
BioCro / Wimovac Crop Model
Humphries and Long, 2005
Miguez et al 2009
Ecosystem Modeling c. 2012
Select Site Configure Run Visualize, ExportRun Model
Dietze, Kooper, LeBauer 2012
LeBauer et al 2013
Given available data,
How well do we know parameters?
How does this affect prediction?
What should we collect?
PEcAn:
Sensitivity Analysis & Variance Decomposition
PEcAn, Automated
Dietze et al 2014
BETYdb + PEcAn
BETYdb is PEcAn’s informatics backend
Provides data, workflow and data provenance
Federated network of databases
Overview
Ideas:
Software: Modular, Reusable, and Useable
Data: Harmonization, Distribution
Workflows: Reproducible, Automated
Science: Cumulative and Synthetic
Examples:
PEcAn ProjectBETYdb TERRA Ref
terraref.ncsa.illinois.edu github.com/terraref @terra_ref
TERRA: Better Breeding Through Science
We have increased yields many times in the last 60 years.
What new opportunities does modern science provide?
University of Illinois
Integrated Pest Management
• Use scientific understanding to select for traits
• Replace manual measurement with remote sensing
• Target specific genes and phenotypes in crosses
ARPA-E TERRA Program
Six Funded Teams
$30 m in awards
$5 m in sensors
TERRA Ref:
Public reference data
HPC Computing
TERRA Ref: An Agricultural Observatory
Similar to and informed by:
Large Synoptic Survey Telescope
National Ecological Observatory Network
Open: Science, Data, Software
Useable: Useful and Familiar to Scientists, Breeders, Precision Ag
Modular: Extensible, Distributed, Automated, Interoperable
Interdisciplinary: Genes to Ecosystems with Robots, Vision, Statistics
Scalable: From Mobile Devices to High Performance Computers
terraref.ncsa.illinois.edu @terra_ref github.com/terraref
TERRA Reference Data and Computing
Sensor Data Sources
Lemnatec Indoor
Danforth, St. Louis
Lemnatec Field
USDA ALRC, Maricopa, AZ
Tractor and UAV
Kansas State
Plus, other teams, public, (sharing optional)
Shared Sorghum genomics and germplasm,
Reference Data
Raw Sequence Data
Aligned Reads
SNPs
Images
Spectra
Point clouds
Shapes
Biomass, Growth
Tissue Chemistry
Photosynthesis
Yield
Stress Tolerance
Ecosystem Services
Big Data Volume & Velocity
Imaging Spectrometers:
VNIR ~3-4 TB/d
SWIR ~1 TB/d
3D Laser Scanner ~ 1 TB/d
4 Year Total: 1 - 40 PB
VNIR
SWIR 3D …
Everything else
Computing and Storage
Roger Server: 1PB online, GIS optimized
Nebula: NCSA Open Stack Server
Blue Waters: 10 PB tape storage
Your Local: [Desktop, HPC, or Sensor Platform]
Data Products Standards Committee
Paul Bartlett Near Earth Autonomy
Jeff White USDA ALARC, ICASA
Melba Crawford Purdue University
Michael Gore,
Elodie Garazave
Cornell University
Matt Colgan Blue River
Christer Janssen PNNL
Barnabas Poczos Carnegie Mellon
Alex Thomasson Texas A&M University
Cheryl Porter University of Florida, AgMIP, USDA
Shawn Serbin Brookhaven National Lab, PEcAn
Shelly Petroy
Christine Laney
NEON
Carolyn J. Lawrence-Dill Iowa State, AgBioData
Eric Lyons University of Arizona, CoGE
Ted Habermann HDF Group
Participants
• Project representatives
• Domain Experts
• Scientific Community (You)*
Responsibilities
• Define Data
• Revise, Improve
• Training, Outreach
* github.com/terraref/reference-data/issues
Computing Pipeline
Data Uploaded via API
Triggers Analytical Pipeline
Generates and Stores Data, Metadata
Users select data, launch VM:
Favorite Software
Data Mounted
HPC Access
States can be Shared, Archived
Acknowledgements
Projects: PEcAn, NCSA, BrownDog, Plants In Silico, CyberGIS,
National Data Service, USDA, AgMIP, National Data Service
Data: Providers and Curators
Mentors: Mike Dietze, Steve Long, Kathleen Treseder
Funding: NSF, EBI, ARPA-E, DOE, NASA
Contact
Web GitHub s Twitter a
David LeBauer dlebauer@illinois.edu dlebauer @dlebauer
BETYdb betydb.org pecanproject/bety @BETYdatabase
PEcAn Project pecaproject.org pecanproject/pecan @PEcAnproject
TERRA Ref terraref.ncsa.illinois.edu terraref @terra_ref
PIs Amy Colin-Marshal, Steve Long, James O’Dwyer, Diwakar Shukla
Plants in Silico
Multi-scale modeling platform to predict crop response to climate change
Plants in Silico: Modular Architecture
Zhu et al, 2015 Plant Cell Environ

More Related Content

What's hot

Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...
Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...
Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...EarthCube
 
Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Anubhav Jain
 
Eco-informatics: Data services for bringing together and publishing the full ...
Eco-informatics: Data services for bringing together and publishing the full ...Eco-informatics: Data services for bringing together and publishing the full ...
Eco-informatics: Data services for bringing together and publishing the full ...TERN Australia
 
Scientific Workflow Systems for accessible, reproducible research
Scientific Workflow Systems for accessible, reproducible researchScientific Workflow Systems for accessible, reproducible research
Scientific Workflow Systems for accessible, reproducible researchPeter van Heusden
 
Genome-scale Big Data Pipelines
Genome-scale Big Data PipelinesGenome-scale Big Data Pipelines
Genome-scale Big Data PipelinesLynn Langit
 
Networking Materials Data
Networking Materials DataNetworking Materials Data
Networking Materials DataIan Foster
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Robert Grossman
 
Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)Robert Grossman
 
Assessing Galaxy's ability to express scientific workflows in bioinformatics
Assessing Galaxy's ability to express scientific workflows in bioinformaticsAssessing Galaxy's ability to express scientific workflows in bioinformatics
Assessing Galaxy's ability to express scientific workflows in bioinformaticsPeter van Heusden
 
Aspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceAspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceRaul Palma
 
AusPlots field data collection with AusScribe
AusPlots field data collection with AusScribeAusPlots field data collection with AusScribe
AusPlots field data collection with AusScribeTERN Australia
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Carole Goble
 
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsBio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsYaoyu Wang
 
Open Notebook Science HUBzero 2011
Open Notebook Science HUBzero 2011Open Notebook Science HUBzero 2011
Open Notebook Science HUBzero 2011Jean-Claude Bradley
 

What's hot (20)

Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...
Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...
Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...
 
Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...
 
Eco-informatics: Data services for bringing together and publishing the full ...
Eco-informatics: Data services for bringing together and publishing the full ...Eco-informatics: Data services for bringing together and publishing the full ...
Eco-informatics: Data services for bringing together and publishing the full ...
 
Scientific Workflow Systems for accessible, reproducible research
Scientific Workflow Systems for accessible, reproducible researchScientific Workflow Systems for accessible, reproducible research
Scientific Workflow Systems for accessible, reproducible research
 
Genome-scale Big Data Pipelines
Genome-scale Big Data PipelinesGenome-scale Big Data Pipelines
Genome-scale Big Data Pipelines
 
Beyond the PDF 2, 2013
Beyond the PDF 2, 2013Beyond the PDF 2, 2013
Beyond the PDF 2, 2013
 
Networking Materials Data
Networking Materials DataNetworking Materials Data
Networking Materials Data
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
 
Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)
 
Assessing Galaxy's ability to express scientific workflows in bioinformatics
Assessing Galaxy's ability to express scientific workflows in bioinformaticsAssessing Galaxy's ability to express scientific workflows in bioinformatics
Assessing Galaxy's ability to express scientific workflows in bioinformatics
 
4A2B2C-2013
4A2B2C-20134A2B2C-2013
4A2B2C-2013
 
Aspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceAspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth Science
 
ISMB Workshop 2014
ISMB Workshop 2014ISMB Workshop 2014
ISMB Workshop 2014
 
2016 davis-plantbio
2016 davis-plantbio2016 davis-plantbio
2016 davis-plantbio
 
2014 moore-ddd
2014 moore-ddd2014 moore-ddd
2014 moore-ddd
 
AusPlots field data collection with AusScribe
AusPlots field data collection with AusScribeAusPlots field data collection with AusScribe
AusPlots field data collection with AusScribe
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing InformaticsBio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
Bio-IT 2017 - Session 7: Next-Gen Sequencing Informatics
 
Open Notebook Science HUBzero 2011
Open Notebook Science HUBzero 2011Open Notebook Science HUBzero 2011
Open Notebook Science HUBzero 2011
 
UKON 2014
UKON 2014UKON 2014
UKON 2014
 

Similar to Reusable Software and Open Data To Optimize Agriculture

eScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodeScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodDuncan Hull
 
Smith T Bio Hdf Bosc2008
Smith T Bio Hdf Bosc2008Smith T Bio Hdf Bosc2008
Smith T Bio Hdf Bosc2008bosc_2008
 
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...aceas13tern
 
D1 1440 cesar wong next generation sequencing & bio medical data analysis
D1 1440 cesar wong next generation sequencing & bio medical data analysisD1 1440 cesar wong next generation sequencing & bio medical data analysis
D1 1440 cesar wong next generation sequencing & bio medical data analysisDr. Wilfred Lin (Ph.D.)
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...GigaScience, BGI Hong Kong
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8Scott Edmunds
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformSanjay Padhi, Ph.D
 
Data dissemination and materials informatics at LBNL
Data dissemination and materials informatics at LBNLData dissemination and materials informatics at LBNL
Data dissemination and materials informatics at LBNLAnubhav Jain
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational ScienceChelle Gentemann
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...Bonnie Hurwitz
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data CommonsVivien Bonazzi
 
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...GigaScience, BGI Hong Kong
 
cyREST: Cytoscape as a Service
cyREST: Cytoscape as a ServicecyREST: Cytoscape as a Service
cyREST: Cytoscape as a ServiceKeiichiro Ono
 
EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsVivien Bonazzi
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
 
Materials Data Facility as Community Database to Share Nano-manufacturing Rec...
Materials Data Facility as Community Database to Share Nano-manufacturing Rec...Materials Data Facility as Community Database to Share Nano-manufacturing Rec...
Materials Data Facility as Community Database to Share Nano-manufacturing Rec...Globus
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarshiptsbbbu
 

Similar to Reusable Software and Open Data To Optimize Agriculture (20)

eScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodeScience: A Transformed Scientific Method
eScience: A Transformed Scientific Method
 
2015 genome-center
2015 genome-center2015 genome-center
2015 genome-center
 
Smith T Bio Hdf Bosc2008
Smith T Bio Hdf Bosc2008Smith T Bio Hdf Bosc2008
Smith T Bio Hdf Bosc2008
 
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
 
D1 1440 cesar wong next generation sequencing & bio medical data analysis
D1 1440 cesar wong next generation sequencing & bio medical data analysisD1 1440 cesar wong next generation sequencing & bio medical data analysis
D1 1440 cesar wong next generation sequencing & bio medical data analysis
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
Data dissemination and materials informatics at LBNL
Data dissemination and materials informatics at LBNLData dissemination and materials informatics at LBNL
Data dissemination and materials informatics at LBNL
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational Science
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data Commons
 
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
 
cyREST: Cytoscape as a Service
cyREST: Cytoscape as a ServicecyREST: Cytoscape as a Service
cyREST: Cytoscape as a Service
 
EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
Materials Data Facility as Community Database to Share Nano-manufacturing Rec...
Materials Data Facility as Community Database to Share Nano-manufacturing Rec...Materials Data Facility as Community Database to Share Nano-manufacturing Rec...
Materials Data Facility as Community Database to Share Nano-manufacturing Rec...
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarship
 

More from David LeBauer

Software and data as scaffolds for integrative science
Software and data as scaffolds for integrative scienceSoftware and data as scaffolds for integrative science
Software and data as scaffolds for integrative scienceDavid LeBauer
 
Matt Jones software-interoperability
Matt Jones software-interoperabilityMatt Jones software-interoperability
Matt Jones software-interoperabilityDavid LeBauer
 
Aaron Ellison: Analytic Web
Aaron Ellison: Analytic WebAaron Ellison: Analytic Web
Aaron Ellison: Analytic WebDavid LeBauer
 
Ilkay Altintas: Kepler
Ilkay Altintas: KeplerIlkay Altintas: Kepler
Ilkay Altintas: KeplerDavid LeBauer
 
Jeroen Steenbeek Ecopath with Ecosim
Jeroen Steenbeek Ecopath with EcosimJeroen Steenbeek Ecopath with Ecosim
Jeroen Steenbeek Ecopath with EcosimDavid LeBauer
 
Aaron Ellison Keynote: Reaching the 99%
Aaron Ellison Keynote: Reaching the 99%Aaron Ellison Keynote: Reaching the 99%
Aaron Ellison Keynote: Reaching the 99%David LeBauer
 
Andrew Thomas OpenBUGS
Andrew Thomas OpenBUGSAndrew Thomas OpenBUGS
Andrew Thomas OpenBUGSDavid LeBauer
 
Anders Nielsen AD Model-Builder
Anders Nielsen AD Model-BuilderAnders Nielsen AD Model-Builder
Anders Nielsen AD Model-BuilderDavid LeBauer
 
Janine Illian R-INLA
Janine Illian R-INLAJanine Illian R-INLA
Janine Illian R-INLADavid LeBauer
 
Anders Nielsen template model-builder
Anders Nielsen template model-builderAnders Nielsen template model-builder
Anders Nielsen template model-builderDavid LeBauer
 
Advancing Software for Ecological Forecasting: Public Sessions
Advancing Software for Ecological Forecasting: Public SessionsAdvancing Software for Ecological Forecasting: Public Sessions
Advancing Software for Ecological Forecasting: Public SessionsDavid LeBauer
 

More from David LeBauer (20)

Software and data as scaffolds for integrative science
Software and data as scaffolds for integrative scienceSoftware and data as scaffolds for integrative science
Software and data as scaffolds for integrative science
 
Yiqi Luo EcoPAD
Yiqi Luo EcoPADYiqi Luo EcoPAD
Yiqi Luo EcoPAD
 
David LeBauer PEcAn
David LeBauer PEcAnDavid LeBauer PEcAn
David LeBauer PEcAn
 
David Kelly SWIFT
David Kelly SWIFTDavid Kelly SWIFT
David Kelly SWIFT
 
Matt Jones software-interoperability
Matt Jones software-interoperabilityMatt Jones software-interoperability
Matt Jones software-interoperability
 
Aaron Ellison: Analytic Web
Aaron Ellison: Analytic WebAaron Ellison: Analytic Web
Aaron Ellison: Analytic Web
 
Ilkay Altintas: Kepler
Ilkay Altintas: KeplerIlkay Altintas: Kepler
Ilkay Altintas: Kepler
 
Jeroen Steenbeek Ecopath with Ecosim
Jeroen Steenbeek Ecopath with EcosimJeroen Steenbeek Ecopath with Ecosim
Jeroen Steenbeek Ecopath with Ecosim
 
Aaron Ellison Keynote: Reaching the 99%
Aaron Ellison Keynote: Reaching the 99%Aaron Ellison Keynote: Reaching the 99%
Aaron Ellison Keynote: Reaching the 99%
 
Andrew Thomas OpenBUGS
Andrew Thomas OpenBUGSAndrew Thomas OpenBUGS
Andrew Thomas OpenBUGS
 
Martyn Plummer JAGS
Martyn Plummer JAGSMartyn Plummer JAGS
Martyn Plummer JAGS
 
Anders Nielsen AD Model-Builder
Anders Nielsen AD Model-BuilderAnders Nielsen AD Model-Builder
Anders Nielsen AD Model-Builder
 
Daniel Lee STAN
Daniel Lee STANDaniel Lee STAN
Daniel Lee STAN
 
Ed Ionides POMP
Ed Ionides POMPEd Ionides POMP
Ed Ionides POMP
 
Janine Illian R-INLA
Janine Illian R-INLAJanine Illian R-INLA
Janine Illian R-INLA
 
Andy Fox DART
Andy Fox DARTAndy Fox DART
Andy Fox DART
 
Mike Dietze PEcAn
Mike Dietze PEcAnMike Dietze PEcAn
Mike Dietze PEcAn
 
de Valpine NIMBLE
de Valpine NIMBLEde Valpine NIMBLE
de Valpine NIMBLE
 
Anders Nielsen template model-builder
Anders Nielsen template model-builderAnders Nielsen template model-builder
Anders Nielsen template model-builder
 
Advancing Software for Ecological Forecasting: Public Sessions
Advancing Software for Ecological Forecasting: Public SessionsAdvancing Software for Ecological Forecasting: Public Sessions
Advancing Software for Ecological Forecasting: Public Sessions
 

Recently uploaded

Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxjana861314
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 

Recently uploaded (20)

Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 

Reusable Software and Open Data To Optimize Agriculture

  • 1. Reusable Software and Open Data To Optimize Agriculture David LeBauer AGU 2015 Fall Meetings @dlebauer
  • 2. Overview Ideas: Software: Modular, Reusable, and Useable Data: Harmonization, Distribution Workflows: Reproducible, Automated Science: Cumulative and Synthetic Examples: PEcAn ProjectBETYdb TERRA Ref
  • 3. Agriculture: Model and Application Food, fuel, and other ecosystem services (e.g. C, N, H2O) Basic science: genes to organism to ecosystem Engineering applications: computing, data collection, prediction Enzyme Ecosystem Continent
  • 4. Overview Ideas: Software: Modular, Reusable, and Useable Data: Harmonization, Distribution Workflows: Reproducible, Automated Science: Cumulative and Synthetic Examples: PEcAn ProjectBETYdb TERRA Ref betydb.org github.com/pecanproject/bety @BETYdatabase
  • 5. BETYdb Database for meta-analysis (BETYdb) Model-data synthesis, provenance (PEcAn) Link Genomics to Phenomics (TERRA Ref)
  • 8. BETYdb: Data entry Workflow BETYdb.org →docs →data entry workflow LeBauer et al, in prep Technicians Enter and CheckScientist Identifies Data
  • 9. Open Data Entry and QAQC Shared Spreadsheets Issue Tracker
  • 10. Data Access /search?search=Salix+vcmax RWeb Application + API BETYdb.org →docs →Data Access PEcAn
  • 11. Overview Ideas: Software: Modular, Reusable, and Useable Data: Harmonization, Distribution Workflows: Reproducible, Automated Science: Cumulative and Synthetic Examples: PEcAn ProjectBETYdb TERRA Ref pecanproject.org github.com/pecanproject/pecan @PEcAnproject
  • 13. PEcAn LeBauer et al, 2013 Ecological Model-data Synthesis
  • 14. PEcAn: complex models in complex workflows Modeling Information Systems Dietze 2016 Princeton University Press BioCro / Wimovac Crop Model Humphries and Long, 2005 Miguez et al 2009
  • 15. Ecosystem Modeling c. 2012 Select Site Configure Run Visualize, ExportRun Model Dietze, Kooper, LeBauer 2012
  • 16. LeBauer et al 2013 Given available data, How well do we know parameters? How does this affect prediction? What should we collect? PEcAn: Sensitivity Analysis & Variance Decomposition
  • 18. BETYdb + PEcAn BETYdb is PEcAn’s informatics backend Provides data, workflow and data provenance Federated network of databases
  • 19. Overview Ideas: Software: Modular, Reusable, and Useable Data: Harmonization, Distribution Workflows: Reproducible, Automated Science: Cumulative and Synthetic Examples: PEcAn ProjectBETYdb TERRA Ref terraref.ncsa.illinois.edu github.com/terraref @terra_ref
  • 20. TERRA: Better Breeding Through Science We have increased yields many times in the last 60 years. What new opportunities does modern science provide? University of Illinois Integrated Pest Management • Use scientific understanding to select for traits • Replace manual measurement with remote sensing • Target specific genes and phenotypes in crosses
  • 21. ARPA-E TERRA Program Six Funded Teams $30 m in awards $5 m in sensors TERRA Ref: Public reference data HPC Computing
  • 22. TERRA Ref: An Agricultural Observatory Similar to and informed by: Large Synoptic Survey Telescope National Ecological Observatory Network
  • 23. Open: Science, Data, Software Useable: Useful and Familiar to Scientists, Breeders, Precision Ag Modular: Extensible, Distributed, Automated, Interoperable Interdisciplinary: Genes to Ecosystems with Robots, Vision, Statistics Scalable: From Mobile Devices to High Performance Computers terraref.ncsa.illinois.edu @terra_ref github.com/terraref TERRA Reference Data and Computing
  • 24. Sensor Data Sources Lemnatec Indoor Danforth, St. Louis Lemnatec Field USDA ALRC, Maricopa, AZ Tractor and UAV Kansas State Plus, other teams, public, (sharing optional) Shared Sorghum genomics and germplasm,
  • 25. Reference Data Raw Sequence Data Aligned Reads SNPs Images Spectra Point clouds Shapes Biomass, Growth Tissue Chemistry Photosynthesis Yield Stress Tolerance Ecosystem Services
  • 26. Big Data Volume & Velocity Imaging Spectrometers: VNIR ~3-4 TB/d SWIR ~1 TB/d 3D Laser Scanner ~ 1 TB/d 4 Year Total: 1 - 40 PB VNIR SWIR 3D … Everything else
  • 27. Computing and Storage Roger Server: 1PB online, GIS optimized Nebula: NCSA Open Stack Server Blue Waters: 10 PB tape storage Your Local: [Desktop, HPC, or Sensor Platform]
  • 28. Data Products Standards Committee Paul Bartlett Near Earth Autonomy Jeff White USDA ALARC, ICASA Melba Crawford Purdue University Michael Gore, Elodie Garazave Cornell University Matt Colgan Blue River Christer Janssen PNNL Barnabas Poczos Carnegie Mellon Alex Thomasson Texas A&M University Cheryl Porter University of Florida, AgMIP, USDA Shawn Serbin Brookhaven National Lab, PEcAn Shelly Petroy Christine Laney NEON Carolyn J. Lawrence-Dill Iowa State, AgBioData Eric Lyons University of Arizona, CoGE Ted Habermann HDF Group Participants • Project representatives • Domain Experts • Scientific Community (You)* Responsibilities • Define Data • Revise, Improve • Training, Outreach * github.com/terraref/reference-data/issues
  • 29. Computing Pipeline Data Uploaded via API Triggers Analytical Pipeline Generates and Stores Data, Metadata Users select data, launch VM: Favorite Software Data Mounted HPC Access States can be Shared, Archived
  • 30. Acknowledgements Projects: PEcAn, NCSA, BrownDog, Plants In Silico, CyberGIS, National Data Service, USDA, AgMIP, National Data Service Data: Providers and Curators Mentors: Mike Dietze, Steve Long, Kathleen Treseder Funding: NSF, EBI, ARPA-E, DOE, NASA
  • 31. Contact Web GitHub s Twitter a David LeBauer dlebauer@illinois.edu dlebauer @dlebauer BETYdb betydb.org pecanproject/bety @BETYdatabase PEcAn Project pecaproject.org pecanproject/pecan @PEcAnproject TERRA Ref terraref.ncsa.illinois.edu terraref @terra_ref
  • 32. PIs Amy Colin-Marshal, Steve Long, James O’Dwyer, Diwakar Shukla Plants in Silico Multi-scale modeling platform to predict crop response to climate change
  • 33. Plants in Silico: Modular Architecture Zhu et al, 2015 Plant Cell Environ