SlideShare a Scribd company logo
EarthCube Conceptual Design:
Enterprise Architecture for
Transformative Research
and Collaboration
Across the Geosciences
http://workspace.earthcube.org/transformative-research-collaboration
ILYA ZASLAVSKY, DAVID VALENTINE, AMARNATH GUPTA
San Diego Supercomputer Center/UCSD
STEPHEN RICHARD
Arizona Geological Survey
TANU MALIK
University of Chicago
Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences
The Science Enterprise
• Ask questions
• Collect information
• Formulate hypotheses
• Test hypotheses to
determine which (if any)
provide satisfactory answer
• Document, curate, and
disseminate data and
results.
…. AND INCREASINGLY:
• Integrate data, analyses,
models across domains
• Collaborate: leverage pooled expertise and resources
increasing amount of data produced in modern science. LSDMA
bridges the gap between data production and data analysis using
a novel approach by combining specific community support and
generic, cross community development. In the Data Life Cycle
Labs (DLCL) experts from the data domain work closely with
scientific groups of selected research domains in joint R&D
where community-specific data life cycles are iteratively
optimized, data and meta-data formats are defined and
standardized, simple access and use is established as well as data
and scientific insights are preserved in long-term and open
accessible archives.
Keywords: data management, data life cycle, data intensive
computing, data analysis, data exploration, LSDMA, support, data
infrastructure
I. INTRODUCTION
Today data is knowledge – data exploration has become the
4th pillar in modern science besides experiment, theory, and
simulation as postulated by Jim Gray in 2007 [1]. Rapidly
increasing data rates in experiments, measurements and
simulation are limiting the speed of scientific production in
various research communities and the gap between the
generated data and data entering the data life cycle (cf. Fig1) is
widening. By providing high performance data management
components, analysis tools, computing resources, storage and
services it is possible to address this challenge but the
realization of a data intensive infrastructure at institutes and
universities is usually time consuming and always expensive.
The introduced “Large Scale Data Management and Analysis”
(LSDMA) project extends the services for research of the
Helmholtz Association of research centers in Germany with
community specific Data Life Cycle Laboratories (DLCL). The
The LSDMA project initiated at the Karlsruhe Institute of
Technology (KIT), builds on the familiarity with supporting
local scientists at a computer center, the knowledge of running
the Grid Computing Centre Karlsruhe (GridKa) [2] as the
German Tier 1 hub in the World Wide LHC Computing
infrastructure [3], the Large Scale Data Facility (LSDF) [4] and
the experience with the very successful Simulation Labs [5]
that specialize at supporting HPC users.
Figure 1. The scientific data life cycle
Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences
Design Framework:
Federation of Systems
Research enterprise includes subsystems at the project, program and
agency level, many of which are independent of NSF
• Requirements are a moving target
• Emergent behavior is to be expected
• Technology is constantly changing
• Community governance within constraints of funding agencies
• Evolutionary process and adaptation:
• Lots of variation; Mechanism to select ‘fittest’; Composability
• Technology must foster delegation of responsibilities and communication:
• Promote self-organization, Cultivate ideas, Maintain feedback between
subsystems
• Reliability: responsiveness, robustness, correctness
• Identity of system is based on shared goals and practices
Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences
Communication loops
Bottom-up Studies
Top-down Studies
Cross-Domain Scientists
Trends and
Patterns
Data
interoperability
best practices
Scientific Governance
Success stories
Technical Governance
Data Providers
Feasibility
Priorities
Strategies
Data Products
Options
Costs
Problems
and issues
Related work
Questions and
clarifications
Questions and
clarifications
Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences
Communication metrics
Components and Perspectives on EarthCube
Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences
Converging on reference
architecture semantics
 Analysis of existing building blocks, and their variability
 Component
 System
 Function
 Description
 Interfaces
 Implementation
 Steward Organization
 Availability
 Reference
 Developing cross-domain vocabularies, connecting domain models
Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences
Requirements Process
 Workshop Summaries
 Surveys
 Architecture Designs
 Analyze what worked
 Incorporate social
technologies
 Inventory CI building blocks
Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences
Concerns
 Hitting the right level of granularity in the design
 Identifying necessary communication channels
 Account for all key perspectives
 Fixing the scope and technologies
 Balancing current and future requirements
 Harmonizing technical and social subsystems and managing
interactions between them
 Uneven standardization and convergence across domains
and functional components
 Constructing a self-organizing plug-and-play system
 Inventorying building blocks
Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences
Summary
 System is defined by:
 Specifications for interfaces and interchange formats (the gateways)
 Definition of key functional components at an abstract level
 Discovery, Workflow s, Data processing, annotation, documentation
 Technology needs to support
 Communication between subsystems (people and machines)
 Collection of metrics required to assess what is working (selection
of the fittest)
 Assembly of components

More Related Content

What's hot

SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?
Philip Bourne
 
Infrastructure for Supporting Computational Social Science
Infrastructure for Supporting Computational Social ScienceInfrastructure for Supporting Computational Social Science
Infrastructure for Supporting Computational Social Science
Derek Hansen
 
Incentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production processIncentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production process
Louise Corti
 
BD2K Update
BD2K UpdateBD2K Update
BD2K Update
Philip Bourne
 
Energy resource management
Energy resource managementEnergy resource management
Energy resource management
Riddhima Kartik
 
Smart Geo. Guido Satta (Maggio 2015)
Smart Geo. Guido Satta (Maggio 2015)Smart Geo. Guido Satta (Maggio 2015)
Smart Geo. Guido Satta (Maggio 2015)
CRS4 Research Center in Sardinia
 
Introduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia ResearchIntroduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia Research
David De Roure
 
The Role of Automated Function Prediction in the Era of Big Data and Small Bu...
The Role of Automated Function Prediction in the Era of Big Data and Small Bu...The Role of Automated Function Prediction in the Era of Big Data and Small Bu...
The Role of Automated Function Prediction in the Era of Big Data and Small Bu...
Philip Bourne
 
SEEKing our way to better presentation of data and models from scientific inv...
SEEKing our way to better presentation of data and models from scientific inv...SEEKing our way to better presentation of data and models from scientific inv...
SEEKing our way to better presentation of data and models from scientific inv...
Natalie Stanford
 
20160607 citation4software opening
20160607 citation4software opening20160607 citation4software opening
20160607 citation4software opening
Daniel S. Katz
 
NIH Data Commons - Note: Presentation has animations
NIH Data Commons  - Note:  Presentation has animations NIH Data Commons  - Note:  Presentation has animations
NIH Data Commons - Note: Presentation has animations
Vivien Bonazzi
 
The commons credit model pilot
The commons credit model pilotThe commons credit model pilot
The commons credit model pilot
George Komatsoulis
 
Big Data becomes Big Analysis
Big Data becomes Big Analysis Big Data becomes Big Analysis
Big Data becomes Big Analysis
OSTHUS
 
Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...
Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...
Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...
Brian Wee
 
Life science requirements from e-infrastructure: initial results from a joint...
Life science requirements from e-infrastructure:initial results from a joint...Life science requirements from e-infrastructure:initial results from a joint...
Life science requirements from e-infrastructure: initial results from a joint...
Rafael C. Jimenez
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things data
ARDC
 
Table of Content - International Journal of Managing Information Technology (...
Table of Content - International Journal of Managing Information Technology (...Table of Content - International Journal of Managing Information Technology (...
Table of Content - International Journal of Managing Information Technology (...
IJMIT JOURNAL
 
Data lifecycle mgt across the enterprise
Data lifecycle mgt across the enterpriseData lifecycle mgt across the enterprise
Data lifecycle mgt across the enterprise
OSTHUS
 
Trust threads : Active Curation and Publishing in SEAD
Trust threads : Active Curation and Publishing in SEADTrust threads : Active Curation and Publishing in SEAD
Trust threads : Active Curation and Publishing in SEAD
Beth Plale
 
Building Data Ecosystems for Accelerated Discovery
Building Data Ecosystems for Accelerated DiscoveryBuilding Data Ecosystems for Accelerated Discovery
Building Data Ecosystems for Accelerated Discovery
adamkraut
 

What's hot (20)

SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?SWOT Analysis - What Does it Tell Us?
SWOT Analysis - What Does it Tell Us?
 
Infrastructure for Supporting Computational Social Science
Infrastructure for Supporting Computational Social ScienceInfrastructure for Supporting Computational Social Science
Infrastructure for Supporting Computational Social Science
 
Incentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production processIncentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production process
 
BD2K Update
BD2K UpdateBD2K Update
BD2K Update
 
Energy resource management
Energy resource managementEnergy resource management
Energy resource management
 
Smart Geo. Guido Satta (Maggio 2015)
Smart Geo. Guido Satta (Maggio 2015)Smart Geo. Guido Satta (Maggio 2015)
Smart Geo. Guido Satta (Maggio 2015)
 
Introduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia ResearchIntroduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia Research
 
The Role of Automated Function Prediction in the Era of Big Data and Small Bu...
The Role of Automated Function Prediction in the Era of Big Data and Small Bu...The Role of Automated Function Prediction in the Era of Big Data and Small Bu...
The Role of Automated Function Prediction in the Era of Big Data and Small Bu...
 
SEEKing our way to better presentation of data and models from scientific inv...
SEEKing our way to better presentation of data and models from scientific inv...SEEKing our way to better presentation of data and models from scientific inv...
SEEKing our way to better presentation of data and models from scientific inv...
 
20160607 citation4software opening
20160607 citation4software opening20160607 citation4software opening
20160607 citation4software opening
 
NIH Data Commons - Note: Presentation has animations
NIH Data Commons  - Note:  Presentation has animations NIH Data Commons  - Note:  Presentation has animations
NIH Data Commons - Note: Presentation has animations
 
The commons credit model pilot
The commons credit model pilotThe commons credit model pilot
The commons credit model pilot
 
Big Data becomes Big Analysis
Big Data becomes Big Analysis Big Data becomes Big Analysis
Big Data becomes Big Analysis
 
Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...
Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...
Conceptual Architecture for USDA and NSF Terrestrial Observation Network Inte...
 
Life science requirements from e-infrastructure: initial results from a joint...
Life science requirements from e-infrastructure:initial results from a joint...Life science requirements from e-infrastructure:initial results from a joint...
Life science requirements from e-infrastructure: initial results from a joint...
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things data
 
Table of Content - International Journal of Managing Information Technology (...
Table of Content - International Journal of Managing Information Technology (...Table of Content - International Journal of Managing Information Technology (...
Table of Content - International Journal of Managing Information Technology (...
 
Data lifecycle mgt across the enterprise
Data lifecycle mgt across the enterpriseData lifecycle mgt across the enterprise
Data lifecycle mgt across the enterprise
 
Trust threads : Active Curation and Publishing in SEAD
Trust threads : Active Curation and Publishing in SEADTrust threads : Active Curation and Publishing in SEAD
Trust threads : Active Curation and Publishing in SEAD
 
Building Data Ecosystems for Accelerated Discovery
Building Data Ecosystems for Accelerated DiscoveryBuilding Data Ecosystems for Accelerated Discovery
Building Data Ecosystems for Accelerated Discovery
 

Similar to AHM 2014: Enterprise Architecture for Transformative Research and Collaboration Across Geoscinces

Cyberistructure
CyberistructureCyberistructure
Cyberistructure
Lab Southwest
 
Paving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflowsPaving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflows
The University of Edinburgh
 
User engagement in research data curation
User engagement in research data curationUser engagement in research data curation
User engagement in research data curation
EDINA, University of Edinburgh
 
Virtual Research Environments
Virtual Research EnvironmentsVirtual Research Environments
Virtual Research Environments
Joss Winn
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
Vivien Bonazzi
 
Hattrick Simpers TMS Machine Learning Workshop Slides
Hattrick Simpers TMS Machine Learning Workshop SlidesHattrick Simpers TMS Machine Learning Workshop Slides
Hattrick Simpers TMS Machine Learning Workshop Slides
Jason Hattrick-Simpers
 
Mendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 PaperMendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 Paper
William Gunn
 
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
Ilkay Altintas, Ph.D.
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
ASIS&T
 
Hedstrom Infrastructure
Hedstrom InfrastructureHedstrom Infrastructure
Hedstrom Infrastructure
guest2c9ba28e
 
Replicating FLOSS Research as eResearch
Replicating FLOSS Research as eResearchReplicating FLOSS Research as eResearch
Replicating FLOSS Research as eResearch
Andrea Wiggins
 
Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016
Jisc RDM
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
Carole Goble
 
Software and Education at NSF/ACI
Software and Education at NSF/ACISoftware and Education at NSF/ACI
Software and Education at NSF/ACI
Daniel S. Katz
 
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
Bridging Gaps and Broadening Participation inToday's and Future Research Com...Bridging Gaps and Broadening Participation inToday's and Future Research Com...
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
Sandra Gesing
 
Research process and research data management
Research  process and research data managementResearch  process and research data management
Research process and research data management
Ken Chad Consulting Ltd
 
UKSG 2014 Breakout Session - Westminster Research Process and Research Data
UKSG 2014 Breakout Session - Westminster Research Process and Research DataUKSG 2014 Breakout Session - Westminster Research Process and Research Data
UKSG 2014 Breakout Session - Westminster Research Process and Research Data
UKSG: connecting the knowledge community
 
PresentationTest
PresentationTestPresentationTest
PresentationTest
bolu804
 
Presentation 2019 08-30
Presentation 2019 08-30Presentation 2019 08-30
Presentation 2019 08-30
Mahdi_Fahmideh
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive Capability
TERN Australia
 

Similar to AHM 2014: Enterprise Architecture for Transformative Research and Collaboration Across Geoscinces (20)

Cyberistructure
CyberistructureCyberistructure
Cyberistructure
 
Paving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflowsPaving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflows
 
User engagement in research data curation
User engagement in research data curationUser engagement in research data curation
User engagement in research data curation
 
Virtual Research Environments
Virtual Research EnvironmentsVirtual Research Environments
Virtual Research Environments
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
Hattrick Simpers TMS Machine Learning Workshop Slides
Hattrick Simpers TMS Machine Learning Workshop SlidesHattrick Simpers TMS Machine Learning Workshop Slides
Hattrick Simpers TMS Machine Learning Workshop Slides
 
Mendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 PaperMendeley Open Repositories 2011 Paper
Mendeley Open Repositories 2011 Paper
 
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
Hedstrom Infrastructure
Hedstrom InfrastructureHedstrom Infrastructure
Hedstrom Infrastructure
 
Replicating FLOSS Research as eResearch
Replicating FLOSS Research as eResearchReplicating FLOSS Research as eResearch
Replicating FLOSS Research as eResearch
 
Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
Software and Education at NSF/ACI
Software and Education at NSF/ACISoftware and Education at NSF/ACI
Software and Education at NSF/ACI
 
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
Bridging Gaps and Broadening Participation inToday's and Future Research Com...Bridging Gaps and Broadening Participation inToday's and Future Research Com...
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
 
Research process and research data management
Research  process and research data managementResearch  process and research data management
Research process and research data management
 
UKSG 2014 Breakout Session - Westminster Research Process and Research Data
UKSG 2014 Breakout Session - Westminster Research Process and Research DataUKSG 2014 Breakout Session - Westminster Research Process and Research Data
UKSG 2014 Breakout Session - Westminster Research Process and Research Data
 
PresentationTest
PresentationTestPresentationTest
PresentationTest
 
Presentation 2019 08-30
Presentation 2019 08-30Presentation 2019 08-30
Presentation 2019 08-30
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive Capability
 

More from EarthCube

Community Webinar: Tune up for AGU
Community Webinar: Tune up for AGUCommunity Webinar: Tune up for AGU
Community Webinar: Tune up for AGU
EarthCube
 
Engagement Team monthly meeting 10.10.2014
Engagement Team monthly meeting 10.10.2014Engagement Team monthly meeting 10.10.2014
Engagement Team monthly meeting 10.10.2014
EarthCube
 
Sci Committee Meeting Slides 10.06.14
Sci Committee Meeting Slides 10.06.14Sci Committee Meeting Slides 10.06.14
Sci Committee Meeting Slides 10.06.14
EarthCube
 
Funded teams slides 10.10.14
Funded teams slides 10.10.14Funded teams slides 10.10.14
Funded teams slides 10.10.14
EarthCube
 
Technology and Architecture Committee meeting slides 10.06.14
Technology and Architecture Committee meeting slides 10.06.14Technology and Architecture Committee meeting slides 10.06.14
Technology and Architecture Committee meeting slides 10.06.14
EarthCube
 
EarthCube Governance Intro for Solar Terrestrial End-user Workshop
EarthCube Governance Intro for Solar Terrestrial End-user WorkshopEarthCube Governance Intro for Solar Terrestrial End-user Workshop
EarthCube Governance Intro for Solar Terrestrial End-user Workshop
EarthCube
 
EarthCube Community Webinar: Introduction to Committees and Teams
EarthCube Community Webinar: Introduction to Committees and TeamsEarthCube Community Webinar: Introduction to Committees and Teams
EarthCube Community Webinar: Introduction to Committees and Teams
EarthCube
 
AHM 2014: The CSDMS Standard Names, Cross-Domain Naming Conventions for Descr...
AHM 2014: The CSDMS Standard Names, Cross-Domain Naming Conventions for Descr...AHM 2014: The CSDMS Standard Names, Cross-Domain Naming Conventions for Descr...
AHM 2014: The CSDMS Standard Names, Cross-Domain Naming Conventions for Descr...
EarthCube
 
AHM 2014: PolarHub: A Global Hub for Geospatial Service Discovery
AHM 2014: PolarHub: A Global Hub for Geospatial Service DiscoveryAHM 2014: PolarHub: A Global Hub for Geospatial Service Discovery
AHM 2014: PolarHub: A Global Hub for Geospatial Service Discovery
EarthCube
 
AHM 2014: Addressing Data and Heterogeneity, Semantic Building Blocks & CI Pe...
AHM 2014: Addressing Data and Heterogeneity, Semantic Building Blocks & CI Pe...AHM 2014: Addressing Data and Heterogeneity, Semantic Building Blocks & CI Pe...
AHM 2014: Addressing Data and Heterogeneity, Semantic Building Blocks & CI Pe...
EarthCube
 
AHM 2014: Revisting Governance Model, Preparing for Next Steps
AHM 2014: Revisting Governance Model, Preparing for Next StepsAHM 2014: Revisting Governance Model, Preparing for Next Steps
AHM 2014: Revisting Governance Model, Preparing for Next Steps
EarthCube
 
AHM 2014: The World of VHub.org ONline Collaboration, Sharing, Data, Models...
AHM 2014: The World of VHub.org ONline Collaboration, Sharing, Data, Models...AHM 2014: The World of VHub.org ONline Collaboration, Sharing, Data, Models...
AHM 2014: The World of VHub.org ONline Collaboration, Sharing, Data, Models...
EarthCube
 
AHM 2014: Crawling for EarthCube
AHM 2014: Crawling for EarthCubeAHM 2014: Crawling for EarthCube
AHM 2014: Crawling for EarthCube
EarthCube
 
AHM 2014: The Flow Simulation Tools on VHub
AHM 2014: The Flow Simulation Tools on VHubAHM 2014: The Flow Simulation Tools on VHub
AHM 2014: The Flow Simulation Tools on VHub
EarthCube
 
AHM 2014: Integrated Data Management System for Critical Zone Observatories
AHM 2014: Integrated Data Management System for Critical Zone ObservatoriesAHM 2014: Integrated Data Management System for Critical Zone Observatories
AHM 2014: Integrated Data Management System for Critical Zone Observatories
EarthCube
 
Peckham 2014 i_em_ss
Peckham 2014 i_em_ssPeckham 2014 i_em_ss
Peckham 2014 i_em_ss
EarthCube
 
AHM 2014: BCube Brokering Framework
AHM 2014: BCube Brokering FrameworkAHM 2014: BCube Brokering Framework
AHM 2014: BCube Brokering Framework
EarthCube
 
AHM 2014: EarthCube Architecture Forum Introduction
AHM 2014: EarthCube Architecture Forum IntroductionAHM 2014: EarthCube Architecture Forum Introduction
AHM 2014: EarthCube Architecture Forum Introduction
EarthCube
 
AHM 2014: A Few Notes on GEOSS Architecture
AHM 2014: A Few Notes on GEOSS ArchitectureAHM 2014: A Few Notes on GEOSS Architecture
AHM 2014: A Few Notes on GEOSS Architecture
EarthCube
 
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
EarthCube
 

More from EarthCube (20)

Community Webinar: Tune up for AGU
Community Webinar: Tune up for AGUCommunity Webinar: Tune up for AGU
Community Webinar: Tune up for AGU
 
Engagement Team monthly meeting 10.10.2014
Engagement Team monthly meeting 10.10.2014Engagement Team monthly meeting 10.10.2014
Engagement Team monthly meeting 10.10.2014
 
Sci Committee Meeting Slides 10.06.14
Sci Committee Meeting Slides 10.06.14Sci Committee Meeting Slides 10.06.14
Sci Committee Meeting Slides 10.06.14
 
Funded teams slides 10.10.14
Funded teams slides 10.10.14Funded teams slides 10.10.14
Funded teams slides 10.10.14
 
Technology and Architecture Committee meeting slides 10.06.14
Technology and Architecture Committee meeting slides 10.06.14Technology and Architecture Committee meeting slides 10.06.14
Technology and Architecture Committee meeting slides 10.06.14
 
EarthCube Governance Intro for Solar Terrestrial End-user Workshop
EarthCube Governance Intro for Solar Terrestrial End-user WorkshopEarthCube Governance Intro for Solar Terrestrial End-user Workshop
EarthCube Governance Intro for Solar Terrestrial End-user Workshop
 
EarthCube Community Webinar: Introduction to Committees and Teams
EarthCube Community Webinar: Introduction to Committees and TeamsEarthCube Community Webinar: Introduction to Committees and Teams
EarthCube Community Webinar: Introduction to Committees and Teams
 
AHM 2014: The CSDMS Standard Names, Cross-Domain Naming Conventions for Descr...
AHM 2014: The CSDMS Standard Names, Cross-Domain Naming Conventions for Descr...AHM 2014: The CSDMS Standard Names, Cross-Domain Naming Conventions for Descr...
AHM 2014: The CSDMS Standard Names, Cross-Domain Naming Conventions for Descr...
 
AHM 2014: PolarHub: A Global Hub for Geospatial Service Discovery
AHM 2014: PolarHub: A Global Hub for Geospatial Service DiscoveryAHM 2014: PolarHub: A Global Hub for Geospatial Service Discovery
AHM 2014: PolarHub: A Global Hub for Geospatial Service Discovery
 
AHM 2014: Addressing Data and Heterogeneity, Semantic Building Blocks & CI Pe...
AHM 2014: Addressing Data and Heterogeneity, Semantic Building Blocks & CI Pe...AHM 2014: Addressing Data and Heterogeneity, Semantic Building Blocks & CI Pe...
AHM 2014: Addressing Data and Heterogeneity, Semantic Building Blocks & CI Pe...
 
AHM 2014: Revisting Governance Model, Preparing for Next Steps
AHM 2014: Revisting Governance Model, Preparing for Next StepsAHM 2014: Revisting Governance Model, Preparing for Next Steps
AHM 2014: Revisting Governance Model, Preparing for Next Steps
 
AHM 2014: The World of VHub.org ONline Collaboration, Sharing, Data, Models...
AHM 2014: The World of VHub.org ONline Collaboration, Sharing, Data, Models...AHM 2014: The World of VHub.org ONline Collaboration, Sharing, Data, Models...
AHM 2014: The World of VHub.org ONline Collaboration, Sharing, Data, Models...
 
AHM 2014: Crawling for EarthCube
AHM 2014: Crawling for EarthCubeAHM 2014: Crawling for EarthCube
AHM 2014: Crawling for EarthCube
 
AHM 2014: The Flow Simulation Tools on VHub
AHM 2014: The Flow Simulation Tools on VHubAHM 2014: The Flow Simulation Tools on VHub
AHM 2014: The Flow Simulation Tools on VHub
 
AHM 2014: Integrated Data Management System for Critical Zone Observatories
AHM 2014: Integrated Data Management System for Critical Zone ObservatoriesAHM 2014: Integrated Data Management System for Critical Zone Observatories
AHM 2014: Integrated Data Management System for Critical Zone Observatories
 
Peckham 2014 i_em_ss
Peckham 2014 i_em_ssPeckham 2014 i_em_ss
Peckham 2014 i_em_ss
 
AHM 2014: BCube Brokering Framework
AHM 2014: BCube Brokering FrameworkAHM 2014: BCube Brokering Framework
AHM 2014: BCube Brokering Framework
 
AHM 2014: EarthCube Architecture Forum Introduction
AHM 2014: EarthCube Architecture Forum IntroductionAHM 2014: EarthCube Architecture Forum Introduction
AHM 2014: EarthCube Architecture Forum Introduction
 
AHM 2014: A Few Notes on GEOSS Architecture
AHM 2014: A Few Notes on GEOSS ArchitectureAHM 2014: A Few Notes on GEOSS Architecture
AHM 2014: A Few Notes on GEOSS Architecture
 
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
 

Recently uploaded

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Vladimir Iglovikov, Ph.D.
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Zilliz
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
Rohit Gautam
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 

Recently uploaded (20)

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 

AHM 2014: Enterprise Architecture for Transformative Research and Collaboration Across Geoscinces

  • 1. EarthCube Conceptual Design: Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences http://workspace.earthcube.org/transformative-research-collaboration ILYA ZASLAVSKY, DAVID VALENTINE, AMARNATH GUPTA San Diego Supercomputer Center/UCSD STEPHEN RICHARD Arizona Geological Survey TANU MALIK University of Chicago
  • 2. Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences The Science Enterprise • Ask questions • Collect information • Formulate hypotheses • Test hypotheses to determine which (if any) provide satisfactory answer • Document, curate, and disseminate data and results. …. AND INCREASINGLY: • Integrate data, analyses, models across domains • Collaborate: leverage pooled expertise and resources increasing amount of data produced in modern science. LSDMA bridges the gap between data production and data analysis using a novel approach by combining specific community support and generic, cross community development. In the Data Life Cycle Labs (DLCL) experts from the data domain work closely with scientific groups of selected research domains in joint R&D where community-specific data life cycles are iteratively optimized, data and meta-data formats are defined and standardized, simple access and use is established as well as data and scientific insights are preserved in long-term and open accessible archives. Keywords: data management, data life cycle, data intensive computing, data analysis, data exploration, LSDMA, support, data infrastructure I. INTRODUCTION Today data is knowledge – data exploration has become the 4th pillar in modern science besides experiment, theory, and simulation as postulated by Jim Gray in 2007 [1]. Rapidly increasing data rates in experiments, measurements and simulation are limiting the speed of scientific production in various research communities and the gap between the generated data and data entering the data life cycle (cf. Fig1) is widening. By providing high performance data management components, analysis tools, computing resources, storage and services it is possible to address this challenge but the realization of a data intensive infrastructure at institutes and universities is usually time consuming and always expensive. The introduced “Large Scale Data Management and Analysis” (LSDMA) project extends the services for research of the Helmholtz Association of research centers in Germany with community specific Data Life Cycle Laboratories (DLCL). The The LSDMA project initiated at the Karlsruhe Institute of Technology (KIT), builds on the familiarity with supporting local scientists at a computer center, the knowledge of running the Grid Computing Centre Karlsruhe (GridKa) [2] as the German Tier 1 hub in the World Wide LHC Computing infrastructure [3], the Large Scale Data Facility (LSDF) [4] and the experience with the very successful Simulation Labs [5] that specialize at supporting HPC users. Figure 1. The scientific data life cycle
  • 3. Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences Design Framework: Federation of Systems Research enterprise includes subsystems at the project, program and agency level, many of which are independent of NSF • Requirements are a moving target • Emergent behavior is to be expected • Technology is constantly changing • Community governance within constraints of funding agencies • Evolutionary process and adaptation: • Lots of variation; Mechanism to select ‘fittest’; Composability • Technology must foster delegation of responsibilities and communication: • Promote self-organization, Cultivate ideas, Maintain feedback between subsystems • Reliability: responsiveness, robustness, correctness • Identity of system is based on shared goals and practices
  • 4. Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences Communication loops Bottom-up Studies Top-down Studies Cross-Domain Scientists Trends and Patterns Data interoperability best practices Scientific Governance Success stories Technical Governance Data Providers Feasibility Priorities Strategies Data Products Options Costs Problems and issues Related work Questions and clarifications Questions and clarifications
  • 5. Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences Communication metrics
  • 7. Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences Converging on reference architecture semantics  Analysis of existing building blocks, and their variability  Component  System  Function  Description  Interfaces  Implementation  Steward Organization  Availability  Reference  Developing cross-domain vocabularies, connecting domain models
  • 8. Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences Requirements Process  Workshop Summaries  Surveys  Architecture Designs  Analyze what worked  Incorporate social technologies  Inventory CI building blocks
  • 9. Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences Concerns  Hitting the right level of granularity in the design  Identifying necessary communication channels  Account for all key perspectives  Fixing the scope and technologies  Balancing current and future requirements  Harmonizing technical and social subsystems and managing interactions between them  Uneven standardization and convergence across domains and functional components  Constructing a self-organizing plug-and-play system  Inventorying building blocks
  • 10. Enterprise Architecture for Transformative Research and Collaboration Across the Geosciences Summary  System is defined by:  Specifications for interfaces and interchange formats (the gateways)  Definition of key functional components at an abstract level  Discovery, Workflow s, Data processing, annotation, documentation  Technology needs to support  Communication between subsystems (people and machines)  Collection of metrics required to assess what is working (selection of the fittest)  Assembly of components