SlideShare a Scribd company logo
1 of 35
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
PRESENTATION
Data Mining Challenges in Distributed Generation
Edward S. Blurock
Blurock Consulting AB
(previously with
Malmö University: Computer Science Dept.
Lund University: Combustion Physics, Energy Sciences
Research Institute for Symbolic Computation
University of California, Irvine: Thesis, Computational Chemistry)
bottom line: a career in (chemical) modelling
(using data, AI and machine learning/data mining …)
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
(SLIGHTLY) REVISED TITLE
Data Mining Challenges in Distributed Generation
Data Mining Challenges in Distributed Generation
Data Mining Challenges in Distributed Generation
Community(?)
Data Mining Challenges in Distributed Generation
Combustion Community(?)
Data Mining Challenges
from the widely distributed generated data from the scientific community
specifically for those dealing with all aspects of combustion
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
WHAT WE ARE TALKING ABOUT
Data
Theme:
you have to have data available
before you can do something with it
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
PRESENTATION
• Introduction (with disclaimers and revisions)
• Motivation:
• Data exchange moving into the clouds
• WG4:
• Standard definition for data collection and mining toward a virtual
chemistry of Smart Energy Carriers
• WG4 Task Force:
• Toward efficient data exchange in the combustion community
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
DATA PERSPECTIVE: DATA EXCHANGE
• Data is the backbone of modern scientific research
• Exchange of data is paramount to successful interaction between research groups
OPEN DATA
Publications and
conferences
Data exchanged between
researchers (email, etc)
Virtual Research Environment
papers
Data files
Clouds (infrastructures)
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
TOWARD A VIRTUAL SCIENTIFIC ENVIRONMENT
We are not alone in this
development
(maybe a bit behind)
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
DATA PERSPECTIVE GOALS: SOCIAL NETWORK
Need
tools
to
promote
efficient
data
sharing
within
the
community
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
DATA PERSPECTIVE: MANY SOURCES
Need
to
accommodate
the
varied
data
that
needs
to
be
handled
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
DATA PERSPECTIVE: INTERRELATIONSHIPS
There
is
no
such
thing
as
an
isolated
data
point
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
DATA PERSPECTIVE: QUALITY CONTROL
Reproducibility
Reliability
Accountability
Due to accountability requirements (financial incentives):
data managing tools are already being used
An important aspect of interdependency of data
is quality control
(calculation of sensitivity or error bars)
Efficient data exchange and availability
(beyond just published data)
is the key
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
ACCOUNTABILITY: ELECTRONIC LAB NOTEBOOKS (ELN)
In other fields
(pharma)
accountability
has
financial
motivations
(patents)
and
lead
to the
development and use
of ELNs
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
TOWARDS EFFICIENT DATA EXCHANGE
SMARTCATS
WG4
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
PRESENTATION
• Introduction (with disclaimers and revisions)
• Motivation:
• Data exchange moving into the clouds
• WG4:
• Standard definition for data collection and mining toward a virtual
chemistry of Smart Energy Carriers
• WG4 Task Force:
• Toward efficient data exchange in the combustion community
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
WG4 SUMMARY
DATA
WG4 can be summarized in one word:
Management of data:
Use of data
How do we keep track of, exchange and manage all the data
that is generated by the SMARTCATS community
How can we efficiently use the immense amount of data
that the SMARTCATS community generates
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
WG4: TITLE
Standard definition
for data collection and mining
toward a virtual chemistry of smart carriers
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
WG4 CHALLENGE
The main challenge of this WG is to provide a
forum for all experts in the combustion
community to formulate a common set of
requirements for a universal combustion
database not only capable of efficiently
store the vast amount of raw data generated
by experiments and modeling but also, more
importantly, efficiently accessible for
future use and maintenance.
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
WG4: AIMS
• Identification of the main requirements and
tools for the development of databases,
software and mathematical tools for data
collection and handling as well as chemistry
optimization using data mining techniques.
• Definition of “crucial” experiments and
simulations, uncertainty and sensitivity
analysis in combustion modeling
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
DEFINITIONS, REQUIREMENTS AND TOOLS
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
WG4: INCREASED DIALOG ABOUT DATA
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
WG4: DATA PERSPECTIVE
Definition of specific sets of prerequisites and
goals for the establishment of a
combustion database that will allow
efficient electronic communication of
combustion-related data.
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
PRESENTATION
• Introduction (with disclaimers and revisions)
• Motivation:
• Data exchange moving into the clouds
• WG4:
• Standard definition for data collection and mining toward a virtual
chemistry of Smart Energy Carriers
• WG4 Task Force:
• Toward efficient data exchange in the combustion
community
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
WG4 TASK FORCE
Goal: To use the expertise within the action
to promote efficient data exchange
among combustion researchers
First task: Cataloging
1. State of the art (in and out of the community)
2. Data within the community
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
GOALS OF PHASE I: CATALOGING
• Roles and Perspectives:
• For each role/perspective catalog a prioritised set of requirements,
expectations and desires
• Data to Disseminate
• For each apparatus and tool, outline (in words, mainly) the data
that could/should be available, from raw data to final published
results.
• Current efforts (inside the Action and outside)
• Catalog how different groups are storing data
• Catalog other data handling from other disciplines
• Projects/proposals/discussions having to do with data
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
PERSPECTIVES AND ROLES
• User: Interested in using the tools to acquire and use the data.
• In this role, the user is interested accessing data in a convenient and efficient way.
The user is also interested in what data is available.
• Generator: Generates data, both experimental and theoretical.
• The first focus is how much, in which detail and in what form the data should/can be
disseminated.
• An important aspect of this is to make this as painless and efficient as possible so as
to not generate more burden.
• Software/Database Developer: Developer of the tool.
• From User: How and in what form the data can be accessed.
• From Generator: Incorporating their data into whatever system they are developing.
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
LEVELS OF DATA
•Public: tables or figures within the text of the publication, or as more
detailed information in supplementary material
•Preliminary: Data leading up to published data
•Experiment: Data directly from the device, uninterpreted and
unedited.
•Intermediate: Data that has been process, but basically very device
dependent and not necessarily useful to others. In a sense, this is only
useful within the research group.
•Collaboration: Data that is useful to exchange among
(knowledgeable?) colleagues and collaborators
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
PARTICULAR FOCUS
•Preliminary:
•Data leading up to published data
• Accessibility
• Usefulness
• Characterisation
• Breadth of exchange: Public, collaborators, within group…
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
CATALOGING
WHAT
data to be cataloged and availability
has to be cataloged first (a major goal of first phase)
HOW
the data is to be stored
is of secondary importance:
• Catalog with respect to particular devices and models
• Within each:
• What are the data types and forms
• Characterisation of the data
• Quality of the data
• Usefulness
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
HOW: SECONDARY FOCUS
• De facto standards
• In moving towards electronic representation, the community is already in the
process establishing standards
• Convenience:
• Researchers generate data and ‘store’ it is the most convenient form
available to them (generation of data is primary concern).
• Software:
• As long as the format is ‘consistent’, intelligent software can interpret it and
then convert to another ‘standard’ form.
HOW
the data is to be stored
is of secondary importance:
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
DIFFERENT ‘STANDARDS’: TWO TYPICAL FORMATS
<?xml version="1.0" encoding="utf-8"?>
<experiment>
<fileAuthor>Chemical Kinetics Laboratory, Institute of Chemistry, ELTE, Budapest, Hungary</fileAuthor>
<fileVersion>
<major>1</major>
<minor>0</minor>
</fileVersion>
<ReSpecThVersion>
<major>1</major>
<minor>0</minor>
</ReSpecThVersion>
<bibliographyLink preferredKey="N. Leplat, P. Dagaut, C. Togbe, J. Vandooren,
Combust. Flame 158 (2011) 705-725, Fig. 9, C3H6 not taken"/>
<apparatus>
<kind>stirred reactor</kind>
</apparatus>
<experimentType>Jet stirred reactor measurement</experimentType>
<commonProperties>
<property description="" label="P" name="pressure" units=“atm">
<value>1</value></property>
<property description="" label="V" name="volume" units=“cm3">
<value>30</value></property>
<property description="" label="tau" name="residence time" units="s" >
<value>0.07</value></property>
<property name="initial composition">
<component><speciesLink preferredKey="C2H5OH" />
<amount units="mole fraction">0.002</amount></component>
<component><speciesLink preferredKey="O2" />
<amount units="mole fraction">0.024</amount></component>
<component><speciesLink preferredKey="N2" />
<amount units="mole fraction">0.974</amount></component>
</property>
Table 1
Experiment Type: Jet stirred reactor measurement
Paper Title: Oxidation of Cyclohexane in a Jet-Stirred
Reactor
Common Properties
Pressure: 106.7 kPa
Volume: 30 cm3
Phi: 0.5
Residence Time: 2 s
Fuel inlet mole fraction: 0.0067
Temperature range: 500 - 1100 K
Inlet mole composition
CH3CHO 0.0067 mole
fraction
H2 0.0345 mole
fraction
N2 0.9 mole
fraction
Temperature(K)/Mole
Fraction
H2 O2 CO CO2
500 0 5.87E-02 0 0
525 0 5.63E-02 0 0
XML format from ReSpecTh Spreadsheet: CloudFlame
State of the art: what is in use now….
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
Review the Hierarchical Data Format (HDF5) used by PrIMe database; hierarchy enables
extension (new groups).
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
XML DATA REPRESENTATION: WE ARE NOT ALONE
XML is the language of the internet == many tools for its manipulation
Important note:
Though understandable for humans,
not necessarily convenient to generate
need tools
Gaining ground in scientific computing
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
SOFTWARE SOLUTION SUPPORTING MANY FORMATS
From a software technical point of view:
interchange between formats
Example
in
computational
chemistry
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
DATABASES WITHIN THE ACTION
http://respecth.chem.elte.hu/respecth
http://www.chemicalkinetics.info
http://primekinetics.org/
Sustainable and Smart Energy Carriers
for Decentralised Energy Production
OUTPUT
Through input from actors in the SMARTCATS action
a
white paper on
Data within the combustion community
We need YOUR input

More Related Content

What's hot

Energy efficient information and communication infrastructures in the smart g...
Energy efficient information and communication infrastructures in the smart g...Energy efficient information and communication infrastructures in the smart g...
Energy efficient information and communication infrastructures in the smart g...redpel dot com
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands Vivien Bonazzi
 
The NIH Data Commons - BD2K All Hands Meeting 2015
The NIH Data Commons -  BD2K All Hands Meeting 2015The NIH Data Commons -  BD2K All Hands Meeting 2015
The NIH Data Commons - BD2K All Hands Meeting 2015Vivien Bonazzi
 
Energies 11-02832
Energies 11-02832Energies 11-02832
Energies 11-02832Fawad52
 
Valuation of Energy Storage White Paper by SCE
Valuation of Energy Storage White Paper by SCEValuation of Energy Storage White Paper by SCE
Valuation of Energy Storage White Paper by SCEUCSD-Strategic-Energy
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
 
Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...
Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...
Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...ICT FOOTPRINT .eu
 
ENERGY MANAGEMENT ALGORITHMS IN SMART GRIDS: STATE OF THE ART AND EMERGING TR...
ENERGY MANAGEMENT ALGORITHMS IN SMART GRIDS: STATE OF THE ART AND EMERGING TR...ENERGY MANAGEMENT ALGORITHMS IN SMART GRIDS: STATE OF THE ART AND EMERGING TR...
ENERGY MANAGEMENT ALGORITHMS IN SMART GRIDS: STATE OF THE ART AND EMERGING TR...ijaia
 
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...European Data Forum
 

What's hot (11)

Energy efficient information and communication infrastructures in the smart g...
Energy efficient information and communication infrastructures in the smart g...Energy efficient information and communication infrastructures in the smart g...
Energy efficient information and communication infrastructures in the smart g...
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands
 
The NIH Data Commons - BD2K All Hands Meeting 2015
The NIH Data Commons -  BD2K All Hands Meeting 2015The NIH Data Commons -  BD2K All Hands Meeting 2015
The NIH Data Commons - BD2K All Hands Meeting 2015
 
Energies 11-02832
Energies 11-02832Energies 11-02832
Energies 11-02832
 
Valuation of Energy Storage White Paper by SCE
Valuation of Energy Storage White Paper by SCEValuation of Energy Storage White Paper by SCE
Valuation of Energy Storage White Paper by SCE
 
The Integrated Grid epri
The Integrated Grid epriThe Integrated Grid epri
The Integrated Grid epri
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...
Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...
Calculation Tools & ICT Insights on energy saving: SAT-S, Save@Work, GreenSpe...
 
ENERGY MANAGEMENT ALGORITHMS IN SMART GRIDS: STATE OF THE ART AND EMERGING TR...
ENERGY MANAGEMENT ALGORITHMS IN SMART GRIDS: STATE OF THE ART AND EMERGING TR...ENERGY MANAGEMENT ALGORITHMS IN SMART GRIDS: STATE OF THE ART AND EMERGING TR...
ENERGY MANAGEMENT ALGORITHMS IN SMART GRIDS: STATE OF THE ART AND EMERGING TR...
 
Syamsir Abduh-Integrated power-2007
Syamsir Abduh-Integrated power-2007Syamsir Abduh-Integrated power-2007
Syamsir Abduh-Integrated power-2007
 
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
 

Similar to Data Mining Challenges in Distributed Generation

Prospering from the Energy Revolution: Six in Sixty - Data and Digitalisation
Prospering from the Energy Revolution: Six in Sixty - Data and DigitalisationProspering from the Energy Revolution: Six in Sixty - Data and Digitalisation
Prospering from the Energy Revolution: Six in Sixty - Data and DigitalisationKTN
 
Show and Tell - Data and Digitalisation, Digital Twins.pdf
Show and Tell - Data and Digitalisation, Digital Twins.pdfShow and Tell - Data and Digitalisation, Digital Twins.pdf
Show and Tell - Data and Digitalisation, Digital Twins.pdfSIFOfgem
 
Sustainable energy for all system design tool: the E.DRE tool, Estimator of D...
Sustainable energy for all system design tool: the E.DRE tool, Estimator of D...Sustainable energy for all system design tool: the E.DRE tool, Estimator of D...
Sustainable energy for all system design tool: the E.DRE tool, Estimator of D...lenses
 
Selecting Ontologies and Publishing Data of Electrical Appliances: A Refrige...
Selecting Ontologies  and Publishing Data of Electrical Appliances: A Refrige...Selecting Ontologies  and Publishing Data of Electrical Appliances: A Refrige...
Selecting Ontologies and Publishing Data of Electrical Appliances: A Refrige...Anna Fensel
 
Integrated approach for the introduction of renewable energies in remote site...
Integrated approach for the introduction of renewable energies in remote site...Integrated approach for the introduction of renewable energies in remote site...
Integrated approach for the introduction of renewable energies in remote site...Mar Martinez
 
Bringing Enterprise IT into the 21st Century: A Management and Sustainabilit...
Bringing Enterprise IT into the 21st Century:  A Management and Sustainabilit...Bringing Enterprise IT into the 21st Century:  A Management and Sustainabilit...
Bringing Enterprise IT into the 21st Century: A Management and Sustainabilit...Jonathan Koomey
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsVivien Bonazzi
 
Enter the New World of Online Learning with NTER!
Enter the New World of Online Learning with NTER!Enter the New World of Online Learning with NTER!
Enter the New World of Online Learning with NTER!NTERlearning
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityTERN Australia
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECAProject
 
Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Vivien Bonazzi
 
Smart Data for Behavioural Change: Towards Energy Efficient Buildings
Smart Data for Behavioural Change: Towards Energy Efficient BuildingsSmart Data for Behavioural Change: Towards Energy Efficient Buildings
Smart Data for Behavioural Change: Towards Energy Efficient BuildingsAnna Fensel
 
Shared Economy & Open Data in #EnergyEfficiency Markets
Shared Economy & Open Data in #EnergyEfficiency MarketsShared Economy & Open Data in #EnergyEfficiency Markets
Shared Economy & Open Data in #EnergyEfficiency MarketsUmesh Bhutoria
 
Data commons bonazzi bd2 k fundamentals of science feb 2017
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017Vivien Bonazzi
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data ProjectEdward Blurock
 
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...datacite
 
Hattrick Simpers TMS Machine Learning Workshop Slides
Hattrick Simpers TMS Machine Learning Workshop SlidesHattrick Simpers TMS Machine Learning Workshop Slides
Hattrick Simpers TMS Machine Learning Workshop SlidesJason Hattrick-Simpers
 
Turning FAIR into Reality: Briefing on the EC’s report on FAIR data
Turning FAIR into Reality: Briefing on the EC’s report on FAIR dataTurning FAIR into Reality: Briefing on the EC’s report on FAIR data
Turning FAIR into Reality: Briefing on the EC’s report on FAIR datadri_ireland
 
Enabling Re-Use and Sustainability: The role of information infrastructure fu...
Enabling Re-Use and Sustainability: The role of information infrastructure fu...Enabling Re-Use and Sustainability: The role of information infrastructure fu...
Enabling Re-Use and Sustainability: The role of information infrastructure fu...Platforma Otwartej Nauki
 

Similar to Data Mining Challenges in Distributed Generation (20)

Prospering from the Energy Revolution: Six in Sixty - Data and Digitalisation
Prospering from the Energy Revolution: Six in Sixty - Data and DigitalisationProspering from the Energy Revolution: Six in Sixty - Data and Digitalisation
Prospering from the Energy Revolution: Six in Sixty - Data and Digitalisation
 
Show and Tell - Data and Digitalisation, Digital Twins.pdf
Show and Tell - Data and Digitalisation, Digital Twins.pdfShow and Tell - Data and Digitalisation, Digital Twins.pdf
Show and Tell - Data and Digitalisation, Digital Twins.pdf
 
Sustainable energy for all system design tool: the E.DRE tool, Estimator of D...
Sustainable energy for all system design tool: the E.DRE tool, Estimator of D...Sustainable energy for all system design tool: the E.DRE tool, Estimator of D...
Sustainable energy for all system design tool: the E.DRE tool, Estimator of D...
 
Selecting Ontologies and Publishing Data of Electrical Appliances: A Refrige...
Selecting Ontologies  and Publishing Data of Electrical Appliances: A Refrige...Selecting Ontologies  and Publishing Data of Electrical Appliances: A Refrige...
Selecting Ontologies and Publishing Data of Electrical Appliances: A Refrige...
 
Integrated approach for the introduction of renewable energies in remote site...
Integrated approach for the introduction of renewable energies in remote site...Integrated approach for the introduction of renewable energies in remote site...
Integrated approach for the introduction of renewable energies in remote site...
 
Bringing Enterprise IT into the 21st Century: A Management and Sustainabilit...
Bringing Enterprise IT into the 21st Century:  A Management and Sustainabilit...Bringing Enterprise IT into the 21st Century:  A Management and Sustainabilit...
Bringing Enterprise IT into the 21st Century: A Management and Sustainabilit...
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
Enter the New World of Online Learning with NTER!
Enter the New World of Online Learning with NTER!Enter the New World of Online Learning with NTER!
Enter the New World of Online Learning with NTER!
 
Green IT Report
Green IT ReportGreen IT Report
Green IT Report
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive Capability
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
 
Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2
 
Smart Data for Behavioural Change: Towards Energy Efficient Buildings
Smart Data for Behavioural Change: Towards Energy Efficient BuildingsSmart Data for Behavioural Change: Towards Energy Efficient Buildings
Smart Data for Behavioural Change: Towards Energy Efficient Buildings
 
Shared Economy & Open Data in #EnergyEfficiency Markets
Shared Economy & Open Data in #EnergyEfficiency MarketsShared Economy & Open Data in #EnergyEfficiency Markets
Shared Economy & Open Data in #EnergyEfficiency Markets
 
Data commons bonazzi bd2 k fundamentals of science feb 2017
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data Project
 
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
 
Hattrick Simpers TMS Machine Learning Workshop Slides
Hattrick Simpers TMS Machine Learning Workshop SlidesHattrick Simpers TMS Machine Learning Workshop Slides
Hattrick Simpers TMS Machine Learning Workshop Slides
 
Turning FAIR into Reality: Briefing on the EC’s report on FAIR data
Turning FAIR into Reality: Briefing on the EC’s report on FAIR dataTurning FAIR into Reality: Briefing on the EC’s report on FAIR data
Turning FAIR into Reality: Briefing on the EC’s report on FAIR data
 
Enabling Re-Use and Sustainability: The role of information infrastructure fu...
Enabling Re-Use and Sustainability: The role of information infrastructure fu...Enabling Re-Use and Sustainability: The role of information infrastructure fu...
Enabling Re-Use and Sustainability: The role of information infrastructure fu...
 

More from Edward Blurock

KEOD23-JThermodynamcsCloud
KEOD23-JThermodynamcsCloudKEOD23-JThermodynamcsCloud
KEOD23-JThermodynamcsCloudEdward Blurock
 
BlurockPresentation-KEOD2023
BlurockPresentation-KEOD2023BlurockPresentation-KEOD2023
BlurockPresentation-KEOD2023Edward Blurock
 
ChemConnect: Poster for European Combustion Meeting 2017
ChemConnect: Poster for European Combustion Meeting 2017ChemConnect: Poster for European Combustion Meeting 2017
ChemConnect: Poster for European Combustion Meeting 2017Edward Blurock
 
ChemConnect: SMARTCATS presentation
ChemConnect: SMARTCATS presentationChemConnect: SMARTCATS presentation
ChemConnect: SMARTCATS presentationEdward Blurock
 
ChemConnect: Viewing the datasets in the repository
ChemConnect: Viewing the datasets in the repositoryChemConnect: Viewing the datasets in the repository
ChemConnect: Viewing the datasets in the repositoryEdward Blurock
 
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...Edward Blurock
 
Poster: Characterizing Ignition behavior through morphing to generic curves
Poster: Characterizing Ignition behavior through morphing to generic curvesPoster: Characterizing Ignition behavior through morphing to generic curves
Poster: Characterizing Ignition behavior through morphing to generic curvesEdward Blurock
 
Poster: Adaptive On-­‐the-­‐fly Regression Tabula@on: Beyond ISAT
Poster: Adaptive On-­‐the-­‐fly Regression Tabula@on: Beyond ISATPoster: Adaptive On-­‐the-­‐fly Regression Tabula@on: Beyond ISAT
Poster: Adaptive On-­‐the-­‐fly Regression Tabula@on: Beyond ISATEdward Blurock
 
Characterization Ignition Behavior through Morphing to Generic Ignition Curves
Characterization Ignition Behavior through Morphing to Generic Ignition CurvesCharacterization Ignition Behavior through Morphing to Generic Ignition Curves
Characterization Ignition Behavior through Morphing to Generic Ignition CurvesEdward Blurock
 
Computability, turing machines and lambda calculus
Computability, turing machines and lambda calculusComputability, turing machines and lambda calculus
Computability, turing machines and lambda calculusEdward Blurock
 
Imperative programming
Imperative programmingImperative programming
Imperative programmingEdward Blurock
 
Database normalization
Database normalizationDatabase normalization
Database normalizationEdward Blurock
 
Generalization abstraction
Generalization abstractionGeneralization abstraction
Generalization abstractionEdward Blurock
 
Computability and Complexity
Computability and ComplexityComputability and Complexity
Computability and ComplexityEdward Blurock
 

More from Edward Blurock (20)

KEOD23-JThermodynamcsCloud
KEOD23-JThermodynamcsCloudKEOD23-JThermodynamcsCloud
KEOD23-JThermodynamcsCloud
 
BlurockPresentation-KEOD2023
BlurockPresentation-KEOD2023BlurockPresentation-KEOD2023
BlurockPresentation-KEOD2023
 
KEOD-2023-Poster.pptx
KEOD-2023-Poster.pptxKEOD-2023-Poster.pptx
KEOD-2023-Poster.pptx
 
ChemConnect: Poster for European Combustion Meeting 2017
ChemConnect: Poster for European Combustion Meeting 2017ChemConnect: Poster for European Combustion Meeting 2017
ChemConnect: Poster for European Combustion Meeting 2017
 
ChemConnect: SMARTCATS presentation
ChemConnect: SMARTCATS presentationChemConnect: SMARTCATS presentation
ChemConnect: SMARTCATS presentation
 
ChemConnect: Viewing the datasets in the repository
ChemConnect: Viewing the datasets in the repositoryChemConnect: Viewing the datasets in the repository
ChemConnect: Viewing the datasets in the repository
 
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...
ChemConnect: Characterizing CombusAon KineAc Data with ontologies and meta-­‐...
 
Poster: Characterizing Ignition behavior through morphing to generic curves
Poster: Characterizing Ignition behavior through morphing to generic curvesPoster: Characterizing Ignition behavior through morphing to generic curves
Poster: Characterizing Ignition behavior through morphing to generic curves
 
Poster: Adaptive On-­‐the-­‐fly Regression Tabula@on: Beyond ISAT
Poster: Adaptive On-­‐the-­‐fly Regression Tabula@on: Beyond ISATPoster: Adaptive On-­‐the-­‐fly Regression Tabula@on: Beyond ISAT
Poster: Adaptive On-­‐the-­‐fly Regression Tabula@on: Beyond ISAT
 
Characterization Ignition Behavior through Morphing to Generic Ignition Curves
Characterization Ignition Behavior through Morphing to Generic Ignition CurvesCharacterization Ignition Behavior through Morphing to Generic Ignition Curves
Characterization Ignition Behavior through Morphing to Generic Ignition Curves
 
Paradigms
ParadigmsParadigms
Paradigms
 
Computability, turing machines and lambda calculus
Computability, turing machines and lambda calculusComputability, turing machines and lambda calculus
Computability, turing machines and lambda calculus
 
Imperative programming
Imperative programmingImperative programming
Imperative programming
 
Programming Languages
Programming LanguagesProgramming Languages
Programming Languages
 
Relational algebra
Relational algebraRelational algebra
Relational algebra
 
Database normalization
Database normalizationDatabase normalization
Database normalization
 
Generalization abstraction
Generalization abstractionGeneralization abstraction
Generalization abstraction
 
Overview
OverviewOverview
Overview
 
Networks
NetworksNetworks
Networks
 
Computability and Complexity
Computability and ComplexityComputability and Complexity
Computability and Complexity
 

Recently uploaded

TOTAL CHOLESTEROL (lipid profile test).pptx
TOTAL CHOLESTEROL (lipid profile test).pptxTOTAL CHOLESTEROL (lipid profile test).pptx
TOTAL CHOLESTEROL (lipid profile test).pptxdharshini369nike
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Module 4: Mendelian Genetics and Punnett Square
Module 4:  Mendelian Genetics and Punnett SquareModule 4:  Mendelian Genetics and Punnett Square
Module 4: Mendelian Genetics and Punnett SquareIsiahStephanRadaza
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaDashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaPraksha3
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Temporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of MasticationTemporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of Masticationvidulajaib
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Welcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work DayWelcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work DayZachary Labe
 
insect anatomy and insect body wall and their physiology
insect anatomy and insect body wall and their  physiologyinsect anatomy and insect body wall and their  physiology
insect anatomy and insect body wall and their physiologyDrAnita Sharma
 
Heredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsHeredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsCharlene Llagas
 
Cytokinin, mechanism and its application.pptx
Cytokinin, mechanism and its application.pptxCytokinin, mechanism and its application.pptx
Cytokinin, mechanism and its application.pptxVarshiniMK
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10ROLANARIBATO3
 

Recently uploaded (20)

TOTAL CHOLESTEROL (lipid profile test).pptx
TOTAL CHOLESTEROL (lipid profile test).pptxTOTAL CHOLESTEROL (lipid profile test).pptx
TOTAL CHOLESTEROL (lipid profile test).pptx
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Module 4: Mendelian Genetics and Punnett Square
Module 4:  Mendelian Genetics and Punnett SquareModule 4:  Mendelian Genetics and Punnett Square
Module 4: Mendelian Genetics and Punnett Square
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaDashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Temporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of MasticationTemporomandibular joint Muscles of Mastication
Temporomandibular joint Muscles of Mastication
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Welcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work DayWelcome to GFDL for Take Your Child To Work Day
Welcome to GFDL for Take Your Child To Work Day
 
insect anatomy and insect body wall and their physiology
insect anatomy and insect body wall and their  physiologyinsect anatomy and insect body wall and their  physiology
insect anatomy and insect body wall and their physiology
 
Heredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of TraitsHeredity: Inheritance and Variation of Traits
Heredity: Inheritance and Variation of Traits
 
Cytokinin, mechanism and its application.pptx
Cytokinin, mechanism and its application.pptxCytokinin, mechanism and its application.pptx
Cytokinin, mechanism and its application.pptx
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort ServiceHot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10Gas_Laws_powerpoint_notes.ppt for grade 10
Gas_Laws_powerpoint_notes.ppt for grade 10
 

Data Mining Challenges in Distributed Generation

  • 1. Sustainable and Smart Energy Carriers for Decentralised Energy Production PRESENTATION Data Mining Challenges in Distributed Generation Edward S. Blurock Blurock Consulting AB (previously with Malmö University: Computer Science Dept. Lund University: Combustion Physics, Energy Sciences Research Institute for Symbolic Computation University of California, Irvine: Thesis, Computational Chemistry) bottom line: a career in (chemical) modelling (using data, AI and machine learning/data mining …)
  • 2. Sustainable and Smart Energy Carriers for Decentralised Energy Production (SLIGHTLY) REVISED TITLE Data Mining Challenges in Distributed Generation Data Mining Challenges in Distributed Generation Data Mining Challenges in Distributed Generation Community(?) Data Mining Challenges in Distributed Generation Combustion Community(?) Data Mining Challenges from the widely distributed generated data from the scientific community specifically for those dealing with all aspects of combustion
  • 3. Sustainable and Smart Energy Carriers for Decentralised Energy Production WHAT WE ARE TALKING ABOUT Data Theme: you have to have data available before you can do something with it
  • 4. Sustainable and Smart Energy Carriers for Decentralised Energy Production PRESENTATION • Introduction (with disclaimers and revisions) • Motivation: • Data exchange moving into the clouds • WG4: • Standard definition for data collection and mining toward a virtual chemistry of Smart Energy Carriers • WG4 Task Force: • Toward efficient data exchange in the combustion community
  • 5. Sustainable and Smart Energy Carriers for Decentralised Energy Production DATA PERSPECTIVE: DATA EXCHANGE • Data is the backbone of modern scientific research • Exchange of data is paramount to successful interaction between research groups OPEN DATA Publications and conferences Data exchanged between researchers (email, etc) Virtual Research Environment papers Data files Clouds (infrastructures)
  • 6. Sustainable and Smart Energy Carriers for Decentralised Energy Production TOWARD A VIRTUAL SCIENTIFIC ENVIRONMENT We are not alone in this development (maybe a bit behind)
  • 7. Sustainable and Smart Energy Carriers for Decentralised Energy Production DATA PERSPECTIVE GOALS: SOCIAL NETWORK Need tools to promote efficient data sharing within the community
  • 8. Sustainable and Smart Energy Carriers for Decentralised Energy Production DATA PERSPECTIVE: MANY SOURCES Need to accommodate the varied data that needs to be handled
  • 9. Sustainable and Smart Energy Carriers for Decentralised Energy Production DATA PERSPECTIVE: INTERRELATIONSHIPS There is no such thing as an isolated data point
  • 10. Sustainable and Smart Energy Carriers for Decentralised Energy Production DATA PERSPECTIVE: QUALITY CONTROL Reproducibility Reliability Accountability Due to accountability requirements (financial incentives): data managing tools are already being used An important aspect of interdependency of data is quality control (calculation of sensitivity or error bars) Efficient data exchange and availability (beyond just published data) is the key
  • 11. Sustainable and Smart Energy Carriers for Decentralised Energy Production ACCOUNTABILITY: ELECTRONIC LAB NOTEBOOKS (ELN) In other fields (pharma) accountability has financial motivations (patents) and lead to the development and use of ELNs
  • 12. Sustainable and Smart Energy Carriers for Decentralised Energy Production TOWARDS EFFICIENT DATA EXCHANGE SMARTCATS WG4
  • 13. Sustainable and Smart Energy Carriers for Decentralised Energy Production PRESENTATION • Introduction (with disclaimers and revisions) • Motivation: • Data exchange moving into the clouds • WG4: • Standard definition for data collection and mining toward a virtual chemistry of Smart Energy Carriers • WG4 Task Force: • Toward efficient data exchange in the combustion community
  • 14. Sustainable and Smart Energy Carriers for Decentralised Energy Production WG4 SUMMARY DATA WG4 can be summarized in one word: Management of data: Use of data How do we keep track of, exchange and manage all the data that is generated by the SMARTCATS community How can we efficiently use the immense amount of data that the SMARTCATS community generates
  • 15. Sustainable and Smart Energy Carriers for Decentralised Energy Production WG4: TITLE Standard definition for data collection and mining toward a virtual chemistry of smart carriers
  • 16. Sustainable and Smart Energy Carriers for Decentralised Energy Production WG4 CHALLENGE The main challenge of this WG is to provide a forum for all experts in the combustion community to formulate a common set of requirements for a universal combustion database not only capable of efficiently store the vast amount of raw data generated by experiments and modeling but also, more importantly, efficiently accessible for future use and maintenance.
  • 17. Sustainable and Smart Energy Carriers for Decentralised Energy Production
  • 18. Sustainable and Smart Energy Carriers for Decentralised Energy Production WG4: AIMS • Identification of the main requirements and tools for the development of databases, software and mathematical tools for data collection and handling as well as chemistry optimization using data mining techniques. • Definition of “crucial” experiments and simulations, uncertainty and sensitivity analysis in combustion modeling
  • 19. Sustainable and Smart Energy Carriers for Decentralised Energy Production DEFINITIONS, REQUIREMENTS AND TOOLS
  • 20. Sustainable and Smart Energy Carriers for Decentralised Energy Production WG4: INCREASED DIALOG ABOUT DATA
  • 21. Sustainable and Smart Energy Carriers for Decentralised Energy Production WG4: DATA PERSPECTIVE Definition of specific sets of prerequisites and goals for the establishment of a combustion database that will allow efficient electronic communication of combustion-related data.
  • 22. Sustainable and Smart Energy Carriers for Decentralised Energy Production PRESENTATION • Introduction (with disclaimers and revisions) • Motivation: • Data exchange moving into the clouds • WG4: • Standard definition for data collection and mining toward a virtual chemistry of Smart Energy Carriers • WG4 Task Force: • Toward efficient data exchange in the combustion community
  • 23. Sustainable and Smart Energy Carriers for Decentralised Energy Production WG4 TASK FORCE Goal: To use the expertise within the action to promote efficient data exchange among combustion researchers First task: Cataloging 1. State of the art (in and out of the community) 2. Data within the community
  • 24. Sustainable and Smart Energy Carriers for Decentralised Energy Production GOALS OF PHASE I: CATALOGING • Roles and Perspectives: • For each role/perspective catalog a prioritised set of requirements, expectations and desires • Data to Disseminate • For each apparatus and tool, outline (in words, mainly) the data that could/should be available, from raw data to final published results. • Current efforts (inside the Action and outside) • Catalog how different groups are storing data • Catalog other data handling from other disciplines • Projects/proposals/discussions having to do with data
  • 25. Sustainable and Smart Energy Carriers for Decentralised Energy Production PERSPECTIVES AND ROLES • User: Interested in using the tools to acquire and use the data. • In this role, the user is interested accessing data in a convenient and efficient way. The user is also interested in what data is available. • Generator: Generates data, both experimental and theoretical. • The first focus is how much, in which detail and in what form the data should/can be disseminated. • An important aspect of this is to make this as painless and efficient as possible so as to not generate more burden. • Software/Database Developer: Developer of the tool. • From User: How and in what form the data can be accessed. • From Generator: Incorporating their data into whatever system they are developing.
  • 26. Sustainable and Smart Energy Carriers for Decentralised Energy Production LEVELS OF DATA •Public: tables or figures within the text of the publication, or as more detailed information in supplementary material •Preliminary: Data leading up to published data •Experiment: Data directly from the device, uninterpreted and unedited. •Intermediate: Data that has been process, but basically very device dependent and not necessarily useful to others. In a sense, this is only useful within the research group. •Collaboration: Data that is useful to exchange among (knowledgeable?) colleagues and collaborators
  • 27. Sustainable and Smart Energy Carriers for Decentralised Energy Production PARTICULAR FOCUS •Preliminary: •Data leading up to published data • Accessibility • Usefulness • Characterisation • Breadth of exchange: Public, collaborators, within group…
  • 28. Sustainable and Smart Energy Carriers for Decentralised Energy Production CATALOGING WHAT data to be cataloged and availability has to be cataloged first (a major goal of first phase) HOW the data is to be stored is of secondary importance: • Catalog with respect to particular devices and models • Within each: • What are the data types and forms • Characterisation of the data • Quality of the data • Usefulness
  • 29. Sustainable and Smart Energy Carriers for Decentralised Energy Production HOW: SECONDARY FOCUS • De facto standards • In moving towards electronic representation, the community is already in the process establishing standards • Convenience: • Researchers generate data and ‘store’ it is the most convenient form available to them (generation of data is primary concern). • Software: • As long as the format is ‘consistent’, intelligent software can interpret it and then convert to another ‘standard’ form. HOW the data is to be stored is of secondary importance:
  • 30. Sustainable and Smart Energy Carriers for Decentralised Energy Production DIFFERENT ‘STANDARDS’: TWO TYPICAL FORMATS <?xml version="1.0" encoding="utf-8"?> <experiment> <fileAuthor>Chemical Kinetics Laboratory, Institute of Chemistry, ELTE, Budapest, Hungary</fileAuthor> <fileVersion> <major>1</major> <minor>0</minor> </fileVersion> <ReSpecThVersion> <major>1</major> <minor>0</minor> </ReSpecThVersion> <bibliographyLink preferredKey="N. Leplat, P. Dagaut, C. Togbe, J. Vandooren, Combust. Flame 158 (2011) 705-725, Fig. 9, C3H6 not taken"/> <apparatus> <kind>stirred reactor</kind> </apparatus> <experimentType>Jet stirred reactor measurement</experimentType> <commonProperties> <property description="" label="P" name="pressure" units=“atm"> <value>1</value></property> <property description="" label="V" name="volume" units=“cm3"> <value>30</value></property> <property description="" label="tau" name="residence time" units="s" > <value>0.07</value></property> <property name="initial composition"> <component><speciesLink preferredKey="C2H5OH" /> <amount units="mole fraction">0.002</amount></component> <component><speciesLink preferredKey="O2" /> <amount units="mole fraction">0.024</amount></component> <component><speciesLink preferredKey="N2" /> <amount units="mole fraction">0.974</amount></component> </property> Table 1 Experiment Type: Jet stirred reactor measurement Paper Title: Oxidation of Cyclohexane in a Jet-Stirred Reactor Common Properties Pressure: 106.7 kPa Volume: 30 cm3 Phi: 0.5 Residence Time: 2 s Fuel inlet mole fraction: 0.0067 Temperature range: 500 - 1100 K Inlet mole composition CH3CHO 0.0067 mole fraction H2 0.0345 mole fraction N2 0.9 mole fraction Temperature(K)/Mole Fraction H2 O2 CO CO2 500 0 5.87E-02 0 0 525 0 5.63E-02 0 0 XML format from ReSpecTh Spreadsheet: CloudFlame State of the art: what is in use now….
  • 31. Sustainable and Smart Energy Carriers for Decentralised Energy Production Review the Hierarchical Data Format (HDF5) used by PrIMe database; hierarchy enables extension (new groups).
  • 32. Sustainable and Smart Energy Carriers for Decentralised Energy Production XML DATA REPRESENTATION: WE ARE NOT ALONE XML is the language of the internet == many tools for its manipulation Important note: Though understandable for humans, not necessarily convenient to generate need tools Gaining ground in scientific computing
  • 33. Sustainable and Smart Energy Carriers for Decentralised Energy Production SOFTWARE SOLUTION SUPPORTING MANY FORMATS From a software technical point of view: interchange between formats Example in computational chemistry
  • 34. Sustainable and Smart Energy Carriers for Decentralised Energy Production DATABASES WITHIN THE ACTION http://respecth.chem.elte.hu/respecth http://www.chemicalkinetics.info http://primekinetics.org/
  • 35. Sustainable and Smart Energy Carriers for Decentralised Energy Production OUTPUT Through input from actors in the SMARTCATS action a white paper on Data within the combustion community We need YOUR input