KREAM@ICCS2013

•Download as PPTX, PDF•

0 likes•235 views

Jaakko Lappalainen

Presentation made at ICCS2013 Barcelona in June 2013.

Technology Education

Jaakko Lappalainen
Computer Science department
University of Alcalá, Spain

Overview
•
•
•
•
•
•
•
•

The problem
Proposed approach
The method
Results
Conclusions
Strengths and weaknesses
Future work
Questions

The problem
• Researchers focus on a particular time
frame and scope for testing their
hypotheses.
• But the conclusions of the research are
projected to the future.
• Paradox: the work that predicts things for
tomorrow, becomes a snapshot of what
happened until today.

Proposed approach
• New data relevant to some hypotheses
gets continuously aggregated as time
passes.
• With common semantics, it can be
combined or related to other datasets.
• Represent the hypothesis as programs
that are executed repeatedly.

The method
• The case of study
– Lenten, L. J., & Moosa, I. A. (2003). An
empirical investigation into long-term climate
change in Australia. Environmental Modelling
& Software, 18(1), 59-70.

• The authors claim that the temperature
series has some a trend feature.

The method (II)
• Let’s find some data sources.
– ACORN-SAT, from the Australian Bureau of
Meteorology. This uses LD!!
– NOAA weather data, not in LD but easy to
parse…

• Periodically ingest data (e.g., into a
relational database)
• An R script checks if the trend on the data
has changed…
• Ingested data is semantically tagged…

Results
• We are checking for Lenten & Moosa’s
hypothesis every week.
– More extensive time scope.
– Wider geographical scope, to all data
available for Australia.

• The snapshot becomes a movie.
• Executable paper

Conclusions
• The tools we already have allows us to
use large-scale computation
infrastructures easily to support science.
– The agINFRA project

• Massive data ingestion.
• Data integration and interlinking.
• User-tailored service execution.

Strengths
• Data availability
– The data is ingested (from LD sources, but
not only) and published.

• Data interoperability
– The data is not stored by itself.

• Actionable data
– Ready to be addressed, used and generate
new actionable data.

Weaknesses
• Represent ‘science inquiry’ as a data
model is not trivial.
• CPU-consuming tasks are even more
consuming.

Future work
• Further dataset interlinking
– More plural value for physical parameters.
– Dataset value error detection.

• Advance in hypothesis representation
– Machine readable research processes.

What's hot

Roberti esa 2014_workshop_slidesquestRCN

The contribution of authors: A study of the relationship between the size and...Rickard Danell

JoshHess_ResumeJosh Hess

Optique presentationDBOnto

Replicating FLOSS Research as eResearchAndrea Wiggins

Towards the impact of design flaws on resources used by an application Serban Stoenescu

Peter (Yun-shao) Sung's Resume 2016IIIPeter Sung

Big Data and Tangibles - TEI 13Consuelo Valdes

Li101 ecosystemjleecbd

Glued EcologyBob O'Hara

PhD Pre-ThesisRui Pereira

[3.4] Practical Benefits and Annoyences of Sharing Data - Daniël Lakens [3TU....3TU.Datacentrum

Program on Mathematical and Statistical Methods for Climate and the Earth Sys...The Statistical and Applied Mathematical Sciences Institute

Drew Hanover Nomination LetterDrew Hanover

Lucas_Taylor_Resume_Gen_Su16Luke Taylor

Ryan Goode Resume all new(2page)Ryan Goode

Katherine M Smarkel ResumeKatherine Smarkel

IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Multi illuminant estimation with c...IEEEBEBTECHSTUDENTPROJECTS

Measuring the usefulness of Knowledge Organization Systems in Information Ret...GESIS

Mary Jo Galbraith Resume (updated)Mary Jo Galbraith

What's hot (20)

Roberti esa 2014_workshop_slides

The contribution of authors: A study of the relationship between the size and...

JoshHess_Resume

Optique presentation

Replicating FLOSS Research as eResearch

Towards the impact of design flaws on resources used by an application

Peter (Yun-shao) Sung's Resume 2016III

Big Data and Tangibles - TEI 13

Li101 ecosystem

Glued Ecology

PhD Pre-Thesis

[3.4] Practical Benefits and Annoyences of Sharing Data - Daniël Lakens [3TU....

Program on Mathematical and Statistical Methods for Climate and the Earth Sys...

Drew Hanover Nomination Letter

Lucas_Taylor_Resume_Gen_Su16

Ryan Goode Resume all new(2page)

Katherine M Smarkel Resume

IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Multi illuminant estimation with c...

Measuring the usefulness of Knowledge Organization Systems in Information Ret...

Mary Jo Galbraith Resume (updated)

Viewers also liked

Gregas - Chinelos - InfatilArmarinhos

Chinelos Gregas - InfantilArmarinhos

Ag infra pilot_programmatic_access_jkklappJaakko Lappalainen

Gregas Rasteiras NovasArmarinhos

Chinelos GREGAS - Feminino AdultoArmarinhos

Captûre Business Overview 2013CaptureWines

MasculinosArmarinhos

Collona final presentationJaakko Lappalainen

Ismail dilber assignment3ismaildlbr

Introducing myselfJaakko Lappalainen

Chinelos - Gregas - InfantilArmarinhos

Randy peays visual resumeRandy Peay

16f628a001ripper

Wireless communication and networkingM Sabir Saeed

Balducci carol visual_resumestoryboardCarol Balducci

Viewers also liked (15)

Gregas - Chinelos - Infatil

Chinelos Gregas - Infantil

Ag infra pilot_programmatic_access_jkklapp

Gregas Rasteiras Novas

Chinelos GREGAS - Feminino Adulto

Captûre Business Overview 2013

Masculinos

Collona final presentation

Ismail dilber assignment3

Introducing myself

Chinelos - Gregas - Infantil

Randy peays visual resume

16f628a

Wireless communication and networking

Balducci carol visual_resumestoryboard

Similar to KREAM@ICCS2013

1. Intro DS.pptxAnusuya123

Accelerating Data-driven Discovery in Energy ScienceIan Foster

Using parallel hierarchical clustering toBiniam Behailu

Continuous modeling - automating model building on high-performance e-Infrast...Ola Spjuth

Program on Mathematical and Statistical Methods for Climate and the Earth Sys...The Statistical and Applied Mathematical Sciences Institute

Summary of 3DPASDaniel S. Katz

Discovering new functional materials for clean energy and beyond using high-t...Anubhav Jain

E research attachment surveyRiri Kusumarani

Science Engagement: A Non-Technical Approach to the Technical DivideCybera Inc.

Working with Instrument Data (GlobusWorld Tour - UMich)Globus

Open-source tools for generating and analyzing large materials data setsAnubhav Jain

Data analytics in computer networkingStenio Fernandes

Semantic Similarity and Selection of Resources Published According to Linked ...Riccardo Albertoni

Research.pptxLEANNAMAETAPANGCO

A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...Ilkay Altintas, Ph.D.

Software tools to facilitate materials science researchAnubhav Jain

Why Data Science Matters - 2014 WDS Data Stewardship Award LectureXiaogang (Marshall) Ma

Accelerating New Materials Design with Supercomputing and Machine LearningAnubhav Jain

Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotat...Khalid Belhajjame

Learning Systems for ScienceIan Foster

Similar to KREAM@ICCS2013 (20)

1. Intro DS.pptx

Accelerating Data-driven Discovery in Energy Science

Using parallel hierarchical clustering to

Continuous modeling - automating model building on high-performance e-Infrast...

Program on Mathematical and Statistical Methods for Climate and the Earth Sys...

Summary of 3DPAS

Discovering new functional materials for clean energy and beyond using high-t...

E research attachment survey

Science Engagement: A Non-Technical Approach to the Technical Divide

Working with Instrument Data (GlobusWorld Tour - UMich)

Open-source tools for generating and analyzing large materials data sets

Data analytics in computer networking

Semantic Similarity and Selection of Resources Published According to Linked ...

Research.pptx

A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...

Software tools to facilitate materials science research

Why Data Science Matters - 2014 WDS Data Stewardship Award Lecture

Accelerating New Materials Design with Supercomputing and Machine Learning

Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotat...

Learning Systems for Science

Recently uploaded

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Apidays New York 2024 - The value of a flexible API Management solution for O...apidays

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Ransomware_Q4_2023. The report. [EN].pdfOverkill Security

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

FWD Group - Insurer Innovation Award 2024The Digital Insurer

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

GenAI Risks & Security Meetup 01052024.pdflior mazor

ICT role in 21st century education and its challengesrafiqahmad00786416

Architecting Cloud Native ApplicationsWSO2

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

Why Teams call analytics are critical to your entire businesspanagenda

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Real Time Object Detection Using Open CVKhem

MINDCTI Revenue Release Quarter One 2024MIND CTI

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker

Apidays New York 2024 - The value of a flexible API Management solution for O...

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Ransomware_Q4_2023. The report. [EN].pdf

Automating Google Workspace (GWS) & more with Apps Script

Boost Fertility New Invention Ups Success Rates.pdf

Powerful Google developer tools for immediate impact! (2023-24 C)

FWD Group - Insurer Innovation Award 2024

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Artificial Intelligence Chap.5 : Uncertainty

GenAI Risks & Security Meetup 01052024.pdf

ICT role in 21st century education and its challenges

Architecting Cloud Native Applications

presentation ICT roal in 21st century education

Why Teams call analytics are critical to your entire business

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

Real Time Object Detection Using Open CV

MINDCTI Revenue Release Quarter One 2024

KREAM@ICCS2013

1. Jaakko Lappalainen Computer Science department University of Alcalá, Spain

2. Overview • • • • • • • • The problem Proposed approach The method Results Conclusions Strengths and weaknesses Future work Questions

3. The problem • Researchers focus on a particular time frame and scope for testing their hypotheses. • But the conclusions of the research are projected to the future. • Paradox: the work that predicts things for tomorrow, becomes a snapshot of what happened until today.

4. Proposed approach • New data relevant to some hypotheses gets continuously aggregated as time passes. • With common semantics, it can be combined or related to other datasets. • Represent the hypothesis as programs that are executed repeatedly.

5. The method • The case of study – Lenten, L. J., & Moosa, I. A. (2003). An empirical investigation into long-term climate change in Australia. Environmental Modelling & Software, 18(1), 59-70. • The authors claim that the temperature series has some a trend feature.

6. The method (II) • Let’s find some data sources. – ACORN-SAT, from the Australian Bureau of Meteorology. This uses LD!! – NOAA weather data, not in LD but easy to parse… • Periodically ingest data (e.g., into a relational database) • An R script checks if the trend on the data has changed… • Ingested data is semantically tagged…

7. Results • We are checking for Lenten & Moosa’s hypothesis every week. – More extensive time scope. – Wider geographical scope, to all data available for Australia. • The snapshot becomes a movie. • Executable paper

8. Conclusions • The tools we already have allows us to use large-scale computation infrastructures easily to support science. – The agINFRA project • Massive data ingestion. • Data integration and interlinking. • User-tailored service execution.

9. Strengths • Data availability – The data is ingested (from LD sources, but not only) and published. • Data interoperability – The data is not stored by itself. • Actionable data – Ready to be addressed, used and generate new actionable data.

10. Weaknesses • Represent ‘science inquiry’ as a data model is not trivial. • CPU-consuming tasks are even more consuming.

11. Future work • Further dataset interlinking – More plural value for physical parameters. – Dataset value error detection. • Advance in hypothesis representation – Machine readable research processes.

12. Questions?

13. Thank you very much! jkk.lapp@uah.es

KREAM@ICCS2013

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (15)

Similar to KREAM@ICCS2013

Similar to KREAM@ICCS2013 (20)

Recently uploaded

Recently uploaded (20)

KREAM@ICCS2013