SlideShare a Scribd company logo
1 of 20
Download to read offline
Information Half-day
Irish Centre for High End Computing (ICHEC)
November 18, 2013
BasicR 1
Overview
What is Analytics?
Why is it necessary?
Some examples of how it is used.
BasicR 2
What is Analytics?
Lies, DAMN LIES and STATISTICS.
The dictionary definition is ”the systematic computational analysis of
data or statistics.”.
Today we shall look at three areas:
1. Hypothesis testing,
2. Model construction,
3. Prediction.
BasicR 3
Some Definitions
Population: this represents a large group of
observations/measurements.
For example it could be the height or age of people in Ireland.
Sample: is a subset of the measurements/observations from the
population.
Could be the height/age of people in this room.
Variable or random variable, are the set of measurements/observations
of the same type. For instance age measurements would be one
variable and height measurements another.
BasicR 4
Sample Ireland
BasicR 5
Hypothesis Testing
The simplest form of hypothesis is does this sample come from this
population.
This might not seem particularly useful, however if we consider the
effects of a drug.
Patients blood pressure is measured before and after the drug is
administered.
Using a paired T-test the effectiveness of the drug can be determined.
BasicR 6
Some Definitions
When modeling there is usually one variable that you want to model,
this is called the ”response variable”.
The other variables are the ”explanatory variables”.
The goal of the model is to ”explain” the variation in the response
variable by the variation in the explanatory ones.
BasicR 7
Model Building
The simplest model is a linear regression model with one response
and one explanatory variable.
Figure:
BasicR 8
Regression
Regression techniques can be extended to many explanatory variables.
With this comes the possibility of variables interacting and a choice of
models or model selection.
It is important to realize that even if a explanatory variable perfectly
models the response variable, it does not imply an effect!
BasicR 9
Classification
Regression is a technique used for continuous variables.
Classification techniques are like models for categorical data.
Typically you can train a machine-learning algorithm to classify
objects/people from a set of explanatory variables.
Given a new set of measurements, the algorithm can then classify the
new object/person.
BasicR 10
Prediction
Models are used to make predictions outside the range of experimental
values. For example the phases of the moon and the tides.
Care must be taken when using statistically derived models, in that
they may not hold outside this range.
Even when a system is completely deterministic, if it is chaotic
predictions can be difficult.
Monte-Carlo approaches can be used to determine the range of
responses (hence the error) in such systems.
BasicR 11
Time Series Analysis
Time series data are measurements collected at regular time intervals.
The data can be split into three components:
1. Seasonal, or regular fluctuations on a frequency higher than that of the
dataset.
2. Trend, fluctuations on a frequency larger than that of the dataset.
3. Random, fluctuations with no apparent pattern.
Time series analysis is a technique that allows prediction of events
into the future using data from the past.
BasicR 12
Trend Discovery
A trend is a steady one-way change in a response variable after
removing the random and/or known variation.
One of the most topical trends at the moment is Global Warming.
Trends are linked to model building in some sense in that discovery of
a trend indicates that the model is incomplete.
In the case of global warming, we know that temperature varies daily
and seasonally and over much longer time periods. The temperature
trend is the change in temperature when these effects are removed.
BasicR 13
Time Series Plot
BasicR 14
Why is it Important?
From a scientific stand point, all measurements we take are subject to
error.
That means any conclusion given this flawed data must also have an
error.
The use of analytics provides a mechanism to objectively evaluate the
error in our conclusion given the data and some assumptions about
the data.
BasicR 15
ICHEC and BDI
First example is a collaboration with Biomedical Devices Ireland
(BDI).
We are measuring properties of blood platelets of normal individuals
vs those with blood disorders, under arterial shear.
We hope to be able to flag individuals for further testing using a
machine-learning algorithm.
BasicR 16
Workflow
BasicR 17
ICHEC and Wind Energy
Wind farms are mandated to provide an estimate of their future
power production.
Penalties exist for inaccurate information.
ICHEC has developed a system that will take weather forecasts from
Met Eirann and other sources that can be applied to farm in question.
Using model averaging techniques, the inevitable forecast errors can
be reduced.
BasicR 18
Wind Prediction
BasicR 19
Summary
Covered the:
What,
Why,
How is it used.
BasicR 20

More Related Content

Similar to 2013.11.14 Big Data Workshop Adam Ralph - 1st set of slides

EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...Servio Fernando Lima Reina
 
Statistical modeling in pharmaceutical research and development
Statistical modeling in pharmaceutical research and developmentStatistical modeling in pharmaceutical research and development
Statistical modeling in pharmaceutical research and developmentPV. Viji
 
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_Report
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_ReportRodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_Report
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_Report​Iván Rodríguez
 
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...IJDKP
 
A Heart Disease Prediction Model using Logistic Regression
A Heart Disease Prediction Model using Logistic RegressionA Heart Disease Prediction Model using Logistic Regression
A Heart Disease Prediction Model using Logistic Regressionijtsrd
 
EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...
EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...
EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...IJDKP
 
Efficiency of Prediction Algorithms for Mining Biological Databases
Efficiency of Prediction Algorithms for Mining Biological  DatabasesEfficiency of Prediction Algorithms for Mining Biological  Databases
Efficiency of Prediction Algorithms for Mining Biological DatabasesIOSR Journals
 
IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...
IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...
IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...IRJET Journal
 
Sample Size Determination.23.11.2021.pdf
Sample Size Determination.23.11.2021.pdfSample Size Determination.23.11.2021.pdf
Sample Size Determination.23.11.2021.pdfstatsanjal
 
EXPERIMENTAL IMPLEMENTATION OF EMBARRASINGLY PARALLEL PROCESS IN ANALYSIS OF ...
EXPERIMENTAL IMPLEMENTATION OF EMBARRASINGLY PARALLEL PROCESS IN ANALYSIS OF ...EXPERIMENTAL IMPLEMENTATION OF EMBARRASINGLY PARALLEL PROCESS IN ANALYSIS OF ...
EXPERIMENTAL IMPLEMENTATION OF EMBARRASINGLY PARALLEL PROCESS IN ANALYSIS OF ...ijesajournal
 
Enhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication NetworksEnhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication NetworksEditor IJCATR
 
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...Editor IJCATR
 
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...Editor IJCATR
 
Cenduit_Whitepaper_Forecasting_Present_14June2016
Cenduit_Whitepaper_Forecasting_Present_14June2016Cenduit_Whitepaper_Forecasting_Present_14June2016
Cenduit_Whitepaper_Forecasting_Present_14June2016Praveen Chand
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MININGAshish Salve
 
An Empirical Study On Diabetes Mellitus Prediction For Typical And Non-Typica...
An Empirical Study On Diabetes Mellitus Prediction For Typical And Non-Typica...An Empirical Study On Diabetes Mellitus Prediction For Typical And Non-Typica...
An Empirical Study On Diabetes Mellitus Prediction For Typical And Non-Typica...Scott Faria
 
Advice On Statistical Analysis For Circulation Research
Advice On Statistical Analysis For Circulation ResearchAdvice On Statistical Analysis For Circulation Research
Advice On Statistical Analysis For Circulation ResearchNancy Ideker
 
Adaptive Clinical Trials: Role of Modelling and Simulation
Adaptive Clinical Trials: Role of Modelling and Simulation Adaptive Clinical Trials: Role of Modelling and Simulation
Adaptive Clinical Trials: Role of Modelling and Simulation SGS
 

Similar to 2013.11.14 Big Data Workshop Adam Ralph - 1st set of slides (20)

EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
 
Statistical modeling in pharmaceutical research and development
Statistical modeling in pharmaceutical research and developmentStatistical modeling in pharmaceutical research and development
Statistical modeling in pharmaceutical research and development
 
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_Report
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_ReportRodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_Report
Rodriguez_Ullmayer_Rojo_RUSIS@UNR_REU_Technical_Report
 
Presentation 5.pptx
Presentation 5.pptxPresentation 5.pptx
Presentation 5.pptx
 
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
IDENTIFICATION OF OUTLIERS IN OXAZOLINES AND OXAZOLES HIGH DIMENSION MOLECULA...
 
A Heart Disease Prediction Model using Logistic Regression
A Heart Disease Prediction Model using Logistic RegressionA Heart Disease Prediction Model using Logistic Regression
A Heart Disease Prediction Model using Logistic Regression
 
EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...
EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...
EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...
 
woot2
woot2woot2
woot2
 
Efficiency of Prediction Algorithms for Mining Biological Databases
Efficiency of Prediction Algorithms for Mining Biological  DatabasesEfficiency of Prediction Algorithms for Mining Biological  Databases
Efficiency of Prediction Algorithms for Mining Biological Databases
 
IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...
IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...
IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...
 
Sample Size Determination.23.11.2021.pdf
Sample Size Determination.23.11.2021.pdfSample Size Determination.23.11.2021.pdf
Sample Size Determination.23.11.2021.pdf
 
EXPERIMENTAL IMPLEMENTATION OF EMBARRASINGLY PARALLEL PROCESS IN ANALYSIS OF ...
EXPERIMENTAL IMPLEMENTATION OF EMBARRASINGLY PARALLEL PROCESS IN ANALYSIS OF ...EXPERIMENTAL IMPLEMENTATION OF EMBARRASINGLY PARALLEL PROCESS IN ANALYSIS OF ...
EXPERIMENTAL IMPLEMENTATION OF EMBARRASINGLY PARALLEL PROCESS IN ANALYSIS OF ...
 
Enhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication NetworksEnhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication Networks
 
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
 
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
 
Cenduit_Whitepaper_Forecasting_Present_14June2016
Cenduit_Whitepaper_Forecasting_Present_14June2016Cenduit_Whitepaper_Forecasting_Present_14June2016
Cenduit_Whitepaper_Forecasting_Present_14June2016
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MINING
 
An Empirical Study On Diabetes Mellitus Prediction For Typical And Non-Typica...
An Empirical Study On Diabetes Mellitus Prediction For Typical And Non-Typica...An Empirical Study On Diabetes Mellitus Prediction For Typical And Non-Typica...
An Empirical Study On Diabetes Mellitus Prediction For Typical And Non-Typica...
 
Advice On Statistical Analysis For Circulation Research
Advice On Statistical Analysis For Circulation ResearchAdvice On Statistical Analysis For Circulation Research
Advice On Statistical Analysis For Circulation Research
 
Adaptive Clinical Trials: Role of Modelling and Simulation
Adaptive Clinical Trials: Role of Modelling and Simulation Adaptive Clinical Trials: Role of Modelling and Simulation
Adaptive Clinical Trials: Role of Modelling and Simulation
 

More from NUI Galway

Vincenzo MacCarrone, Explaining the trajectory of collective bargaining in Ir...
Vincenzo MacCarrone, Explaining the trajectory of collective bargaining in Ir...Vincenzo MacCarrone, Explaining the trajectory of collective bargaining in Ir...
Vincenzo MacCarrone, Explaining the trajectory of collective bargaining in Ir...NUI Galway
 
Tom Turner, Tipping the scales for labour in Ireland?
Tom Turner, Tipping the scales for labour in Ireland? Tom Turner, Tipping the scales for labour in Ireland?
Tom Turner, Tipping the scales for labour in Ireland? NUI Galway
 
Tom McDonnell, Medium-term trends in the Irish labour market and possibilitie...
Tom McDonnell, Medium-term trends in the Irish labour market and possibilitie...Tom McDonnell, Medium-term trends in the Irish labour market and possibilitie...
Tom McDonnell, Medium-term trends in the Irish labour market and possibilitie...NUI Galway
 
Stephen Byrne, A non-employment index for Ireland
Stephen Byrne, A non-employment index for IrelandStephen Byrne, A non-employment index for Ireland
Stephen Byrne, A non-employment index for IrelandNUI Galway
 
Sorcha Foster, The risk of automation of work in Ireland
Sorcha Foster, The risk of automation of work in IrelandSorcha Foster, The risk of automation of work in Ireland
Sorcha Foster, The risk of automation of work in IrelandNUI Galway
 
Sinead Pembroke, Living with uncertainty: The social implications of precario...
Sinead Pembroke, Living with uncertainty: The social implications of precario...Sinead Pembroke, Living with uncertainty: The social implications of precario...
Sinead Pembroke, Living with uncertainty: The social implications of precario...NUI Galway
 
Paul MacFlynn, A low skills equilibrium in Northern Ireland
Paul MacFlynn, A low skills equilibrium in Northern IrelandPaul MacFlynn, A low skills equilibrium in Northern Ireland
Paul MacFlynn, A low skills equilibrium in Northern IrelandNUI Galway
 
Nuala Whelan, The role of labour market activation in building a healthy work...
Nuala Whelan, The role of labour market activation in building a healthy work...Nuala Whelan, The role of labour market activation in building a healthy work...
Nuala Whelan, The role of labour market activation in building a healthy work...NUI Galway
 
Michéal Collins, and Dr Michelle Maher, Auto enrolment
 Michéal Collins, and Dr Michelle Maher, Auto enrolment Michéal Collins, and Dr Michelle Maher, Auto enrolment
Michéal Collins, and Dr Michelle Maher, Auto enrolmentNUI Galway
 
Michael Taft, A new enterprise model
Michael Taft, A new enterprise modelMichael Taft, A new enterprise model
Michael Taft, A new enterprise modelNUI Galway
 
Luke Rehill, Patterns of firm-level productivity in Ireland
Luke Rehill, Patterns of firm-level productivity in IrelandLuke Rehill, Patterns of firm-level productivity in Ireland
Luke Rehill, Patterns of firm-level productivity in IrelandNUI Galway
 
Lucy Pyne, Evidence from the Social Inclusion and Community Activation Programme
Lucy Pyne, Evidence from the Social Inclusion and Community Activation ProgrammeLucy Pyne, Evidence from the Social Inclusion and Community Activation Programme
Lucy Pyne, Evidence from the Social Inclusion and Community Activation ProgrammeNUI Galway
 
Lisa Wilson, The gendered nature of job quality and job insecurity
Lisa Wilson, The gendered nature of job quality and job insecurityLisa Wilson, The gendered nature of job quality and job insecurity
Lisa Wilson, The gendered nature of job quality and job insecurityNUI Galway
 
Karina Doorley, axation, labour force participation and gender equality in Ir...
Karina Doorley, axation, labour force participation and gender equality in Ir...Karina Doorley, axation, labour force participation and gender equality in Ir...
Karina Doorley, axation, labour force participation and gender equality in Ir...NUI Galway
 
Jason Loughrey, Household income volatility in Ireland
Jason Loughrey, Household income volatility in IrelandJason Loughrey, Household income volatility in Ireland
Jason Loughrey, Household income volatility in IrelandNUI Galway
 
Ivan Privalko, What do Workers get from Mobility?
Ivan Privalko, What do Workers get from Mobility?Ivan Privalko, What do Workers get from Mobility?
Ivan Privalko, What do Workers get from Mobility?NUI Galway
 
Helen Johnston, Labour market transitions: barriers and enablers
Helen Johnston, Labour market transitions: barriers and enablersHelen Johnston, Labour market transitions: barriers and enablers
Helen Johnston, Labour market transitions: barriers and enablersNUI Galway
 
Gail Irvine, Fulfilling work in Ireland
Gail Irvine, Fulfilling work in IrelandGail Irvine, Fulfilling work in Ireland
Gail Irvine, Fulfilling work in IrelandNUI Galway
 
Frank Walsh, Assessing competing explanations for the decline in trade union ...
Frank Walsh, Assessing competing explanations for the decline in trade union ...Frank Walsh, Assessing competing explanations for the decline in trade union ...
Frank Walsh, Assessing competing explanations for the decline in trade union ...NUI Galway
 
Eamon Murphy, An overview of labour market participation in Ireland over the ...
Eamon Murphy, An overview of labour market participation in Ireland over the ...Eamon Murphy, An overview of labour market participation in Ireland over the ...
Eamon Murphy, An overview of labour market participation in Ireland over the ...NUI Galway
 

More from NUI Galway (20)

Vincenzo MacCarrone, Explaining the trajectory of collective bargaining in Ir...
Vincenzo MacCarrone, Explaining the trajectory of collective bargaining in Ir...Vincenzo MacCarrone, Explaining the trajectory of collective bargaining in Ir...
Vincenzo MacCarrone, Explaining the trajectory of collective bargaining in Ir...
 
Tom Turner, Tipping the scales for labour in Ireland?
Tom Turner, Tipping the scales for labour in Ireland? Tom Turner, Tipping the scales for labour in Ireland?
Tom Turner, Tipping the scales for labour in Ireland?
 
Tom McDonnell, Medium-term trends in the Irish labour market and possibilitie...
Tom McDonnell, Medium-term trends in the Irish labour market and possibilitie...Tom McDonnell, Medium-term trends in the Irish labour market and possibilitie...
Tom McDonnell, Medium-term trends in the Irish labour market and possibilitie...
 
Stephen Byrne, A non-employment index for Ireland
Stephen Byrne, A non-employment index for IrelandStephen Byrne, A non-employment index for Ireland
Stephen Byrne, A non-employment index for Ireland
 
Sorcha Foster, The risk of automation of work in Ireland
Sorcha Foster, The risk of automation of work in IrelandSorcha Foster, The risk of automation of work in Ireland
Sorcha Foster, The risk of automation of work in Ireland
 
Sinead Pembroke, Living with uncertainty: The social implications of precario...
Sinead Pembroke, Living with uncertainty: The social implications of precario...Sinead Pembroke, Living with uncertainty: The social implications of precario...
Sinead Pembroke, Living with uncertainty: The social implications of precario...
 
Paul MacFlynn, A low skills equilibrium in Northern Ireland
Paul MacFlynn, A low skills equilibrium in Northern IrelandPaul MacFlynn, A low skills equilibrium in Northern Ireland
Paul MacFlynn, A low skills equilibrium in Northern Ireland
 
Nuala Whelan, The role of labour market activation in building a healthy work...
Nuala Whelan, The role of labour market activation in building a healthy work...Nuala Whelan, The role of labour market activation in building a healthy work...
Nuala Whelan, The role of labour market activation in building a healthy work...
 
Michéal Collins, and Dr Michelle Maher, Auto enrolment
 Michéal Collins, and Dr Michelle Maher, Auto enrolment Michéal Collins, and Dr Michelle Maher, Auto enrolment
Michéal Collins, and Dr Michelle Maher, Auto enrolment
 
Michael Taft, A new enterprise model
Michael Taft, A new enterprise modelMichael Taft, A new enterprise model
Michael Taft, A new enterprise model
 
Luke Rehill, Patterns of firm-level productivity in Ireland
Luke Rehill, Patterns of firm-level productivity in IrelandLuke Rehill, Patterns of firm-level productivity in Ireland
Luke Rehill, Patterns of firm-level productivity in Ireland
 
Lucy Pyne, Evidence from the Social Inclusion and Community Activation Programme
Lucy Pyne, Evidence from the Social Inclusion and Community Activation ProgrammeLucy Pyne, Evidence from the Social Inclusion and Community Activation Programme
Lucy Pyne, Evidence from the Social Inclusion and Community Activation Programme
 
Lisa Wilson, The gendered nature of job quality and job insecurity
Lisa Wilson, The gendered nature of job quality and job insecurityLisa Wilson, The gendered nature of job quality and job insecurity
Lisa Wilson, The gendered nature of job quality and job insecurity
 
Karina Doorley, axation, labour force participation and gender equality in Ir...
Karina Doorley, axation, labour force participation and gender equality in Ir...Karina Doorley, axation, labour force participation and gender equality in Ir...
Karina Doorley, axation, labour force participation and gender equality in Ir...
 
Jason Loughrey, Household income volatility in Ireland
Jason Loughrey, Household income volatility in IrelandJason Loughrey, Household income volatility in Ireland
Jason Loughrey, Household income volatility in Ireland
 
Ivan Privalko, What do Workers get from Mobility?
Ivan Privalko, What do Workers get from Mobility?Ivan Privalko, What do Workers get from Mobility?
Ivan Privalko, What do Workers get from Mobility?
 
Helen Johnston, Labour market transitions: barriers and enablers
Helen Johnston, Labour market transitions: barriers and enablersHelen Johnston, Labour market transitions: barriers and enablers
Helen Johnston, Labour market transitions: barriers and enablers
 
Gail Irvine, Fulfilling work in Ireland
Gail Irvine, Fulfilling work in IrelandGail Irvine, Fulfilling work in Ireland
Gail Irvine, Fulfilling work in Ireland
 
Frank Walsh, Assessing competing explanations for the decline in trade union ...
Frank Walsh, Assessing competing explanations for the decline in trade union ...Frank Walsh, Assessing competing explanations for the decline in trade union ...
Frank Walsh, Assessing competing explanations for the decline in trade union ...
 
Eamon Murphy, An overview of labour market participation in Ireland over the ...
Eamon Murphy, An overview of labour market participation in Ireland over the ...Eamon Murphy, An overview of labour market participation in Ireland over the ...
Eamon Murphy, An overview of labour market participation in Ireland over the ...
 

Recently uploaded

20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 

Recently uploaded (20)

20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 

2013.11.14 Big Data Workshop Adam Ralph - 1st set of slides

  • 1. Information Half-day Irish Centre for High End Computing (ICHEC) November 18, 2013 BasicR 1
  • 2. Overview What is Analytics? Why is it necessary? Some examples of how it is used. BasicR 2
  • 3. What is Analytics? Lies, DAMN LIES and STATISTICS. The dictionary definition is ”the systematic computational analysis of data or statistics.”. Today we shall look at three areas: 1. Hypothesis testing, 2. Model construction, 3. Prediction. BasicR 3
  • 4. Some Definitions Population: this represents a large group of observations/measurements. For example it could be the height or age of people in Ireland. Sample: is a subset of the measurements/observations from the population. Could be the height/age of people in this room. Variable or random variable, are the set of measurements/observations of the same type. For instance age measurements would be one variable and height measurements another. BasicR 4
  • 6. Hypothesis Testing The simplest form of hypothesis is does this sample come from this population. This might not seem particularly useful, however if we consider the effects of a drug. Patients blood pressure is measured before and after the drug is administered. Using a paired T-test the effectiveness of the drug can be determined. BasicR 6
  • 7. Some Definitions When modeling there is usually one variable that you want to model, this is called the ”response variable”. The other variables are the ”explanatory variables”. The goal of the model is to ”explain” the variation in the response variable by the variation in the explanatory ones. BasicR 7
  • 8. Model Building The simplest model is a linear regression model with one response and one explanatory variable. Figure: BasicR 8
  • 9. Regression Regression techniques can be extended to many explanatory variables. With this comes the possibility of variables interacting and a choice of models or model selection. It is important to realize that even if a explanatory variable perfectly models the response variable, it does not imply an effect! BasicR 9
  • 10. Classification Regression is a technique used for continuous variables. Classification techniques are like models for categorical data. Typically you can train a machine-learning algorithm to classify objects/people from a set of explanatory variables. Given a new set of measurements, the algorithm can then classify the new object/person. BasicR 10
  • 11. Prediction Models are used to make predictions outside the range of experimental values. For example the phases of the moon and the tides. Care must be taken when using statistically derived models, in that they may not hold outside this range. Even when a system is completely deterministic, if it is chaotic predictions can be difficult. Monte-Carlo approaches can be used to determine the range of responses (hence the error) in such systems. BasicR 11
  • 12. Time Series Analysis Time series data are measurements collected at regular time intervals. The data can be split into three components: 1. Seasonal, or regular fluctuations on a frequency higher than that of the dataset. 2. Trend, fluctuations on a frequency larger than that of the dataset. 3. Random, fluctuations with no apparent pattern. Time series analysis is a technique that allows prediction of events into the future using data from the past. BasicR 12
  • 13. Trend Discovery A trend is a steady one-way change in a response variable after removing the random and/or known variation. One of the most topical trends at the moment is Global Warming. Trends are linked to model building in some sense in that discovery of a trend indicates that the model is incomplete. In the case of global warming, we know that temperature varies daily and seasonally and over much longer time periods. The temperature trend is the change in temperature when these effects are removed. BasicR 13
  • 15. Why is it Important? From a scientific stand point, all measurements we take are subject to error. That means any conclusion given this flawed data must also have an error. The use of analytics provides a mechanism to objectively evaluate the error in our conclusion given the data and some assumptions about the data. BasicR 15
  • 16. ICHEC and BDI First example is a collaboration with Biomedical Devices Ireland (BDI). We are measuring properties of blood platelets of normal individuals vs those with blood disorders, under arterial shear. We hope to be able to flag individuals for further testing using a machine-learning algorithm. BasicR 16
  • 18. ICHEC and Wind Energy Wind farms are mandated to provide an estimate of their future power production. Penalties exist for inaccurate information. ICHEC has developed a system that will take weather forecasts from Met Eirann and other sources that can be applied to farm in question. Using model averaging techniques, the inevitable forecast errors can be reduced. BasicR 18