SlideShare a Scribd company logo
+ 
Data Basics 
Slides edited by Valerio Di Fonzo for www.globalpolis.org 
Based on the work of Mine Çetinkaya-Rundel of OpenIntro 
The slides may be copied, edited, and/or shared via the CC BY-SA license
Data matrix 
Data are organized in what we call a data matrix, where each row 
represents an observation or a case, and each column represents a 
variable:
Types of variables 
There are two types of variables, 
numerical and categorical. Numerical, in 
other words quantitative, variables take on 
numerical values. It is sensible to add, 
subtract, take averages, et cetera, with 
these values. Categorical, or qualitative, 
variables. Take on a limited number 
of distinct categories. These categories 
can be identified with numbers. For 
example, it is customary to see the gender 
variable coded as zero for male and one 
for females, but it wouldn't be sensible to 
do arithmetic operations with these 
values. 
Numerical variables can further be 
categorized as continuous or discrete. 
Continuous numerical variables are 
usually measured, such as height. 
These variables can take on any 
number of infinite values given, 
within a given range. Discrete 
numerical variables are those 
that take on one of a specific set of 
numeric values where we're able to 
count or enumerate all of the 
possibilities. One example of a 
discrete variable is the number of cars 
a household owns. In general, count 
data are an example of discrete 
variables. 
Categorical variables that have 
ordered levels are called ordinal. 
Think about a survey question 
where you're asked how satisfied 
you are with the customer service 
you received. And the options are 
very unsatisfied, unsatisfied, neutral, 
satisfied, and very satisfied. These 
levels have an inherent ordering. 
Hence the variable would be called 
ordinal. If the levels of categorical 
variable do not have an inherent 
ordering to them, then the variable is 
simply called categorical.
Types of variables (cont.) 
gender - categorical 
sleep - numerical, continuous 
bedtime - categorical, ordinal 
countries - numerical, discrete 
dread - categorical, ordinal (could also be used as numerical)
Associated vs. independent 
● When two variables show some connection with one 
another, they are called associated variables. 
○ Associated variables can also be called 
dependent variables and vice-versa. 
● If two variables are not associated, i.e. there is no 
evident connection between the two, then they are 
said to be independent.
Hypothesis Testing Framework 
● We start with a null hypothesis (H0) that represents the status 
quo. 
● We also have an alternative hypothesis (HA) that represents 
our research question, i.e. what we're testing for. 
● We conduct a hypothesis test under the assumption that the 
null hypothesis is true, either via simulation or theoretical 
methods. 
● If the test results suggest that the data do not provide 
convincing evidence for the alternative hypothesis, we stick 
with the null hypothesis. If they do, then we reject the null 
hypothesis in favor of the alternative. 
N.B. The slides of the case study on gender discrimination provide specific 
information regarding hypothesis testing.

More Related Content

What's hot

Lecture 2: Latent/Manifest/Observed Variables using in SEM Analysis (www.stat...
Lecture 2: Latent/Manifest/Observed Variables using in SEM Analysis (www.stat...Lecture 2: Latent/Manifest/Observed Variables using in SEM Analysis (www.stat...
Lecture 2: Latent/Manifest/Observed Variables using in SEM Analysis (www.stat...
Stats Statswork
 
Attribute grammer
Attribute grammerAttribute grammer
Attribute grammer
Md. Al-Amin
 
Logistic Regression Analysis
Logistic Regression AnalysisLogistic Regression Analysis
Logistic Regression Analysis
COSTARCH Analytical Consulting (P) Ltd.
 
M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...
M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...
M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...
Istituto nazionale di statistica
 
Statistical Hypothesis Testing - Spinning the Wheel
Statistical Hypothesis Testing - Spinning the WheelStatistical Hypothesis Testing - Spinning the Wheel
Statistical Hypothesis Testing - Spinning the Wheel
Chi-Squared Innovations
 
Multicollinearity
MulticollinearityMulticollinearity
Multicollinearity
Bernard Asia
 
The wilcoxon matched pairs signed-ranks test
The wilcoxon matched pairs signed-ranks testThe wilcoxon matched pairs signed-ranks test
The wilcoxon matched pairs signed-ranks test
Regent University
 
Structured equation model
Structured equation modelStructured equation model
Structured equation model
King Abidi
 
WF ED 540, Data Analysis, Fall 2017
WF ED 540, Data Analysis, Fall 2017WF ED 540, Data Analysis, Fall 2017
WF ED 540, Data Analysis, Fall 2017
Penn State University
 
The application of irt using the rasch model presnetation1
The application of irt using the rasch model presnetation1The application of irt using the rasch model presnetation1
The application of irt using the rasch model presnetation1Carlo Magno
 
Multicolinearity
MulticolinearityMulticolinearity
Multicolinearity
Pawan Kawan
 

What's hot (13)

Lecture 2: Latent/Manifest/Observed Variables using in SEM Analysis (www.stat...
Lecture 2: Latent/Manifest/Observed Variables using in SEM Analysis (www.stat...Lecture 2: Latent/Manifest/Observed Variables using in SEM Analysis (www.stat...
Lecture 2: Latent/Manifest/Observed Variables using in SEM Analysis (www.stat...
 
Lei&wu
Lei&wuLei&wu
Lei&wu
 
Attribute grammer
Attribute grammerAttribute grammer
Attribute grammer
 
Logistic Regression Analysis
Logistic Regression AnalysisLogistic Regression Analysis
Logistic Regression Analysis
 
M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...
M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...
M.E.Bontempi-Panel data: Models, estimation,and the role of attrition and Mea...
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Statistical Hypothesis Testing - Spinning the Wheel
Statistical Hypothesis Testing - Spinning the WheelStatistical Hypothesis Testing - Spinning the Wheel
Statistical Hypothesis Testing - Spinning the Wheel
 
Multicollinearity
MulticollinearityMulticollinearity
Multicollinearity
 
The wilcoxon matched pairs signed-ranks test
The wilcoxon matched pairs signed-ranks testThe wilcoxon matched pairs signed-ranks test
The wilcoxon matched pairs signed-ranks test
 
Structured equation model
Structured equation modelStructured equation model
Structured equation model
 
WF ED 540, Data Analysis, Fall 2017
WF ED 540, Data Analysis, Fall 2017WF ED 540, Data Analysis, Fall 2017
WF ED 540, Data Analysis, Fall 2017
 
The application of irt using the rasch model presnetation1
The application of irt using the rasch model presnetation1The application of irt using the rasch model presnetation1
The application of irt using the rasch model presnetation1
 
Multicolinearity
MulticolinearityMulticolinearity
Multicolinearity
 

Similar to Data, Variables, Hypothesis

Measurement of scales
Measurement of scalesMeasurement of scales
Measurement of scales
AkifIshaque
 
Variables.ppt
Variables.pptVariables.ppt
Variables.ppt
rupasi13
 
Research methodology Chapter 6
Research methodology Chapter 6Research methodology Chapter 6
Research methodology Chapter 6Pulchowk Campus
 
Formulating a Hypothesis
Formulating a HypothesisFormulating a Hypothesis
Formulating a Hypothesisbjkim0228
 
Lecture 7.ppt
Lecture 7.pptLecture 7.ppt
Lecture 7.ppt
DanishAhmed586293
 
Chapter_1_Lecture.pptx
Chapter_1_Lecture.pptxChapter_1_Lecture.pptx
Chapter_1_Lecture.pptx
ZelalemGebreegziabhe
 
Research Methodology
Research MethodologyResearch Methodology
Research Methodology
Aneel Raza
 
QUANTITATIVE RESEARCH DESIGN AND METHODS.ppt
QUANTITATIVE RESEARCH DESIGN AND METHODS.pptQUANTITATIVE RESEARCH DESIGN AND METHODS.ppt
QUANTITATIVE RESEARCH DESIGN AND METHODS.ppt
Bhawna173140
 
Advanced StatisticsUnit 5There are several r.docx
Advanced StatisticsUnit 5There are several r.docxAdvanced StatisticsUnit 5There are several r.docx
Advanced StatisticsUnit 5There are several r.docx
nettletondevon
 
Libro en ingles
Libro en inglesLibro en ingles
Libro en ingles
Irving Molina
 
April Heyward Research Methods Class Session - 8-5-2021
April Heyward Research Methods Class Session - 8-5-2021April Heyward Research Methods Class Session - 8-5-2021
April Heyward Research Methods Class Session - 8-5-2021
April Heyward
 
Mr 4. quantitative research design and methods
Mr 4. quantitative research design and methodsMr 4. quantitative research design and methods
Mr 4. quantitative research design and methods
S'Roni Roni
 
Statistics
StatisticsStatistics
Statistics
DataScience14
 
What is research & process
What is research & processWhat is research & process
What is research & process
Abaid Manj
 
Variables scales of measure
Variables scales of measureVariables scales of measure
Variables scales of measure
Shameem Rizwan
 
Sampling-A compact study of different types of sample
Sampling-A compact study of different types of sampleSampling-A compact study of different types of sample
Sampling-A compact study of different types of sample
Asith Paul.K
 
DISCRIMINANT ANALYSIS.pptx
DISCRIMINANT ANALYSIS.pptxDISCRIMINANT ANALYSIS.pptx
DISCRIMINANT ANALYSIS.pptx
Anup597384
 
Ch # 6 brm
Ch # 6 brmCh # 6 brm
Ch # 6 brm
Hekmatullah Samsor
 
1 introduction to psychological statistics
1 introduction to psychological statistics1 introduction to psychological statistics
1 introduction to psychological statistics
Mary Anne (Riyan) Portuguez
 
Scientific method ppt
Scientific method pptScientific method ppt
Scientific method pptMaria Donohue
 

Similar to Data, Variables, Hypothesis (20)

Measurement of scales
Measurement of scalesMeasurement of scales
Measurement of scales
 
Variables.ppt
Variables.pptVariables.ppt
Variables.ppt
 
Research methodology Chapter 6
Research methodology Chapter 6Research methodology Chapter 6
Research methodology Chapter 6
 
Formulating a Hypothesis
Formulating a HypothesisFormulating a Hypothesis
Formulating a Hypothesis
 
Lecture 7.ppt
Lecture 7.pptLecture 7.ppt
Lecture 7.ppt
 
Chapter_1_Lecture.pptx
Chapter_1_Lecture.pptxChapter_1_Lecture.pptx
Chapter_1_Lecture.pptx
 
Research Methodology
Research MethodologyResearch Methodology
Research Methodology
 
QUANTITATIVE RESEARCH DESIGN AND METHODS.ppt
QUANTITATIVE RESEARCH DESIGN AND METHODS.pptQUANTITATIVE RESEARCH DESIGN AND METHODS.ppt
QUANTITATIVE RESEARCH DESIGN AND METHODS.ppt
 
Advanced StatisticsUnit 5There are several r.docx
Advanced StatisticsUnit 5There are several r.docxAdvanced StatisticsUnit 5There are several r.docx
Advanced StatisticsUnit 5There are several r.docx
 
Libro en ingles
Libro en inglesLibro en ingles
Libro en ingles
 
April Heyward Research Methods Class Session - 8-5-2021
April Heyward Research Methods Class Session - 8-5-2021April Heyward Research Methods Class Session - 8-5-2021
April Heyward Research Methods Class Session - 8-5-2021
 
Mr 4. quantitative research design and methods
Mr 4. quantitative research design and methodsMr 4. quantitative research design and methods
Mr 4. quantitative research design and methods
 
Statistics
StatisticsStatistics
Statistics
 
What is research & process
What is research & processWhat is research & process
What is research & process
 
Variables scales of measure
Variables scales of measureVariables scales of measure
Variables scales of measure
 
Sampling-A compact study of different types of sample
Sampling-A compact study of different types of sampleSampling-A compact study of different types of sample
Sampling-A compact study of different types of sample
 
DISCRIMINANT ANALYSIS.pptx
DISCRIMINANT ANALYSIS.pptxDISCRIMINANT ANALYSIS.pptx
DISCRIMINANT ANALYSIS.pptx
 
Ch # 6 brm
Ch # 6 brmCh # 6 brm
Ch # 6 brm
 
1 introduction to psychological statistics
1 introduction to psychological statistics1 introduction to psychological statistics
1 introduction to psychological statistics
 
Scientific method ppt
Scientific method pptScientific method ppt
Scientific method ppt
 

More from Global Polis

Binomial distribution
Binomial distributionBinomial distribution
Binomial distribution
Global Polis
 
Normal distribution
Normal distributionNormal distribution
Normal distribution
Global Polis
 
Bayeasian inference
Bayeasian inferenceBayeasian inference
Bayeasian inference
Global Polis
 
Conditional probability, and probability trees
Conditional probability, and probability treesConditional probability, and probability trees
Conditional probability, and probability trees
Global Polis
 
Introduction to probability
Introduction to probabilityIntroduction to probability
Introduction to probability
Global Polis
 
Observational Studies and Sampling
Observational Studies and Sampling Observational Studies and Sampling
Observational Studies and Sampling
Global Polis
 
Experimental Design and Blocking
Experimental Design and BlockingExperimental Design and Blocking
Experimental Design and Blocking
Global Polis
 
Data Collection and Sampling
Data Collection and SamplingData Collection and Sampling
Data Collection and Sampling
Global Polis
 
Categorical Data
Categorical DataCategorical Data
Categorical Data
Global Polis
 
Gender Discrimination: A case study
Gender Discrimination: A case studyGender Discrimination: A case study
Gender Discrimination: A case study
Global Polis
 
Working with Numerical Data
Working with  Numerical DataWorking with  Numerical Data
Working with Numerical Data
Global Polis
 

More from Global Polis (11)

Binomial distribution
Binomial distributionBinomial distribution
Binomial distribution
 
Normal distribution
Normal distributionNormal distribution
Normal distribution
 
Bayeasian inference
Bayeasian inferenceBayeasian inference
Bayeasian inference
 
Conditional probability, and probability trees
Conditional probability, and probability treesConditional probability, and probability trees
Conditional probability, and probability trees
 
Introduction to probability
Introduction to probabilityIntroduction to probability
Introduction to probability
 
Observational Studies and Sampling
Observational Studies and Sampling Observational Studies and Sampling
Observational Studies and Sampling
 
Experimental Design and Blocking
Experimental Design and BlockingExperimental Design and Blocking
Experimental Design and Blocking
 
Data Collection and Sampling
Data Collection and SamplingData Collection and Sampling
Data Collection and Sampling
 
Categorical Data
Categorical DataCategorical Data
Categorical Data
 
Gender Discrimination: A case study
Gender Discrimination: A case studyGender Discrimination: A case study
Gender Discrimination: A case study
 
Working with Numerical Data
Working with  Numerical DataWorking with  Numerical Data
Working with Numerical Data
 

Recently uploaded

一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
AndrzejJarynowski
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Enterprise Wired
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
mzpolocfi
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 

Recently uploaded (20)

一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 

Data, Variables, Hypothesis

  • 1. + Data Basics Slides edited by Valerio Di Fonzo for www.globalpolis.org Based on the work of Mine Çetinkaya-Rundel of OpenIntro The slides may be copied, edited, and/or shared via the CC BY-SA license
  • 2. Data matrix Data are organized in what we call a data matrix, where each row represents an observation or a case, and each column represents a variable:
  • 3. Types of variables There are two types of variables, numerical and categorical. Numerical, in other words quantitative, variables take on numerical values. It is sensible to add, subtract, take averages, et cetera, with these values. Categorical, or qualitative, variables. Take on a limited number of distinct categories. These categories can be identified with numbers. For example, it is customary to see the gender variable coded as zero for male and one for females, but it wouldn't be sensible to do arithmetic operations with these values. Numerical variables can further be categorized as continuous or discrete. Continuous numerical variables are usually measured, such as height. These variables can take on any number of infinite values given, within a given range. Discrete numerical variables are those that take on one of a specific set of numeric values where we're able to count or enumerate all of the possibilities. One example of a discrete variable is the number of cars a household owns. In general, count data are an example of discrete variables. Categorical variables that have ordered levels are called ordinal. Think about a survey question where you're asked how satisfied you are with the customer service you received. And the options are very unsatisfied, unsatisfied, neutral, satisfied, and very satisfied. These levels have an inherent ordering. Hence the variable would be called ordinal. If the levels of categorical variable do not have an inherent ordering to them, then the variable is simply called categorical.
  • 4. Types of variables (cont.) gender - categorical sleep - numerical, continuous bedtime - categorical, ordinal countries - numerical, discrete dread - categorical, ordinal (could also be used as numerical)
  • 5. Associated vs. independent ● When two variables show some connection with one another, they are called associated variables. ○ Associated variables can also be called dependent variables and vice-versa. ● If two variables are not associated, i.e. there is no evident connection between the two, then they are said to be independent.
  • 6. Hypothesis Testing Framework ● We start with a null hypothesis (H0) that represents the status quo. ● We also have an alternative hypothesis (HA) that represents our research question, i.e. what we're testing for. ● We conduct a hypothesis test under the assumption that the null hypothesis is true, either via simulation or theoretical methods. ● If the test results suggest that the data do not provide convincing evidence for the alternative hypothesis, we stick with the null hypothesis. If they do, then we reject the null hypothesis in favor of the alternative. N.B. The slides of the case study on gender discrimination provide specific information regarding hypothesis testing.