SlideShare a Scribd company logo
1 of 20
Master the Art of Analytics
A Simplistic Explainer Series For Citizen Data Scientists
J o u r n e y To w a r d s A u g m e n t e d A n a l y t i c s
Paired Samples T-test
Basic Terminologies
 Sample data is the subset of population data used to represent the entire
group as whole
 For instance, if we want to come up with average value of all cars in
united states, it is impractical to assess the each car value in united
states, adding these numbers and dividing by total number of cars
 Instead, we can randomly select some of the cars, say 200 and get value
of each of these 200 cars and find average of these 200 numbers
 These 200 numbers containing randomly selected 200 cars’ values is
called a sample data of entire United states’ cars’ values (population
data)
 There are two popular sampling techniques namely simple random
sampling and stratified sampling which are explained in annexure section
 P- value : In case of Paired samples t test, it indicates whether there is a
statistically significant difference between two samples
 For different levels of accuracy desired, the p-value can be checked at different
thresholds and inference can be made accordingly
 For instance, for confidence level or accuracy = 95% ( error =5%) , we have to
check p-value against the threshold of 0.05.
 If p-value < 0.05 then the difference is significant and treatment has been
effective else the difference is insignificant and treatment has been not
significantly effective
 Similarly, for confidence level =98% (error =2%), we have to check p-value
against the threshold of 0.02.
 If p-value < 0.02 then the difference is significant and treatment has been
effective else the difference is insignificant and treatment has been not
significantly effective and so on
Basic Terminologies
Introduction
• It is used to determine whether the mean of a dependent variable (e.g.,
weight, anxiety level, salary, reaction time, etc.) is the same in two related
groups (e.g., two groups of participants that are measured at two different
"time points" or who undergo two different "conditions")
• Thus the classic use of the Paired t-Test is to evaluate the before and after
of some treatment
• Examples :
• Understand whether there was a difference in managers' salaries before and after
undertaking a PhD (i.e., your dependent variable would be "salary", and your two
related groups would be the two different "time points"; that is, salaries "before"
and "after" undertaking the PhD)
• Measure the blood pressure of patient A, give him something (pharmaceutical,
exercise, Tilapia) to reduce his blood pressure, then measure the blood pressure of
patient A again. Repeat for patients B, C, D, ... In this case, the data of "Before" and
"After" are paired by patient
Example : Input
Let’s conduct the Paired sample t-test on following two variables, one is a time
dimension containing months and the other is a measure :
Month Value
January 90
February 95
March 80
April 78
May 75
June 70
Time dimension to divide
data into two groups
Dependent Variable
Let’s say, measure values before April belong to ‘before’ or ‘pre’ sample and from
April belong to ‘After’ or ‘post’ sample
Example : Output
Pre sample mean 55
Post sample mean 74.3
Mean Difference 19.3
P-value 0.041
 At 95% confidence level (5% chance of error) :
 As p-value = 0.041 which is less than 0.05, there is a statistically significant
difference between means of pre and post sample values
 The treatment has been effective
 At 98 % confidence level (2% chance of error) :
 As p-value = 0.041 which is greater than 0.02, there is no statistically
significant difference between the means of pre and post samples
 The treatment has not been effective
Standard
input
parameters
& sample UI
SAMPLE OUTPUT
1 :
INTERPRETATION
SAMPLE
OUTPUT 2 :
MODEL
SUMMARY
SAMPLE OUTPUT 3 :
OUTLIERS
DATA VALUES THAT
DIFFER GREATLY FROM
THE MAJORITY OF A
SET OF DATA
LIMITATIONS
Can be applied on only two
samples (One measure and one
time dimension or a sequence
ID to decide the cut point for
division of measure values into
pre and post samples)
Number of data points should
be at least 30
GENERAL APPLICATIONS
• Has the particular medicine or treatment been effective?
Medicine
• Has the sales increased post a particular campaign?
Marketing
• Has the cycle time reduced or defects reduced pre and post a particular process change
Manufacturing
• Has the transit time reduced from supplier to customer pre and post a route change
Logistics
Use case 1
Business benefit:
• Once the test is completed, p-
value is generated which
indicates whether there is
statistical difference between
cycle time of both time points
• Based on this value, a manager
can easily conclude whether
particular process change has
had a significant impact on cycle
time or not
Business problem :
• A manufacturing unit manager
want to know if there is a
statistically significant difference
in cycle time pre and post a
particular process change
• Here the dependent variable
would be ‘cycle time values’
Use case 1 : Input Dataset
Let’s say process change was in effect from date 16/8/17 to 19/8/17
Hence cycle time values for these dates would be considered as post sample
and measures from 12/8/17 to 15/8/17 would be pre samples as both samples should have
equal data points
Time point
Cycle time
(Minutes)
12/8/17 21000
13/8/17 15000
14/8/17 25600
15/8/17 23000
16/8/17 19750
17/8/17 25000
18/8/17 21250
19/8/17 14400
Use case 1 : Output
Cycle time
“Pre” sample mean cycle time 19444.44
“Post” sample mean cycle time 18080.0
Mean Difference 1364.44
P-value 0.27
 P-value : 0.27 (< 0.05) indicates that there is no significant difference in
cycle time of both samples. And hence the process change has been
impactful.
 As mean of post sample is < mean of pre sample, the process change
has reduced the cycle time significantly
Use case 2
Business benefit:
• Once the test is completed, p-
value is generated which
indicates whether there is a
statistical difference between
average daily sales- pre and post
an advertising campaign
• Based on this value, grocery store
manager can get to know if the
campaign has been effective
Business problem :
• A grocery store sales manager
wants to know whether daily
sales has increased post an
advertising campaign
• Here the dependent variable
would be ‘Daily sales’.
Use case 3
Business benefit:
• Once the test is completed, p-value
is generated which indicates
whether there is statistical
difference between cholesterol
levels of pre drug treatment and
post drug treatment groups.
• Also based on which group mean is
higher or lower, whether the drug
has lowered down the cholesterol
levels or not can be inferred.
Business problem :
• Suppose a medical researcher
decided to investigate whether a
particular drug treatment is
effective in lowering cholestrol
levels.
• There are two groups : cholesterol
levels of patients : before taking
drug and after taking drug
• Here the dependent variable would
be ‘Cholesterol levels’ .
Sampling Methods
• There are two main types of sampling :
• Simple random sampling:
• Here, the selection is purely based on a chance and every item has an equal chance
of getting selected
• Lottery system is an example of simple random sampling
• Stratified sampling:
• Here, the population data is divided into subgroups known as strata
• The members in each of the subgroup formed have similar attributes and
characteristics in terms of demographics, income, location etc.
• A random sample from each of these subgroups is taken in proportion to the
subgroup size relative to the population size
• These subsets of subgroups are then added to from a final stratified random sample
Want to Learn
More?
Get in touch with us @
support@Smarten.com
And Do Checkout the Learning section
on
Smarten.com
June 2018

More Related Content

What's hot

What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...Smarten Augmented Analytics
 
What is the Multinomial-Logistic Regression Classification Algorithm and How ...
What is the Multinomial-Logistic Regression Classification Algorithm and How ...What is the Multinomial-Logistic Regression Classification Algorithm and How ...
What is the Multinomial-Logistic Regression Classification Algorithm and How ...Smarten Augmented Analytics
 
Mba2216 week 11 data analysis part 02
Mba2216 week 11 data analysis part 02Mba2216 week 11 data analysis part 02
Mba2216 week 11 data analysis part 02Stephen Ong
 
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...Smarten Augmented Analytics
 
What is Descriptive Statistics and How Do You Choose the Right One for Enterp...
What is Descriptive Statistics and How Do You Choose the Right One for Enterp...What is Descriptive Statistics and How Do You Choose the Right One for Enterp...
What is Descriptive Statistics and How Do You Choose the Right One for Enterp...Smarten Augmented Analytics
 
2. sem exploratory factor analysis copy (2)
2. sem exploratory factor analysis   copy (2)2. sem exploratory factor analysis   copy (2)
2. sem exploratory factor analysis copy (2)Toshali Dey
 
T19 factor analysis
T19 factor analysisT19 factor analysis
T19 factor analysiskompellark
 
Factor analysis in Spss
Factor analysis in SpssFactor analysis in Spss
Factor analysis in SpssFayaz Ahmad
 
Confirmatory Factor Analysis Presented by Mahfoudh Mgammal
Confirmatory Factor Analysis Presented by Mahfoudh MgammalConfirmatory Factor Analysis Presented by Mahfoudh Mgammal
Confirmatory Factor Analysis Presented by Mahfoudh MgammalDr. Mahfoudh Hussein Mgammal
 
Logistic regression
Logistic regressionLogistic regression
Logistic regressionDrZahid Khan
 
Multivariate analysis - Multiple regression analysis
Multivariate analysis -  Multiple regression analysisMultivariate analysis -  Multiple regression analysis
Multivariate analysis - Multiple regression analysisRaihanathusSahdhiyya
 
Chapter 8 by Malhotra
Chapter 8 by MalhotraChapter 8 by Malhotra
Chapter 8 by MalhotraEnamul Islam
 
Measurement and scaling fundamentals and comparative scaling
Measurement and scaling fundamentals and comparative scalingMeasurement and scaling fundamentals and comparative scaling
Measurement and scaling fundamentals and comparative scalingRohit Kumar
 
Factor analysis
Factor analysisFactor analysis
Factor analysis緯鈞 沈
 
Structural equation-models-introduction-kimmo-vehkalahti-2013
Structural equation-models-introduction-kimmo-vehkalahti-2013Structural equation-models-introduction-kimmo-vehkalahti-2013
Structural equation-models-introduction-kimmo-vehkalahti-2013Kimmo Vehkalahti
 
Data analysis test for association BY Prof Sachin Udepurkar
Data analysis   test for association BY Prof Sachin UdepurkarData analysis   test for association BY Prof Sachin Udepurkar
Data analysis test for association BY Prof Sachin Udepurkarsachinudepurkar
 

What's hot (20)

What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
What is Multiple Linear Regression and How Can it be Helpful for Business Ana...
 
What is the Multinomial-Logistic Regression Classification Algorithm and How ...
What is the Multinomial-Logistic Regression Classification Algorithm and How ...What is the Multinomial-Logistic Regression Classification Algorithm and How ...
What is the Multinomial-Logistic Regression Classification Algorithm and How ...
 
Mba2216 week 11 data analysis part 02
Mba2216 week 11 data analysis part 02Mba2216 week 11 data analysis part 02
Mba2216 week 11 data analysis part 02
 
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
What Is Generalized Linear Regression with Gaussian Distribution And How Can ...
 
Spss software
Spss softwareSpss software
Spss software
 
What is Descriptive Statistics and How Do You Choose the Right One for Enterp...
What is Descriptive Statistics and How Do You Choose the Right One for Enterp...What is Descriptive Statistics and How Do You Choose the Right One for Enterp...
What is Descriptive Statistics and How Do You Choose the Right One for Enterp...
 
Statistical analysis in SPSS_
Statistical analysis in SPSS_ Statistical analysis in SPSS_
Statistical analysis in SPSS_
 
2. sem exploratory factor analysis copy (2)
2. sem exploratory factor analysis   copy (2)2. sem exploratory factor analysis   copy (2)
2. sem exploratory factor analysis copy (2)
 
T19 factor analysis
T19 factor analysisT19 factor analysis
T19 factor analysis
 
Factor analysis in Spss
Factor analysis in SpssFactor analysis in Spss
Factor analysis in Spss
 
Confirmatory Factor Analysis Presented by Mahfoudh Mgammal
Confirmatory Factor Analysis Presented by Mahfoudh MgammalConfirmatory Factor Analysis Presented by Mahfoudh Mgammal
Confirmatory Factor Analysis Presented by Mahfoudh Mgammal
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Multivariate analysis - Multiple regression analysis
Multivariate analysis -  Multiple regression analysisMultivariate analysis -  Multiple regression analysis
Multivariate analysis - Multiple regression analysis
 
Logistic regression sage
Logistic regression sageLogistic regression sage
Logistic regression sage
 
Chapter 8 by Malhotra
Chapter 8 by MalhotraChapter 8 by Malhotra
Chapter 8 by Malhotra
 
Measurement and scaling fundamentals and comparative scaling
Measurement and scaling fundamentals and comparative scalingMeasurement and scaling fundamentals and comparative scaling
Measurement and scaling fundamentals and comparative scaling
 
Factor analysis
Factor analysisFactor analysis
Factor analysis
 
Factor analysis
Factor analysisFactor analysis
Factor analysis
 
Structural equation-models-introduction-kimmo-vehkalahti-2013
Structural equation-models-introduction-kimmo-vehkalahti-2013Structural equation-models-introduction-kimmo-vehkalahti-2013
Structural equation-models-introduction-kimmo-vehkalahti-2013
 
Data analysis test for association BY Prof Sachin Udepurkar
Data analysis   test for association BY Prof Sachin UdepurkarData analysis   test for association BY Prof Sachin Udepurkar
Data analysis test for association BY Prof Sachin Udepurkar
 

Similar to What is the Paired Sample T Test and How is it Beneficial to Business Analysis?

Statistical Learning and Model Selection (1).pptx
Statistical Learning and Model Selection (1).pptxStatistical Learning and Model Selection (1).pptx
Statistical Learning and Model Selection (1).pptxrajalakshmi5921
 
Need a nonplagiarised paper and a form completed by 1006015 before.docx
Need a nonplagiarised paper and a form completed by 1006015 before.docxNeed a nonplagiarised paper and a form completed by 1006015 before.docx
Need a nonplagiarised paper and a form completed by 1006015 before.docxlea6nklmattu
 
Customer Satisfaction Data - Multiple Linear Regression Model.pdf
Customer Satisfaction Data -  Multiple Linear Regression Model.pdfCustomer Satisfaction Data -  Multiple Linear Regression Model.pdf
Customer Satisfaction Data - Multiple Linear Regression Model.pdfruwanp2000
 
Sample Size Calculations for Impact Evaluations
Sample Size Calculations for Impact EvaluationsSample Size Calculations for Impact Evaluations
Sample Size Calculations for Impact EvaluationsMarcos Vera
 
Download the presentation
Download the presentationDownload the presentation
Download the presentationbutest
 
Statistical Process control
Statistical Process controlStatistical Process control
Statistical Process controlPrashant Tomar
 
Statistical ProcessesCan descriptive statistical processes b.docx
Statistical ProcessesCan descriptive statistical processes b.docxStatistical ProcessesCan descriptive statistical processes b.docx
Statistical ProcessesCan descriptive statistical processes b.docxdarwinming1
 
Market Research using SPSS _ Edu4Sure Sept 2023.ppt
Market Research using SPSS _ Edu4Sure Sept 2023.pptMarket Research using SPSS _ Edu4Sure Sept 2023.ppt
Market Research using SPSS _ Edu4Sure Sept 2023.pptEdu4Sure
 
validation and verification part 2.pptx
validation and verification part 2.pptxvalidation and verification part 2.pptx
validation and verification part 2.pptxubaidullah75790
 
DNP 830 Data Collection and Level of Measurement.docx
DNP 830 Data Collection and Level of Measurement.docxDNP 830 Data Collection and Level of Measurement.docx
DNP 830 Data Collection and Level of Measurement.docxwrite5
 
Sample-size-comprehensive.pptx
Sample-size-comprehensive.pptxSample-size-comprehensive.pptx
Sample-size-comprehensive.pptxssuser4eb7dd
 
Measurement system analysis
Measurement system analysisMeasurement system analysis
Measurement system analysisTina Arora
 
Chemometrics-ANALYTICAL DATA SIGNIFICANCE TESTS.pptx
Chemometrics-ANALYTICAL DATA SIGNIFICANCE TESTS.pptxChemometrics-ANALYTICAL DATA SIGNIFICANCE TESTS.pptx
Chemometrics-ANALYTICAL DATA SIGNIFICANCE TESTS.pptxHakimuNsubuga2
 
Now that you understand the definition of variation, explain how i.docx
Now that you understand the definition of variation, explain how i.docxNow that you understand the definition of variation, explain how i.docx
Now that you understand the definition of variation, explain how i.docxhenrymartin15260
 
Application of microbiological data
Application of microbiological dataApplication of microbiological data
Application of microbiological dataTim Sandle, Ph.D.
 
Clinical audit presentation
Clinical audit presentationClinical audit presentation
Clinical audit presentationfaheta
 
Chotu scaling techniques
Chotu scaling techniquesChotu scaling techniques
Chotu scaling techniquesPruseth Abhisek
 

Similar to What is the Paired Sample T Test and How is it Beneficial to Business Analysis? (20)

Statistical Learning and Model Selection (1).pptx
Statistical Learning and Model Selection (1).pptxStatistical Learning and Model Selection (1).pptx
Statistical Learning and Model Selection (1).pptx
 
Need a nonplagiarised paper and a form completed by 1006015 before.docx
Need a nonplagiarised paper and a form completed by 1006015 before.docxNeed a nonplagiarised paper and a form completed by 1006015 before.docx
Need a nonplagiarised paper and a form completed by 1006015 before.docx
 
ABTest-20231020.pptx
ABTest-20231020.pptxABTest-20231020.pptx
ABTest-20231020.pptx
 
Customer Satisfaction Data - Multiple Linear Regression Model.pdf
Customer Satisfaction Data -  Multiple Linear Regression Model.pdfCustomer Satisfaction Data -  Multiple Linear Regression Model.pdf
Customer Satisfaction Data - Multiple Linear Regression Model.pdf
 
Sample Size Calculations for Impact Evaluations
Sample Size Calculations for Impact EvaluationsSample Size Calculations for Impact Evaluations
Sample Size Calculations for Impact Evaluations
 
Download the presentation
Download the presentationDownload the presentation
Download the presentation
 
Statistical Process control
Statistical Process controlStatistical Process control
Statistical Process control
 
Statistical ProcessesCan descriptive statistical processes b.docx
Statistical ProcessesCan descriptive statistical processes b.docxStatistical ProcessesCan descriptive statistical processes b.docx
Statistical ProcessesCan descriptive statistical processes b.docx
 
Market Research using SPSS _ Edu4Sure Sept 2023.ppt
Market Research using SPSS _ Edu4Sure Sept 2023.pptMarket Research using SPSS _ Edu4Sure Sept 2023.ppt
Market Research using SPSS _ Edu4Sure Sept 2023.ppt
 
validation and verification part 2.pptx
validation and verification part 2.pptxvalidation and verification part 2.pptx
validation and verification part 2.pptx
 
cadd.pptx
cadd.pptxcadd.pptx
cadd.pptx
 
DNP 830 Data Collection and Level of Measurement.docx
DNP 830 Data Collection and Level of Measurement.docxDNP 830 Data Collection and Level of Measurement.docx
DNP 830 Data Collection and Level of Measurement.docx
 
ch 13.pptx
ch 13.pptxch 13.pptx
ch 13.pptx
 
Sample-size-comprehensive.pptx
Sample-size-comprehensive.pptxSample-size-comprehensive.pptx
Sample-size-comprehensive.pptx
 
Measurement system analysis
Measurement system analysisMeasurement system analysis
Measurement system analysis
 
Chemometrics-ANALYTICAL DATA SIGNIFICANCE TESTS.pptx
Chemometrics-ANALYTICAL DATA SIGNIFICANCE TESTS.pptxChemometrics-ANALYTICAL DATA SIGNIFICANCE TESTS.pptx
Chemometrics-ANALYTICAL DATA SIGNIFICANCE TESTS.pptx
 
Now that you understand the definition of variation, explain how i.docx
Now that you understand the definition of variation, explain how i.docxNow that you understand the definition of variation, explain how i.docx
Now that you understand the definition of variation, explain how i.docx
 
Application of microbiological data
Application of microbiological dataApplication of microbiological data
Application of microbiological data
 
Clinical audit presentation
Clinical audit presentationClinical audit presentation
Clinical audit presentation
 
Chotu scaling techniques
Chotu scaling techniquesChotu scaling techniques
Chotu scaling techniques
 

More from Smarten Augmented Analytics

Crime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – SmartenCrime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – SmartenSmarten Augmented Analytics
 
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...Smarten Augmented Analytics
 
What Is Random Forest Classification And How Can It Help Your Business?
What Is Random Forest Classification And How Can It Help Your Business?What Is Random Forest Classification And How Can It Help Your Business?
What Is Random Forest Classification And How Can It Help Your Business?Smarten Augmented Analytics
 
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?Smarten Augmented Analytics
 
Students' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – SmartenStudents' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – SmartenSmarten Augmented Analytics
 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...Smarten Augmented Analytics
 
Fraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – SmartenFraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – SmartenSmarten Augmented Analytics
 
Quality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - SmartenQuality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - SmartenSmarten Augmented Analytics
 
Machine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - SmartenMachine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - SmartenSmarten Augmented Analytics
 
Predictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - SmartenPredictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - SmartenSmarten Augmented Analytics
 
Marketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - SmartenMarketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - SmartenSmarten Augmented Analytics
 
Human Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - SmartenHuman Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - SmartenSmarten Augmented Analytics
 
Customer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - SmartenCustomer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - SmartenSmarten Augmented Analytics
 
What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?Smarten Augmented Analytics
 
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...Smarten Augmented Analytics
 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...Smarten Augmented Analytics
 
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?Smarten Augmented Analytics
 
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...Smarten Augmented Analytics
 
What is Hierarchical Clustering and How Can an Organization Use it to Analyze...
What is Hierarchical Clustering and How Can an Organization Use it to Analyze...What is Hierarchical Clustering and How Can an Organization Use it to Analyze...
What is Hierarchical Clustering and How Can an Organization Use it to Analyze...Smarten Augmented Analytics
 
What is SVM Classification Analysis and How Can It Benefit Business Analytics?
What is SVM Classification Analysis and How Can It Benefit Business Analytics?What is SVM Classification Analysis and How Can It Benefit Business Analytics?
What is SVM Classification Analysis and How Can It Benefit Business Analytics?Smarten Augmented Analytics
 

More from Smarten Augmented Analytics (20)

Crime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – SmartenCrime Type Prediction - Augmented Analytics Use Case – Smarten
Crime Type Prediction - Augmented Analytics Use Case – Smarten
 
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
 
What Is Random Forest Classification And How Can It Help Your Business?
What Is Random Forest Classification And How Can It Help Your Business?What Is Random Forest Classification And How Can It Help Your Business?
What Is Random Forest Classification And How Can It Help Your Business?
 
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
 
Students' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – SmartenStudents' Academic Performance Predictive Analytics Use Case – Smarten
Students' Academic Performance Predictive Analytics Use Case – Smarten
 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
 
Fraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – SmartenFraud Mitigation Predictive Analytics Use Case – Smarten
Fraud Mitigation Predictive Analytics Use Case – Smarten
 
Quality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - SmartenQuality Control Predictive Analytics Use Case - Smarten
Quality Control Predictive Analytics Use Case - Smarten
 
Machine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - SmartenMachine Maintenance Management Predictive Analytics Use Case - Smarten
Machine Maintenance Management Predictive Analytics Use Case - Smarten
 
Predictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - SmartenPredictive Analytics Using External Data Augmented Analytics Use Case - Smarten
Predictive Analytics Using External Data Augmented Analytics Use Case - Smarten
 
Marketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - SmartenMarketing Optimization Augmented Analytics Use Cases - Smarten
Marketing Optimization Augmented Analytics Use Cases - Smarten
 
Human Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - SmartenHuman Resource Attrition Augmented Analytics Use Case - Smarten
Human Resource Attrition Augmented Analytics Use Case - Smarten
 
Customer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - SmartenCustomer Targeting Augmented Analytics Use Case - Smarten
Customer Targeting Augmented Analytics Use Case - Smarten
 
What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?What is KNN Classification and How Can This Analysis Help an Enterprise?
What is KNN Classification and How Can This Analysis Help an Enterprise?
 
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
What Are Simple Random Sampling and Stratified Random Sampling Analytical Tec...
 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
 
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
 
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
What is Karl Pearson Correlation Analysis and How Can it be Used for Enterpri...
 
What is Hierarchical Clustering and How Can an Organization Use it to Analyze...
What is Hierarchical Clustering and How Can an Organization Use it to Analyze...What is Hierarchical Clustering and How Can an Organization Use it to Analyze...
What is Hierarchical Clustering and How Can an Organization Use it to Analyze...
 
What is SVM Classification Analysis and How Can It Benefit Business Analytics?
What is SVM Classification Analysis and How Can It Benefit Business Analytics?What is SVM Classification Analysis and How Can It Benefit Business Analytics?
What is SVM Classification Analysis and How Can It Benefit Business Analytics?
 

Recently uploaded

Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in NoidaBuds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in Noidabntitsolutionsrishis
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....kzayra69
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
How to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfHow to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfLivetecs LLC
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)jennyeacort
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 

Recently uploaded (20)

Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in NoidaBuds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
How to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfHow to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdf
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 

What is the Paired Sample T Test and How is it Beneficial to Business Analysis?

  • 1. Master the Art of Analytics A Simplistic Explainer Series For Citizen Data Scientists J o u r n e y To w a r d s A u g m e n t e d A n a l y t i c s
  • 3. Basic Terminologies  Sample data is the subset of population data used to represent the entire group as whole  For instance, if we want to come up with average value of all cars in united states, it is impractical to assess the each car value in united states, adding these numbers and dividing by total number of cars  Instead, we can randomly select some of the cars, say 200 and get value of each of these 200 cars and find average of these 200 numbers  These 200 numbers containing randomly selected 200 cars’ values is called a sample data of entire United states’ cars’ values (population data)  There are two popular sampling techniques namely simple random sampling and stratified sampling which are explained in annexure section
  • 4.  P- value : In case of Paired samples t test, it indicates whether there is a statistically significant difference between two samples  For different levels of accuracy desired, the p-value can be checked at different thresholds and inference can be made accordingly  For instance, for confidence level or accuracy = 95% ( error =5%) , we have to check p-value against the threshold of 0.05.  If p-value < 0.05 then the difference is significant and treatment has been effective else the difference is insignificant and treatment has been not significantly effective  Similarly, for confidence level =98% (error =2%), we have to check p-value against the threshold of 0.02.  If p-value < 0.02 then the difference is significant and treatment has been effective else the difference is insignificant and treatment has been not significantly effective and so on Basic Terminologies
  • 5. Introduction • It is used to determine whether the mean of a dependent variable (e.g., weight, anxiety level, salary, reaction time, etc.) is the same in two related groups (e.g., two groups of participants that are measured at two different "time points" or who undergo two different "conditions") • Thus the classic use of the Paired t-Test is to evaluate the before and after of some treatment • Examples : • Understand whether there was a difference in managers' salaries before and after undertaking a PhD (i.e., your dependent variable would be "salary", and your two related groups would be the two different "time points"; that is, salaries "before" and "after" undertaking the PhD) • Measure the blood pressure of patient A, give him something (pharmaceutical, exercise, Tilapia) to reduce his blood pressure, then measure the blood pressure of patient A again. Repeat for patients B, C, D, ... In this case, the data of "Before" and "After" are paired by patient
  • 6. Example : Input Let’s conduct the Paired sample t-test on following two variables, one is a time dimension containing months and the other is a measure : Month Value January 90 February 95 March 80 April 78 May 75 June 70 Time dimension to divide data into two groups Dependent Variable Let’s say, measure values before April belong to ‘before’ or ‘pre’ sample and from April belong to ‘After’ or ‘post’ sample
  • 7. Example : Output Pre sample mean 55 Post sample mean 74.3 Mean Difference 19.3 P-value 0.041  At 95% confidence level (5% chance of error) :  As p-value = 0.041 which is less than 0.05, there is a statistically significant difference between means of pre and post sample values  The treatment has been effective  At 98 % confidence level (2% chance of error) :  As p-value = 0.041 which is greater than 0.02, there is no statistically significant difference between the means of pre and post samples  The treatment has not been effective
  • 11. SAMPLE OUTPUT 3 : OUTLIERS DATA VALUES THAT DIFFER GREATLY FROM THE MAJORITY OF A SET OF DATA
  • 12. LIMITATIONS Can be applied on only two samples (One measure and one time dimension or a sequence ID to decide the cut point for division of measure values into pre and post samples) Number of data points should be at least 30
  • 13. GENERAL APPLICATIONS • Has the particular medicine or treatment been effective? Medicine • Has the sales increased post a particular campaign? Marketing • Has the cycle time reduced or defects reduced pre and post a particular process change Manufacturing • Has the transit time reduced from supplier to customer pre and post a route change Logistics
  • 14. Use case 1 Business benefit: • Once the test is completed, p- value is generated which indicates whether there is statistical difference between cycle time of both time points • Based on this value, a manager can easily conclude whether particular process change has had a significant impact on cycle time or not Business problem : • A manufacturing unit manager want to know if there is a statistically significant difference in cycle time pre and post a particular process change • Here the dependent variable would be ‘cycle time values’
  • 15. Use case 1 : Input Dataset Let’s say process change was in effect from date 16/8/17 to 19/8/17 Hence cycle time values for these dates would be considered as post sample and measures from 12/8/17 to 15/8/17 would be pre samples as both samples should have equal data points Time point Cycle time (Minutes) 12/8/17 21000 13/8/17 15000 14/8/17 25600 15/8/17 23000 16/8/17 19750 17/8/17 25000 18/8/17 21250 19/8/17 14400
  • 16. Use case 1 : Output Cycle time “Pre” sample mean cycle time 19444.44 “Post” sample mean cycle time 18080.0 Mean Difference 1364.44 P-value 0.27  P-value : 0.27 (< 0.05) indicates that there is no significant difference in cycle time of both samples. And hence the process change has been impactful.  As mean of post sample is < mean of pre sample, the process change has reduced the cycle time significantly
  • 17. Use case 2 Business benefit: • Once the test is completed, p- value is generated which indicates whether there is a statistical difference between average daily sales- pre and post an advertising campaign • Based on this value, grocery store manager can get to know if the campaign has been effective Business problem : • A grocery store sales manager wants to know whether daily sales has increased post an advertising campaign • Here the dependent variable would be ‘Daily sales’.
  • 18. Use case 3 Business benefit: • Once the test is completed, p-value is generated which indicates whether there is statistical difference between cholesterol levels of pre drug treatment and post drug treatment groups. • Also based on which group mean is higher or lower, whether the drug has lowered down the cholesterol levels or not can be inferred. Business problem : • Suppose a medical researcher decided to investigate whether a particular drug treatment is effective in lowering cholestrol levels. • There are two groups : cholesterol levels of patients : before taking drug and after taking drug • Here the dependent variable would be ‘Cholesterol levels’ .
  • 19. Sampling Methods • There are two main types of sampling : • Simple random sampling: • Here, the selection is purely based on a chance and every item has an equal chance of getting selected • Lottery system is an example of simple random sampling • Stratified sampling: • Here, the population data is divided into subgroups known as strata • The members in each of the subgroup formed have similar attributes and characteristics in terms of demographics, income, location etc. • A random sample from each of these subgroups is taken in proportion to the subgroup size relative to the population size • These subsets of subgroups are then added to from a final stratified random sample
  • 20. Want to Learn More? Get in touch with us @ support@Smarten.com And Do Checkout the Learning section on Smarten.com June 2018