SlideShare a Scribd company logo
1 of 15
STATISTICS
ECON 355 – Regression Analysis
DISTRIBUTION REVIEW
• We use distribution to better understand what data looks like
• A police unit records speed of passing vehicles on a road with speed limit of 45
miles/hour
• Use speed.csv dataset on blackboard to find out the mean, standard deviation, and
histogram of the results
• For the histogram (in Stata) set the number of bins to 5, Y-axis should display frequency,
add height label to bars, change Range/value (major tick-label properties) of X-axis to
min 25, max 90 and delta of 5
POPULATION VS. SAMPLE
• Most of the time our questions are about the universe (population).
• For example:
• “do people who sleep more drink less coffee?”
• Or “are smaller class sizes associated with higher test scores?”
• Or “what is the relationship between price and age of residential real estate?”
• Unfortunately, we are not able to collect data from every single person, class size, or
house in the world
• We use samples of data to answer the question about the population
SOME OTHER QUESTIONS WE CAN ANSWER BY USING
SAMPLES
• What are the mean earnings of people in their 20s?
• What is the standard deviation of the earnings of people in their 20s?
• What is the variance of the earnings of people in their 20s?
• What is the average unemployment rate in New York city?
• What is the average height of a GU student?
PARAMETER, STATISTIC, AND ESTIMATOR
• When we want to find out the mean earnings (𝜇) of people in their 20s we are looking
for a population parameter
• When we collect answers from a sample of people in their 20s and calculate the average
earnings ( 𝑥) we have found the sample statistic
• We use sample statistics to estimate population parameters
• Sample statistics are estimators of population parameters
• In this specific example 𝑥 is an estimator of 𝜇
• Estimate – numerical value of the estimator when it is actually computed using data
from a specific sample
PARAMETER, STATISTIC, AND ESTIMATOR EXAMPLE
• In the regression of test scores on class size
• What is the general regression?
• What is the fitted regression?
• What is the population here?
• What is the sample?
• What is the parameter we are trying to estimate?
• What is the estimator of that parameter?
• What is the estimate?
• Difference between estimator and estimate?
IN CLASS
You run a regression of coffee consumption on the number of hours one sleeps at night.
And find that the coefficient on the number of hours slept at night is -0.75. Please choose
the answer that lists the following in the correct order: population in question; sample;
parameter; estimator; estimate
A. Everyone on the planet, the group of people you interview for your homework; 𝛽1; 1; -0.75
B. Everyone on the planet, the group of people you interview for your homework; 1; 𝛽1; -0.75
C. The group of people you interview for your homework, everyone on the planet; 𝛽1; 1; -0.75
D. Everyone on the planet, the group of people you interview for your homework; 1; -0.75; 𝛽1
REPEATED SAMPLES
• Suppose you are trying to evaluate an estimator by repeatedly drawing random samples
from a population
• Example 1.
• To estimate the relationship between hours slept and caffeine consumed you divided into 8
groups, collected data and ran 8 regressions. Here we have 8 estimates of the relationship. What
is the estimator here?
• Example 2.
• To estimate the average height of a GU student each of you asks 30 students how tall they are.
This way we will get 24 estimates of the average height of a GU student. What is the estimator
here?
• Repeated samples mean that the estimator is not one number but rather a series of
numbers. We are going to be working with its distribution.
RANDOM SAMPLE
• If we estimate the height of GU students by using the sample of people we meet outside
of basketball locker room we are not going to have a random sample
• If we estimate the unemployment rate by going to a park at 10 am on a weekday and
ask people we run into if they are employed we are not going to have a random sample
• I am trying to figure out the relationship between the number of DUIs in a state and
alcohol consumption. What sample should I use?
• Random sample means that everyone in the population has an equal chance of ending
up in the sample
ESTIMATOR PROPERTY 1. UNBIASEDNESS
• If you evaluate an estimator many times by using repeatedly drawn samples, it is
reasonable to hope that on average you would get the right answer.
• Example: we are trying to estimate the average height of a GU student. We collect data
from random samples 5 times and get the following results
Sample number Average height in the
sample
1 167.64
2 175.26
3 160.02
4 152.4
5 177.8
UNBIASEDNESS CONT’D
• What is the average number across the samples?
• If our estimator 𝑥 is unbiased then the average height of a GU student is about 5 feet 6
inches (166 cm)
• If you decided to estimate the average height in your sample by using the following
formula:
• 𝑖=1
𝑁
ℎ𝑒𝑖𝑔ℎ𝑡 𝑖
𝑁
+ 2
• you would have a biased estimator
ESTIMATOR PROPERTY 2. CONSISTENCY.
• The larger the sample the closer the estimator is to the population parameter
• Example: The larger the sample of GU students you survey about their height the closer
the estimate of the height will be to the actual average height of a GU student
ESTIMATOR PROPERTY 3. VARIANCE AND EFFICIENCY
• We want the estimator with the smaller variance. That estimator will be the most
efficient.
• Example: we are trying to estimate the average height of a GU student. We collect data
from random samples 5 times and get the following results
Sample number Average height in the
sample
Median height in the
sample
1 167.64 167.64
2 175.26 172.72
3 160.02 162.56
4 152.4 165.1
5 177.8 162.56
Variance 111.6 18.06
EFFICIENCY CONT’D
• Median appears to have a smaller variance, which means it is a more efficient estimator
of the average height of a GU student
REVIEW.
• Please explain unbiasedness, consistency, and variance by applying the ideas to the
example of the relationship between coffee consumed and hours slept.
• What is the difference between parameter, estimator and estimate?

More Related Content

What's hot

Central spread - symmetry (jejit + indepth)
Central   spread - symmetry (jejit + indepth)Central   spread - symmetry (jejit + indepth)
Central spread - symmetry (jejit + indepth)Ken Plummer
 
5 Things You Are Probably Doing Wrong With Data
5 Things You Are Probably Doing Wrong With Data5 Things You Are Probably Doing Wrong With Data
5 Things You Are Probably Doing Wrong With DataSahil Manekia
 
1 statistical analysis notes
1 statistical analysis notes1 statistical analysis notes
1 statistical analysis notescartlidge
 
Proportion test using Chi square
Proportion test using Chi squareProportion test using Chi square
Proportion test using Chi squareParag Shah
 
Improving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradoxImproving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradoxMaarten van Smeden
 
Lesson 10 rm psych stats & graphs 2013
Lesson 10   rm psych stats & graphs 2013Lesson 10   rm psych stats & graphs 2013
Lesson 10 rm psych stats & graphs 2013coburgpsych
 
Ppt scientific method no variables
Ppt scientific method no variablesPpt scientific method no variables
Ppt scientific method no variablesaimorales
 
Standard error and sample size
Standard error and sample sizeStandard error and sample size
Standard error and sample sizedrplayfoot
 
Experimental designs and data analysis in the field of Agronomy science by ma...
Experimental designs and data analysis in the field of Agronomy science by ma...Experimental designs and data analysis in the field of Agronomy science by ma...
Experimental designs and data analysis in the field of Agronomy science by ma...Manoj Sharma
 

What's hot (15)

Data in science
Data in science Data in science
Data in science
 
Central spread - symmetry (jejit + indepth)
Central   spread - symmetry (jejit + indepth)Central   spread - symmetry (jejit + indepth)
Central spread - symmetry (jejit + indepth)
 
5 Things You Are Probably Doing Wrong With Data
5 Things You Are Probably Doing Wrong With Data5 Things You Are Probably Doing Wrong With Data
5 Things You Are Probably Doing Wrong With Data
 
Statistical Tools
Statistical ToolsStatistical Tools
Statistical Tools
 
1 statistical analysis notes
1 statistical analysis notes1 statistical analysis notes
1 statistical analysis notes
 
Proportion test using Chi square
Proportion test using Chi squareProportion test using Chi square
Proportion test using Chi square
 
Improving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradoxImproving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradox
 
Correlational Designs
Correlational Designs Correlational Designs
Correlational Designs
 
Lesson 10 rm psych stats & graphs 2013
Lesson 10   rm psych stats & graphs 2013Lesson 10   rm psych stats & graphs 2013
Lesson 10 rm psych stats & graphs 2013
 
Sti2018 jws
Sti2018 jwsSti2018 jws
Sti2018 jws
 
Ppt scientific method no variables
Ppt scientific method no variablesPpt scientific method no variables
Ppt scientific method no variables
 
Data analysis, statistics, and probability review
Data analysis, statistics, and probability reviewData analysis, statistics, and probability review
Data analysis, statistics, and probability review
 
Exam rmt
Exam rmtExam rmt
Exam rmt
 
Standard error and sample size
Standard error and sample sizeStandard error and sample size
Standard error and sample size
 
Experimental designs and data analysis in the field of Agronomy science by ma...
Experimental designs and data analysis in the field of Agronomy science by ma...Experimental designs and data analysis in the field of Agronomy science by ma...
Experimental designs and data analysis in the field of Agronomy science by ma...
 

Similar to Regression Analysis Key Concepts

Lecture 2 What is Statistics, Anyway
Lecture 2 What is Statistics, AnywayLecture 2 What is Statistics, Anyway
Lecture 2 What is Statistics, AnywayJason Edington
 
Qt business statistics-lesson1-2013
Qt business statistics-lesson1-2013Qt business statistics-lesson1-2013
Qt business statistics-lesson1-2013sonu kumar
 
Topic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis TestingTopic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis TestingRyan Herzog
 
ststs nw.pptx
ststs nw.pptxststs nw.pptx
ststs nw.pptxMrymNb
 
introduction to statistical theory
introduction to statistical theoryintroduction to statistical theory
introduction to statistical theoryUnsa Shakir
 
statistical inference.pptx
statistical inference.pptxstatistical inference.pptx
statistical inference.pptxsuerie2
 
Estimating populations and samples
Estimating populations and samplesEstimating populations and samples
Estimating populations and samplesMuhammadFardeen4
 
Statistics for analytics
Statistics for analyticsStatistics for analytics
Statistics for analyticsdeepika4721
 
How to Analyse Data
How to Analyse DataHow to Analyse Data
How to Analyse DataAmit Sharma
 
03-Data-Analysis-Final.pdf
03-Data-Analysis-Final.pdf03-Data-Analysis-Final.pdf
03-Data-Analysis-Final.pdfSugumarSarDurai
 
The role of statistics and the data analysis process.ppt
The role of statistics and the data analysis process.pptThe role of statistics and the data analysis process.ppt
The role of statistics and the data analysis process.pptJakeCuenca10
 
probability.pptx
probability.pptxprobability.pptx
probability.pptxbisan3
 
De-Mystifying Stats: A primer on basic statistics
De-Mystifying Stats: A primer on basic statisticsDe-Mystifying Stats: A primer on basic statistics
De-Mystifying Stats: A primer on basic statisticsGillian Byrne
 
Basic stat analysis using excel
Basic stat analysis using excelBasic stat analysis using excel
Basic stat analysis using excelParag Shah
 
2_Lecture 2_Confidence_Interval_3.pdf
2_Lecture 2_Confidence_Interval_3.pdf2_Lecture 2_Confidence_Interval_3.pdf
2_Lecture 2_Confidence_Interval_3.pdfCHANSreyya1
 
Methods of Sampling Techniques and Sample Size
Methods of Sampling Techniques and Sample SizeMethods of Sampling Techniques and Sample Size
Methods of Sampling Techniques and Sample SizeAnup Suchak
 
R - what do the numbers mean? #RStats
R - what do the numbers mean? #RStatsR - what do the numbers mean? #RStats
R - what do the numbers mean? #RStatsJen Stirrup
 

Similar to Regression Analysis Key Concepts (20)

Lecture 2 What is Statistics, Anyway
Lecture 2 What is Statistics, AnywayLecture 2 What is Statistics, Anyway
Lecture 2 What is Statistics, Anyway
 
Qt business statistics-lesson1-2013
Qt business statistics-lesson1-2013Qt business statistics-lesson1-2013
Qt business statistics-lesson1-2013
 
Topic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis TestingTopic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis Testing
 
ststs nw.pptx
ststs nw.pptxststs nw.pptx
ststs nw.pptx
 
introduction to statistical theory
introduction to statistical theoryintroduction to statistical theory
introduction to statistical theory
 
LR 9 Estimation.pdf
LR 9 Estimation.pdfLR 9 Estimation.pdf
LR 9 Estimation.pdf
 
statistical inference.pptx
statistical inference.pptxstatistical inference.pptx
statistical inference.pptx
 
Estimating populations and samples
Estimating populations and samplesEstimating populations and samples
Estimating populations and samples
 
Statistics for analytics
Statistics for analyticsStatistics for analytics
Statistics for analytics
 
How to Analyse Data
How to Analyse DataHow to Analyse Data
How to Analyse Data
 
Sampling
SamplingSampling
Sampling
 
03-Data-Analysis-Final.pdf
03-Data-Analysis-Final.pdf03-Data-Analysis-Final.pdf
03-Data-Analysis-Final.pdf
 
The role of statistics and the data analysis process.ppt
The role of statistics and the data analysis process.pptThe role of statistics and the data analysis process.ppt
The role of statistics and the data analysis process.ppt
 
A.2 se and sd
A.2 se  and sdA.2 se  and sd
A.2 se and sd
 
probability.pptx
probability.pptxprobability.pptx
probability.pptx
 
De-Mystifying Stats: A primer on basic statistics
De-Mystifying Stats: A primer on basic statisticsDe-Mystifying Stats: A primer on basic statistics
De-Mystifying Stats: A primer on basic statistics
 
Basic stat analysis using excel
Basic stat analysis using excelBasic stat analysis using excel
Basic stat analysis using excel
 
2_Lecture 2_Confidence_Interval_3.pdf
2_Lecture 2_Confidence_Interval_3.pdf2_Lecture 2_Confidence_Interval_3.pdf
2_Lecture 2_Confidence_Interval_3.pdf
 
Methods of Sampling Techniques and Sample Size
Methods of Sampling Techniques and Sample SizeMethods of Sampling Techniques and Sample Size
Methods of Sampling Techniques and Sample Size
 
R - what do the numbers mean? #RStats
R - what do the numbers mean? #RStatsR - what do the numbers mean? #RStats
R - what do the numbers mean? #RStats
 

More from Ryan Herzog

Chapter 14 - Great Recession
Chapter 14 - Great RecessionChapter 14 - Great Recession
Chapter 14 - Great RecessionRyan Herzog
 
Chapter 13 - AD/AS
Chapter 13 - AD/ASChapter 13 - AD/AS
Chapter 13 - AD/ASRyan Herzog
 
Chapter 12 - Monetary Policy
Chapter 12 - Monetary PolicyChapter 12 - Monetary Policy
Chapter 12 - Monetary PolicyRyan Herzog
 
Chapter 11 - IS Curve
Chapter 11 - IS CurveChapter 11 - IS Curve
Chapter 11 - IS CurveRyan Herzog
 
Chapter 10 - Great Recession
Chapter 10 - Great RecessionChapter 10 - Great Recession
Chapter 10 - Great RecessionRyan Herzog
 
Chapter 9 - Short Run
Chapter 9 - Short RunChapter 9 - Short Run
Chapter 9 - Short RunRyan Herzog
 
Chapter 8 - Inflation
Chapter 8 - InflationChapter 8 - Inflation
Chapter 8 - InflationRyan Herzog
 
Chapter 7 - Labor Market
Chapter 7 - Labor MarketChapter 7 - Labor Market
Chapter 7 - Labor MarketRyan Herzog
 
Chapter 6 - Romer Model
Chapter 6 - Romer Model Chapter 6 - Romer Model
Chapter 6 - Romer Model Ryan Herzog
 
Chapter 5 - Solow Model for Growth
Chapter 5 - Solow Model for GrowthChapter 5 - Solow Model for Growth
Chapter 5 - Solow Model for GrowthRyan Herzog
 
Chapter 4 - Model of Production
Chapter 4 - Model of ProductionChapter 4 - Model of Production
Chapter 4 - Model of ProductionRyan Herzog
 
Chapter 3 - Long-Run Economic Growth
Chapter 3 - Long-Run Economic GrowthChapter 3 - Long-Run Economic Growth
Chapter 3 - Long-Run Economic GrowthRyan Herzog
 
Chapter 2 - Measuring the Macroeconomy
Chapter 2 - Measuring the MacroeconomyChapter 2 - Measuring the Macroeconomy
Chapter 2 - Measuring the MacroeconomyRyan Herzog
 
Topic 7 (questions)
Topic 7 (questions)Topic 7 (questions)
Topic 7 (questions)Ryan Herzog
 
Topic 5 (multiple regression)
Topic 5 (multiple regression)Topic 5 (multiple regression)
Topic 5 (multiple regression)Ryan Herzog
 
Topic 6 (model specification)
Topic 6 (model specification)Topic 6 (model specification)
Topic 6 (model specification)Ryan Herzog
 
Topic 5 (multiple regression)
Topic 5 (multiple regression)Topic 5 (multiple regression)
Topic 5 (multiple regression)Ryan Herzog
 
Topic 4 (binary)
Topic 4 (binary)Topic 4 (binary)
Topic 4 (binary)Ryan Herzog
 

More from Ryan Herzog (20)

Chapter 14 - Great Recession
Chapter 14 - Great RecessionChapter 14 - Great Recession
Chapter 14 - Great Recession
 
Chapter 13 - AD/AS
Chapter 13 - AD/ASChapter 13 - AD/AS
Chapter 13 - AD/AS
 
Chapter 12 - Monetary Policy
Chapter 12 - Monetary PolicyChapter 12 - Monetary Policy
Chapter 12 - Monetary Policy
 
Chapter 11 - IS Curve
Chapter 11 - IS CurveChapter 11 - IS Curve
Chapter 11 - IS Curve
 
Chapter 10 - Great Recession
Chapter 10 - Great RecessionChapter 10 - Great Recession
Chapter 10 - Great Recession
 
Chapter 9 - Short Run
Chapter 9 - Short RunChapter 9 - Short Run
Chapter 9 - Short Run
 
Chapter 8 - Inflation
Chapter 8 - InflationChapter 8 - Inflation
Chapter 8 - Inflation
 
Chapter 7 - Labor Market
Chapter 7 - Labor MarketChapter 7 - Labor Market
Chapter 7 - Labor Market
 
Chapter 6 - Romer Model
Chapter 6 - Romer Model Chapter 6 - Romer Model
Chapter 6 - Romer Model
 
Chapter 5 - Solow Model for Growth
Chapter 5 - Solow Model for GrowthChapter 5 - Solow Model for Growth
Chapter 5 - Solow Model for Growth
 
Chapter 4 - Model of Production
Chapter 4 - Model of ProductionChapter 4 - Model of Production
Chapter 4 - Model of Production
 
Chapter 3 - Long-Run Economic Growth
Chapter 3 - Long-Run Economic GrowthChapter 3 - Long-Run Economic Growth
Chapter 3 - Long-Run Economic Growth
 
Chapter 2 - Measuring the Macroeconomy
Chapter 2 - Measuring the MacroeconomyChapter 2 - Measuring the Macroeconomy
Chapter 2 - Measuring the Macroeconomy
 
Topic 7 (data)
Topic 7 (data)Topic 7 (data)
Topic 7 (data)
 
Inequality
InequalityInequality
Inequality
 
Topic 7 (questions)
Topic 7 (questions)Topic 7 (questions)
Topic 7 (questions)
 
Topic 5 (multiple regression)
Topic 5 (multiple regression)Topic 5 (multiple regression)
Topic 5 (multiple regression)
 
Topic 6 (model specification)
Topic 6 (model specification)Topic 6 (model specification)
Topic 6 (model specification)
 
Topic 5 (multiple regression)
Topic 5 (multiple regression)Topic 5 (multiple regression)
Topic 5 (multiple regression)
 
Topic 4 (binary)
Topic 4 (binary)Topic 4 (binary)
Topic 4 (binary)
 

Recently uploaded

internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 

Recently uploaded (20)

internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 

Regression Analysis Key Concepts

  • 1. STATISTICS ECON 355 – Regression Analysis
  • 2. DISTRIBUTION REVIEW • We use distribution to better understand what data looks like • A police unit records speed of passing vehicles on a road with speed limit of 45 miles/hour • Use speed.csv dataset on blackboard to find out the mean, standard deviation, and histogram of the results • For the histogram (in Stata) set the number of bins to 5, Y-axis should display frequency, add height label to bars, change Range/value (major tick-label properties) of X-axis to min 25, max 90 and delta of 5
  • 3. POPULATION VS. SAMPLE • Most of the time our questions are about the universe (population). • For example: • “do people who sleep more drink less coffee?” • Or “are smaller class sizes associated with higher test scores?” • Or “what is the relationship between price and age of residential real estate?” • Unfortunately, we are not able to collect data from every single person, class size, or house in the world • We use samples of data to answer the question about the population
  • 4. SOME OTHER QUESTIONS WE CAN ANSWER BY USING SAMPLES • What are the mean earnings of people in their 20s? • What is the standard deviation of the earnings of people in their 20s? • What is the variance of the earnings of people in their 20s? • What is the average unemployment rate in New York city? • What is the average height of a GU student?
  • 5. PARAMETER, STATISTIC, AND ESTIMATOR • When we want to find out the mean earnings (𝜇) of people in their 20s we are looking for a population parameter • When we collect answers from a sample of people in their 20s and calculate the average earnings ( 𝑥) we have found the sample statistic • We use sample statistics to estimate population parameters • Sample statistics are estimators of population parameters • In this specific example 𝑥 is an estimator of 𝜇 • Estimate – numerical value of the estimator when it is actually computed using data from a specific sample
  • 6. PARAMETER, STATISTIC, AND ESTIMATOR EXAMPLE • In the regression of test scores on class size • What is the general regression? • What is the fitted regression? • What is the population here? • What is the sample? • What is the parameter we are trying to estimate? • What is the estimator of that parameter? • What is the estimate? • Difference between estimator and estimate?
  • 7. IN CLASS You run a regression of coffee consumption on the number of hours one sleeps at night. And find that the coefficient on the number of hours slept at night is -0.75. Please choose the answer that lists the following in the correct order: population in question; sample; parameter; estimator; estimate A. Everyone on the planet, the group of people you interview for your homework; 𝛽1; 1; -0.75 B. Everyone on the planet, the group of people you interview for your homework; 1; 𝛽1; -0.75 C. The group of people you interview for your homework, everyone on the planet; 𝛽1; 1; -0.75 D. Everyone on the planet, the group of people you interview for your homework; 1; -0.75; 𝛽1
  • 8. REPEATED SAMPLES • Suppose you are trying to evaluate an estimator by repeatedly drawing random samples from a population • Example 1. • To estimate the relationship between hours slept and caffeine consumed you divided into 8 groups, collected data and ran 8 regressions. Here we have 8 estimates of the relationship. What is the estimator here? • Example 2. • To estimate the average height of a GU student each of you asks 30 students how tall they are. This way we will get 24 estimates of the average height of a GU student. What is the estimator here? • Repeated samples mean that the estimator is not one number but rather a series of numbers. We are going to be working with its distribution.
  • 9. RANDOM SAMPLE • If we estimate the height of GU students by using the sample of people we meet outside of basketball locker room we are not going to have a random sample • If we estimate the unemployment rate by going to a park at 10 am on a weekday and ask people we run into if they are employed we are not going to have a random sample • I am trying to figure out the relationship between the number of DUIs in a state and alcohol consumption. What sample should I use? • Random sample means that everyone in the population has an equal chance of ending up in the sample
  • 10. ESTIMATOR PROPERTY 1. UNBIASEDNESS • If you evaluate an estimator many times by using repeatedly drawn samples, it is reasonable to hope that on average you would get the right answer. • Example: we are trying to estimate the average height of a GU student. We collect data from random samples 5 times and get the following results Sample number Average height in the sample 1 167.64 2 175.26 3 160.02 4 152.4 5 177.8
  • 11. UNBIASEDNESS CONT’D • What is the average number across the samples? • If our estimator 𝑥 is unbiased then the average height of a GU student is about 5 feet 6 inches (166 cm) • If you decided to estimate the average height in your sample by using the following formula: • 𝑖=1 𝑁 ℎ𝑒𝑖𝑔ℎ𝑡 𝑖 𝑁 + 2 • you would have a biased estimator
  • 12. ESTIMATOR PROPERTY 2. CONSISTENCY. • The larger the sample the closer the estimator is to the population parameter • Example: The larger the sample of GU students you survey about their height the closer the estimate of the height will be to the actual average height of a GU student
  • 13. ESTIMATOR PROPERTY 3. VARIANCE AND EFFICIENCY • We want the estimator with the smaller variance. That estimator will be the most efficient. • Example: we are trying to estimate the average height of a GU student. We collect data from random samples 5 times and get the following results Sample number Average height in the sample Median height in the sample 1 167.64 167.64 2 175.26 172.72 3 160.02 162.56 4 152.4 165.1 5 177.8 162.56 Variance 111.6 18.06
  • 14. EFFICIENCY CONT’D • Median appears to have a smaller variance, which means it is a more efficient estimator of the average height of a GU student
  • 15. REVIEW. • Please explain unbiasedness, consistency, and variance by applying the ideas to the example of the relationship between coffee consumed and hours slept. • What is the difference between parameter, estimator and estimate?

Editor's Notes

  1. Underline that the statistic is the estimator of the population value
  2. You could have a different estimator, for example, you could say that the correlation coefficient is the estimator. Correct answer: B
  3. For your project make sure the sample you collect is random. You will have to discuss why you think your sample is random. If the sample is random then the estimator will be close to the population parameter
  4. The average height turns out to be 166.6 or about 5.6
  5. Gonzaga university has about 7500 students. If you only estimate height of 4 students that sample size would not be enough.