This document summarizes an alternative allocation design proposed for the Occupational Employment Statistics (OES) Survey. The new design aims to allocate more samples to industry strata that have high occupational variability and large populations, to improve reliability. It uses a Neyman allocation approach that factors in an occupational variability measure (Sh) calculated for each industry stratum. This variability measure is a weighted mean of coefficient of variations for the top 90th percentile of occupations within each stratum. Adjusting the exponent in the Neyman formula helps spread out sample allocation across geographic areas for more reliable estimates.
Presentation: "Researchers’ perceptions of DH trends and topics in the English and Spanish-speaking community. Day of DH data as a case study" at DH2016 Conference in Krakow.
Authors:
Gimena Del Rio Riande (gdelrio.riande@gmail.com)
CONICET, Universidad de Buenos Aires
Salvador Ros (sros@scc.uned.es)
Elena González-Blanco (egonzalezblanco@flog.uned.es)
Antonio Robles Gomez (arobles@scc.uned.es)
Spanish University for Distance Education, UNED
Presentation: "Researchers’ perceptions of DH trends and topics in the English and Spanish-speaking community. Day of DH data as a case study" at DH2016 Conference in Krakow.
Authors:
Gimena Del Rio Riande (gdelrio.riande@gmail.com)
CONICET, Universidad de Buenos Aires
Salvador Ros (sros@scc.uned.es)
Elena González-Blanco (egonzalezblanco@flog.uned.es)
Antonio Robles Gomez (arobles@scc.uned.es)
Spanish University for Distance Education, UNED
İnovatif Kimya Dergisi Sayı-12 Anlatılan Konu Başlıkları
NaOH Üretimi
Biyomalzemeler ve Biyouyum
Nükleer Silahsızlandırma-Toryum
İpucu
Poliester Mamüllerde Haşıl Sökümü
Sigara
Kimya ve Kemometri
Ayın Röportajı : Üsküdar Üniversitesi Rektör Yardımcısı Adli Bilimler Uzmanı Prof. Dr. Sevil ATASOY ile Ayın Röportajı.
Ayrıca Her Ay 3 Web Sitesi ve Kimya Bulmacası, Kimya Sektöründen Haberler ile Kimya Sözlüğü
İyi okumalar dileriz.
Assignment Exercise 17–1: Variance Analysis
Greenview Hospital operated at 120% of normal capacity in two of its departments during the year. It operated 120% times 20,000 normal capacity direct labor nursing hours in routine services and it operated 120% times 20,000 normal capacity equipment hours in the laboratory. The lab allocates overhead by measuring minutes and hours the equipment is used; thus equipment hours.
Assumptions:
For Routine Services Nursing:
• 20,000 hours × 120% = 24,000 direct labor nursing hours.
• Budgeted Overhead at 24,000 hours = $42,000 fixed plus $6,000 variable = $48,000 total.
• Actual Overhead at 24,000 hours = $42,000 fixed plus $7,000 variable = $49,000 total.
• Applied Overhead for 24,000 hours at $2.35 = $56,400.
For Laboratory:
• 20,000 hours × 120% = 24,000 equipment hours.
• Budgeted Overhead at 24,000 hours = $59,600 fixed plus $11,400 variable = $71,000 total.
• Actual Overhead at 24,000 hours = $59,600 fixed plus $11,600 variable = $71,200 total.
• Applied Overhead for 24,000 hours at $3.455 = $82,920.
Required:
1. Set up a worksheet for applied overhead costs and volume variance with a column for Routine Services Nursing and a second column for Laboratory.
2. Set up a worksheet for actual overhead costs and budget variance with a column for Routine Services Nursing and a second column for Laboratory.
3. Set up a worksheet for volume variance and budget variance totaling net variance with a column for Routine Services Nursing and a second column for Laboratory.
4. Insert input data from the Assumptions.
5. Complete computations for all three worksheets
Assignment Exercise 17–2: Three-Level Revenue Forecast
Three eye-ear-nose-and-throat physicians decide to hire an experienced audiologist in order to add a new service line to their practice.* They ask the practice manager to prepare a three-level volume forecast as a first step in their decision-making.
Assumptions: for the base level (most likely) revenue forecast, assume $200 per procedure times 4 procedures per day times 5 days equals 20 procedures per week times 50 weeks per year equals 1,000 potential procedures per year.
For the best case revenue forecast, assume an increase in volume of one procedure per day average, for an annual increase of 250 procedures (5 days per week times 50 weeks equals 250). (The best case is if the practice gains a particular managed care contract.)
For the worst case revenue forecast, assume a decrease in volume of 2 procedures per day average, for an annual decrease of 500 procedures. (The worst case is if the practice loses a major payer.)
*Audiologists were designated as “eligible for physician and other prescriber incentives” as discussed elsewhere. Thus the new service line was a logical move.
Required: Using the above assumptions, prepare a three-level forecast similar to the example in Figure 17–5 and document your calculations.
Assignment Exercise 17–3: Target Operating Income
Acme Medical Supply Company d ...
Python and the Holy Grail of Causal Inference - Dennis Ramondt, Huib KeeminkPyData
PyData Amsterdam 2018
Causal Inference, AKA how effective is your new product, policy or feature? Inspired by A\B testing in tech, organizations have turned to randomized testing. However, randomization often fails, leaving us in a biased reality. Join us on our quest to dispel myths about randomized testing and build practical models for effect measurement in business situations, in this Eneco-Heineken joint talk.
İnovatif Kimya Dergisi Sayı-12 Anlatılan Konu Başlıkları
NaOH Üretimi
Biyomalzemeler ve Biyouyum
Nükleer Silahsızlandırma-Toryum
İpucu
Poliester Mamüllerde Haşıl Sökümü
Sigara
Kimya ve Kemometri
Ayın Röportajı : Üsküdar Üniversitesi Rektör Yardımcısı Adli Bilimler Uzmanı Prof. Dr. Sevil ATASOY ile Ayın Röportajı.
Ayrıca Her Ay 3 Web Sitesi ve Kimya Bulmacası, Kimya Sektöründen Haberler ile Kimya Sözlüğü
İyi okumalar dileriz.
Assignment Exercise 17–1: Variance Analysis
Greenview Hospital operated at 120% of normal capacity in two of its departments during the year. It operated 120% times 20,000 normal capacity direct labor nursing hours in routine services and it operated 120% times 20,000 normal capacity equipment hours in the laboratory. The lab allocates overhead by measuring minutes and hours the equipment is used; thus equipment hours.
Assumptions:
For Routine Services Nursing:
• 20,000 hours × 120% = 24,000 direct labor nursing hours.
• Budgeted Overhead at 24,000 hours = $42,000 fixed plus $6,000 variable = $48,000 total.
• Actual Overhead at 24,000 hours = $42,000 fixed plus $7,000 variable = $49,000 total.
• Applied Overhead for 24,000 hours at $2.35 = $56,400.
For Laboratory:
• 20,000 hours × 120% = 24,000 equipment hours.
• Budgeted Overhead at 24,000 hours = $59,600 fixed plus $11,400 variable = $71,000 total.
• Actual Overhead at 24,000 hours = $59,600 fixed plus $11,600 variable = $71,200 total.
• Applied Overhead for 24,000 hours at $3.455 = $82,920.
Required:
1. Set up a worksheet for applied overhead costs and volume variance with a column for Routine Services Nursing and a second column for Laboratory.
2. Set up a worksheet for actual overhead costs and budget variance with a column for Routine Services Nursing and a second column for Laboratory.
3. Set up a worksheet for volume variance and budget variance totaling net variance with a column for Routine Services Nursing and a second column for Laboratory.
4. Insert input data from the Assumptions.
5. Complete computations for all three worksheets
Assignment Exercise 17–2: Three-Level Revenue Forecast
Three eye-ear-nose-and-throat physicians decide to hire an experienced audiologist in order to add a new service line to their practice.* They ask the practice manager to prepare a three-level volume forecast as a first step in their decision-making.
Assumptions: for the base level (most likely) revenue forecast, assume $200 per procedure times 4 procedures per day times 5 days equals 20 procedures per week times 50 weeks per year equals 1,000 potential procedures per year.
For the best case revenue forecast, assume an increase in volume of one procedure per day average, for an annual increase of 250 procedures (5 days per week times 50 weeks equals 250). (The best case is if the practice gains a particular managed care contract.)
For the worst case revenue forecast, assume a decrease in volume of 2 procedures per day average, for an annual decrease of 500 procedures. (The worst case is if the practice loses a major payer.)
*Audiologists were designated as “eligible for physician and other prescriber incentives” as discussed elsewhere. Thus the new service line was a logical move.
Required: Using the above assumptions, prepare a three-level forecast similar to the example in Figure 17–5 and document your calculations.
Assignment Exercise 17–3: Target Operating Income
Acme Medical Supply Company d ...
Python and the Holy Grail of Causal Inference - Dennis Ramondt, Huib KeeminkPyData
PyData Amsterdam 2018
Causal Inference, AKA how effective is your new product, policy or feature? Inspired by A\B testing in tech, organizations have turned to randomized testing. However, randomization often fails, leaving us in a biased reality. Join us on our quest to dispel myths about randomized testing and build practical models for effect measurement in business situations, in this Eneco-Heineken joint talk.
Grade RubricPoint UniversityBUSI 555 Cost Management & Decision Making Week 3 Excel AssignmentChapters 6, 7 & 9Exercise or ProblemPointsPoints EarnedComments6.115
Dr. Paz: Instructor will enter points earned here6.1657.1857.1957.34159.29155000%
Ex 6-11Exercise 6.11Name:1.Select the appropriate accounts from the drop-down menus in the shaded cells in columns B and C.Enter the appropriate amounts in the shaded cells in columns I and J.Journal EntriesWork in Process—Encapsulating64,800Finished GoodsWork in Process—Mixing64,800Work in Process—BottlingWork in Process—EncapsulatingWork in Process—Bottling126,000Work in Process—MixingWork in Process—Encapsulating126,000Finished Goods194,400Work in Process—Bottling194,4002.Enter the appropriate amounts in the shaded cells in columns C, D, F, G, I, J, and L.T-accountsWIP—MixingWIP—EncapsulatingWIP—BottlingFinished Goods86,40064,80064,800126,000126,000194,400194,40079,20072,00021,60018,0003,600194,400
Ex 6.16Exercise 6.16Name:Enter the appropriate amounts in the shaded cells in column K.1.Physical Flow ScheduleUnitsUnits to account for:Units, beginning work in process80,000Units started95,000Units to account for175,000Units accounted for:Units completed and transferred out:Started and completed78,000From beginning work in process80,000Units in ending work in process17,000Total units accounted for175,000Enter the appropriate amounts in the shaded cells in columns E, G, I, and K.2.Equivalent Units - Weighted Average MethodMaterialsConversionUnits completed158,000158,000Units, ending work in process:Percent
Materials17,000x100%17,000Conversion17,000x25%4,250Equivalent units of output175,000162,2503.Equivalent Units - FIFO MethodMaterialsConversionUnits started and completed78,00078,000Units, beginning work in process:Percent
Materials80,000x0%0Conversion80,000x70%56,000Units, ending work in process:Materials17,000x100%17,000Conversion17,000x25%4,250Equivalent units of output95,000138,250
Ex 7.18Exercise 7.18Name:Enter the appropriate amounts in the gray-shaded cells. The essay answers will not be graded.1.Cost Driver InformationDept. ADept B.TotalYear 1Actual direct labor hours24,00036,00060,0002400036000Cost allocation ratios40%60%100%Year 2Actual direct labor hours25,00025,00050,0002500025000Cost allocation ratios50%50%100%Allocation Human Resources
DepartmentProducing DepartmentsYear 1Dept. ADept B.TotalActual Human Resources Costs$ 120,000120000Allocation to producing departmentsDept. A$ 120,000x40%$ 48,0000.4Dept. B$ 120,000x60%$ 72,0000.6Total$ 48,000$ 72,000$ 120,000Allocation Human Resources
DepartmentProducing DepartmentsYear 2Dept. ADept B.TotalActual Human Resources Costs$ 120,000120000Allocation to producing departmentsDept. A$ 120,000x50%$ 60,0000.5Dept. B$ 120,000x50%$ 60,0000.5Total$ 60,000$ 60,000$ 120,0002.The manager of department B is not controlling HR cost better than manager of depar ...
PA 1c. Decision VariablesabcdCalculated values0.21110.531110.09760.docxgerardkortney
PA 1c. Decision VariablesabcdCalculated values0.21110.531110.09760.16019TotalObjective Function0.860.940.930.850.90772Constraints1111110.774-0.094-0.093-0.0850.09077>=0-0.0860.846-0.093-0.0850.40847>=0-0.086-0.0940.837-0.0851.90E-17>=0-0.086-0.094-0.0930.7650.04539>=00.94-2.790.22693>=00.86-1.86-2.00E-16>=0-0.129-0.141-0.13950.72256.90E-17>=0
a.
Let the weights be a, b, c and d to midterm, final, individual assignment and Participation respectively.
Korey would like to maximize the course grade. Therefore the course grade (Maximization):
=0.86a + 0.94b + 0.93c + 0.85d
Restrictions to course grade working: a+b+c+d=1
The weights must be non-negative, Non negativity constraints: a, b, c, d ≥ 0
The four components for each should determine 10% of the sum of the grade at least.
0.86a ≥ 0.1 (0.86a + 0.94b + 0.93c + 0.85d)
0.86a ≥ 0.086a + 0.094b + 0.093c + 0.085d
0.774a – 0.094b – 0.093c -0.085d ≥ 0
0.94b ≥ 0.1 (0.86a + 0.94b + 0.93c + 0.85d)
0846b ≥ 0.086a + 0.094b + 0.093c + 0.085d
0.846b – 0.086a – 0.093c – 0.085d ≥ 0
0.93c ≥ 0.1 (0.86a + 0.94b + 0.93c + 0.85d)
0.93c ≥ 0.086a +0.094b +0.093c + 0.085d
0.837c – 0.086a – 0.094b – 0.085d ≥ 0
0.85d ≥ 0.1 (0.86a + 0.94b + 0.93c + 0.85d)
0.85d ≥ 0.086a + 0.094b + 0.093c + 0.085d
0.765d – 0.086a – 0.094b – 0.093c ≥ 0
Here it is three times the particular assignment grade.
0.94b ≥ 3(0.93c)
0.94b ≥ 2.79c
0.94b – 2.79c ≥ 0
Midterm grade must count at least twice as much as the individual assignment score.
0.86a ≥ 2(0.93c)
0.86a ≥ 1.86c
0.86a – 1.86c ≥ 0
The presence of the grade should be less than the 15% of the whole grade.
0.85d ≤ 0.15(0.86a + 0.94b +0.93c +0.85d)
0.85d ≤ 0.129a + 0.141b +0.1395c + 0.1275d
0.7225d – 00.129a – 0.141b – 0.1395c ≥ 0
b.
The complete optimization model is Course grade (Maximization):
= 0.86a + 0.94b + 0.93c + 0.85d
a+b+c+d=1
0.774a – 0.094b - 0.093c – 0.085d ≥ 0
0.846b – 0.086a – 0.093c – 0.085d ≥ 0
0.837c – 0.086a – 0.094b – 0.085d ≥ 0
0.765d – 0.086a – 0.094b – 0.093c ≥ 0
0.94b – 2.79c ≥ 0
0.86a – 1.86c ≥ 0
0.7225d – 0.129a – 0.141b – 0.1395c ≥ 0
c.
Therefore midterm weights should be 21%, final weights 53%, individual assignment 10%, Participation should be 16%.
The maximum course grade is 90%.
PA 5b.Rosenberg Land DevelopmentDataOneTwoThreeBedroomBedroomBedroomUnitUnitUnit1BR2BR3BRAvailableConstruction cost$450,000$600,000$750,000$180,000,000Total units325Profit/ unit$45,000$60,000$75,000Minimum15%25%25%ModelTotalUnits Build4067162270Minimum406767Construction cost$18,202,247$40,449,438$121,348,315$180,000,000Contribution in profit$1,820,225$4,044,944$12,134,831$18,000,000c.ModelTotalUnits Build4981195325Minimum498181Construction cost$21,937,500$48,750,000$146,250,000$216,937,500Contribution in profit$2,193,750$4,875,000$14,625,000$21,693,750
a.
1BR = number of one bedroom units produced
2BR = number of two bedroom units produced
3BR = number of three bedroom units produced
Maximize Total Profit = $45,000 (1BR) + $60,000 (2BR) + $75,000 (3BR)
(1BR) + (2BR) + (.
1. Alternative Allocation Design for
the Occupational Employment
Statistics (OES) Survey
Ernest Lawley, Bureau of Labor Statistics
Marie C. Stetser, Bureau of Labor Statistics
Dr. Eduardas Valaitis, American University
OEUS ANNUAL MEETING 2007
Washington, DC
2. Alternative Allocation Design for the
Occupational Employment
Statistics (OES) Survey
• Occupational Employment Statistics (OES)
Survey
• Frame Development
• Frame Stratification
• Sample Requirements
• Prior Allocation Design
• Current Allocation Design
• Calculating Sh (standard error)
• Reliability
3. OES Survey
• Partnership with 50 States + DC, Guam,
Puerto Rico, US Virgin Islands
• Measures occupational employment and
wages within 300+ industry groups*
– Approximately 800 detailed occupations
(SOC)
– Broken down by MSA—aggregated Statewide
and Nationwide
*using 4-digit and 5-digit NAICS codes
4. Frame Development
• Quarterly Census of Employment and Wages (QCEW)
– Collects non-railroad data for all business establishments for 50
States + DC, PR, USVI
– Data includes pertinent information for each establishment such
as: Trade Name, Legal Name, Address information, and Monthly
Employment for the past 12 months
– Data compiled into Bureau’s Longitudinal Database (LDB)
• Railroad Frame File
– Collected by Bureau’s Office of Safety and Health (OSH)
• Guam Frame File
– Collected by one of the BLS Regional Offices
All three elements combined; OES Frame≈6.7 million
business establishments
5. Frame Stratification
• Frame initially stratified geographically
– Approximately 600 geographic areas
• Approximately 400 State/Metropolitan Statistical Areas (MSAs)
• Approximately 200 Non-MSA Areas (“rural”)
• Frame further stratified by detailed industry (NAICS 4-
digit, selected NAICS 5-digit)
– Approximately 350 industries
– Industry is related to occupation
• Approximately 170,000 total non-empty strata
– Each business establishment in the nation fits into exactly one of
these defined strata
– Each non-empty stratum contains one business establishment to
hundreds of business establishments
6. Frame Stratification
State 1
MSA X MSA Y
Industry 1 Industry 2Industry 1 Industry 2
State 2
MSA X MSA Z
Industry
1
Industry
2
Industry
1
Industry
2
7. Sample Requirements
• Sample allocated by stratum
• Sample Allocation≈1.2 million establishments
• Individual State Sample Sizes (∑≈1.2 million)
– Confidential value for each State
– Based on State employment population
– Last modified in 1996
Example:
Hypothetically (exact values are confidential):
State State Sample Size
California 120,000
Texas 100,000
New York 100,000
Florida 85,000
And so forth… Σ≈1.2 million
8. Prior Allocation Design
“Proportional-to-Employment”
• Maximum Employment
– Maximum monthly employment value in LDB for each
establishment
STEPS:
1. Sum max employment values across stratum, Nh
2. Sum max employment values across state, ΣNh
3. Look up Individual State Sample Size, n
4. Calculate stratum allocation: nh=n∙(Nh/ΣNh)
5. Repeat calculation for all strata, approx. 170,000
times
Note: n may require iterative reduction to work
minimum sample allocation requirements for each
9. Prior Allocation Design
• Advantages
– Simple
– Strata with larger populations are allocated
more sample
• Is this necessarily an advantage?
10. Prior Allocation Design
“A sample should allocate most heavily to those
strata where the least amount of certainty
exists.”
Causes for uncertainty (less reliability)
within a sampled stratum:
• Undersampling a large population
• Undersampling where there is a large
variability in occupations
11. Prior Allocation Design
• Disadvantage
– Estimates in smaller strata that have large
occupational variability may not be reliable
due to allocation of smaller sample size
12. Prior Allocation Design
Accomodations/Food
Services Industry
• 90% of all employees work in
88 occupations
• 12.8 million workers in this
industry
Wholesale Trade Industry
• 90% of all employees work in
175 occupations
• 6.1 million workers in this
industry
EXAMPLE
Which of these cells should be allocated more
sample?
Using “Proportional Allocation”:
Accom/Food Services Wholesale Trade
120,000 establishments 72,000 establishments
13. Current Allocation Design
Neyman Allocation
( )∑
=
⋅
⋅
•= H
1h
hh
hh
h
SN
SN
nn
n=Individual State “fixed” sample size
Nh = sum of stratum frame employees
Sh represents an occupational
variability measure within a stratum
Occupations for each stratum (or cell)
obtained from recent estimates
file; weighted data
Denominator summed overall by
state
14. Current Allocation Design
Neyman Allocation Proportional Allocation
( )∑
=
⋅
⋅
•= H
1h
hh
hh
h
SN
SN
nn
( )∑
=
•= H
1h
h
h
h
N
N
nn
“Occupational Variability” measure; notice that the
“adjustment” from the Proportional Allocation
formula.
15. Calculating Sh
1. Calculate a “coefficient of variation” for
each occupation within an industry.
2. Determine 90th
-percentile of occupations
within each industry.
3. Sh (for each industry) is calculated by
obtaining the weighted mean of CVs for
the 90th
-percentile of occupations within
each industry.
16. Calculating Sh
Step 1: Calculating a “coefficient of variation” for
each occupation within stratum
– Using most recent weighted estimates file:
• Count # of employees in each occupation for each business
establishment (call this yi)
• Count # of employees total for each business establishment
(call this xi)
• Sample weight, wi, represents the number of business
establishments that each establishment on the estimates
file (i) represents
• Create a “weighted ratio”Rw=Σ(wi∙yi)/Σ(wi∙xi); summed over
a defined cell
– Note: This ratio is the ratio of occupational employment to
overall employment; ratio will always be ≤ 1.
17. Calculating Sh
• CV formula (unweighted)
– Derived from variance formula
– Relative variance (CV2
) for an original variate Yi:
– Using a little algebra (remember R=y/x):
( )
2
N
i
2
i
2
Y
2
2
Y
Y)1N(
YY
Y
S
CV
⋅−
−
==
∑
R
S
x
1
xR
S
y
S
CV
y
yy
Y
⋅
=
⋅
==
( )
R
1N
xRy
x
1
CV
N
1i
2
ii
Y
−
⋅−
⋅
=
∑
=
18. Calculating Sh
( )[ ]
w
i
i
n
1i
2
iwii
Y
R
1w
xRyw
x
1
CV R
−
⋅−
⋅
≈
∑
∑
=
• CV formula (for each defined “Sh cell”),
summed by cell (including weights):
• Note: x-bar is a weighted average.
∑
∑
= n
1
i
n
i
ii
w
xw
x
19. Calculating Sh
EXAMPLE (hypothetical cell w/ sampled 2 business establishments)
• Restaurant ABC; represents 5 businesses
• What is ABC’s weight?
• Restaurant XYZ; represents itself (1 business)
• What is XYZ’s weight?
ABC’s Staffing Pattern
Occupation # employed
Waitress/Waiter 8
Cook 4
Dishwasher 2
Janitor 1
Manager 1
TOTAL 16
XYZ’s Staffing Pattern
Occupation # employed
Waitress/Waiter 32
Cook 15
Dishwasher 10
Manager 3
TOTAL 60
Calculations for ABC
Waitress/Waiter Cook Dishwasher Janitor Manager
yi
= 8 yi
= 4 yi
= 2 yi
= 1 yi
= 1
wi
yi
=5∙8=40 wi
yi
=5∙4=20 wi
yi
=5∙2=10 wi
yi
=5∙1=5 wi
yi
=5∙1=5
xi = 16 xi = 16 xi = 16 xi = 16 xi = 16
wi
xi
=5∙16=80 wi
xi
=5∙16=80 wi
xi
=5∙16=80 wi
xi
=5∙16=80 wi
xi
=5∙16=80
Calculations for XYZ
Waitress/Waiter Cook Dishwasher Manager
yi
= 32 yi
= 15 yi
= 10 yi
= 3
wi
yi
=1∙32=32 wi
yi
=1∙15=15 wi
yi
=1∙10=10 wi
yi
=1∙3=3
xi
= 60 xi
= 60 xi
= 60 xi
= 60
wi
xi
=1∙60=60 wi
xi
=1∙60=60 wi
xi
=1∙60=60 wi
xi
=1∙60=60
20. Calculating Sh
( )[ ]
w
i
i
n
1i
2
iwii
Y
R
1w
xRyw
x
1
CV R
−
⋅−
⋅
≈
∑
∑
=
ABC
yi
=8
wiyi=5∙8=40
xi
=16
wi
xi
=5∙16=80
XYZ
yi
=32
wiyi=1∙32=32
xi
=60
wi
xi
=1∙60=60
Example: CVs for
Occupations
Occupation CV
Waitress/Waiter 0.060
Cook 0
Dishwasher 0.271
Janitor 1.626
Manager 0.203
Waitress/Waiter
( )
( ) ( )
( ) 060.0
140
3240
16
140
32406032
140
32408040
15
6080
1
CV
22
YR
≈
+
−
+⋅−+
+⋅−
⋅
+
+
≈
The smaller the CV
value, the less diverse
the occupation is within
the defined cell.
21. Step 2: Avoiding “atypical” occupations
within each cell:
• Conservative approach: utilize 90th
-
percentile until further research is done
• Exclude bottom 10th
percentile of
occupations
Calculating Sh
22. Calculating Sh
Step 3: A CV is created for each occupation
within a defined cell—How are occupations
within a cell “combined” to create one value
for the cell?
– Weighted mean of 90th
-percentile occupations
• Obtain occupational proportion for each cell
• Obtain Sh by calculating weighted mean of the top-90th
-
percentile of occupations
– Less prevalent (bottom 10%) occupations are eliminated
– Sh=weighted mean of 90th-
percentile CVs within defined cell
23. Calculating Sh
Example (sorted in “proportional order”)
90th
percentile
(Look at proportions)
• 90th
-percentile Occupations
– Weighted mean=Sh=Σ ”products”
≈ 0.03 + 0 + 0.04 = 0.07
Weighted Mean of CVs of All Occupations
Occupation CV Proportion Product
Waitress/Waiter 0.060 72/140≈0.51 0.060*0.51≈0.03
Cook 0 35/140=0.25 0*0.25=0
Dishwasher 0.271 20/140≈0.14 0.271*0.14≈0.04
Manager 0.203 8/140≈0.06 0.203*0.06≈0.01
Janitor 1.626 5/140≈0.04 1.626*0.04≈0.07
24. Calculating Sh
Defining Sh “cell”
– Normality of individual CVs
– Sufficient amount of data to create reliable estimate of
occupational variability (Sh)
25. Calculating Sh
Aggregation by National Industry (Industry-only)
Concerns:
– Assumption that national aggregates of industry will produce
accurate CVs and Sh values
• Aggregation necessary due to lack of data for finely-detailed cells
• 88.6% of industry MSA-BOS staffing patterns were similar to
corresponding nationally-aggregated industry staffing patterns
(α=0.10)
27. Reliability
• Problem of small populations in geographic areas
• Desire to produce similar reliability in large and small areas
– Example: Utilizing the Neyman Allocation method illustrated,
Chicago takes up approximately 54% if Illinois’s sample allocation;
this may lead to a possible unreliable sample in non-Chicago areas
within Illinois
28. Reliability
How to “spread out” sample allocation?
Bankier (1988): Power Allocations: Determining Sample Sizes for
Subnational Areas
• Adjust exponent for Nh (numerator and denominator) in the
Neyman Allocation
• Drops Chicago’s value to approx. 34% of IL’s sample allocation
( )∑=
⋅
⋅
•= H
h
hh
hh
h
SN
SN
nn
1
Nh = sum of stratum frame employees
Sh represents an occupational
variability measure within a stratum
Occupations for each stratum (or
cell) obtained from recent
estimates file; weighted data
Denominator summed overall by state
29. Total Allocation for Illinois
0
2000
4000
6000
8000
10000
12000
14000
16000
18000
20000
B
loom
ingtonC
ham
paign
C
hicago
D
anvilleR
ockIsland
D
ecaturK
ankakee
P
eoria
R
ockford
E
.S
t.LouisS
pringfield
B
O
S
1
B
O
S
2
B
O
S
3
B
O
S
4
Allocation
Neyman 90th
Neyman 90th(SqRoot)
Reliability
30. Alternative Allocation Design for
the Occupational Employment
Statistics (OES) Survey
QUESTIONS?
lawley.ernest@bls.gov
Editor's Notes
QCEW/LDB will be referenced in a couple of future slides
State/MSA=MSAs split into two or more parts if crosses a state line; i.e. Kansas City is divided into Kansas City, MO and Kansas City, KS.
Now the frame has been created and stratified; time to allocate for the sample: 6.7 million frame1.2 million sample
Last modified in 1996 and hasn’t been modified since due to possible shifting of budgets from state-to-state. States do not like when budgets are shifted because it takes money away from one State and gives it to another.
Maximum Employment: refer to QCEW/LDB slide 4
An objective of a good sample design is to produce similar reliability of estimates within stratum; namely, similar occupational reliability for each defined area.
Note: you want to select an appropriate sample (and not short-change) in strata where populations are large, even though occupations are not so variable. This is due to the fact that you still want to create reliable estimates.
The “proportional to employment” design ensured that large populations would not be undersampled, but did not ensure a proper sample for populations where large occupational variability existed.
Inefficiencytoo much sample wasted on (occupational) homogenous strata; better to collect more heterogenous information.
Notice that Acc/Food Services industry contains less occupations, thus is more homogenous than Wholesale Trade industry which contains more occupations (more heterogeneous or exhibits more occupational diversity).
Proportional Allocation does not consider occupational variability within a stratum (or aggregated stratum)
Is there a method that takes into consideration both frame size AND occupational variability?
Notice that values in the numerator, when each is increased—this results in an increase in allocation (denominator is fixed for each state).
Question: what do we use for Sh? Sh is defined as the “key variable” of interest; measures some sort of “variability”. We are measuring employment and wages for 800 different occupations by stratum. Within each stratum exists an “occupational variance”.
Where to obtain occupational information? Occupational information is NOT on the QCEW (slide 4). Must obtain occupational information from the most readily available source; the most recent OES estimates file.
How to calculate Sh?
VERY CAREFULLY!
For simplicity, an Sh cell will be defined as industry-only (disregard geography).
Including weights in formula.
We do not want to include any unusual occupations when aggregating CV values; these unusual occupations tend to have high CV values and may heavily (and unncessesarily) influence occupational variance, Sh.
Put step 1 and step 2 together, calculate weighted mean of CVs for 90th percentile occupations within industry.
90th percentile: .51+.25+.14=.90
Note: Utilizing the 90th-percentile weighted mean is a more objective measure of Sh than using the “unweighted mean of typical occupations” method. Choosing typical occupations for an industry may be a subjective measure.
How are we going to group CVs to get a weighted average for Sh? Best grouping was determined nationally by industry. That is, take a weighed mean of all CVs for the 90th percentile of occupations in each national industry cell (or stratum).
Does staffing pattern of a local industry match staffing pattern of national industry? Does staffing pattern of education industry in Miami match national education staffing pattern? Does staffing pattern of education industry in Salt Lake City match national education staffing pattern?
How are we going to group CVs to get a weighted average for Sh? Best grouping was determined nationally by industry. That is, take a weighed mean of all CVs for the 90th percentile of occupations in each national industry cell (or stratum).
Neyman Allocation using occupational variability measure for Sh takes care of the problem of unreliable estimates within industries that may have a small number of employees relative to occupational variability, but what about unreliable estimates in areas of states where there are small populations?
Bloomington, Champaign, Danville, Kankakee, Springfield
Adjusted exponent to Nh and tested; acceptable level of “spread” of allocation when exponent=0.5.