Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Teacher evaluation presentation mississippi
1. Taking Control of the Mississippi Teacher Evaluation
Framework
John Cronin, Ph.D. – Senior Director of Education Research
Northwest Evaluation Association
You can view and download this presentation at:
http://www.slideshare.net/JFCronin/teacher-evaluation-present-41205047
2. Percent of students who say
they do not receive their state
accountability test results.
37%
Make Assessment Matter: Students and Educators
Want Tests that Support Learning (2014). –
Portland, OR. NWEA and Grunwald Associates LLC.
4. What NWEA supports
• The evaluation process should focus on
helping teachers improve.
• The principal or designated evaluator
should control the evaluation.
• Tests should inform principal decision-
making and not be the deciding factor in
an evaluation.
• Multiple measures should be used.
5. Ultimately – the principal should
decide
• Evaluation inherently involves
judgment – not a bad thing.
• Evidence should inform and not
direct their judgment.
• The implemented system should
differentiate performance.
• Courts respect the judgment of
school administrators relative to
personnel decisions.
6. Effective teaching
and professional
job performance
Evidence of
professional
responsibilities
Evidence of
student
learning
Evidence of
professional
practice
The evaluation of teaching
by classroom observation
and use of artifacts
The evaluation of the
teacher’s effectiveness in
making progress toward
their goals and fulfilling the
responsibilities of a
professional educator.
The evaluation of a
teacher’s contribution to
student learning and
growth
A simple framework for teacher evaluation
7. Effective teaching
and professional
job performance
Evidence of
professional
responsibilities
Evidence of
student
learning
Evidence of
professional
practice
MSTAR-= Domains 1-4
• Planning
• Assessment
• Instruction
• Learning Environment
M-STAR Domain 5:
Professional practices and
responsibilities
Individual Growth or
Student Learning Objectives
- 30%
Schoolwide Growth – 20%
A simple framework for teacher evaluation –
Mississippi
8. Distinguishing teacher effectiveness from
teacher evaluation
• Teacher effectiveness – The judgment of a teacher’s
ability to positively impact learning in the classroom.
• Teacher evaluation – The judgment of a teacher’s
overall performance including:
– Teacher effectiveness
– Common standards of job performance
– Participation in the school community
– Adherence to professional standards
9. What teacher effectiveness infers
• Evidence of Learning – A claim that the
improvement in learning (or lack of it)
reflected on one or more tests is caused by
the teacher.
• Evidence of good practice – That the
observers ratings or conclusions are reliable
and associated with behaviors that cause
improved learning in the classroom.
10. Purposes of summative evaluation
• Make an accurate and defensible
judgment of an educator’s job
performance.
• Provide ratings of performance that
provide meaningful differentiation across
educators.
• Help educators focus on their students
and their practice.
• Retain your top educators.
• Dismiss ineffective educators.
11. The greatest tragedy of this century in
education so far, was the number of
young, talented teachers who lost their
positions in the last recession.
12. Employment of Elementary Teachers
2007-2012
1538000 1544270 1544300
1485600
1415000
1360380
2007 2008 2009 2010 2011 2012
NUMBER OF TEACHERS
Source: (2012, May) Bureau of Labor Statistics – Occupational Employment Statistics
Numbers exclude special education and kindergarten teachers
The elementary school
teacher workforce shrunk by
178,000 teachers (11%)
between May, 2007 and May,
2012.
13. The impact of seniority based layoffs on
learning
Source: Boyd, L., Lankford, H., Loeb, S., and Wycoff, J. (2011). Center for Education Policy.
Stanford University.
In a simulation study of implementation of a layoff of 5%
of teachers using New York City data, reliance on seniority
based layoffs resulted would:
• Result in 25% more teachers laid off.
• Teachers laid off would be .31 standard deviations more
effective (using a value-added criterion) than those lost
using an effectiveness criterion.
• 84% of teachers with unsatisfactory ratings would be
retained.
14. Elements of a teacher’s score
M-STAR –
supervisor’s
evaluation
School-wide
Growth
Student Growth
Percentile or
Student Learning
Objective
15. Teacher observation as a part of
teacher evaluation
Systematic observation of teacher performance
is a central part of every state’s teacher
evaluation plan.
16. If performance ratings aren’t
differentiated, then all
differentiation comes from
test data.
21. “The (Race to the Top teacher evaluation) changes, already
under way in some cities and states, are intended to
provide meaningful feedback and, critically, to weed out
weak performers. And here are some of the early results:
In Florida, 97 percent of teachers were deemed effective
or highly effective in the most recent evaluations. In
Tennessee, 98 percent of teachers were judged to be “at
expectations.” In Michigan, 98 percent of teachers were
rated effective or better.”
Source: New York Times (2013, March 30). Curious Grade for Teachers: Nearly all Pass.
Retrieved from:
http://www.nytimes.com/2013/03/31/education/curious-grade-for-teachers-nearly-
all-pass.html?pagewanted=all&_r=0
22. Results of Georgia Teacher Evaluation
Pilot
1% 2%
75%
23%
Evaluator Rating
ineffective
Minimally Effective
Effective
Highly Effective
25. Bill and Melina Gates Foundation (2013, January). Ensuring Fair and Reliable
Measures of Effective Teaching: Culminating Findings from the MET Projects Three-
Year Study
Observation by Reliability coefficient
(relative to state test
value-added gain)
Proportion of test
variance
explained
Principal – 1 .51 26.0%
Principal – 2 .58 33.6%
Principal and other administrator .67 44.9%
Principal and three short
observations by peer observers
.67 44.9%
Two principal observations and
two peer observations
.66 43.6%
Two principal observations and
two different peer observers
.69 47.6%
Two principal observations one
peer observation and three short
observations by peers
.72 51.8%
Reliability of a variety of teacher observation
implementations
26. Elements of a teacher’s score
M-STAR –
supervisor’s
evaluation
School-wide
Growth
Student Growth
Percentile or
Student Learning
Objective
28. • Produce rankings of student growth. A teacher’s SGP
is the median of that teacher’s students.
• Do not introduce controls for factors outside a
teacher’s control that may influence growth.
• Advances a claim of causation – that the teachers
ranking is based on learning caused.
• Can be applied to as few as 20% of the teachers in a
school system (Whitehurst, 2013).
Student Growth Percentiles
29. • Poverty rate of students in the classroom.
• Language development of students.
• Special education status of students.
• Prior disciplinary record of students.
• Student attendance.
• Non-random assignment of students to teachers.
• Class size.
• Gender.
Factors that may influence a
teacher’s growth percentile
Baker B., Oluwole, J., Green, P. (2013). The legal consequences of mandating high stakes decisions
based on low quality information: Teacher evaluation in the Race to the Top Era. Education Policy
Analysis Archives. Vol 21. No 5.
30. • They do not support claims of causation.
• They do not control for external factors, outside the
teacher’s control, that may impact growth.
• If teachers or students improve as a group, it will not
be reflected in SGP data.
Issues with Student Growth
Percentiles
Whitehurst, G. J. (2013). Teacher value- added: Do we want a ten percent solution? The
Brown Center Chalkboard, April 24. Washington, DC: Brookings Institution.
Retrieved October 2, 2014, from www.brookings.edu/blogs/brown-center-chalkboard/
posts/2013/04/24-merit-pay-whitehurst
32. Why test design matters
In 2014 the 8th grade Mississippi math test provided
nearly double the information function for students
performing one standard deviation above the mean as
students at the mean and four times the information as
compared with those one standard deviation below.
33. Tests are not equally accurate for all
students
California STAR NWEA MAP
34. Issues in the use of growth and value-added
measures
“Among those who ranked in the top
category on the TAKS reading test, more
than 17% ranked among the lowest two
categories on the Stanford. Similarly
more than 15% of the lowest value-added
teachers on the TAKS were in the highest
two categories on the Stanford.”
Corcoran, S., Jennings, J., & Beveridge, A., Teacher Effectiveness on High and Low
Stakes Tests, Paper presented at the Institute for Research on Poverty summer
workshop, Madison, WI (2010).
35. What Makes Schools Work Study -
Mathematics
-10.0
-5.0
0.0
5.0
10.0
15.0
-10.0 -5.0 0.0 5.0 10.0 15.0
Year2
Year 1
Value-added index by teacher
Data used represents a portion of the teachers who participated in Vanderbilt
University’s What Makes Schools Work Project, funded by the federal Institute of
Education Sciences
37. Elements of a teacher’s score
M-STAR –
supervisor’s
evaluation
School-wide
Growth
Student Growth
Percentile or
Student Learning
Objective
38. Benefits and risks around the use of
school-wide growth
• Benefits
• Encourages collaboration
• Increases focus on language development and
mathematics
• Risks
• Not a valid measure of job performance
• Believed to be unfair by teachers in these
subjects.
39. “What aggravates teachers most is that 40 to 50 percent
of their evaluation is based on "student achievement"
— but it's not always their own students who are being
measured.
For example, fourth- and fifth-grade teachers are rated
partly on their students' FCAT scores. But the FCAT is not
given until third grade. So if you teach a lower grade,
then your "student achievement" score is based on the
scores of older students at your school. Similarly,
teachers of subjects that don't even appear on the
state's standardized test are being evaluated, at least in
part, on FCAT scores.”
Krueger, C. Solochek, J., and Sokol, M. (2012, October 19)
VAM, the new teacher evaluation system, stirs concern,
confusion. Tampa Bay Times.
40. The actual proportion of
teachers for which student
growth can be measured
through the typical state
assessment.
25%
41. • Are a contract negotiated between the principal and teacher
around student results.
• Do not produce rankings that compare teacher results across
settings
• Do not introduce controls to account for factors that may
influence growth that are outside the teachers influence.
• Do not advance a claim of causation – teacher competence is
demonstrated by fulfillment of the contract
Student Learning Objectives
42. • Do not provide evidence of teacher effectiveness.
• Teachers using SLOs may be evaluated against less
rigorous criteria than teachers evaluated by value-added
methods.
• Goals are not consistent in difficulty.
• Goals are not consistent across teachers.
Student Learning Objectives
43. Employing value-added methodologies, Jackson
found that teachers had a substantive effect on
non-cognitive outcomes that was independent
of their effect on test scores
Source: Jackson, K. (2013). Non-Cognitive Ability, Test Scores and Teacher
Quality: Evidence from 9th Grade Teachers in North Carolina. Northwestern
University and NBER
Non-cognitive factors
44. • Lowered the average student absenteeism by 7.4 days.
• Improved the probability that students would enroll in
the next grade by 5 percentage points.
• Reduced the likelihood of suspension by 2.8%
• Improved the average GPA by .09 (Algebra) or .05
(English)
Source: Jackson, K. (2013). Non-Cognitive Ability, Test Scores and Teacher
Quality: Evidence from 9th Grade Teachers in North Carolina. Northwestern
University and NBER
Non-cognitive factors
46. Suggested reading
Baker B., Oluwole, J., Green, P. (2013). The legal
consequences of mandating high stakes decisions based
on low quality information: Teacher evaluation in the
Race to the Top Era. Education Policy Analysis Archives.
Vol 21. No 5.