SlideShare a Scribd company logo
The seven habits of highly
effective statisticians
Stephen Senn
Consultant Statistician, Edinburgh, UK
© Stephen Senn 2020 1
A Question to Keep You Amused
Consider a ‘coin of ignorance’
   
 
, 1 ,
1, 0 1
P H P T
f
 
 
  
  
The coin is tossed 100 times. If X is the number of heads,
which of these two is more likely?:
 
 
50
100 ?
P X
P X


100!/(50!50!)  1029 sequences
One sequence
© Stephen Senn 2020 2
 Is the
probability
of a head
Every
value of 
is equally
likely
Of course, this is an ironic title
• Any statistician knows that you should think in terms of the three Cs:
• Causation
• Control
• Comparison
• To which a fourth might be added
• Counterfactuals
• The question of interest is
• What habits have a beneficial effect on your probability of being an effective
statistician?
• Many effective statisticians will be in the habit of taking breakfast. This
doesn’t make taking breakfast a cause of being an effective statistician.
© Stephen Senn 2020 3
That which
would have
happened
had you
acted
differently
And my advice is hypocritical
• I earn my living as a statistician promoting, using and evaluating
numerical evidence
• Based on studies with
• Control
• Randomisation
• Replication
• I am proposing instead to give you advice based on one uncontrolled
example
• Me
© Stephen Senn 2020 4
The magnificent seven
• Read
• Listen ( & see)
• Understand
• Think
• Do
• Calculate
• Communicate
• Include some classics in your reading
• Fit the answer to the problem not vice versa
• Requires some subject matter comprehension
• It’s not just a matter of mathematics (but it also is)
• The devil is the detail and doing discovers it
• Use calculations to increase, not instead of understanding
• Think hard about what the simplest honest way is to
communicate the message
© Stephen Senn 2020 5
I am not going to go through this list in detail
• Instead I shall illustrate some of these points by a few examples I shall
present
• Invalid inversion
• Regression to the mean
• Some statistical ‘howlers’
• These will illustrate between them the value of
• Understand
• Communicate
• Think
• Do
• Read
• Calculate
© Stephen Senn 2020 6
What happened to Listen?
That’s where you come in!
A Simple Example of ‘Invalid Inversion’
• Most women do not suffer from breast cancer
• It would be a mistake to conclude, however, that most breast cancer
victims are not women
• To do so would be to transpose the conditionals
• This is an example of invalid inversion
• Why is this important?
• People regularly confuse the probability of the data given the
hypothesis with the probability of the hypothesis given the data
• Misinterpretation of P-values is linked to this
7(c) Stephen Senn
Some Plausible Figures for the UK
8(c) Stephen Senn
Some Plausible Figures for the UK
Probability breast cancer given female = 550/31,418=0.018
9(c) Stephen Senn
Some Plausible Figures for the UK
Probability female given breast cancer =550/553=0.995
10(c) Stephen Senn
The difference is in the denominator
The numerator is the same
11(c) Stephen Senn
Invalid inversion is an error caused by mistaking the relevant marginal class
550/31418 or 550/553
A Little Maths
 
 
 
 
 
 
       Unless ,
P A B
P A B
P B
P A B
P B A
P A
P B P A P A B P B A




 
So invalid inversion is equivalent to a confusion of the marginal probabilities. The
same joint probability is involved in the two conditional probabilities but different
marginal probabilities are involved
12(c) Stephen Senn
The Regression Analogue
Predicting Y from X is not the same as predicting X from Y.
2
2
XY
Y X
X
XY
X Y
Y








Note the similarity with the probability case.
The numerator (the covariance) is a statistic of joint variation.
The denominators (the variances) are statistics of marginal variation. These
marginal statistics are not the same.
13(c) Stephen Senn
The difference is in the denominator
The numerator is the same
Dimensional analysis
• Consider the example of regressing weight from height and vice versa
• Suppose you put height in cm into your ‘black’ box to predict weight in kg
• The input is in cm
• The output is in kg
• You must multiply the cm by a regression coefficient that is in kg/cm
• The covariance is in units of kg x cm and you divide by a variance that is in cm2 to get
kg/cm
• Suppose you put weight in kg into your black box to predict height in cm
• You must multiply the kg in a coefficient that is in cm/kg
• The numerator is the covariance in both cases
• A different variance is used for the denominator
© Stephen Senn 2020 14
Just to make that perfectly clear
© Stephen Senn 2020 15
𝑤𝑒𝑖𝑔ℎ𝑡 𝑘𝑔 = 𝑐𝑜𝑛𝑠𝑡𝑎𝑛𝑡 𝑘𝑔 +
𝑐𝑜𝑣 𝑐𝑚 × 𝑘𝑔
𝑣𝑎𝑟 𝑐𝑚 × 𝑐𝑚
× ℎ𝑒𝑖𝑔ℎ𝑡 𝑐𝑚
ℎ𝑒𝑖𝑔ℎ𝑡 𝑐𝑚 = 𝑐𝑜𝑛𝑠𝑡𝑎𝑛𝑡 𝑐𝑚 +
𝑐𝑜𝑣(𝑐𝑚 × 𝑘𝑔)
𝑣𝑎𝑟(𝑘𝑔 × 𝑘𝑔)
× 𝑤𝑒𝑖𝑔ℎ𝑡(𝑘𝑔)
Morals
• Think carefully about basic and fundamental concepts in probability and
statistics
• Seek an understanding that is not just mathematical but that reveals why
things have to be the way they are
• Make parallels
• Regression is similar to conditional probability in some way
• Dimensional analysis (a tool used by physicists and engineers) is very valuable
• Find the simplest way to communicate important points
• Proofs are good but not for this
• Examples are excellent
• Read widely and seek different explanations of the same thing
© Stephen Senn 2020 16
Regression to the Mean
A Simulated Example
• Diastolic blood pressure (DBP)
• Mean 90mmHg
• Between patient variance 50mmHg2
• Within patient variance 15 mmHg2
• Boundary for hypertensive 95 mmHg
• Simulation of 1000 patients whose DBP at baseline
and outcome are shown
• Blue consistent normotensive
• Red Consistent hypertensive
• Orange hypertensive/normotensive or vice versa
17(c) Stephen Senn
18(c) Stephen Senn
What you will
see if all
patients
are followed up
19(c) Stephen Senn
What you will
see if hypertensive
patients
are followed up
(c) Stephen Senn 20
Mean at baseline and
outcome are the same
Mean at outcome is
lower than at baseline
All patients are hypertensive
at baseline
Many are not at outcome
Probably not the best way to explain this
© Stephen Senn 2020 21
Who wrote this?
Senn, S. J. (1988). How much of the placebo 'effect' is really statistical
regression? [letter]. Statistics in Medicine, 7(11), 1203
Doing and calculating avoids stupid mistakes
Stupid mistake Cure
Proposing allocation ratios of 7:5:3
for a three armed trial.
Calculate the minimum block size.
Hint: It’s 105.
Proposing some software for cross-
over trials that could adjust the
treatments to which patients are
allocated depending on results in
earlier periods.
Try do this is real time.
Hint: This may help you learn that patients do not
arrive simultaneously in a clinical trial.
Claim that the use of placebos in
clinical trials is unethical if there is
an effective treatment.
Run a clinical trial in a serious disease where there
is a partially effective treatment.
Hint: How do you avoid withdrawing the partially
effective treatment from some patients?
© Stephen Senn 2020 22
Advice on Understanding, Thinking, Reading
etc.
• Mathematics is important
• But it’s not enough
• Statistics is not a branch of mathematics although probability theory is
• Applications are important
• Loving your data
• Getting to know the application area
• Biology!
• Pharmacology!
• Reading the classics is good for you
• Especially Fisher
© Stephen Senn 2020 23
That problem
The two events are equally likely. In fact,
 
1
, , 0,1, .
1
n P X k k n
n
   

L
Proof could involve some or all of the following:
marginal, conditional and joint probabilities
calculus
Bayes theorem
posterior probability
predictive distribution
proof by induction
© Stephen Senn 2020 24
Intuition
Imagine one billion tosses.
Your posterior probability would have to be very close to the
observed relative frequency, which would be close to the
‘true’ value.
But your prior probability says every true value is equally
likely.
Therefore, every observable ratio is equally likely.
But the result is also trivially true for n = 1. It is hardly
surprising, therefore, if the result is true for every value of n
between 1 and 1 billion.
© Stephen Senn 2020 25
Moral
• It is important to think about your assumptions carefully
• If you do this you can understand what they imply
• Trying simple cases is helpful
• If you do this you can often see what the solution must be
• Extreme cases (one billion tosses) can also be helpful
• The mathematical solution is valuable but it is not a substitute for this
• Statistics is more than just mathematics
• It is also science and philosophy
© Stephen Senn 2020 26
real problem
real problem
operational
problem
solution application
solution application
idealised
problem
Mathematics
Statistics
© Stephen Senn 2020 27
In the mathematical formulation of any problem it is necessary
to base oneself on some appropriate idealizations and
simplification…... One loses sight of the original nature of the
problem, falls in love with the idealization, and then blames
reality for not conforming to it.
de Finetti, (1975).
It seems a pity that while we statisticians have an opportunity
to rate as first-class scientists we should settle for the rather
dreary role of second-class mathematicians.
George Box (1990)
© Stephen Senn 2020 28
Statistics is a subject where everything has to
be understood three times
•In terms of mathematics
•In terms of philosophy
•In terms of application
© Stephen Senn 2020 29
•Finally, I would like
to leave you with
this question
•Did you know there
are only 120 days
to Christmas?
Traditional Polish Present
Piernik
Alternative suggestion
© Stephen Senn 2020 30
3rd edition out soon

More Related Content

What's hot

Prognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient healthPrognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient health
Maarten van Smeden
 
Has modelling killed randomisation inference frankfurt
Has modelling killed randomisation inference frankfurtHas modelling killed randomisation inference frankfurt
Has modelling killed randomisation inference frankfurt
Stephen Senn
 
Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...
Evangelos Kritsotakis
 
Introduction to prediction modelling - Berlin 2018 - Part II
Introduction to prediction modelling - Berlin 2018 - Part IIIntroduction to prediction modelling - Berlin 2018 - Part II
Introduction to prediction modelling - Berlin 2018 - Part II
Maarten van Smeden
 
Minimally important differences
Minimally important differencesMinimally important differences
Minimally important differences
Stephen Senn
 
Choosing Regression Models
Choosing Regression ModelsChoosing Regression Models
Choosing Regression Models
Stephen Senn
 
Mann Whitney U Test And Chi Squared
Mann Whitney U Test And Chi SquaredMann Whitney U Test And Chi Squared
Mann Whitney U Test And Chi Squaredguest2137aa
 
Minimally important differences v2
Minimally important differences v2Minimally important differences v2
Minimally important differences v2
Stephen Senn
 
Sample size calculations
Sample size calculationsSample size calculations
Whatever happened to design based inference
Whatever happened to design based inferenceWhatever happened to design based inference
Whatever happened to design based inference
StephenSenn2
 
T test
T testT test
Improving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradoxImproving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradox
Maarten van Smeden
 
Methods of randomisation in clinical trials
Methods of randomisation in clinical trialsMethods of randomisation in clinical trials
Methods of randomisation in clinical trials
Amy Mehaboob
 
Introduction To Survival Analysis
Introduction To Survival AnalysisIntroduction To Survival Analysis
Introduction To Survival Analysis
federicorotolo
 
Sample size calculation
Sample size calculationSample size calculation
Sample size calculation
Santam Chakraborty
 
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Ewout Steyerberg
 
Correcting for missing data, measurement error and confounding
Correcting for missing data, measurement error and confoundingCorrecting for missing data, measurement error and confounding
Correcting for missing data, measurement error and confounding
Maarten van Smeden
 
Clinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptxClinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptx
StephenSenn2
 

What's hot (20)

Prognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient healthPrognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient health
 
Has modelling killed randomisation inference frankfurt
Has modelling killed randomisation inference frankfurtHas modelling killed randomisation inference frankfurt
Has modelling killed randomisation inference frankfurt
 
Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...
 
Introduction to prediction modelling - Berlin 2018 - Part II
Introduction to prediction modelling - Berlin 2018 - Part IIIntroduction to prediction modelling - Berlin 2018 - Part II
Introduction to prediction modelling - Berlin 2018 - Part II
 
Minimally important differences
Minimally important differencesMinimally important differences
Minimally important differences
 
Choosing Regression Models
Choosing Regression ModelsChoosing Regression Models
Choosing Regression Models
 
Mc Nemar
Mc NemarMc Nemar
Mc Nemar
 
Mann Whitney U Test And Chi Squared
Mann Whitney U Test And Chi SquaredMann Whitney U Test And Chi Squared
Mann Whitney U Test And Chi Squared
 
Minimally important differences v2
Minimally important differences v2Minimally important differences v2
Minimally important differences v2
 
Sample size calculations
Sample size calculationsSample size calculations
Sample size calculations
 
Whatever happened to design based inference
Whatever happened to design based inferenceWhatever happened to design based inference
Whatever happened to design based inference
 
Part 1 Survival Analysis
Part 1 Survival AnalysisPart 1 Survival Analysis
Part 1 Survival Analysis
 
T test
T testT test
T test
 
Improving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradoxImproving predictions: Lasso, Ridge and Stein's paradox
Improving predictions: Lasso, Ridge and Stein's paradox
 
Methods of randomisation in clinical trials
Methods of randomisation in clinical trialsMethods of randomisation in clinical trials
Methods of randomisation in clinical trials
 
Introduction To Survival Analysis
Introduction To Survival AnalysisIntroduction To Survival Analysis
Introduction To Survival Analysis
 
Sample size calculation
Sample size calculationSample size calculation
Sample size calculation
 
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
 
Correcting for missing data, measurement error and confounding
Correcting for missing data, measurement error and confoundingCorrecting for missing data, measurement error and confounding
Correcting for missing data, measurement error and confounding
 
Clinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptxClinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptx
 

Similar to The Seven Habits of Highly Effective Statisticians

What should we expect from reproducibiliry
What should we expect from reproducibiliryWhat should we expect from reproducibiliry
What should we expect from reproducibiliry
Stephen Senn
 
Thinking statistically v3
Thinking statistically v3Thinking statistically v3
Thinking statistically v3
Stephen Senn
 
Topic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis TestingTopic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis Testing
Ryan Herzog
 
Lecture by Professor Imre Janszky about random error.
Lecture by Professor Imre Janszky about random error. Lecture by Professor Imre Janszky about random error.
Lecture by Professor Imre Janszky about random error.
EPINOR
 
How to do the maths
How to do the mathsHow to do the maths
How to do the maths
Ahmed Elfaitury
 
Senn repligate
Senn repligateSenn repligate
Senn repligate
jemille6
 
Seven myths of randomisation
Seven myths of randomisation Seven myths of randomisation
Seven myths of randomisation
Stephen Senn
 
In search of the lost loss function
In search of the lost loss function In search of the lost loss function
In search of the lost loss function
Stephen Senn
 
Statistics in clinical and translational research common pitfalls
Statistics in clinical and translational research  common pitfallsStatistics in clinical and translational research  common pitfalls
Statistics in clinical and translational research common pitfalls
Pavlos Msaouel, MD, PhD
 
The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...
StephenSenn2
 
The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...
jemille6
 
1 statistical analysis notes
1 statistical analysis notes1 statistical analysis notes
1 statistical analysis notescartlidge
 
The challenge of small data
The challenge of small dataThe challenge of small data
The challenge of small data
Stephen Senn
 
Why I hate minimisation
Why I hate minimisationWhy I hate minimisation
Why I hate minimisation
Stephen Senn
 
Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)
jasondeveau
 
On being Bayesian
On being BayesianOn being Bayesian
On being Bayesian
Stephen Senn
 
P values and replication
P values and replicationP values and replication
P values and replication
Stephen Senn
 
probability.pptx
probability.pptxprobability.pptx
probability.pptx
bisan3
 
Insights from psychology on lack of reproducibility
Insights from psychology on lack of reproducibilityInsights from psychology on lack of reproducibility
Insights from psychology on lack of reproducibility
Dorothy Bishop
 
Clinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptxClinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptx
StephenSenn3
 

Similar to The Seven Habits of Highly Effective Statisticians (20)

What should we expect from reproducibiliry
What should we expect from reproducibiliryWhat should we expect from reproducibiliry
What should we expect from reproducibiliry
 
Thinking statistically v3
Thinking statistically v3Thinking statistically v3
Thinking statistically v3
 
Topic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis TestingTopic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis Testing
 
Lecture by Professor Imre Janszky about random error.
Lecture by Professor Imre Janszky about random error. Lecture by Professor Imre Janszky about random error.
Lecture by Professor Imre Janszky about random error.
 
How to do the maths
How to do the mathsHow to do the maths
How to do the maths
 
Senn repligate
Senn repligateSenn repligate
Senn repligate
 
Seven myths of randomisation
Seven myths of randomisation Seven myths of randomisation
Seven myths of randomisation
 
In search of the lost loss function
In search of the lost loss function In search of the lost loss function
In search of the lost loss function
 
Statistics in clinical and translational research common pitfalls
Statistics in clinical and translational research  common pitfallsStatistics in clinical and translational research  common pitfalls
Statistics in clinical and translational research common pitfalls
 
The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...
 
The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...
 
1 statistical analysis notes
1 statistical analysis notes1 statistical analysis notes
1 statistical analysis notes
 
The challenge of small data
The challenge of small dataThe challenge of small data
The challenge of small data
 
Why I hate minimisation
Why I hate minimisationWhy I hate minimisation
Why I hate minimisation
 
Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)
 
On being Bayesian
On being BayesianOn being Bayesian
On being Bayesian
 
P values and replication
P values and replicationP values and replication
P values and replication
 
probability.pptx
probability.pptxprobability.pptx
probability.pptx
 
Insights from psychology on lack of reproducibility
Insights from psychology on lack of reproducibilityInsights from psychology on lack of reproducibility
Insights from psychology on lack of reproducibility
 
Clinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptxClinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptx
 

More from Stephen Senn

What is your question
What is your questionWhat is your question
What is your question
Stephen Senn
 
Vaccine trials in the age of COVID-19
Vaccine trials in the age of COVID-19Vaccine trials in the age of COVID-19
Vaccine trials in the age of COVID-19
Stephen Senn
 
To infinity and beyond v2
To infinity and beyond v2To infinity and beyond v2
To infinity and beyond v2
Stephen Senn
 
Approximate ANCOVA
Approximate ANCOVAApproximate ANCOVA
Approximate ANCOVA
Stephen Senn
 
Clinical trials: quo vadis in the age of covid?
Clinical trials: quo vadis in the age of covid?Clinical trials: quo vadis in the age of covid?
Clinical trials: quo vadis in the age of covid?
Stephen Senn
 
A century of t tests
A century of t testsA century of t tests
A century of t tests
Stephen Senn
 
Is ignorance bliss
Is ignorance blissIs ignorance bliss
Is ignorance bliss
Stephen Senn
 
To infinity and beyond
To infinity and beyond To infinity and beyond
To infinity and beyond
Stephen Senn
 
De Finetti meets Popper
De Finetti meets PopperDe Finetti meets Popper
De Finetti meets Popper
Stephen Senn
 
Understanding randomisation
Understanding randomisationUnderstanding randomisation
Understanding randomisation
Stephen Senn
 
In Search of Lost Infinities: What is the “n” in big data?
In Search of Lost Infinities: What is the “n” in big data?In Search of Lost Infinities: What is the “n” in big data?
In Search of Lost Infinities: What is the “n” in big data?
Stephen Senn
 
Seventy years of RCTs
Seventy years of RCTsSeventy years of RCTs
Seventy years of RCTs
Stephen Senn
 
The Rothamsted school meets Lord's paradox
The Rothamsted school meets Lord's paradoxThe Rothamsted school meets Lord's paradox
The Rothamsted school meets Lord's paradox
Stephen Senn
 
The revenge of RA Fisher
The revenge of RA Fisher The revenge of RA Fisher
The revenge of RA Fisher
Stephen Senn
 
The story of MTA/02
The story of MTA/02The story of MTA/02
The story of MTA/02
Stephen Senn
 
Confounding, politics, frustration and knavish tricks
Confounding, politics, frustration and knavish tricksConfounding, politics, frustration and knavish tricks
Confounding, politics, frustration and knavish tricks
Stephen Senn
 
And thereby hangs a tail
And thereby hangs a tailAnd thereby hangs a tail
And thereby hangs a tail
Stephen Senn
 
The revenge of RA Fisher
The revenge of RA FisherThe revenge of RA Fisher
The revenge of RA Fisher
Stephen Senn
 

More from Stephen Senn (18)

What is your question
What is your questionWhat is your question
What is your question
 
Vaccine trials in the age of COVID-19
Vaccine trials in the age of COVID-19Vaccine trials in the age of COVID-19
Vaccine trials in the age of COVID-19
 
To infinity and beyond v2
To infinity and beyond v2To infinity and beyond v2
To infinity and beyond v2
 
Approximate ANCOVA
Approximate ANCOVAApproximate ANCOVA
Approximate ANCOVA
 
Clinical trials: quo vadis in the age of covid?
Clinical trials: quo vadis in the age of covid?Clinical trials: quo vadis in the age of covid?
Clinical trials: quo vadis in the age of covid?
 
A century of t tests
A century of t testsA century of t tests
A century of t tests
 
Is ignorance bliss
Is ignorance blissIs ignorance bliss
Is ignorance bliss
 
To infinity and beyond
To infinity and beyond To infinity and beyond
To infinity and beyond
 
De Finetti meets Popper
De Finetti meets PopperDe Finetti meets Popper
De Finetti meets Popper
 
Understanding randomisation
Understanding randomisationUnderstanding randomisation
Understanding randomisation
 
In Search of Lost Infinities: What is the “n” in big data?
In Search of Lost Infinities: What is the “n” in big data?In Search of Lost Infinities: What is the “n” in big data?
In Search of Lost Infinities: What is the “n” in big data?
 
Seventy years of RCTs
Seventy years of RCTsSeventy years of RCTs
Seventy years of RCTs
 
The Rothamsted school meets Lord's paradox
The Rothamsted school meets Lord's paradoxThe Rothamsted school meets Lord's paradox
The Rothamsted school meets Lord's paradox
 
The revenge of RA Fisher
The revenge of RA Fisher The revenge of RA Fisher
The revenge of RA Fisher
 
The story of MTA/02
The story of MTA/02The story of MTA/02
The story of MTA/02
 
Confounding, politics, frustration and knavish tricks
Confounding, politics, frustration and knavish tricksConfounding, politics, frustration and knavish tricks
Confounding, politics, frustration and knavish tricks
 
And thereby hangs a tail
And thereby hangs a tailAnd thereby hangs a tail
And thereby hangs a tail
 
The revenge of RA Fisher
The revenge of RA FisherThe revenge of RA Fisher
The revenge of RA Fisher
 

Recently uploaded

Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
2023240532
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 

Recently uploaded (20)

Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 

The Seven Habits of Highly Effective Statisticians

  • 1. The seven habits of highly effective statisticians Stephen Senn Consultant Statistician, Edinburgh, UK © Stephen Senn 2020 1
  • 2. A Question to Keep You Amused Consider a ‘coin of ignorance’       , 1 , 1, 0 1 P H P T f           The coin is tossed 100 times. If X is the number of heads, which of these two is more likely?:     50 100 ? P X P X   100!/(50!50!)  1029 sequences One sequence © Stephen Senn 2020 2  Is the probability of a head Every value of  is equally likely
  • 3. Of course, this is an ironic title • Any statistician knows that you should think in terms of the three Cs: • Causation • Control • Comparison • To which a fourth might be added • Counterfactuals • The question of interest is • What habits have a beneficial effect on your probability of being an effective statistician? • Many effective statisticians will be in the habit of taking breakfast. This doesn’t make taking breakfast a cause of being an effective statistician. © Stephen Senn 2020 3 That which would have happened had you acted differently
  • 4. And my advice is hypocritical • I earn my living as a statistician promoting, using and evaluating numerical evidence • Based on studies with • Control • Randomisation • Replication • I am proposing instead to give you advice based on one uncontrolled example • Me © Stephen Senn 2020 4
  • 5. The magnificent seven • Read • Listen ( & see) • Understand • Think • Do • Calculate • Communicate • Include some classics in your reading • Fit the answer to the problem not vice versa • Requires some subject matter comprehension • It’s not just a matter of mathematics (but it also is) • The devil is the detail and doing discovers it • Use calculations to increase, not instead of understanding • Think hard about what the simplest honest way is to communicate the message © Stephen Senn 2020 5
  • 6. I am not going to go through this list in detail • Instead I shall illustrate some of these points by a few examples I shall present • Invalid inversion • Regression to the mean • Some statistical ‘howlers’ • These will illustrate between them the value of • Understand • Communicate • Think • Do • Read • Calculate © Stephen Senn 2020 6 What happened to Listen? That’s where you come in!
  • 7. A Simple Example of ‘Invalid Inversion’ • Most women do not suffer from breast cancer • It would be a mistake to conclude, however, that most breast cancer victims are not women • To do so would be to transpose the conditionals • This is an example of invalid inversion • Why is this important? • People regularly confuse the probability of the data given the hypothesis with the probability of the hypothesis given the data • Misinterpretation of P-values is linked to this 7(c) Stephen Senn
  • 8. Some Plausible Figures for the UK 8(c) Stephen Senn
  • 9. Some Plausible Figures for the UK Probability breast cancer given female = 550/31,418=0.018 9(c) Stephen Senn
  • 10. Some Plausible Figures for the UK Probability female given breast cancer =550/553=0.995 10(c) Stephen Senn
  • 11. The difference is in the denominator The numerator is the same 11(c) Stephen Senn Invalid inversion is an error caused by mistaking the relevant marginal class 550/31418 or 550/553
  • 12. A Little Maths                    Unless , P A B P A B P B P A B P B A P A P B P A P A B P B A       So invalid inversion is equivalent to a confusion of the marginal probabilities. The same joint probability is involved in the two conditional probabilities but different marginal probabilities are involved 12(c) Stephen Senn
  • 13. The Regression Analogue Predicting Y from X is not the same as predicting X from Y. 2 2 XY Y X X XY X Y Y         Note the similarity with the probability case. The numerator (the covariance) is a statistic of joint variation. The denominators (the variances) are statistics of marginal variation. These marginal statistics are not the same. 13(c) Stephen Senn The difference is in the denominator The numerator is the same
  • 14. Dimensional analysis • Consider the example of regressing weight from height and vice versa • Suppose you put height in cm into your ‘black’ box to predict weight in kg • The input is in cm • The output is in kg • You must multiply the cm by a regression coefficient that is in kg/cm • The covariance is in units of kg x cm and you divide by a variance that is in cm2 to get kg/cm • Suppose you put weight in kg into your black box to predict height in cm • You must multiply the kg in a coefficient that is in cm/kg • The numerator is the covariance in both cases • A different variance is used for the denominator © Stephen Senn 2020 14
  • 15. Just to make that perfectly clear © Stephen Senn 2020 15 𝑤𝑒𝑖𝑔ℎ𝑡 𝑘𝑔 = 𝑐𝑜𝑛𝑠𝑡𝑎𝑛𝑡 𝑘𝑔 + 𝑐𝑜𝑣 𝑐𝑚 × 𝑘𝑔 𝑣𝑎𝑟 𝑐𝑚 × 𝑐𝑚 × ℎ𝑒𝑖𝑔ℎ𝑡 𝑐𝑚 ℎ𝑒𝑖𝑔ℎ𝑡 𝑐𝑚 = 𝑐𝑜𝑛𝑠𝑡𝑎𝑛𝑡 𝑐𝑚 + 𝑐𝑜𝑣(𝑐𝑚 × 𝑘𝑔) 𝑣𝑎𝑟(𝑘𝑔 × 𝑘𝑔) × 𝑤𝑒𝑖𝑔ℎ𝑡(𝑘𝑔)
  • 16. Morals • Think carefully about basic and fundamental concepts in probability and statistics • Seek an understanding that is not just mathematical but that reveals why things have to be the way they are • Make parallels • Regression is similar to conditional probability in some way • Dimensional analysis (a tool used by physicists and engineers) is very valuable • Find the simplest way to communicate important points • Proofs are good but not for this • Examples are excellent • Read widely and seek different explanations of the same thing © Stephen Senn 2020 16
  • 17. Regression to the Mean A Simulated Example • Diastolic blood pressure (DBP) • Mean 90mmHg • Between patient variance 50mmHg2 • Within patient variance 15 mmHg2 • Boundary for hypertensive 95 mmHg • Simulation of 1000 patients whose DBP at baseline and outcome are shown • Blue consistent normotensive • Red Consistent hypertensive • Orange hypertensive/normotensive or vice versa 17(c) Stephen Senn
  • 18. 18(c) Stephen Senn What you will see if all patients are followed up
  • 19. 19(c) Stephen Senn What you will see if hypertensive patients are followed up
  • 20. (c) Stephen Senn 20 Mean at baseline and outcome are the same Mean at outcome is lower than at baseline All patients are hypertensive at baseline Many are not at outcome
  • 21. Probably not the best way to explain this © Stephen Senn 2020 21 Who wrote this? Senn, S. J. (1988). How much of the placebo 'effect' is really statistical regression? [letter]. Statistics in Medicine, 7(11), 1203
  • 22. Doing and calculating avoids stupid mistakes Stupid mistake Cure Proposing allocation ratios of 7:5:3 for a three armed trial. Calculate the minimum block size. Hint: It’s 105. Proposing some software for cross- over trials that could adjust the treatments to which patients are allocated depending on results in earlier periods. Try do this is real time. Hint: This may help you learn that patients do not arrive simultaneously in a clinical trial. Claim that the use of placebos in clinical trials is unethical if there is an effective treatment. Run a clinical trial in a serious disease where there is a partially effective treatment. Hint: How do you avoid withdrawing the partially effective treatment from some patients? © Stephen Senn 2020 22
  • 23. Advice on Understanding, Thinking, Reading etc. • Mathematics is important • But it’s not enough • Statistics is not a branch of mathematics although probability theory is • Applications are important • Loving your data • Getting to know the application area • Biology! • Pharmacology! • Reading the classics is good for you • Especially Fisher © Stephen Senn 2020 23
  • 24. That problem The two events are equally likely. In fact,   1 , , 0,1, . 1 n P X k k n n      L Proof could involve some or all of the following: marginal, conditional and joint probabilities calculus Bayes theorem posterior probability predictive distribution proof by induction © Stephen Senn 2020 24
  • 25. Intuition Imagine one billion tosses. Your posterior probability would have to be very close to the observed relative frequency, which would be close to the ‘true’ value. But your prior probability says every true value is equally likely. Therefore, every observable ratio is equally likely. But the result is also trivially true for n = 1. It is hardly surprising, therefore, if the result is true for every value of n between 1 and 1 billion. © Stephen Senn 2020 25
  • 26. Moral • It is important to think about your assumptions carefully • If you do this you can understand what they imply • Trying simple cases is helpful • If you do this you can often see what the solution must be • Extreme cases (one billion tosses) can also be helpful • The mathematical solution is valuable but it is not a substitute for this • Statistics is more than just mathematics • It is also science and philosophy © Stephen Senn 2020 26
  • 27. real problem real problem operational problem solution application solution application idealised problem Mathematics Statistics © Stephen Senn 2020 27
  • 28. In the mathematical formulation of any problem it is necessary to base oneself on some appropriate idealizations and simplification…... One loses sight of the original nature of the problem, falls in love with the idealization, and then blames reality for not conforming to it. de Finetti, (1975). It seems a pity that while we statisticians have an opportunity to rate as first-class scientists we should settle for the rather dreary role of second-class mathematicians. George Box (1990) © Stephen Senn 2020 28
  • 29. Statistics is a subject where everything has to be understood three times •In terms of mathematics •In terms of philosophy •In terms of application © Stephen Senn 2020 29
  • 30. •Finally, I would like to leave you with this question •Did you know there are only 120 days to Christmas? Traditional Polish Present Piernik Alternative suggestion © Stephen Senn 2020 30 3rd edition out soon

Editor's Notes

  1. If you know why the title of this talk is extremely stupid, then you clearly know something about control, data and reasoning: in short, you have most of what it takes to be a statistician. If you have studied statistics then you will also know that a large amount of anything, and this includes successful careers, is luck. In this talk I shall try share some of my experiences of being a statistician in the hope that it will help you make the most of whatever luck life throws you, In so doing, I shall try my best to overcome the distorting influence of that easiest of sciences hindsight. Without giving too much away, I shall be recommending that you read, listen, think, calculate, understand, communicate, and do. I shall give you some example of what I think works and what I think doesn’t In all of this you should never forget the power of negativity and also the joy of being able to wake up every day and say to yourself ‘I love the small of data in the morning’. 30 minutes presentation plus 5 minutes questions
  2. This example is covered in chapter 4 of Senn, S. J. (2003). Dicing with Death. Cambridge: Cambridge University Press.
  3. See Senn, S. J. (2013). Invalid inversion. Significance, 10(2), 40-42
  4. Since we are calculating the probability of having breast cancer given that someone is female, we condition on being ‘female’. We thus strike out the column ‘male’ as being irrelevant. The probability we require is the joint frequency ‘breast cancer’ and ‘female’ divide by the relevant marginal frequency ‘female’
  5. Since we are calculating the probability of being female given that someone suffering from breast cancer, we condition on suffering from breast cancer ’. We thus strike out the column ‘not suffering from breast cancer ’ as being irrelevant. The probability we require is the joint frequency ‘breast cancer’ and ‘female’ divide by the relevant marginal frequency ‘suffering from breast cancer ’
  6. Extract of GenStat program "To simulate regression to the mean" "This version used to try and reproduce the numbers selected (285)in original version of Significance paper" "Set parameters" SCALAR NSIM,mean,betvar,withvar,cut,lower,upper;VALUE=1000,90,50,15,95,60,120 TEXT xlabel,ylabel,title; VALUES='DBP at Baseline (mmHg)','DBP at Outcome (mmHg)','Diastolic blood pressure' "Begin simulation" FOR [NTIMES=1000] GRANDOM [DISTRIBUTION=Normal; NVALUES=NSIM; SEED=0; MEAN=mean; VARIANCE=betvar] True GRANDOM [DISTRIBUTION=Normal; NVALUES=NSIM; SEED=0; MEAN=0; VARIANCE=withvar] E1 CALCULATE X=True+E1 CALCULATE HBase=X>=cut CALCULATE Check=SUM(HBase) IF Check.EQ.285 PRINT Check; DECIMALS=0 EXIT [CONTROL=for] ENDIF ENDFOR VARIATE [NVALUES=2]Xline1,Xline2,Xline3,Yline1,Yline2,Yline3 CALCULATE Xline1=cut CALCULATE Yline1$[1],Yline1$[2]=lower,upper CALCULATE Xline2$[1],Xline2$[2]=lower,upper CALCULATE Yline2=cut CALCULATE Xline3$[1],Xline3$[2]=lower, upper CALCULATE Yline3$[1],Yline3$[2]=lower, upper
  7. See Senn, S. J. (2009). Three things every medical writer should know about statistics. The Write Stuff, 18(3), 159-162
  8. These are prime numbers. The minimum block size is thus the product of them all and that is 105. By the time the last patient has completed period two (say) many of the patients will have completed the whole trial. The way to run such a trial is as an add-on trial. All patients receive the current therapy as standard and they receive either placebo or the new treatment in addition. Trials of HIV infection were often of this sort and (correctly) described as placebo controlled. Senn, S. J. (2001). The Misunderstood Placebo. Applied Clinical Trials, 10(5), 40-46
  9. See Senn, S. J. (1998). Mathematics: governess or handmaiden? Journal of the Royal Statistical Society Series D-The Statistician, 47(2), 251-259