SlideShare a Scribd company logo
1 of 34
Download to read offline
Statistical Artifacts in the 
Ratio of Discrete Quantities 
Roger G. Johnston (*) 
Shayla D. Schroder 
Rajika Mallawaaratchy 
Los Alamos National Laboratory 
(*) Since October 2007: 
Argonne National Laboratory 
http://www.ne.anl.gov/capabilities/vat 
LAUR-02-1782
Common Problems 
in Using the Ratio 
„ Choosing units poorly 
„ Keeping wrong number of digits 
„ Ignoring covariance in error analysis 
„ Undefined when denominator = 0
Common Problems 
in Using the Ratio (con’t) 
These problems are fairly well recognized 
(except by students), 
but two others aren’’t...
Other Problems with the Ratio 
(Less Widely Recognized) 
„ Artifacts in the ratio when the numerator & 
denominator are discrete 
„ Lexicon
The Statistical Artifact 
„ Weird fine-structure (sometimes not so fine) shows up in the 
histogram of the ratio of two discrete variables. 
„ This can, and often has, been misinterpreted as instrumentation 
problems, or as potentially interesting science or engineering. 
„ But it really is an artifact of ratioing discrete numbers. 
„ Yet the artifact is not a binning error!
Batting Average: an instructive example 
BA = number of hits 
number of at bats 
.000 ” BA ” 1.000 
(1001 possible batting averages)
Batting Average (con’t) 
Batting .333 is ““easy””. 
I can go: 1 for 3, 2 for 6, 3 for 9, ... 
but 
Batting .334 is difficult! 
I must go: 96 for 287, 97 for 290, 98 for 293, ...
Batting Average (con’t) 
Many players don’’t get 287 official at bats 
in an entire season, so they never 
even get a shot at batting .334! 
(Thus, .334 is nearly unobtainable.)
Just to be Specific... 
Consider the ratio R=A/B 
where A & B are: 
•• integers in the range 0-255 
•• uncorrelated 
•• given by a Gaussian probability distribution with 
mean=127.5, s=32 
and R values are: 
•• in the range 0-5 
•• digitized (quantized) over 256 values (channels or bins)
What Does the Ratio 
Histogram Look Like? 
A: integers, 0-255 
(256 bins = 8 bit resolution) 
B: integers, 0-255 
(256 bins = 8 bit resolution) 
R=ratio=A/B: 0-5 
in 256 bins 
(8-bit resolution)
The Artifact Gets Worse With 
Higher Histogram Resolution! 
A: integers, 0-255 
(256 bins = 8 bit resolution) 
B: integers, 0-255 
(256 bins = 8 bit resolution) 
R=ratio=A/B: 0-5 
in 1024 bins 
(10-bit resolution)
Thus, the Artifact is Not 
Due to Binning Errors! 
With higher resolution for the ratio, the histogram 
artifact gets worse, not better. 
Why? 
Because there are more 
nearly unobtainable ““batting averages””
Artifacts in the Ratio 
Histogram 
How bad can it get?
Artifacts in the Ratio 
Histogram (con’t) 
A: integers, 0-9 
B: integers, 0-9 
R=ratio=A/B: 0-5 
in 100 bins
Artifacts in the Ratio 
Histogram (con’t) 
So how CAN we reduce the artifact?
Artifacts in the Ratio 
Histogram (con’t) 
A: integers, 0-99 
(100 bins) 
B: integers, 0-99 
(100 bins) 
R=ratio=A/B: 0-5 in 1000 
bins
More Bins for A & B Reduces 
the Artifactual Fine Structure! 
A: integers, 0-999 
(1000 bins) 
B: integers, 0-999 
(1000 bins) 
R=ratio=A/B: 0-5 in 1000 
bins
Artifacts in the Ratio 
Histogram (con’t) 
But is it just a matter of 
oscillating high and low values 
in adjacent bins ?
Runs are Possible! 
A: integers, 0-73 
B: integers, 0-108 
R=ratio=A/B: 0-5 
in 264 bins
Getting Fooled By the 
Statistical Artifact 
But is the statistical artifact 
in the ratio REALLY a problem?
Getting Fooled By the 
Statistical Artifact (con’t) 
Yes! 
We’’re aware of 7 examples 
at Los Alamos National Laboratory 
of the artifact fooling scientists, 
engineers, or technicians.
Getting Fooled By the 
Statistical Artifact -- example 1 
Application 
data acquisition software 
Artifact Misinterpreted As 
software bug
Getting Fooled By the 
Statistical Artifact -- example 2 
Application 
analog-to-digital converter electronics 
Artifact Misinterpreted As 
electronic noise
Getting Fooled By the 
Statistical Artifact -- example 3 
Application 
image processing (ratio of one image to 
another) 
Artifact Misinterpreted As 
video noise
Getting Fooled By the 
Statistical Artifact -- example 4 
Application 
computer modeling 
Artifact Misinterpreted As 
numeric non-convergence
Getting Fooled By the 
Statistical Artifact -- example 5 
Application 
light scattering (normalizing to laser intensity) 
Artifact Misinterpreted As 
instrument problems
Getting Fooled By the 
Statistical Artifact -- example 6 
Application 
fluorescence from biological cells during flow 
cytometry 
Artifact Misinterpreted As 
a new subset population of cells
Getting Fooled By the 
Statistical Artifact -- example 7 
Application 
finding data outliers 
Artifact Misinterpreted As 
excessive number of outliers
Recommendations for Not 
Getting Fooled by the Artifact 
•• Use the highest practical resolution (lots of 
bits) for the numerator & denominator but the 
lowest practical resolution for the ratio. 
•• Add a small amount of real random noise to the 
numerator and/or denominator.
Recommendations for Not 
Getting Fooled by the Artifact 
(con’t) 
•• Smooth the ratio histogram 
•• Use analog electronics to measure the analog ratio 
of the numerator & denominator before digitizing. 
•• Model the artifact
Recommendations for Not 
Getting Fooled by the Artifact 
(con’t) 
•• If nothing else, at least be aware of the artifact so as 
not to get fooled!
Lexicon Problems 
If you believe the Dictionary (usually a bad idea), then ““ratio”” is 
only a noun. Thus, these statements are not allowed: 
““We are going to ratio 2 numbers.”” (verb) 
““The artifact shows up during ratioing.”” (gerund) 
““I promise to never get fooled again by the ratioing (or ratio) 
process.”” (adjective)
Lexicon Problems (con’t) 
But the only important test of the appropriateness of a given 
(non-obscene) word or phrase in English is: 
(1) is it unambiguous? 
and 
(2) is it concise? 
Thus, we should surely allow ““ratio”” to be used as a verb, 
gerund, and adjective (not just as a noun) as is the case with 
many words in English and most technical words!
References 
•• Roger G. Johnston, Shayla D. Schroder, and A. Rajika Mallawaaratchy, ““Statistical 
Artifacts in the Ratio of Discrete Quantities””, American Statistician 49, 285-291 
(1995). 
•• Comments by Cornel G. Ormsby and Reply by Roger G. Johnston, American 
Statistician 50, 281 (1996). 
•• Argonne National Laboratory Vulnerability Assessment Team Home Page: 
http://www.ne.anl.gov/capabilities/vat/ (since October 2007)

More Related Content

Similar to Statistical Artifacts in the Ratio of Discrete Quantities

Motion 4 introduction to measurement (shared)
Motion 4  introduction to measurement (shared)Motion 4  introduction to measurement (shared)
Motion 4 introduction to measurement (shared)
LThistlewood
 
pdf_20221218_125132_0000 (1).pdf
pdf_20221218_125132_0000 (1).pdfpdf_20221218_125132_0000 (1).pdf
pdf_20221218_125132_0000 (1).pdf
AmitSing9
 
Cs221 lecture5-fall11
Cs221 lecture5-fall11Cs221 lecture5-fall11
Cs221 lecture5-fall11
darwinrlo
 

Similar to Statistical Artifacts in the Ratio of Discrete Quantities (20)

Part Numbering and ID codes: general considerations and check digits
Part Numbering and ID codes: general considerations and check digitsPart Numbering and ID codes: general considerations and check digits
Part Numbering and ID codes: general considerations and check digits
 
2dig circ
2dig circ2dig circ
2dig circ
 
An introduction to R
An introduction to RAn introduction to R
An introduction to R
 
Motion 4 introduction to measurement (shared)
Motion 4  introduction to measurement (shared)Motion 4  introduction to measurement (shared)
Motion 4 introduction to measurement (shared)
 
Rthch2nts 160709112024
Rthch2nts 160709112024Rthch2nts 160709112024
Rthch2nts 160709112024
 
Simplification of Circuits.pdf
Simplification of Circuits.pdfSimplification of Circuits.pdf
Simplification of Circuits.pdf
 
Tolerance analysis for pm
Tolerance analysis for pmTolerance analysis for pm
Tolerance analysis for pm
 
Alpine ML Talk: Vtreat: A Package for Automating Variable Treatment in R By ...
Alpine ML Talk:  Vtreat: A Package for Automating Variable Treatment in R By ...Alpine ML Talk:  Vtreat: A Package for Automating Variable Treatment in R By ...
Alpine ML Talk: Vtreat: A Package for Automating Variable Treatment in R By ...
 
2dig circ
2dig circ2dig circ
2dig circ
 
Beyond PFCount: Shrif Nada
Beyond PFCount: Shrif NadaBeyond PFCount: Shrif Nada
Beyond PFCount: Shrif Nada
 
Statistics Assignment Help
Statistics Assignment HelpStatistics Assignment Help
Statistics Assignment Help
 
Data Mining: Concepts and Techniques (3rd ed.) - Chapter 3 preprocessing
Data Mining:  Concepts and Techniques (3rd ed.)- Chapter 3 preprocessingData Mining:  Concepts and Techniques (3rd ed.)- Chapter 3 preprocessing
Data Mining: Concepts and Techniques (3rd ed.) - Chapter 3 preprocessing
 
pdf_20221218_125132_0000 (1).pdf
pdf_20221218_125132_0000 (1).pdfpdf_20221218_125132_0000 (1).pdf
pdf_20221218_125132_0000 (1).pdf
 
Cs221 lecture5-fall11
Cs221 lecture5-fall11Cs221 lecture5-fall11
Cs221 lecture5-fall11
 
ilp-nlp-slides.pdf
ilp-nlp-slides.pdfilp-nlp-slides.pdf
ilp-nlp-slides.pdf
 
FinalReport
FinalReportFinalReport
FinalReport
 
2013 - Andrei Zmievski: Machine learning para datos
2013 - Andrei Zmievski: Machine learning para datos2013 - Andrei Zmievski: Machine learning para datos
2013 - Andrei Zmievski: Machine learning para datos
 
Error detection and correction
Error detection and correctionError detection and correction
Error detection and correction
 
Digital 1
Digital 1Digital 1
Digital 1
 
Explainable AI
Explainable AIExplainable AI
Explainable AI
 

More from Roger Johnston

More from Roger Johnston (20)

In Risu Veritas: Humor & Security
In Risu Veritas: Humor & SecurityIn Risu Veritas: Humor & Security
In Risu Veritas: Humor & Security
 
Journal of Physical Security 15(1)
Journal of Physical Security 15(1)Journal of Physical Security 15(1)
Journal of Physical Security 15(1)
 
Security Audits.pdf
Security Audits.pdfSecurity Audits.pdf
Security Audits.pdf
 
Camera Obscura and Security/Privacy
Camera Obscura and Security/PrivacyCamera Obscura and Security/Privacy
Camera Obscura and Security/Privacy
 
Vulnerability Assessment: The Missing Manual for the Missing Link
Vulnerability Assessment:  The Missing Manual for the Missing Link  Vulnerability Assessment:  The Missing Manual for the Missing Link
Vulnerability Assessment: The Missing Manual for the Missing Link
 
Journal of Physical Security 14(1)
Journal of Physical Security 14(1)Journal of Physical Security 14(1)
Journal of Physical Security 14(1)
 
Want seals with that?
Want seals with that?Want seals with that?
Want seals with that?
 
Journal of Physical Security 13(1)
Journal of Physical Security 13(1)Journal of Physical Security 13(1)
Journal of Physical Security 13(1)
 
Election Security 2020
Election Security 2020Election Security 2020
Election Security 2020
 
Security Assurance
Security AssuranceSecurity Assurance
Security Assurance
 
A New Approach to Vulnerability Assessment
A New Approach to Vulnerability AssessmentA New Approach to Vulnerability Assessment
A New Approach to Vulnerability Assessment
 
Understanding Vulnerability Assessments
Understanding Vulnerability AssessmentsUnderstanding Vulnerability Assessments
Understanding Vulnerability Assessments
 
Devil's Dictionary of Security Terms
Devil's Dictionary of Security Terms Devil's Dictionary of Security Terms
Devil's Dictionary of Security Terms
 
Vulnerability Assessments
Vulnerability Assessments  Vulnerability Assessments
Vulnerability Assessments
 
Design Reviews Versus Vulnerability Assessments for Physical Security
Design Reviews Versus Vulnerability Assessments for Physical SecurityDesign Reviews Versus Vulnerability Assessments for Physical Security
Design Reviews Versus Vulnerability Assessments for Physical Security
 
Journal of Physical Security 12(3)
Journal of Physical Security 12(3)Journal of Physical Security 12(3)
Journal of Physical Security 12(3)
 
Journal of Physical Security 12(2)
Journal of Physical Security 12(2)Journal of Physical Security 12(2)
Journal of Physical Security 12(2)
 
Unconventional Security Devices
Unconventional Security DevicesUnconventional Security Devices
Unconventional Security Devices
 
Making the Business Case for Security Investment
Making the Business Case for Security InvestmentMaking the Business Case for Security Investment
Making the Business Case for Security Investment
 
Journal of Physical Security 11(1)
Journal of Physical Security 11(1)Journal of Physical Security 11(1)
Journal of Physical Security 11(1)
 

Recently uploaded

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 

Recently uploaded (20)

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 

Statistical Artifacts in the Ratio of Discrete Quantities

  • 1. Statistical Artifacts in the Ratio of Discrete Quantities Roger G. Johnston (*) Shayla D. Schroder Rajika Mallawaaratchy Los Alamos National Laboratory (*) Since October 2007: Argonne National Laboratory http://www.ne.anl.gov/capabilities/vat LAUR-02-1782
  • 2. Common Problems in Using the Ratio „ Choosing units poorly „ Keeping wrong number of digits „ Ignoring covariance in error analysis „ Undefined when denominator = 0
  • 3. Common Problems in Using the Ratio (con’t) These problems are fairly well recognized (except by students), but two others aren’’t...
  • 4. Other Problems with the Ratio (Less Widely Recognized) „ Artifacts in the ratio when the numerator & denominator are discrete „ Lexicon
  • 5. The Statistical Artifact „ Weird fine-structure (sometimes not so fine) shows up in the histogram of the ratio of two discrete variables. „ This can, and often has, been misinterpreted as instrumentation problems, or as potentially interesting science or engineering. „ But it really is an artifact of ratioing discrete numbers. „ Yet the artifact is not a binning error!
  • 6. Batting Average: an instructive example BA = number of hits number of at bats .000 ” BA ” 1.000 (1001 possible batting averages)
  • 7. Batting Average (con’t) Batting .333 is ““easy””. I can go: 1 for 3, 2 for 6, 3 for 9, ... but Batting .334 is difficult! I must go: 96 for 287, 97 for 290, 98 for 293, ...
  • 8. Batting Average (con’t) Many players don’’t get 287 official at bats in an entire season, so they never even get a shot at batting .334! (Thus, .334 is nearly unobtainable.)
  • 9. Just to be Specific... Consider the ratio R=A/B where A & B are: •• integers in the range 0-255 •• uncorrelated •• given by a Gaussian probability distribution with mean=127.5, s=32 and R values are: •• in the range 0-5 •• digitized (quantized) over 256 values (channels or bins)
  • 10. What Does the Ratio Histogram Look Like? A: integers, 0-255 (256 bins = 8 bit resolution) B: integers, 0-255 (256 bins = 8 bit resolution) R=ratio=A/B: 0-5 in 256 bins (8-bit resolution)
  • 11. The Artifact Gets Worse With Higher Histogram Resolution! A: integers, 0-255 (256 bins = 8 bit resolution) B: integers, 0-255 (256 bins = 8 bit resolution) R=ratio=A/B: 0-5 in 1024 bins (10-bit resolution)
  • 12. Thus, the Artifact is Not Due to Binning Errors! With higher resolution for the ratio, the histogram artifact gets worse, not better. Why? Because there are more nearly unobtainable ““batting averages””
  • 13. Artifacts in the Ratio Histogram How bad can it get?
  • 14. Artifacts in the Ratio Histogram (con’t) A: integers, 0-9 B: integers, 0-9 R=ratio=A/B: 0-5 in 100 bins
  • 15. Artifacts in the Ratio Histogram (con’t) So how CAN we reduce the artifact?
  • 16. Artifacts in the Ratio Histogram (con’t) A: integers, 0-99 (100 bins) B: integers, 0-99 (100 bins) R=ratio=A/B: 0-5 in 1000 bins
  • 17. More Bins for A & B Reduces the Artifactual Fine Structure! A: integers, 0-999 (1000 bins) B: integers, 0-999 (1000 bins) R=ratio=A/B: 0-5 in 1000 bins
  • 18. Artifacts in the Ratio Histogram (con’t) But is it just a matter of oscillating high and low values in adjacent bins ?
  • 19. Runs are Possible! A: integers, 0-73 B: integers, 0-108 R=ratio=A/B: 0-5 in 264 bins
  • 20. Getting Fooled By the Statistical Artifact But is the statistical artifact in the ratio REALLY a problem?
  • 21. Getting Fooled By the Statistical Artifact (con’t) Yes! We’’re aware of 7 examples at Los Alamos National Laboratory of the artifact fooling scientists, engineers, or technicians.
  • 22. Getting Fooled By the Statistical Artifact -- example 1 Application data acquisition software Artifact Misinterpreted As software bug
  • 23. Getting Fooled By the Statistical Artifact -- example 2 Application analog-to-digital converter electronics Artifact Misinterpreted As electronic noise
  • 24. Getting Fooled By the Statistical Artifact -- example 3 Application image processing (ratio of one image to another) Artifact Misinterpreted As video noise
  • 25. Getting Fooled By the Statistical Artifact -- example 4 Application computer modeling Artifact Misinterpreted As numeric non-convergence
  • 26. Getting Fooled By the Statistical Artifact -- example 5 Application light scattering (normalizing to laser intensity) Artifact Misinterpreted As instrument problems
  • 27. Getting Fooled By the Statistical Artifact -- example 6 Application fluorescence from biological cells during flow cytometry Artifact Misinterpreted As a new subset population of cells
  • 28. Getting Fooled By the Statistical Artifact -- example 7 Application finding data outliers Artifact Misinterpreted As excessive number of outliers
  • 29. Recommendations for Not Getting Fooled by the Artifact •• Use the highest practical resolution (lots of bits) for the numerator & denominator but the lowest practical resolution for the ratio. •• Add a small amount of real random noise to the numerator and/or denominator.
  • 30. Recommendations for Not Getting Fooled by the Artifact (con’t) •• Smooth the ratio histogram •• Use analog electronics to measure the analog ratio of the numerator & denominator before digitizing. •• Model the artifact
  • 31. Recommendations for Not Getting Fooled by the Artifact (con’t) •• If nothing else, at least be aware of the artifact so as not to get fooled!
  • 32. Lexicon Problems If you believe the Dictionary (usually a bad idea), then ““ratio”” is only a noun. Thus, these statements are not allowed: ““We are going to ratio 2 numbers.”” (verb) ““The artifact shows up during ratioing.”” (gerund) ““I promise to never get fooled again by the ratioing (or ratio) process.”” (adjective)
  • 33. Lexicon Problems (con’t) But the only important test of the appropriateness of a given (non-obscene) word or phrase in English is: (1) is it unambiguous? and (2) is it concise? Thus, we should surely allow ““ratio”” to be used as a verb, gerund, and adjective (not just as a noun) as is the case with many words in English and most technical words!
  • 34. References •• Roger G. Johnston, Shayla D. Schroder, and A. Rajika Mallawaaratchy, ““Statistical Artifacts in the Ratio of Discrete Quantities””, American Statistician 49, 285-291 (1995). •• Comments by Cornel G. Ormsby and Reply by Roger G. Johnston, American Statistician 50, 281 (1996). •• Argonne National Laboratory Vulnerability Assessment Team Home Page: http://www.ne.anl.gov/capabilities/vat/ (since October 2007)