SlideShare a Scribd company logo
Causes of False Alarms in
  Inferential Event Detection
Systems for Distribution System
   Water Quality Monitoring

   Ed Roehl, John Cook, Ruby Daamen, and Uwe Mundry
        Advanced Data Mining International, LLC
                Greenville, South Carolina
Early Results from



   WRF PROJECT 4182

Interpreting real-time online
  monitoring data for water
   quality event detection
Acknowledgements

ADMi gratefully acknowledges the Water
 Research Foundation as the joint owners of
 certain technical information upon which this
 presentation is based. ADMi thanks the
 Foundation for their financial, technical, and
 administrative assistance in funding the
 project through which much of this
 information was discovered.
Thanks to Our Utility Partners


• City of Columbus, Ohio, Division of Power
  and Water
• Greenville Water System
• Newport News Waterworks
• Oklahoma City Water Department
• Startex Jackson Wellford Duncan Water
  District (SJWD)
The Concept
Inferential Event Detection System (IEDS)

• Focus on distribution system security
• Real-time monitoring of “conventional” WQ
  parameters - CL2, PH, COND/SC, TURB,
  TOC, TEMP
• Infers an “event” by detecting anomalous
  patterns of WQ behavior
  – Does not measure concentrations of specific
    compounds like liquid chromatography
• Systems have been available for a number
  years
Colorado State pilot loop from Project 3086
    Hach “panel” SC, pH, Cl2, turbidity   TOC
data acquisition                        analyzer



                                              ventilation


                                                   toxin
 flow                                              injection
pump                                                 pump

    flow direction flow loop     injection point
CSU pilot loop results




  chlorine residual    chlorine residual response
response to Aldicarb          to Na Cyanide
conductivity                        CSU pilot loop results, cont.




                                          pH
               conductivity response to        pH response to Na
                    Na Arsenate                     Cyanide
HOW? - event detection

                                           event
SCADA                                                     historical database of
            1                                             “normal” behaviors”
                                                               5


feature vector                                           6
    CL2       TURB                                     distances to
       dCL2 pH   SC
                                                       neighbors
2       dt          TOC


                                                                                   CL2
                                                   4
                                                   vector
                                                    track
          3D projection of n-dimensional
                  feature space       3
Reality
Measurement errors, tank cycling, etc.
                                      1 week




                    data
                    gap




               10-minute time steps


• Fast, full scale change
WRF Project 4182
• Reports of unacceptable numbers of false
  positives unless sensitivity reduced
  – defeats purpose
• Thesis - a more effective IEDS can be
  developed by incorporating the effects of
  operational parameters on water quality
  variability
  – reduce false positives
  – local ops params – Q, Ps, LVL
• Utility partners provided multi-year data from
  40+ monitoring sites
Definitions


• A contaminated slug flowing past a
  sensor array might only be detectable for
  a few minutes or less.
  – Here, target detection window  20 minutes
     • event is manifest and detectable


• “Normal” data – all the data here
  represents normal operations and
  normal data collection issues.
Causes of False
   Positives
Question 1
• Q1: If event detection relies on pattern matching vectors,
  how similar are “normal” new vectors to normal old
  vectors?
   – “old” = historical database
   – “new” = vectors streaming from process
• Expected A: If both are normal, they should be pretty
  similar.
                                                         CARTOON




                historical normal             new normal data
                       data
Experiment 1 – determine if old and new vectors
      cohabitate the same sub-spaces of feature space
CL2 (mg/l)
                                          1.    Divide 4 years
                   historical   new             of 10-min data
                                                into ~70% old
                                                and ~30% new




                                          2.    Define vector
              scalar CL2
                                                features for
                                                each WQ
                                                parameter
                              coarse           a.   Scalars -
                                                    divided into 5
                           segmentation             20% sub-ranges
cont 1: Experiment 1 - cohabitating hist. and new

                                     2. cont. - Create
              coarse
                                        features
           segmentation
                                        b. D1 = 1-time-
                                           step difference;
                                           sub-divide into
                                           6 sub-ranges
                                        c. D2 = D1 time-
                                           delayed 1 time
                                           step
       D1 & D2 CL2


   Process Dynamics – scalar+D1+D2 describe
   parameter’s current position+velocity+acceleration.
cont 2: Experiment 1 – cohabitating old and new
3.   Count cohabitating old and new in sub-spaces
     (hypercuboids) formed by 5 scalar, 6 D1, and 6 D2 sub-
     ranges
Combinatorial Explosion – even with coarse segmentation
  - 3 scalars = 5 x 5 x 5 = 125 cuboids
  - + D1 = 125 x 6 x 6 x 6 = 27k hypercuboids
  - + D2 = 27k x 6 x 6 x 6 = 5.8 million
                     tank site
cont 3: Experiment 1 – cohabitating hist. and new
booster pump station away from tanks




                                       next experiment
Back to Question 1
• Q1: How similar are “normal old” and “normal new”
  vectors?
• Expected A: If both are “normal”, they should be
  pretty similar.
• Real A: Not very - numerous false alarms may be
  unavoidable without desensitizing IEDS
Question 2


• Q2: What would happen if we periodically transfer
  “new” vectors to the historical database?
  – Experiment 1 – static old & new


• Expected A: False alarms should decrease.
Experiment 2 – simulate updating hist. database




                                 new vectors                           47,624
•   Simulations used the site away from tanks
    –    features = CL2, SC, COND, TURB scalars+D1s+D2s
•   dnn = distance of new vector to “nearest neighbor” old vector
    –    In IEDS dnn > specified limit triggers alarm
•   Findings
    1.   Transfer cases are high percentage of no-transfer case
    2.   Little difference between transfer cases
    3.   Indicates that successive “normal” vectors can be far apart
cont: Experiment 2


   ~1 false/wk

                                   ~1 false/10wk




• Weekly transfers
  – 1 false/wk: dnn = 42 x avg(dnn)
  – 1 false/10wks: dnn = 59 x avg(dnn)
Back to Question 2

• Q2: What would happen if we periodically
  transfer “new” vectors to the historical
  database?

• Expected A: False alarms should decrease.

• Real A: False alarms might not fall to
  acceptable levels.
Question 3


• Q3: Why are successive vectors so
  far apart?

• A: To come.
Experiment 3 – correlation matrices
• Cross-correlation matrix – correlates changes among multiple
  ops & WQ parameters
   – change = Dx = current value – value x time steps ago

• Utility B stand alone
  site
   • 86-sec time step                           1 time-step
   • Mix of WQ and
     operational
     parameters

                3 time-step
                 (4.3 min)
                  change


                7 time-step
                 (10 min)
                  change
Experiment 4 – autocorrelation of Dx
• Autocorrelation function correlates a signal to itself to
  determine how deterministic / random it is.
   – determinism = current behavior depends somewhat on past
   – randomness = current behavior unrelated to past


          1 time-step
                                              14 time-step
           (86 sec)
                                                (20 min)
            change
                                                 change
Back to Question 3
• Q3: Why are successive vectors so far
  apart?

• A: WQ change on time scales  20 minutes
  can be “apparently random”.
  – Exp. 3 (x-matrices) - WQ & ops parameter
    changes are poorly correlated
  – Exp. 4 (autocorr.) - individual WQ parameter
    change is non-deterministic
  – Same findings at multiple sites & utilities

• non-determinism = randomness = noise
Causes of WQ variability

                                     Stand
                                     Alone
                                      Site




• Unmeasured disturbances
  – pressure & flow transients
• Measurement errors
Current
Research
Alternative to stand-alone site
                                  monitoring




                                             Event
                                            Detector




• Upstream / downstream sites
• Upstream site provides
  – boundary conditions for downstream WQ
  – more operational parameters
Multi-Site Concept
• Event detection performed on filtered signals
  – model-based filtering of downstream WQ signals
  – modeling = accounting of causes of variability
  – filtered signals less variable

• Modeling technique
  – multivariate, nonlinear curve fitting by (multi-layer
    perceptron) artificial neural networks (ANN)
     • “machine learning” from AI
  – inputs - upstream and “local” WQ and ops
     • spectrally decomposed into components
     • autoregressive “local” WQ inputs time delayed to be outside
       detection window (e.g., 20 minutes)
     • co-linear inputs decorrelated
  – ANN “learns” best predictor components
upstream
               flow
                                        downstream         2-Site Example
COND (mS/cm)                                                               TEMP (deg. F)
                              COND



                                      TEMP




                                                   test data
                                                                                       PH

                                CL2

CL2 (mg/l)



                                  PH
                                             1-hour time steps (220 days, August to March)


• Raw WQ variability is similar but not identical
     – differences caused by unmeasured disturbances
• 1-hour time step too big for 20-minute detection window
     – exploratory research on multi-site
Results – detail
Downstream COND D1 (mS/cm)

                   D1 = 1 time-step difference
                                                    of test data




                             measured data
Downstream CL2 D1 (mg/l)     upstream only
                             upstream+auto
Downstream PH D1




                                                     1-hour time steps
More Complicated 4-Site Example
    Q? = unmeasured
                                          LVL,
      disturbances                       COND,   TANK
                                          CL2
                       Q, PSUC, PDIS,
                      COND, CL2, TEMP
                                                   A     Q, PSUC, PDIS,
                                                        COND, CL2, TEMP

                         BPS                               BPS
                          B                                 A

              LVL,                  Q? = unmeasured
    TANK     COND,                    disturbances
              CL2
      B
• BPS B is “target” site
• Utility operates multiple WTPs with different
  sources
• 1 year of data (1-min reduced to 4-min)
  – first 10 month = training
  – last 2 months = test
BPS B COND Process Model – training data
                                                                       Training Data
                                measured predicted residuals         N: 76,148
                                                                     R2: 0.847
                                                                     RMSE: 72 mS/cm




                                                                                       Residual Error (mS/cm)
BPS B COND (mS/cm)




                               4-minute training data observations


                      • Looks Good!
BPS B COND Process Model – test data
                          Test Data                              hump
                     N: 17,296
                     R2: 0.893
                     RMSE: 69 mS/cm




                                                                           Residual Error (mS/cm)
BPS B COND (mS/cm)




                        measured predicted residuals

                                         4-minute test data observations

                     • Hump may be from different WTP/source
                     • Looks Good!
BPS B COND Process Model – test data


                                                                          Detail
BPS B COND (mS/cm)




                        measured predicted


                                        4-minute test data observations

                     • Looks Bad!
                     • Process model misses some periods - maybe
                       from unmonitored flows through junctions
BPS B COND Dx autocorrelations
COND




                           • Dx = D
R           R2               over x
                             number of
                             minutes

CL2                        • R2s are
                             low

R           R2
BPS B CL2 Process Model – test data


                                                     trough




                                                                                             Residual Error (mg/l)
BPS B CL2 (mg/l)




                                                                               Test Data
                   measured predicted residuals                           N: 11,715
                                                                          R2: 0.912
                                                                          RMSE: 0.085 mg/l




                                        4-minute test data observations

                    • trough may be from different WTP/source
                    • Looks Good!
BPS B CL2 Process Model – test data
                      Detail
BPS B CL2 (mg/l)




                           measured predicted




                                   4-minute test data observations

                   • Looks Bad!
                   • Process model missing some periods - maybe
                     from unmonitored flows through junctions
Conclusions
IEDS - Conclusions
• Practical problems
  – data reliability
  – no guarantees that contamination event would “look”
    different than “normal” because
     • “normal” is so highly variable
     • WQ sensors being used might not provide the “information”
       necessary to discriminate
  – Where to put / how many?


• Stand-Alone Sites
  – face widely ranging random variability from unknown
    disturbances, a.k.a. normal operations
  – high alarm limits needed to reduce false positives -
    defeats purpose
cont - Conclusions


• Multi-Site approach
  – Can account for/explain 80-90% of downstream
    WQ variability
  – unproven on  20 min detection window
  – diminished when too much complexity
  – field testing to be done at GWS and SJWD
cont - Conclusions
             • Other reasons to monitor distribution system WQ
                – control processes at WTP to improve WQ at points of delivery
                – detect common problems - low total chlorine, nitrification, line
                  integrity, DBPs, biofilm sloughing, incipient complaint detection
CL2 (mg/l)




                                                                     BPS A TANK A
                                                                     BPS B TANK B



                                  1-minute time steps 1/1/05 – 11/16/09
Compare WTP with DS turbidity




• Little variability in WTP turbidity, < 0.1 NTU !
Correlate DS turbidity with WTP WQ




• ANN process model
• Inputs = finished water
  alkalinity, hardness, color, and
  source blend ratio
• R2 = 0.71

More Related Content

Similar to Wqtc2011 causes offalsealarms-20111115-final

Wqtc2013 invest ofperformanceprobswitheds-20130910
Wqtc2013 invest ofperformanceprobswitheds-20130910Wqtc2013 invest ofperformanceprobswitheds-20130910
Wqtc2013 invest ofperformanceprobswitheds-20130910
John B. Cook, PE, CEO
 
Modeling full scale-data(2)
Modeling full scale-data(2)Modeling full scale-data(2)
Modeling full scale-data(2)
John B. Cook, PE, CEO
 
Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18
Aritra Sarkar
 
QX Simulator and quantum programming - 2020-04-28
QX Simulator and quantum programming - 2020-04-28QX Simulator and quantum programming - 2020-04-28
QX Simulator and quantum programming - 2020-04-28
Aritra Sarkar
 
Niko Neufeld "A 32 Tbit/s Data Acquisition System"
Niko Neufeld "A 32 Tbit/s Data Acquisition System"Niko Neufeld "A 32 Tbit/s Data Acquisition System"
Niko Neufeld "A 32 Tbit/s Data Acquisition System"
Yandex
 
Inside LoLA - Experiences from building a state space tool for place transiti...
Inside LoLA - Experiences from building a state space tool for place transiti...Inside LoLA - Experiences from building a state space tool for place transiti...
Inside LoLA - Experiences from building a state space tool for place transiti...
Universität Rostock
 
Overview of Photonics Research at Calit2: Scaling from Nanometers to the Earth
Overview of Photonics Research at Calit2: Scaling from Nanometers to the EarthOverview of Photonics Research at Calit2: Scaling from Nanometers to the Earth
Overview of Photonics Research at Calit2: Scaling from Nanometers to the Earth
Larry Smarr
 
Modeling full scale-data(2)
Modeling full scale-data(2)Modeling full scale-data(2)
Modeling full scale-data(2)
John B. Cook, PE, CEO
 
409282776-5G-RAN2-0-KPI-Introduction.pptx
409282776-5G-RAN2-0-KPI-Introduction.pptx409282776-5G-RAN2-0-KPI-Introduction.pptx
409282776-5G-RAN2-0-KPI-Introduction.pptx
QasimQadir3
 
data-stream-processing-SEEP.pptx
data-stream-processing-SEEP.pptxdata-stream-processing-SEEP.pptx
data-stream-processing-SEEP.pptx
AhmadTawfigAlRadaide
 
Scalable membership management
Scalable membership management Scalable membership management
Scalable membership management Vinay Setty
 
DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...
DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...
DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...
Deltares
 
Reinhard (1).ppt
Reinhard (1).pptReinhard (1).ppt
Reinhard (1).ppt
akhilgupta391173
 
Qualitative model of transport
Qualitative model of transportQualitative model of transport
Qualitative model of transport
RokhitTharshini
 
Multidimensional wave digital filtering network
Multidimensional wave digital filtering networkMultidimensional wave digital filtering network
Multidimensional wave digital filtering network
jason Tseng
 
Implementation of the fully adaptive radar framework: Practical limitations
Implementation of the fully adaptive radar framework: Practical limitationsImplementation of the fully adaptive radar framework: Practical limitations
Implementation of the fully adaptive radar framework: Practical limitations
Luis Úbeda Medina
 
Opensample: A Low-latency, Sampling-based Measurement Platform for Software D...
Opensample: A Low-latency, Sampling-based Measurement Platform for Software D...Opensample: A Low-latency, Sampling-based Measurement Platform for Software D...
Opensample: A Low-latency, Sampling-based Measurement Platform for Software D...
Junho Suh
 
The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...
The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...
The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...
Steve Cuddy
 
A Deep Learning use case for water end use detection by Roberto Díaz and José...
A Deep Learning use case for water end use detection by Roberto Díaz and José...A Deep Learning use case for water end use detection by Roberto Díaz and José...
A Deep Learning use case for water end use detection by Roberto Díaz and José...
Big Data Spain
 

Similar to Wqtc2011 causes offalsealarms-20111115-final (20)

Wqtc2013 invest ofperformanceprobswitheds-20130910
Wqtc2013 invest ofperformanceprobswitheds-20130910Wqtc2013 invest ofperformanceprobswitheds-20130910
Wqtc2013 invest ofperformanceprobswitheds-20130910
 
Modeling full scale-data(2)
Modeling full scale-data(2)Modeling full scale-data(2)
Modeling full scale-data(2)
 
Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18
 
QX Simulator and quantum programming - 2020-04-28
QX Simulator and quantum programming - 2020-04-28QX Simulator and quantum programming - 2020-04-28
QX Simulator and quantum programming - 2020-04-28
 
Niko Neufeld "A 32 Tbit/s Data Acquisition System"
Niko Neufeld "A 32 Tbit/s Data Acquisition System"Niko Neufeld "A 32 Tbit/s Data Acquisition System"
Niko Neufeld "A 32 Tbit/s Data Acquisition System"
 
Inside LoLA - Experiences from building a state space tool for place transiti...
Inside LoLA - Experiences from building a state space tool for place transiti...Inside LoLA - Experiences from building a state space tool for place transiti...
Inside LoLA - Experiences from building a state space tool for place transiti...
 
Overview of Photonics Research at Calit2: Scaling from Nanometers to the Earth
Overview of Photonics Research at Calit2: Scaling from Nanometers to the EarthOverview of Photonics Research at Calit2: Scaling from Nanometers to the Earth
Overview of Photonics Research at Calit2: Scaling from Nanometers to the Earth
 
Modeling full scale-data(2)
Modeling full scale-data(2)Modeling full scale-data(2)
Modeling full scale-data(2)
 
409282776-5G-RAN2-0-KPI-Introduction.pptx
409282776-5G-RAN2-0-KPI-Introduction.pptx409282776-5G-RAN2-0-KPI-Introduction.pptx
409282776-5G-RAN2-0-KPI-Introduction.pptx
 
data-stream-processing-SEEP.pptx
data-stream-processing-SEEP.pptxdata-stream-processing-SEEP.pptx
data-stream-processing-SEEP.pptx
 
Scalable membership management
Scalable membership management Scalable membership management
Scalable membership management
 
DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...
DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...
DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...
 
Reinhard (1).ppt
Reinhard (1).pptReinhard (1).ppt
Reinhard (1).ppt
 
Qualitative model of transport
Qualitative model of transportQualitative model of transport
Qualitative model of transport
 
Multidimensional wave digital filtering network
Multidimensional wave digital filtering networkMultidimensional wave digital filtering network
Multidimensional wave digital filtering network
 
Implementation of the fully adaptive radar framework: Practical limitations
Implementation of the fully adaptive radar framework: Practical limitationsImplementation of the fully adaptive radar framework: Practical limitations
Implementation of the fully adaptive radar framework: Practical limitations
 
Gray 110916 ns-fwkshp
Gray 110916 ns-fwkshpGray 110916 ns-fwkshp
Gray 110916 ns-fwkshp
 
Opensample: A Low-latency, Sampling-based Measurement Platform for Software D...
Opensample: A Low-latency, Sampling-based Measurement Platform for Software D...Opensample: A Low-latency, Sampling-based Measurement Platform for Software D...
Opensample: A Low-latency, Sampling-based Measurement Platform for Software D...
 
The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...
The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...
The Remarkable Benefits and Grave Dangers of using Artificial Intelligence in...
 
A Deep Learning use case for water end use detection by Roberto Díaz and José...
A Deep Learning use case for water end use detection by Roberto Díaz and José...A Deep Learning use case for water end use detection by Roberto Díaz and José...
A Deep Learning use case for water end use detection by Roberto Díaz and José...
 

More from John B. Cook, PE, CEO

Orange Co. Water District's Solution to Water Crisis
Orange Co. Water District's Solution to Water CrisisOrange Co. Water District's Solution to Water Crisis
Orange Co. Water District's Solution to Water Crisis
John B. Cook, PE, CEO
 
Scwrc2014 savannah basinresourceoptimization-20141021
Scwrc2014 savannah basinresourceoptimization-20141021Scwrc2014 savannah basinresourceoptimization-20141021
Scwrc2014 savannah basinresourceoptimization-20141021
John B. Cook, PE, CEO
 
Asset management-cda
Asset management-cdaAsset management-cda
Asset management-cda
John B. Cook, PE, CEO
 
Adm graphics-2003
Adm graphics-2003Adm graphics-2003
Adm graphics-2003
John B. Cook, PE, CEO
 
Neiwpcc nps 2010
Neiwpcc nps 2010Neiwpcc nps 2010
Neiwpcc nps 2010
John B. Cook, PE, CEO
 
Hic06 spatial interpolation
Hic06 spatial interpolationHic06 spatial interpolation
Hic06 spatial interpolation
John B. Cook, PE, CEO
 
Daamen r 2010scwr-cpaper
Daamen r 2010scwr-cpaperDaamen r 2010scwr-cpaper
Daamen r 2010scwr-cpaper
John B. Cook, PE, CEO
 
Wqtc2013 dist syswq-modeling-20131107
Wqtc2013 dist syswq-modeling-20131107Wqtc2013 dist syswq-modeling-20131107
Wqtc2013 dist syswq-modeling-20131107
John B. Cook, PE, CEO
 
Caw toronto presentation-20121031
Caw toronto presentation-20121031Caw toronto presentation-20121031
Caw toronto presentation-20121031
John B. Cook, PE, CEO
 
Ewri2009 big data_jbc
Ewri2009 big data_jbcEwri2009 big data_jbc
Ewri2009 big data_jbc
John B. Cook, PE, CEO
 
Ad mi floridan-aquiferwls-for-pps
Ad mi floridan-aquiferwls-for-ppsAd mi floridan-aquiferwls-for-pps
Ad mi floridan-aquiferwls-for-pps
John B. Cook, PE, CEO
 
Wrf4285 climate changepresentation-20121008
Wrf4285 climate changepresentation-20121008Wrf4285 climate changepresentation-20121008
Wrf4285 climate changepresentation-20121008
John B. Cook, PE, CEO
 

More from John B. Cook, PE, CEO (16)

Orange Co. Water District's Solution to Water Crisis
Orange Co. Water District's Solution to Water CrisisOrange Co. Water District's Solution to Water Crisis
Orange Co. Water District's Solution to Water Crisis
 
Scwrc2014 savannah basinresourceoptimization-20141021
Scwrc2014 savannah basinresourceoptimization-20141021Scwrc2014 savannah basinresourceoptimization-20141021
Scwrc2014 savannah basinresourceoptimization-20141021
 
Asset management-cda
Asset management-cdaAsset management-cda
Asset management-cda
 
Adm graphics-2003
Adm graphics-2003Adm graphics-2003
Adm graphics-2003
 
Neiwpcc nps 2010
Neiwpcc nps 2010Neiwpcc nps 2010
Neiwpcc nps 2010
 
Hic06 spatial interpolation
Hic06 spatial interpolationHic06 spatial interpolation
Hic06 spatial interpolation
 
Integrated river basin management
Integrated river basin managementIntegrated river basin management
Integrated river basin management
 
Daamen r 2010scwr-cpaper
Daamen r 2010scwr-cpaperDaamen r 2010scwr-cpaper
Daamen r 2010scwr-cpaper
 
Wqtc2013 dist syswq-modeling-20131107
Wqtc2013 dist syswq-modeling-20131107Wqtc2013 dist syswq-modeling-20131107
Wqtc2013 dist syswq-modeling-20131107
 
Caw toronto presentation-20121031
Caw toronto presentation-20121031Caw toronto presentation-20121031
Caw toronto presentation-20121031
 
Ewri2009 big data_jbc
Ewri2009 big data_jbcEwri2009 big data_jbc
Ewri2009 big data_jbc
 
Ad mi floridan-aquiferwls-for-pps
Ad mi floridan-aquiferwls-for-ppsAd mi floridan-aquiferwls-for-pps
Ad mi floridan-aquiferwls-for-pps
 
Wrf4285 climate changepresentation-20121008
Wrf4285 climate changepresentation-20121008Wrf4285 climate changepresentation-20121008
Wrf4285 climate changepresentation-20121008
 
Neiwpcc2010.ppt
Neiwpcc2010.pptNeiwpcc2010.ppt
Neiwpcc2010.ppt
 
Hic06 spatial interpolation
Hic06 spatial interpolationHic06 spatial interpolation
Hic06 spatial interpolation
 
Ad mi floridan-aquiferwls-for-pps
Ad mi floridan-aquiferwls-for-ppsAd mi floridan-aquiferwls-for-pps
Ad mi floridan-aquiferwls-for-pps
 

Recently uploaded

The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
Jen Stirrup
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 

Recently uploaded (20)

The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 

Wqtc2011 causes offalsealarms-20111115-final

  • 1. Causes of False Alarms in Inferential Event Detection Systems for Distribution System Water Quality Monitoring Ed Roehl, John Cook, Ruby Daamen, and Uwe Mundry Advanced Data Mining International, LLC Greenville, South Carolina
  • 2. Early Results from WRF PROJECT 4182 Interpreting real-time online monitoring data for water quality event detection
  • 3. Acknowledgements ADMi gratefully acknowledges the Water Research Foundation as the joint owners of certain technical information upon which this presentation is based. ADMi thanks the Foundation for their financial, technical, and administrative assistance in funding the project through which much of this information was discovered.
  • 4. Thanks to Our Utility Partners • City of Columbus, Ohio, Division of Power and Water • Greenville Water System • Newport News Waterworks • Oklahoma City Water Department • Startex Jackson Wellford Duncan Water District (SJWD)
  • 6. Inferential Event Detection System (IEDS) • Focus on distribution system security • Real-time monitoring of “conventional” WQ parameters - CL2, PH, COND/SC, TURB, TOC, TEMP • Infers an “event” by detecting anomalous patterns of WQ behavior – Does not measure concentrations of specific compounds like liquid chromatography • Systems have been available for a number years
  • 7. Colorado State pilot loop from Project 3086 Hach “panel” SC, pH, Cl2, turbidity TOC data acquisition analyzer ventilation toxin flow injection pump pump flow direction flow loop injection point
  • 8. CSU pilot loop results chlorine residual chlorine residual response response to Aldicarb to Na Cyanide
  • 9. conductivity CSU pilot loop results, cont. pH conductivity response to pH response to Na Na Arsenate Cyanide
  • 10. HOW? - event detection event SCADA historical database of 1 “normal” behaviors” 5 feature vector 6 CL2 TURB distances to dCL2 pH SC neighbors 2 dt TOC CL2 4 vector track 3D projection of n-dimensional feature space 3
  • 12. Measurement errors, tank cycling, etc. 1 week data gap 10-minute time steps • Fast, full scale change
  • 13. WRF Project 4182 • Reports of unacceptable numbers of false positives unless sensitivity reduced – defeats purpose • Thesis - a more effective IEDS can be developed by incorporating the effects of operational parameters on water quality variability – reduce false positives – local ops params – Q, Ps, LVL • Utility partners provided multi-year data from 40+ monitoring sites
  • 14. Definitions • A contaminated slug flowing past a sensor array might only be detectable for a few minutes or less. – Here, target detection window  20 minutes • event is manifest and detectable • “Normal” data – all the data here represents normal operations and normal data collection issues.
  • 15. Causes of False Positives
  • 16. Question 1 • Q1: If event detection relies on pattern matching vectors, how similar are “normal” new vectors to normal old vectors? – “old” = historical database – “new” = vectors streaming from process • Expected A: If both are normal, they should be pretty similar. CARTOON historical normal new normal data data
  • 17. Experiment 1 – determine if old and new vectors cohabitate the same sub-spaces of feature space CL2 (mg/l) 1. Divide 4 years historical new of 10-min data into ~70% old and ~30% new 2. Define vector scalar CL2 features for each WQ parameter coarse a. Scalars - divided into 5 segmentation 20% sub-ranges
  • 18. cont 1: Experiment 1 - cohabitating hist. and new 2. cont. - Create coarse features segmentation b. D1 = 1-time- step difference; sub-divide into 6 sub-ranges c. D2 = D1 time- delayed 1 time step D1 & D2 CL2 Process Dynamics – scalar+D1+D2 describe parameter’s current position+velocity+acceleration.
  • 19. cont 2: Experiment 1 – cohabitating old and new 3. Count cohabitating old and new in sub-spaces (hypercuboids) formed by 5 scalar, 6 D1, and 6 D2 sub- ranges Combinatorial Explosion – even with coarse segmentation - 3 scalars = 5 x 5 x 5 = 125 cuboids - + D1 = 125 x 6 x 6 x 6 = 27k hypercuboids - + D2 = 27k x 6 x 6 x 6 = 5.8 million tank site
  • 20. cont 3: Experiment 1 – cohabitating hist. and new booster pump station away from tanks next experiment Back to Question 1 • Q1: How similar are “normal old” and “normal new” vectors? • Expected A: If both are “normal”, they should be pretty similar. • Real A: Not very - numerous false alarms may be unavoidable without desensitizing IEDS
  • 21. Question 2 • Q2: What would happen if we periodically transfer “new” vectors to the historical database? – Experiment 1 – static old & new • Expected A: False alarms should decrease.
  • 22. Experiment 2 – simulate updating hist. database new vectors 47,624 • Simulations used the site away from tanks – features = CL2, SC, COND, TURB scalars+D1s+D2s • dnn = distance of new vector to “nearest neighbor” old vector – In IEDS dnn > specified limit triggers alarm • Findings 1. Transfer cases are high percentage of no-transfer case 2. Little difference between transfer cases 3. Indicates that successive “normal” vectors can be far apart
  • 23. cont: Experiment 2 ~1 false/wk ~1 false/10wk • Weekly transfers – 1 false/wk: dnn = 42 x avg(dnn) – 1 false/10wks: dnn = 59 x avg(dnn)
  • 24. Back to Question 2 • Q2: What would happen if we periodically transfer “new” vectors to the historical database? • Expected A: False alarms should decrease. • Real A: False alarms might not fall to acceptable levels.
  • 25. Question 3 • Q3: Why are successive vectors so far apart? • A: To come.
  • 26. Experiment 3 – correlation matrices • Cross-correlation matrix – correlates changes among multiple ops & WQ parameters – change = Dx = current value – value x time steps ago • Utility B stand alone site • 86-sec time step 1 time-step • Mix of WQ and operational parameters 3 time-step (4.3 min) change 7 time-step (10 min) change
  • 27. Experiment 4 – autocorrelation of Dx • Autocorrelation function correlates a signal to itself to determine how deterministic / random it is. – determinism = current behavior depends somewhat on past – randomness = current behavior unrelated to past 1 time-step 14 time-step (86 sec) (20 min) change change
  • 28. Back to Question 3 • Q3: Why are successive vectors so far apart? • A: WQ change on time scales  20 minutes can be “apparently random”. – Exp. 3 (x-matrices) - WQ & ops parameter changes are poorly correlated – Exp. 4 (autocorr.) - individual WQ parameter change is non-deterministic – Same findings at multiple sites & utilities • non-determinism = randomness = noise
  • 29. Causes of WQ variability Stand Alone Site • Unmeasured disturbances – pressure & flow transients • Measurement errors
  • 31. Alternative to stand-alone site monitoring Event Detector • Upstream / downstream sites • Upstream site provides – boundary conditions for downstream WQ – more operational parameters
  • 32. Multi-Site Concept • Event detection performed on filtered signals – model-based filtering of downstream WQ signals – modeling = accounting of causes of variability – filtered signals less variable • Modeling technique – multivariate, nonlinear curve fitting by (multi-layer perceptron) artificial neural networks (ANN) • “machine learning” from AI – inputs - upstream and “local” WQ and ops • spectrally decomposed into components • autoregressive “local” WQ inputs time delayed to be outside detection window (e.g., 20 minutes) • co-linear inputs decorrelated – ANN “learns” best predictor components
  • 33. upstream flow downstream 2-Site Example COND (mS/cm) TEMP (deg. F) COND TEMP test data PH CL2 CL2 (mg/l) PH 1-hour time steps (220 days, August to March) • Raw WQ variability is similar but not identical – differences caused by unmeasured disturbances • 1-hour time step too big for 20-minute detection window – exploratory research on multi-site
  • 34. Results – detail Downstream COND D1 (mS/cm) D1 = 1 time-step difference of test data measured data Downstream CL2 D1 (mg/l) upstream only upstream+auto Downstream PH D1 1-hour time steps
  • 35. More Complicated 4-Site Example Q? = unmeasured LVL, disturbances COND, TANK CL2 Q, PSUC, PDIS, COND, CL2, TEMP A Q, PSUC, PDIS, COND, CL2, TEMP BPS BPS B A LVL, Q? = unmeasured TANK COND, disturbances CL2 B • BPS B is “target” site • Utility operates multiple WTPs with different sources • 1 year of data (1-min reduced to 4-min) – first 10 month = training – last 2 months = test
  • 36. BPS B COND Process Model – training data Training Data measured predicted residuals N: 76,148 R2: 0.847 RMSE: 72 mS/cm Residual Error (mS/cm) BPS B COND (mS/cm) 4-minute training data observations • Looks Good!
  • 37. BPS B COND Process Model – test data Test Data hump N: 17,296 R2: 0.893 RMSE: 69 mS/cm Residual Error (mS/cm) BPS B COND (mS/cm) measured predicted residuals 4-minute test data observations • Hump may be from different WTP/source • Looks Good!
  • 38. BPS B COND Process Model – test data Detail BPS B COND (mS/cm) measured predicted 4-minute test data observations • Looks Bad! • Process model misses some periods - maybe from unmonitored flows through junctions
  • 39. BPS B COND Dx autocorrelations COND • Dx = D R R2 over x number of minutes CL2 • R2s are low R R2
  • 40. BPS B CL2 Process Model – test data trough Residual Error (mg/l) BPS B CL2 (mg/l) Test Data measured predicted residuals N: 11,715 R2: 0.912 RMSE: 0.085 mg/l 4-minute test data observations • trough may be from different WTP/source • Looks Good!
  • 41. BPS B CL2 Process Model – test data Detail BPS B CL2 (mg/l) measured predicted 4-minute test data observations • Looks Bad! • Process model missing some periods - maybe from unmonitored flows through junctions
  • 43. IEDS - Conclusions • Practical problems – data reliability – no guarantees that contamination event would “look” different than “normal” because • “normal” is so highly variable • WQ sensors being used might not provide the “information” necessary to discriminate – Where to put / how many? • Stand-Alone Sites – face widely ranging random variability from unknown disturbances, a.k.a. normal operations – high alarm limits needed to reduce false positives - defeats purpose
  • 44. cont - Conclusions • Multi-Site approach – Can account for/explain 80-90% of downstream WQ variability – unproven on  20 min detection window – diminished when too much complexity – field testing to be done at GWS and SJWD
  • 45. cont - Conclusions • Other reasons to monitor distribution system WQ – control processes at WTP to improve WQ at points of delivery – detect common problems - low total chlorine, nitrification, line integrity, DBPs, biofilm sloughing, incipient complaint detection CL2 (mg/l) BPS A TANK A BPS B TANK B 1-minute time steps 1/1/05 – 11/16/09
  • 46. Compare WTP with DS turbidity • Little variability in WTP turbidity, < 0.1 NTU !
  • 47. Correlate DS turbidity with WTP WQ • ANN process model • Inputs = finished water alkalinity, hardness, color, and source blend ratio • R2 = 0.71