SlideShare a Scribd company logo
1 of 1
2012-07 ims-APRM@Tsukuba


                              Ellipsoidal Representations                                                                                                                                                                                                                        n

                                                                                                                                                                                                                                                                                 å( X        - X ) (Yi - Y )



                                Regarding Correlations
                                                                                                                                                                                                                                                                                         i
                                                                                                                                                                                                                                                           r [ X ,Y ] =          i=1
                                                                                                                                                                                                                                                                            n                       n

                                                                                                                                                                                                                                                                           å( X          - X)       å (Y - Y )
                                                                                                                                                                                                                                                                                                2                2
                                                                                                                                                                                                                                                                                       i                  i
                                                                                                                                                                                                                                                                           i=1                      i=1




                                                                                  The author likes to express certain basic theories that may touch
              Toshiyuki                                                           upon the very roots of statistics. One is the geometric representation
                                                                                  of ρ and the other is the mysterious robustness of ρ. Both are simple
             SHIMONO                                                              but their implications are very deep, which may even affect
     PhD. ( Info. Sci. and Tech. ), a freelance                                   epistemology i.e. how humans sense/judge this ambiguous real
      tshimono@05.alumni.u-tokyo.ac.jp
                                                                                  world.

       Please read „ρ‟ below as “correlation coefficient” developed since 1880‟s.
                                                                                                                                                                                      Spherical Model                                                           F. Galton                    K. Pearson

             Background                                                                              Main Findings                                                                                                       Prospects
Yˆ = a X ++ a X + b                                                                                                                                                                                       •
      1 1     p p                                                            0. A spherical triangle gives a view.                                                                                              Ellipsoidal representation of
Despite fundamental, multiple regression                                       cos( ) of the inner angles give partial-ρ.                                                                                       correlations and regression
analysis is hard to interpret the outcomes!!                                 1. The „correlation ellipsoid‟ directly gives                                                                                      by computer software would
                              By choosing the explanatory variables             multiple-ρ, partial-ρ, coefficients                                                                                             be useful.
How happens?                  intentionally among many candidate
                                                                             2. The mysterious robustness                                                                                                  •    Influences to statistics and data
-> multicolliearity           variables, you may freely invert the sign of
                              the regression coefficient of any variable             — but not yet fully developed                                                                                              analysis from these geometric views.
-> un-intuitive sign        inversion.invert.
                              you want to
-> perturbation by finite sampling

§1. An important empirical fact : the mysterious robustness regarding ρ
                                                                                                                                 Theoretical development is
                                                                                                                                                                                                                                     The relation between ρ
Quite often, |ρ[ X:Y ] −ρ[ f(X) : g(Y) ] | < 0.05                                                                                much more required,
                                                                                                                                 especially for higher                                                                               and Spearman‟s rank
                                                                                                                                                                                                                                     ρ for (infinitely many
if X, Y, f(X), g(Y) have no outliers, and if f, g are                                                                            dimensional cases
                                                                                                                                                                                                                                     pairs of) X,Y that are
increasing functions.                                                                                                                                                                                                                forming 2-dim
                                                                                                                                                                                                                                     Gaussian distribution
 This robustness seems comaparable to the                                                                                                                                                                 Despite strong
                                                                                                                                                                                                          the deformations
                                                                                                                                                                                                                                     along 0≤ρ≤1 is shown
                                                       Recall Fisher‟s z transformation                                                                                                                                              in red. The difference is
sampling error with N > 500. Note: peoples often judges            1         1+ r
                                                                                                                                                                                                          often                      as small as less than
                                                               z=      log                                                                                                                                cause very small           0.05.
things from only N=1,2 or 10 to see relations.                     2         1- r                                                                                                                         effects on ρ.


                                                                                                                                                                                                                                     観察        符号化
                                                                                                                                                                                                                               X           Y             ± 1
                                                                                                                                                                                                                                      r1
                                                                                                                                                                                                                                               r2
                                                                                                                                                                                                                                                    r3
                                                                                                                                                                                                                               ± 1


                                                                                                                                                                                                                                r2  2 /  r1
                                                                                                                                                                                                                                r3  2 /  arcsin r1




                                                                                                                                                                                                         When ρ is not so strong, deformations of
                                                                                                                                                                                                         observations may be a very minor issue. „N‟
                                                                                                                                                                                                         (sample size) is important.
                                                                                                                                                                                                         These facts deeply affect how a human
                                                                                                                                                                                                         recognizes relationship between/
                                                                                                                                                                                                         among multiple phenomena.


§2. The Correlation Ellipse & Correlation Ellipsoid                                                                                                                                                         6 teams of the Central League played 130 games in the each
                                                                                                                                                                                                            of past 31 years. Each dot below corresponds to each team
                                                                                                                                                                                                            and each year (N = 186 = 6 × 31).
                                                   Where does the champion                                         The two variables are assumed to
                                                                                                                   form 2-dim Gaussian distribution with
                                                   come from? The winner is                                        zero-centered. Note: quite many
                                                   approximately ρ times as                                        distribution cannot be distinguished
                                                   strong as the true guy.                                         from     Gaussian   distribution  by
                                                                                                                   Kolmogorov-Smirnov test unless N >
                                                                                                                   30 or 100.




                                                                                                                                                 result

                                                                                                                                                      ability


                                                                                                                                                                                                 This ellipsoid touches the unit cube at
                                                                                                                                                                                                 (±1,±ρ12,±ρ13), (±ρ12, ±1,±ρ23),(±ρ13,±ρ23,±1)


This ellipse touches                                                                                                                                                                         The (hyper-)ellipsoid touches the unit
the unit square at                                                                                                                                                                           (hyper-)cube at 2×k points of
(±1,±ρ), (±ρ,±1).                                                                                                                                                                             ±( ρ・1 , ρ・2 ,.., ρ・k ) with ・ =
                                                                                                                                                                                             1,2,..,k.
             ▲ The “correlation ellipse” for given ▲ The “correlation ellipsoid” for ρ-matrix
                                                   ρ
                                                                                                                                                                                                                                                                    Trivial example of the correspondence between the shape



§3. How are the multiple/partial ρ and coeff. a i drawn ?
                                                                                                                                                                                                                                                                    of distribution of N≤5 and the correlation ellipse.
                                                                                                                                                                                                                                                                    Coincidentally, the Spearman‟s rank ρ is 0.1 times integer
                                                                                                                                                                                                                                                                    when all the variables are different values.




                                                                                                                                                                The k-th partial ρ is read
    Assume ρ between                                              The partial ρ is read                                                                         by the ruler of a straight               The sign (plus/minus) can be determined by
                                                                  by a graduated ruler.                                                                         line parallel to
    variables Y, X1, X2                                                                                                                                         k-th axis of the space                   either of these geometric methods. You may
                                                                                 r -rr
                                                                 偏相関係数     r¢ =                                1         2 12
                                                                                                                                                                                                         consider how the coefficient a i changes its
    are given.                                                                  1- r 1- r
                                                                                         1
                                                                                                                   2
                                                                                                                    2                   2
                                                                                                                                       12
                                                                                                                                                                The k-th regression coefficient          value according to the number of variables X i
                                                                                                                                                                is read by
                                                                                                                                                                the linear scalar field inside the       increases.
  The multiple ρ                                                The standardized                                                                                hyper-ellipsoid,                         For SEM, how | a i*| > 1 happens is explained.
                                                                                                                                                                with ±1 valued at the tangential
  is the similarity ratio.                                      partial regression                                                                              points to the facets of xk=±1,
  重相関係数は相似比                                                     coefficients a i is                                                                             and 0 valued at the other
                                                                                                                                                                tangential points to the other
                                                                read by a linear                                                                                facets.
                                                                scalar field.
                                                            (標準化)偏回帰係数
                                                                                   r -rr
                                                                              a1 = 1 2 212
                                                                               *
                                                                                                                                                                                                        ANY REFERENCE?
                                                                                    1- r                                                    12



                                                                                0.5   色        0.0
                                                                                                                   色
                                                                                                                               0.833
                                                                                                                                                                                                     Those above are basically my original works
                                                                                 X1           X2                X1 , X2
                                                                                      - 0.8                                                                                                          except the definitions of correlations and
                                                                                                                                                                                                     regression analysis. The author sincerely
                                                                                                   R=0.833

                                                                                                             (0.5 , 0)                              Now you can grab                                 welcomes any related literature information.
 The ratio of the red arrow to the whole                                                                           (0.8, -1)
                                                                                                                                (1 , -0.8)
                                                                                                                                                    how Y depends on                                                                                                              and patron!
 red line section is equal to the multiple         Although X2 and the color Y is no correlated,
 correlation coefficient.                          the determinant coefficient Y from X1 and X2 is
                                                                                                                                                    *multiple* variables
                                                   much more than that of Y from only X1.
                                                   —A case when zero-correlation is not useless
                                                                                                                                                    X i by a bird‟s-eye.                               How can one distinguish
                      R = r12 + r22                                                                                                                 -> how multicollinearity                          things from multiple features?
                     or R = r12 + r22 ++ rp2                                                                                                       occurs / how unintuitive sign                     What is distinguishing in the world?
                                                                                                                                                    inversion occurs / etc.                           There seems to be several theoretical
When the correlation between/among                                           This is when X2 is useless                                             -> A theoretical framework
explanatory variables is/are zero, then                                      to explain Y because X1                                                                                                  principles to be
                                                                                                                                                    telling whether any action                            developed..
the multiple correlation coefficient                                         conceals the effect of X2.
becomes √r12+r22 or √ r12+..+rp2 .                                           This happens when                                                      causes positive or negative
                                                                                                                                                    effect in daily/social real.                                                                                   Nara Great Buddha
— determinant coefficient additivity                                         r2 = r1 r12 thus r = r1 .

More Related Content

What's hot

2010 Honda Insight Hybrid Virginia Beach
2010 Honda Insight Hybrid Virginia Beach2010 Honda Insight Hybrid Virginia Beach
2010 Honda Insight Hybrid Virginia BeachCheckered Flag Honda
 
세바시 15분 조재형 가톨릭의과대학 서울성모병원 내분비내과 조교수 - 우리 의학교과서의 세계진출, 그 15년의 기록
세바시 15분 조재형 가톨릭의과대학 서울성모병원 내분비내과 조교수 - 우리 의학교과서의 세계진출, 그 15년의 기록세바시 15분 조재형 가톨릭의과대학 서울성모병원 내분비내과 조교수 - 우리 의학교과서의 세계진출, 그 15년의 기록
세바시 15분 조재형 가톨릭의과대학 서울성모병원 내분비내과 조교수 - 우리 의학교과서의 세계진출, 그 15년의 기록cbs15min
 
2010 Honda Insight Hybrid San Leandro
2010 Honda Insight Hybrid San Leandro2010 Honda Insight Hybrid San Leandro
2010 Honda Insight Hybrid San LeandroSan Leandro Honda
 
2010 Honda Insight Hybrid Los Angeles
2010 Honda Insight Hybrid Los Angeles2010 Honda Insight Hybrid Los Angeles
2010 Honda Insight Hybrid Los AngelesMiller Honda Van Nuys
 
Dental amalgam
Dental amalgamDental amalgam
Dental amalgamZirgi Rana
 
Fringe eu procurement - sara piller
Fringe   eu procurement - sara pillerFringe   eu procurement - sara piller
Fringe eu procurement - sara pillerlgconf11
 

What's hot (10)

2010 Honda Insight Hybrid Virginia Beach
2010 Honda Insight Hybrid Virginia Beach2010 Honda Insight Hybrid Virginia Beach
2010 Honda Insight Hybrid Virginia Beach
 
세바시 15분 조재형 가톨릭의과대학 서울성모병원 내분비내과 조교수 - 우리 의학교과서의 세계진출, 그 15년의 기록
세바시 15분 조재형 가톨릭의과대학 서울성모병원 내분비내과 조교수 - 우리 의학교과서의 세계진출, 그 15년의 기록세바시 15분 조재형 가톨릭의과대학 서울성모병원 내분비내과 조교수 - 우리 의학교과서의 세계진출, 그 15년의 기록
세바시 15분 조재형 가톨릭의과대학 서울성모병원 내분비내과 조교수 - 우리 의학교과서의 세계진출, 그 15년의 기록
 
2010 Honda Insight Hybrid San Leandro
2010 Honda Insight Hybrid San Leandro2010 Honda Insight Hybrid San Leandro
2010 Honda Insight Hybrid San Leandro
 
2010 Honda Insight Hybrid Los Angeles
2010 Honda Insight Hybrid Los Angeles2010 Honda Insight Hybrid Los Angeles
2010 Honda Insight Hybrid Los Angeles
 
Dental amalgam
Dental amalgamDental amalgam
Dental amalgam
 
Fringe eu procurement - sara piller
Fringe   eu procurement - sara pillerFringe   eu procurement - sara piller
Fringe eu procurement - sara piller
 
How to Create an Innovation Culture
How to Create an Innovation CultureHow to Create an Innovation Culture
How to Create an Innovation Culture
 
Magazine flat plan
Magazine flat planMagazine flat plan
Magazine flat plan
 
APA climate change planning presentation
APA climate change planning presentationAPA climate change planning presentation
APA climate change planning presentation
 
2010 Mazda Mx5 Baton Rouge
2010 Mazda Mx5 Baton Rouge2010 Mazda Mx5 Baton Rouge
2010 Mazda Mx5 Baton Rouge
 

Viewers also liked

Chapter36b
Chapter36bChapter36b
Chapter36bYing Liu
 
Properties of Geometric Figures
Properties of Geometric FiguresProperties of Geometric Figures
Properties of Geometric FiguresMai Nicole Olaguer
 
Chapter 3 Movement of Substances Lesson 2 - Effects of isotonic, hypotonic & ...
Chapter 3 Movement of Substances Lesson 2 - Effects of isotonic, hypotonic & ...Chapter 3 Movement of Substances Lesson 2 - Effects of isotonic, hypotonic & ...
Chapter 3 Movement of Substances Lesson 2 - Effects of isotonic, hypotonic & ...j3di79
 
Ridge regression, lasso and elastic net
Ridge regression, lasso and elastic netRidge regression, lasso and elastic net
Ridge regression, lasso and elastic netVivian S. Zhang
 

Viewers also liked (6)

Chapter36b
Chapter36bChapter36b
Chapter36b
 
Properties of Geometric Figures
Properties of Geometric FiguresProperties of Geometric Figures
Properties of Geometric Figures
 
Ridge regression
Ridge regressionRidge regression
Ridge regression
 
Autocorrelation
AutocorrelationAutocorrelation
Autocorrelation
 
Chapter 3 Movement of Substances Lesson 2 - Effects of isotonic, hypotonic & ...
Chapter 3 Movement of Substances Lesson 2 - Effects of isotonic, hypotonic & ...Chapter 3 Movement of Substances Lesson 2 - Effects of isotonic, hypotonic & ...
Chapter 3 Movement of Substances Lesson 2 - Effects of isotonic, hypotonic & ...
 
Ridge regression, lasso and elastic net
Ridge regression, lasso and elastic netRidge regression, lasso and elastic net
Ridge regression, lasso and elastic net
 

Similar to Ellipsoidal Representations Regarding Correlations

VMware vCloud Director and Nexus 1000V / Workload Mobility
VMware vCloud Director and Nexus 1000V / Workload MobilityVMware vCloud Director and Nexus 1000V / Workload Mobility
VMware vCloud Director and Nexus 1000V / Workload MobilitySal Lopez
 
Re on farm survey outline results
Re on farm  survey outline resultsRe on farm  survey outline results
Re on farm survey outline resultsBioenergy Crops
 
Re on farm survey outline results
Re on farm  survey outline resultsRe on farm  survey outline results
Re on farm survey outline resultsBioenergy Crops
 
Housing Exclusion of the Elderly in Slovenia
Housing Exclusion of the Elderly in SloveniaHousing Exclusion of the Elderly in Slovenia
Housing Exclusion of the Elderly in SloveniaFEANTSA
 
High stakes world of Mobile Payments
High stakes world of Mobile PaymentsHigh stakes world of Mobile Payments
High stakes world of Mobile PaymentstxtNation
 
High stakes-world-of-mobile-payments-infographic
High stakes-world-of-mobile-payments-infographicHigh stakes-world-of-mobile-payments-infographic
High stakes-world-of-mobile-payments-infographicTyson Hackwood
 
VocaLight Infrared Classroom Amplification Brochure
VocaLight Infrared Classroom Amplification BrochureVocaLight Infrared Classroom Amplification Brochure
VocaLight Infrared Classroom Amplification BrochureGailMaynard
 
Print Portfolio1
Print Portfolio1Print Portfolio1
Print Portfolio1cooeesigns
 
The 35th Travelweekly Digital Issue
The 35th Travelweekly Digital Issue The 35th Travelweekly Digital Issue
The 35th Travelweekly Digital Issue vietnamtw
 
Regression analysis
Regression analysisRegression analysis
Regression analysisSubin Raj
 
Wetland Mapping & Conservation Efforts in the Fort Nelson area
Wetland Mapping & Conservation Efforts in the Fort Nelson areaWetland Mapping & Conservation Efforts in the Fort Nelson area
Wetland Mapping & Conservation Efforts in the Fort Nelson areafnfnlands
 
Ipsos ASI: Once Upon a Brand
Ipsos ASI: Once Upon a BrandIpsos ASI: Once Upon a Brand
Ipsos ASI: Once Upon a BrandIpsos UK
 
Seattle Magazine 2012 Media Kit
Seattle Magazine 2012 Media KitSeattle Magazine 2012 Media Kit
Seattle Magazine 2012 Media Kitjkueber
 

Similar to Ellipsoidal Representations Regarding Correlations (20)

Hntbmvva panels low
Hntbmvva panels lowHntbmvva panels low
Hntbmvva panels low
 
TV Connect London 2013 Preview Brochure
TV Connect London 2013 Preview BrochureTV Connect London 2013 Preview Brochure
TV Connect London 2013 Preview Brochure
 
Exerpt From Exec Overview
Exerpt From Exec OverviewExerpt From Exec Overview
Exerpt From Exec Overview
 
VMware vCloud Director and Nexus 1000V / Workload Mobility
VMware vCloud Director and Nexus 1000V / Workload MobilityVMware vCloud Director and Nexus 1000V / Workload Mobility
VMware vCloud Director and Nexus 1000V / Workload Mobility
 
Reaching Vulnerable Children in Georgia
Reaching Vulnerable Children in GeorgiaReaching Vulnerable Children in Georgia
Reaching Vulnerable Children in Georgia
 
The Singles Lifecycle
The Singles LifecycleThe Singles Lifecycle
The Singles Lifecycle
 
Re on farm survey outline results
Re on farm  survey outline resultsRe on farm  survey outline results
Re on farm survey outline results
 
Re on farm survey outline results
Re on farm  survey outline resultsRe on farm  survey outline results
Re on farm survey outline results
 
Housing Exclusion of the Elderly in Slovenia
Housing Exclusion of the Elderly in SloveniaHousing Exclusion of the Elderly in Slovenia
Housing Exclusion of the Elderly in Slovenia
 
High stakes world of Mobile Payments
High stakes world of Mobile PaymentsHigh stakes world of Mobile Payments
High stakes world of Mobile Payments
 
High stakes-world-of-mobile-payments-infographic
High stakes-world-of-mobile-payments-infographicHigh stakes-world-of-mobile-payments-infographic
High stakes-world-of-mobile-payments-infographic
 
VocaLight Infrared Classroom Amplification Brochure
VocaLight Infrared Classroom Amplification BrochureVocaLight Infrared Classroom Amplification Brochure
VocaLight Infrared Classroom Amplification Brochure
 
Print Portfolio1
Print Portfolio1Print Portfolio1
Print Portfolio1
 
The 35th Travelweekly Digital Issue
The 35th Travelweekly Digital Issue The 35th Travelweekly Digital Issue
The 35th Travelweekly Digital Issue
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Wetland Mapping & Conservation Efforts in the Fort Nelson area
Wetland Mapping & Conservation Efforts in the Fort Nelson areaWetland Mapping & Conservation Efforts in the Fort Nelson area
Wetland Mapping & Conservation Efforts in the Fort Nelson area
 
Ipsos ASI: Once Upon a Brand
Ipsos ASI: Once Upon a BrandIpsos ASI: Once Upon a Brand
Ipsos ASI: Once Upon a Brand
 
Seattle Magazine 2012 Media Kit
Seattle Magazine 2012 Media KitSeattle Magazine 2012 Media Kit
Seattle Magazine 2012 Media Kit
 
Timeline 1
Timeline 1Timeline 1
Timeline 1
 
Finals
FinalsFinals
Finals
 

More from Toshiyuki Shimono

国際産業数理・応用数理会議のポスター(作成中)
国際産業数理・応用数理会議のポスター(作成中)国際産業数理・応用数理会議のポスター(作成中)
国際産業数理・応用数理会議のポスター(作成中)Toshiyuki Shimono
 
インターネット等からデータを自動収集するソフトウェアに必要な補助機能とその実装
インターネット等からデータを自動収集するソフトウェアに必要な補助機能とその実装インターネット等からデータを自動収集するソフトウェアに必要な補助機能とその実装
インターネット等からデータを自動収集するソフトウェアに必要な補助機能とその実装Toshiyuki Shimono
 
extracting only a necessary file from a zip file
extracting only a necessary file from a zip fileextracting only a necessary file from a zip file
extracting only a necessary file from a zip fileToshiyuki Shimono
 
A Hacking Toolset for Big Tabular Files -- JAPAN.PM 2021
A Hacking Toolset for Big Tabular Files -- JAPAN.PM 2021A Hacking Toolset for Big Tabular Files -- JAPAN.PM 2021
A Hacking Toolset for Big Tabular Files -- JAPAN.PM 2021Toshiyuki Shimono
 
新型コロナの感染者数 全国の状況 2021年2月上旬まで
新型コロナの感染者数 全国の状況 2021年2月上旬まで新型コロナの感染者数 全国の状況 2021年2月上旬まで
新型コロナの感染者数 全国の状況 2021年2月上旬までToshiyuki Shimono
 
Multiplicative Decompositions of Stochastic Distributions and Their Applicat...
 Multiplicative Decompositions of Stochastic Distributions and Their Applicat... Multiplicative Decompositions of Stochastic Distributions and Their Applicat...
Multiplicative Decompositions of Stochastic Distributions and Their Applicat...Toshiyuki Shimono
 
Theory to consider an inaccurate testing and how to determine the prior proba...
Theory to consider an inaccurate testing and how to determine the prior proba...Theory to consider an inaccurate testing and how to determine the prior proba...
Theory to consider an inaccurate testing and how to determine the prior proba...Toshiyuki Shimono
 
Interpreting Multiple Regression via an Ellipse Inscribed in a Square Extensi...
Interpreting Multiple Regressionvia an Ellipse Inscribed in a Square Extensi...Interpreting Multiple Regressionvia an Ellipse Inscribed in a Square Extensi...
Interpreting Multiple Regression via an Ellipse Inscribed in a Square Extensi...Toshiyuki Shimono
 
BigQueryを使ってみた(2018年2月)
BigQueryを使ってみた(2018年2月)BigQueryを使ってみた(2018年2月)
BigQueryを使ってみた(2018年2月)Toshiyuki Shimono
 
既存分析ソフトへ
データを投入する前に
簡便な分析するためのソフトの作り方の提案
既存分析ソフトへ
データを投入する前に
簡便な分析するためのソフトの作り方の提案既存分析ソフトへ
データを投入する前に
簡便な分析するためのソフトの作り方の提案
既存分析ソフトへ
データを投入する前に
簡便な分析するためのソフトの作り方の提案Toshiyuki Shimono
 
To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...
To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...
To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...Toshiyuki Shimono
 
To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...
To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...
To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...Toshiyuki Shimono
 
Make Accumulated Data in Companies Eloquent by SQL Statement Constructors (PDF)
Make Accumulated Data in Companies Eloquent by SQL Statement Constructors (PDF)Make Accumulated Data in Companies Eloquent by SQL Statement Constructors (PDF)
Make Accumulated Data in Companies Eloquent by SQL Statement Constructors (PDF)Toshiyuki Shimono
 
企業等に蓄積されたデータを分析するための処理機能の提案
企業等に蓄積されたデータを分析するための処理機能の提案企業等に蓄積されたデータを分析するための処理機能の提案
企業等に蓄積されたデータを分析するための処理機能の提案Toshiyuki Shimono
 
新入社員の頃に教えて欲しかったようなことなど
新入社員の頃に教えて欲しかったようなことなど新入社員の頃に教えて欲しかったようなことなど
新入社員の頃に教えて欲しかったようなことなどToshiyuki Shimono
 
ページャ lessを使いこなす
ページャ lessを使いこなすページャ lessを使いこなす
ページャ lessを使いこなすToshiyuki Shimono
 
Guiを使わないテキストデータ処理
Guiを使わないテキストデータ処理Guiを使わないテキストデータ処理
Guiを使わないテキストデータ処理Toshiyuki Shimono
 
データ全貌把握の方法170324
データ全貌把握の方法170324データ全貌把握の方法170324
データ全貌把握の方法170324Toshiyuki Shimono
 
Macで開発環境を整える170420
Macで開発環境を整える170420Macで開発環境を整える170420
Macで開発環境を整える170420Toshiyuki Shimono
 

More from Toshiyuki Shimono (20)

国際産業数理・応用数理会議のポスター(作成中)
国際産業数理・応用数理会議のポスター(作成中)国際産業数理・応用数理会議のポスター(作成中)
国際産業数理・応用数理会議のポスター(作成中)
 
インターネット等からデータを自動収集するソフトウェアに必要な補助機能とその実装
インターネット等からデータを自動収集するソフトウェアに必要な補助機能とその実装インターネット等からデータを自動収集するソフトウェアに必要な補助機能とその実装
インターネット等からデータを自動収集するソフトウェアに必要な補助機能とその実装
 
extracting only a necessary file from a zip file
extracting only a necessary file from a zip fileextracting only a necessary file from a zip file
extracting only a necessary file from a zip file
 
A Hacking Toolset for Big Tabular Files -- JAPAN.PM 2021
A Hacking Toolset for Big Tabular Files -- JAPAN.PM 2021A Hacking Toolset for Big Tabular Files -- JAPAN.PM 2021
A Hacking Toolset for Big Tabular Files -- JAPAN.PM 2021
 
新型コロナの感染者数 全国の状況 2021年2月上旬まで
新型コロナの感染者数 全国の状況 2021年2月上旬まで新型コロナの感染者数 全国の状況 2021年2月上旬まで
新型コロナの感染者数 全国の状況 2021年2月上旬まで
 
Multiplicative Decompositions of Stochastic Distributions and Their Applicat...
 Multiplicative Decompositions of Stochastic Distributions and Their Applicat... Multiplicative Decompositions of Stochastic Distributions and Their Applicat...
Multiplicative Decompositions of Stochastic Distributions and Their Applicat...
 
Theory to consider an inaccurate testing and how to determine the prior proba...
Theory to consider an inaccurate testing and how to determine the prior proba...Theory to consider an inaccurate testing and how to determine the prior proba...
Theory to consider an inaccurate testing and how to determine the prior proba...
 
Interpreting Multiple Regression via an Ellipse Inscribed in a Square Extensi...
Interpreting Multiple Regressionvia an Ellipse Inscribed in a Square Extensi...Interpreting Multiple Regressionvia an Ellipse Inscribed in a Square Extensi...
Interpreting Multiple Regression via an Ellipse Inscribed in a Square Extensi...
 
BigQueryを使ってみた(2018年2月)
BigQueryを使ってみた(2018年2月)BigQueryを使ってみた(2018年2月)
BigQueryを使ってみた(2018年2月)
 
Seminar0917
Seminar0917Seminar0917
Seminar0917
 
既存分析ソフトへ
データを投入する前に
簡便な分析するためのソフトの作り方の提案
既存分析ソフトへ
データを投入する前に
簡便な分析するためのソフトの作り方の提案既存分析ソフトへ
データを投入する前に
簡便な分析するためのソフトの作り方の提案
既存分析ソフトへ
データを投入する前に
簡便な分析するためのソフトの作り方の提案
 
To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...
To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...
To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...
 
To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...
To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...
To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...
 
Make Accumulated Data in Companies Eloquent by SQL Statement Constructors (PDF)
Make Accumulated Data in Companies Eloquent by SQL Statement Constructors (PDF)Make Accumulated Data in Companies Eloquent by SQL Statement Constructors (PDF)
Make Accumulated Data in Companies Eloquent by SQL Statement Constructors (PDF)
 
企業等に蓄積されたデータを分析するための処理機能の提案
企業等に蓄積されたデータを分析するための処理機能の提案企業等に蓄積されたデータを分析するための処理機能の提案
企業等に蓄積されたデータを分析するための処理機能の提案
 
新入社員の頃に教えて欲しかったようなことなど
新入社員の頃に教えて欲しかったようなことなど新入社員の頃に教えて欲しかったようなことなど
新入社員の頃に教えて欲しかったようなことなど
 
ページャ lessを使いこなす
ページャ lessを使いこなすページャ lessを使いこなす
ページャ lessを使いこなす
 
Guiを使わないテキストデータ処理
Guiを使わないテキストデータ処理Guiを使わないテキストデータ処理
Guiを使わないテキストデータ処理
 
データ全貌把握の方法170324
データ全貌把握の方法170324データ全貌把握の方法170324
データ全貌把握の方法170324
 
Macで開発環境を整える170420
Macで開発環境を整える170420Macで開発環境を整える170420
Macで開発環境を整える170420
 

Recently uploaded

Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Quarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayQuarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayMakMakNepo
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Romantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxRomantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxsqpmdrvczh
 

Recently uploaded (20)

Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Quarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayQuarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up Friday
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
Romantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxRomantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptx
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 

Ellipsoidal Representations Regarding Correlations

  • 1. 2012-07 ims-APRM@Tsukuba Ellipsoidal Representations n å( X - X ) (Yi - Y ) Regarding Correlations i r [ X ,Y ] = i=1 n n å( X - X) å (Y - Y ) 2 2 i i i=1 i=1 The author likes to express certain basic theories that may touch Toshiyuki upon the very roots of statistics. One is the geometric representation of ρ and the other is the mysterious robustness of ρ. Both are simple SHIMONO but their implications are very deep, which may even affect PhD. ( Info. Sci. and Tech. ), a freelance epistemology i.e. how humans sense/judge this ambiguous real tshimono@05.alumni.u-tokyo.ac.jp world. Please read „ρ‟ below as “correlation coefficient” developed since 1880‟s. Spherical Model F. Galton K. Pearson Background Main Findings Prospects Yˆ = a X ++ a X + b • 1 1 p p 0. A spherical triangle gives a view. Ellipsoidal representation of Despite fundamental, multiple regression cos( ) of the inner angles give partial-ρ. correlations and regression analysis is hard to interpret the outcomes!! 1. The „correlation ellipsoid‟ directly gives by computer software would By choosing the explanatory variables multiple-ρ, partial-ρ, coefficients be useful. How happens? intentionally among many candidate 2. The mysterious robustness • Influences to statistics and data -> multicolliearity variables, you may freely invert the sign of the regression coefficient of any variable — but not yet fully developed analysis from these geometric views. -> un-intuitive sign inversion.invert. you want to -> perturbation by finite sampling §1. An important empirical fact : the mysterious robustness regarding ρ Theoretical development is The relation between ρ Quite often, |ρ[ X:Y ] −ρ[ f(X) : g(Y) ] | < 0.05 much more required, especially for higher and Spearman‟s rank ρ for (infinitely many if X, Y, f(X), g(Y) have no outliers, and if f, g are dimensional cases pairs of) X,Y that are increasing functions. forming 2-dim Gaussian distribution This robustness seems comaparable to the Despite strong the deformations along 0≤ρ≤1 is shown Recall Fisher‟s z transformation in red. The difference is sampling error with N > 500. Note: peoples often judges 1 1+ r often as small as less than z= log cause very small 0.05. things from only N=1,2 or 10 to see relations. 2 1- r effects on ρ. 観察 符号化 X Y ± 1 r1 r2 r3 ± 1 r2  2 /  r1 r3  2 /  arcsin r1 When ρ is not so strong, deformations of observations may be a very minor issue. „N‟ (sample size) is important. These facts deeply affect how a human recognizes relationship between/ among multiple phenomena. §2. The Correlation Ellipse & Correlation Ellipsoid 6 teams of the Central League played 130 games in the each of past 31 years. Each dot below corresponds to each team and each year (N = 186 = 6 × 31). Where does the champion The two variables are assumed to form 2-dim Gaussian distribution with come from? The winner is zero-centered. Note: quite many approximately ρ times as distribution cannot be distinguished strong as the true guy. from Gaussian distribution by Kolmogorov-Smirnov test unless N > 30 or 100. result ability This ellipsoid touches the unit cube at (±1,±ρ12,±ρ13), (±ρ12, ±1,±ρ23),(±ρ13,±ρ23,±1) This ellipse touches The (hyper-)ellipsoid touches the unit the unit square at (hyper-)cube at 2×k points of (±1,±ρ), (±ρ,±1). ±( ρ・1 , ρ・2 ,.., ρ・k ) with ・ = 1,2,..,k. ▲ The “correlation ellipse” for given ▲ The “correlation ellipsoid” for ρ-matrix ρ Trivial example of the correspondence between the shape §3. How are the multiple/partial ρ and coeff. a i drawn ? of distribution of N≤5 and the correlation ellipse. Coincidentally, the Spearman‟s rank ρ is 0.1 times integer when all the variables are different values. The k-th partial ρ is read Assume ρ between The partial ρ is read by the ruler of a straight The sign (plus/minus) can be determined by by a graduated ruler. line parallel to variables Y, X1, X2 k-th axis of the space either of these geometric methods. You may r -rr 偏相関係数 r¢ = 1 2 12 consider how the coefficient a i changes its are given. 1- r 1- r 1 2 2 2 12 The k-th regression coefficient value according to the number of variables X i is read by the linear scalar field inside the increases. The multiple ρ The standardized hyper-ellipsoid, For SEM, how | a i*| > 1 happens is explained. with ±1 valued at the tangential is the similarity ratio. partial regression points to the facets of xk=±1, 重相関係数は相似比 coefficients a i is and 0 valued at the other tangential points to the other read by a linear facets. scalar field. (標準化)偏回帰係数 r -rr a1 = 1 2 212 * ANY REFERENCE? 1- r 12 0.5 色 0.0 色 0.833 Those above are basically my original works X1 X2 X1 , X2 - 0.8 except the definitions of correlations and regression analysis. The author sincerely R=0.833 (0.5 , 0) Now you can grab welcomes any related literature information. The ratio of the red arrow to the whole (0.8, -1) (1 , -0.8) how Y depends on and patron! red line section is equal to the multiple Although X2 and the color Y is no correlated, correlation coefficient. the determinant coefficient Y from X1 and X2 is *multiple* variables much more than that of Y from only X1. —A case when zero-correlation is not useless X i by a bird‟s-eye. How can one distinguish R = r12 + r22 -> how multicollinearity things from multiple features? or R = r12 + r22 ++ rp2 occurs / how unintuitive sign What is distinguishing in the world? inversion occurs / etc. There seems to be several theoretical When the correlation between/among This is when X2 is useless -> A theoretical framework explanatory variables is/are zero, then to explain Y because X1 principles to be telling whether any action developed.. the multiple correlation coefficient conceals the effect of X2. becomes √r12+r22 or √ r12+..+rp2 . This happens when causes positive or negative effect in daily/social real. Nara Great Buddha — determinant coefficient additivity r2 = r1 r12 thus r = r1 .