1. GEX: A flexible method for approximating influence
via Geometric Ensemble (NeurIPS 2023)
Sung-Yub Kim (presenter), Kyungsu Kim, Eunho Yang
Oct. 6th, 2023
2. TL;DR
• We empirically verify that the proposed method improves the downstream task performance and
relieves the framework dependency of IF (and its approximations).
12. Distributional bias in IF and its approximations
<latexit sha1_base64="lL3vvGifEpow471EGLQ7F4WhACM=">AAACR3icbVBNaxsxFNQ6Seu4X25z7EXElCRQzG7pRxowGHxJbgnEdsDrLFpZ64hotYv0NmAL/btccs2tf6GXHhJCjtFufGjtDgjNm3kP6U2cC67B9395tbX1jRcv65uNV6/fvH3XfP9hoLNCUdanmcjUWUw0E1yyPnAQ7CxXjKSxYMP4slf6wyumNM/kKcxyNk7JVPKEUwJOiprnYUrgghJhjmxUcQBzqgg9knZ3/nm+s9fADged6goT55jAmp4NdZFGhnYCe+6qaWTmO47RELLc4rIsKxs1W37br4BXSbAgLbTAcdS8DScZLVImgQqi9SjwcxgbooBTwWwjLDTLCb0kUzZyVJKU6bGpcrD4k1MmOMmUOxJwpf49YUiq9SyNXWe5qV72SvF/3qiAZH9suMwLYJI+P5QUAkOGy1DxhCtGQcwcIVRx91dML4jLClz0DRdCsLzyKhl8aQff299Ovra6Pxdx1NFHtI12UYB+oC46RMeojyi6Rr/RHbr3brw/3oP3+Nxa8xYzW+gf1LwnZf2xHw==</latexit>
ITracIn(z, z0
) :=
1
C
C
X
c=1
gc>
z0 gc
z
<latexit sha1_base64="Xjnu88gy+RPwO3H+1aB3E6C0IIg=">AAACVnicbVFNaxsxENWumyZ1mnbbHnsRNSEpBLNb+g2GQC71zQ5xEvA6i1bWOiKSdpFmC7bQn2wv7U/ppVS79qF1OjDozXszSPOUV4IbiOOfQdh5sPNwd+9Rd//xwZOn0bPnl6asNWUTWopSX+fEMMEVmwAHwa4rzYjMBbvK784a/eor04aX6gKWFZtJslC84JSAp7JIppLALSXCDl3WYgB7oQkdqvORO16drI5ed7GPz4P2SAuv2cTZM5eaWmaWDhJ346tFZldHHtEUysrhcXbe5E1T4UZrJJdFvbgft4Hvg2QDemgToyz6ls5LWkumgApizDSJK5hZooFTwVw3rQ2rCL0jCzb1UBHJzMy2tjh86Jk5LkrtUwFu2b8nLJHGLGXuO5vFzbbWkP/TpjUUH2eWq6oGpuj6oqIWGErceIznXDMKYukBoZr7t2J6S7xx4H+i601Itle+Dy7f9JP3/Xfjt73TTxs79tBL9AodowR9QKfoCxqhCaLoO/oVhEEn+BH8DnfC3XVrGGxmXqB/Ioz+AJLfsqw=</latexit>
ITracInRP(z, z0
) :=
1
C
C
X
c=1
gc>
z0 QRQ>
Rgc
z
<latexit sha1_base64="C5MaCNLjqlxsGTzGITmp5Mmztak=">AAACRnicbVBNaxRBEK1Zv5L1I6s5emlcJBF0mRE1MSBEcongIQY3CexMhp6e3k2Tnu6hu0bYbebXefGcmz/Bi4eEkKs9s3vQxAdNP96roqpeVkphMQx/Bp1bt+/cvbe03L3/4OGjld7jJwdWV4bxIdNSm6OMWi6F4kMUKPlRaTgtMskPs9Odxj/8xo0VWn3FacmTgk6UGAtG0UtpL4kLiieMSvepTluO6D4apWUu6vXZy9naiy7x2PrQfpPUzdbq4xh1SYbpPok/+1E5Td1+fexeRXUjzt2msk57/XAQtiA3SbQgfVhgL+2dxblmVcEVMkmtHUVhiYmjBgWTvO7GleUlZad0wkeeKlpwm7g2hpo890pOxtr4p5C06t8djhbWTovMVzaH2uteI/7PG1U43kycUGWFXLH5oHElCWrSZEpyYThDOfWEMiP8roSdUEMZ+uS7PoTo+sk3ycHrQfRu8PbLm/72+0UcS/AUnsE6RLAB27ALezAEBt/hF5zDRfAj+B1cBlfz0k6w6FmFf9CBPyenry4=</latexit>
IArnoldi(z, z0
) := g>
z0 UR⇤ 1
R U>
R gz
13. Distributional bias in IF and its approximations
<latexit sha1_base64="f47u6XDbCSPi2grflbpZy6fnKhM=">AAAFzHicjVTbbhMxEJ1CAqXcWnjkxaJFaqU0Soq4PSBVKgIqAQqoN6kJkdfrTVb17ka2tzRd5ZUP5I1P4JE/4NjZtuklFSt5PT4z9sw5491goGJjG43fMzduVqq3bs/embt77/6Dh/MLj3ZMlmsht0WmMr0XcCNVnMptG1sl9wZa8iRQcjc42HD+3UOpTZylW3Y4kJ2E99I4igW3gLKFmb/UJkuSjvAOKKKCWqQpowGGoRhoDCslRs+pTk0awVqmd0ANfBpzQPlpFCcFf4AVh59hTsu18rYEroEYWArZVj0awc6BpCTwXkEeRp9gWcy5P8edZmAnQBktUY+6dIyZYW+G3QrvH2Wk87cwVinE6Yk/15yrz1XOUZXyp4VTuDieErX0MGpYfcCe3Nfh2KUe2/CYoD4N/drhoa/+Dx0Cm376ykT0En3Gu12q1UbNHHF9xAfoxzdo/t33hflejTk5ri14lrzKJ8wnu3bo2S17rROsV5EzhB35PLHvOitzueqOvO5byCtLdtd16aS31seP+6BxGsMceU5nPITXvaBN1NsF12X0robhNHiL4bpZYO14Oobu9jGvyZlnWq9zzyUBFpbdnaa4Y/cea13WrP0uDavm2UT/cbMu3oHp2Wqn2kyqdL2iV2Xseb+rlvtv6Bh2WOrr7l0MzdZOb/uVvLvzi416wz/sstEsjUUqn1Z3/lc7zESeyNQKxY3ZbzYGtlNwbWOh5GiunRs54OKA9+Q+zJQn0nQK/zMasWdAQhZlGiO1zKOTOwqeGDNMAkQm3PbNRZ8Dr/Lt5zZ63SnidJBbmYpxoihXzGbM/dlYGGsprBrC4ELHqJWJPtdcWPz/5iBC8yLly8bOWr35sv7i69ri+ptSjll6Qk9xZZv0itbpIz66bRKVzUpWOaoMq1+qtlpUR+PQGzPlnsd07qn+/AdL/z9U</latexit>
Proposition 3.1 (Distributional bias in bilinear self-influence). Let us as-
sume gz follows a P-dimensional stable distribution (e.g., Gaussian, Cauchy, and
Lévy distribution) and M 2 RP ⇥P
is a positive (semi-)definite matrix. Then,
self-influence in the form of IM (z, z) = g>
z Mgz follows a unimodal distribution.
Furthermore, if gz follows a Gaussian distribution, then the self-influence follows
a generalized 2
-distribution.
14. A simple case study in a modified two-circle dataset
<latexit sha1_base64="tEVn5IcyeMefnGdLs/hWXGoL6PM=">AAACAXicbZDLSsNAFIYn9VbrLepGcBMsgquSSL0sC24UhFawF2hDmEwn7dDJhZkToYS48VXcuFDErW/hzrdxkmahrT8MfPznHOac3404k2Ca31ppaXllda28XtnY3Nre0Xf3OjKMBaFtEvJQ9FwsKWcBbQMDTnuRoNh3Oe26k6us3n2gQrIwuIdpRG0fjwLmMYJBWY5+MPAxjAnmyU3q5AyQ3DabqaNXzZqZy1gEq4AqKtRy9K/BMCSxTwMgHEvZt8wI7AQLYITTtDKIJY0wmeAR7SsMsE+lneQXpMaxcoaGFwr1AjBy9/dEgn0pp76rOrMd5XwtM/+r9WPwLu2EBVEMNCCzj7yYGxAaWRzGkAlKgE8VYCKY2tUgYywwARVaRYVgzZ+8CJ3TmnVeO7urVxv1Io4yOkRH6ARZ6AI10DVqoTYi6BE9o1f0pj1pL9q79jFrLWnFzD76I+3zBwqwlz0=</latexit>
ILOO
<latexit sha1_base64="sRYssw0GitVG1c+smryccc5XV6I=">AAACA3icbVDLSsNAFJ34rPUVdaebwSLUTUnEF4hQdKO7CvYBTSg300k7dPJgZiKUUHDjr7hxoYhbf8Kdf+OkzUJbD1w4nHMv997jxZxJZVnfxtz8wuLScmGluLq2vrFpbm03ZJQIQusk4pFoeSApZyGtK6Y4bcWCQuBx2vQG15nffKBCsii8V8OYugH0QuYzAkpLHXPXCUD1CfD0doSdi7IDPO7DpW1Zhx2zZFWsMfAssXNSQjlqHfPL6UYkCWioCAcp27YVKzcFoRjhdFR0EkljIAPo0bamIQRUuun4hxE+0EoX+5HQFSo8Vn9PpBBIOQw83ZldLKe9TPzPayfKP3dTFsaJoiGZLPITjlWEs0BwlwlKFB9qAkQwfSsmfRBAlI6tqEOwp1+eJY2jin1aObk7LlWv8jgKaA/tozKy0RmqohtUQ3VE0CN6Rq/ozXgyXox342PSOmfkMzvoD4zPH+s/lmg=</latexit>
I (↵ = 100)
<latexit sha1_base64="ZmlcEXoPjl2BxdNi7DFGhbT3CY8=">AAACBHicbVDLSsNAFJ34rPUVddnNYBHqpiTiC0QoutFdBfuAJpSb6aQdOnkwMxFK6MKNv+LGhSJu/Qh3/o2TNgttPTDM4Zx7ufceL+ZMKsv6NhYWl5ZXVgtrxfWNza1tc2e3KaNEENogEY9E2wNJOQtpQzHFaTsWFAKP05Y3vM781gMVkkXhvRrF1A2gHzKfEVBa6polJwA1IMDT2zF2LioO8HgAl1bVsg+7Zln/E+B5YuekjHLUu+aX04tIEtBQEQ5SdmwrVm4KQjHC6bjoJJLGQIbQpx1NQwiodNPJEWN8oJUe9iOhX6jwRP3dkUIg5SjwdGW2spz1MvE/r5Mo/9xNWRgnioZkOshPOFYRzhLBPSYoUXykCRDB9K6YDEAAUTq3og7Bnj15njSPqvZp9eTuuFy7yuMooBLaRxVkozNUQzeojhqIoEf0jF7Rm/FkvBjvxse0dMHIe/bQHxifP17olqA=</latexit>
I (↵ = 0.01)
<latexit sha1_base64="AoSyz/4d860DiF5Wn/pRjBOqz3U=">AAACCnicbVC7SgNBFJ2Nrxhfq5Y2o0GICGFXfDVC0CZ2EcwDsku4O5lNhsw+mJkVwpLaxl+xsVDE1i+w82+cTVJo4oGBM+fcy733eDFnUlnWt5FbWFxaXsmvFtbWNza3zO2dhowSQWidRDwSLQ8k5SykdcUUp61YUAg8Tpve4Cbzmw9USBaF92oYUzeAXsh8RkBpqWPuV0sO8LgPR/gKV/ExnvywE4Dqe356O+qYRatsjYHniT0lRTRFrWN+Od2IJAENFeEgZdu2YuWmIBQjnI4KTiJpDGQAPdrWNISASjcdnzLCh1rpYj8S+oUKj9XfHSkEUg4DT1dmG8pZLxP/89qJ8i/dlIVxomhIJoP8hGMV4SwX3GWCEsWHmgARTO+KSR8EEKXTK+gQ7NmT50njpGyfl8/uTouV62kcebSHDlAJ2egCVVAV1VAdEfSIntErejOejBfj3fiYlOaMac8u+gPj8wcybpi7</latexit>
H(↵) = H + ↵I
17. Utilization of Geometric Ensemble
<latexit sha1_base64="gRUshho/+GpXwFSAGbUgD/oKvMw=">AAAEIXicbVNLbxMxEN4mPEp4tXDkMqKLlJY0SipBOVCpAglVahUF+pS6beR1ZrNWvfZie0uT1f4VLvwVLhxAqDfEn2GyaUtbmIM1+jyPb8afw1QK61qtX1OV6o2bt25P36ndvXf/wcOZ2Uc7VmeG4zbXUpu9kFmUQuG2E07iXmqQJaHE3fDo7fh+9xiNFVptuWGKBwkbKBEJzhxBvdnKcuDwxIVR3jU61VaM4QLqa2itYAqsUINMMiPcECJtQFOxxZQZlqBDI0bYh07HzjdhAx1kFpi1WYLAgFgsOsOIVh8u4sEPXIyOHS5AIBQECXNxGOYfisO8W/jAeCzwGC2M0Ggos6k9SG0t+Bv1zQacp8/DCrT8kpH9mDFDTdAYbZqwFaNqgL/mQ8yIjgOJzDrwu4uddb8svIhigOqYyYw6jSvQAGAzHoOLKd7vrMNr6PpNeJcZamcSbbABIgL/xAdBNRUIlWYOdPSXo2VJKhH8UTnYpt+gYqjGB7WQUn8aB8Va9ictx/iEBnKnDY0X5JD1BARFLxcrnfXn7clOakGIA6FyJsVALRQ1IBv0RoeB02mZsAKBYqFkJdLLAxohHxYFBChlfXRpY0Gm+mhCwzjmV1PKiCKqnzQugqkyEZm8T5S3qB7tuxag6p8z6c3MtZqt0uBfp33mzHln1u3NnAZ9zUkcynFJMtlvt1J3kDPjBJdY1ILMYsr4ERvgPrmKFGMP8lLhBTwjpF9uLtLKQYlezshZYu0wCSlyzNlevxuD/7vbz1z06iAv3xMVnzSKMgmO9EffBfrC0AvJITmM0y8QHHhMcuakZlujJbSvj/yvs7PUbL9svni/NLf65mwd094T76lX99resrfqrXldb9vjlc+Vr5XvlR/VL9Vv1Z/V00loZeos57F3xaq//wD+QVa0</latexit>
Proposition (Hessian singularity for over-parameterized NNs). Let us as-
sume a pre-trained parameter ✓⇤
2 RP
achieves zero training loss L(S, ✓⇤
) = 0
for squared error. Then, H has at least P NK zero-eigenvalues for NNs such
that NK < P. Furthermore, if x is an input of training sample z 2 S, then the
following holds for the eigenvectors {ui}P
i=NK+1
g>
z ui = r>
ŷ `(z, ✓⇤
) r>
✓ f(x, ✓⇤
)ui
| {z }
0
= 0
<latexit sha1_base64="PpgwH7Ake9PCmrRDLUlAeqSlFn8=">AAACEHicbVDJSgNBEO1xjXGLevTSGMR4CTPiBiJEvSh4iGAWyIRQ0+kkTXoWumuEMOQTvPgrXjwo4tWjN//GznLQxAdNP96roqqeF0mh0ba/rZnZufmFxdRSenlldW09s7FZ1mGsGC+xUIaq6oHmUgS8hAIlr0aKg+9JXvG6VwO/8sCVFmFwj72I131oB6IlGKCRGpk91wfsMJDJTb8x5IjJ7UWfumc5F2TUgXM7bzv7jUzW/EPQaeKMSZaMUWxkvtxmyGKfB8gkaF1z7AjrCSgUTPJ+2o01j4B1oc1rhgbgc11Phgf16a5RmrQVKvMCpEP1d0cCvtY93zOVg5X1pDcQ//NqMbZO64kIohh5wEaDWrGkGNJBOrQpFGcoe4YAU8LsSlkHFDA0GaZNCM7kydOkfJB3jvNHd4fZwuU4jhTZJjskRxxyQgrkmhRJiTDySJ7JK3mznqwX6936GJXOWOOeLfIH1ucPChab6g==</latexit>
ILA (↵ = 0.01)
<latexit sha1_base64="JY+U0ZmSLnSoGiMXkCJ3WQ+ilXc=">AAACD3icbZDJSgNBEIZ7XGPcoh69NAYlXsKMuIEIUS8KHiKYBTIh1HQ6SZOehe4aIQx5Ay++ihcPinj16s23sbMcNPGHho+/quiq34uk0Gjb39bM7Nz8wmJqKb28srq2ntnYLOswVoyXWChDVfVAcykCXkKBklcjxcH3JK943atBvfLAlRZhcI+9iNd9aAeiJRigsRqZPdcH7DCQyU2/MWTE5PaiT92znAsy6sC5Y9v7jUzWzttD0WlwxpAlYxUbmS+3GbLY5wEyCVrXHDvCegIKBZO8n3ZjzSNgXWjzmsEAfK7ryfCePt01TpO2QmVegHTo/p5IwNe653umc7CxnqwNzP9qtRhbp/VEBFGMPGCjj1qxpBjSQTi0KRRnKHsGgClhdqWsAwoYmgjTJgRn8uRpKB/kneP80d1htnA5jiNFtskOyRGHnJACuSZFUiKMPJJn8krerCfrxXq3PkatM9Z4Zov8kfX5A5GDm7I=</latexit>
ILA (↵ = 100)
<latexit sha1_base64="s0zEbOYkkBwPLESPadaN5uCYuU0=">AAACCXicbZC7SgNBFIZn4y3G26qlzWAQYhN2xaggQtRGwSKKuUASwuxkkgyZvTBzVgjLtja+io2FIra+gZ1v42SzhSb+MPDxn3OYc34nEFyBZX0bmbn5hcWl7HJuZXVtfcPc3KopP5SUVakvfNlwiGKCe6wKHARrBJIR1xGs7gwvx/X6A5OK+949jALWdknf4z1OCWirY+KWS2BAiYiu407CANHNeYxbp4W7s5K13zHzVtFKhGfBTiGPUlU65ler69PQZR5QQZRq2lYA7YhI4FSwONcKFQsIHZI+a2r0iMtUO0ouifGedrq450v9PMCJ+3siIq5SI9fRneNd1XRtbP5Xa4bQO2lH3AtCYB6dfNQLBQYfj2PBXS4ZBTHSQKjkeldMB0QSCjq8nA7Bnj55FmoHRfuoWLo9zJcv0jiyaAftogKy0TEqoytUQVVE0SN6Rq/ozXgyXox342PSmjHSmW30R8bnDwB/mTo=</latexit>
ILA (R = 50)
<latexit sha1_base64="IF4DsSdOVW+Hs34gt59F9n075OY=">AAAB/XicbVDLSgMxFL3js9bX+Ni5CRahbsqMWBVEKLrRXRX7gM5QMmnahmYeJBmhDsVfceNCEbf+hzv/xkw7C209EDiccy/35HgRZ1JZ1rcxN7+wuLScW8mvrq1vbJpb23UZxoLQGgl5KJoelpSzgNYUU5w2I0Gx73Ha8AZXqd94oEKyMLhXw4i6Pu4FrMsIVlpqm7uOj1WfYJ7cjJBzXry7KFuHbbNglawx0CyxM1KADNW2+eV0QhL7NFCEYylbthUpN8FCMcLpKO/EkkaYDHCPtjQNsE+lm4zTj9CBVjqoGwr9AoXG6u+NBPtSDn1PT6ZZ5bSXiv95rVh1z9yEBVGsaEAmh7oxRypEaRWowwQlig81wUQwnRWRPhaYKF1YXpdgT395ltSPSvZJqXx7XKhcZnXkYA/2oQg2nEIFrqEKNSDwCM/wCm/Gk/FivBsfk9E5I9vZgT8wPn8Ad7eT8A==</latexit>
I (R = 50)