ロジスティック回帰分析の書き方

公益社団法人日本心理学会
心理・医学系研究者のためのデータ解析R境Rによる統計学の研究会

第12回研究集会ロジスティック回帰分析

「ロジスティック回帰分析」
の書き⽅
医療経済研究機構
清水沙友里
1

書き方セクションが存在する理由
最良の予測モデル構築のため
普遍的なモデル構築手法を知る
解析をしながらモデル構築はしない
結果的に論文が書きやすくなる→accept!

示された結果の妥当性を評価するため
 多くの論文では報告の質に問題がある
 何を報告すれば他の研究者がモデル全体を正
しく理解できるかを知る
2

臨床疫学に関する指針・声明・ガイドライン
広義の指針：研究倫理や科学性
・ヘルシンキ宣言など
全般的な指針：論文執筆の原則
・Uniform Requirements,学会規定など

研究法別の標準報告様式
・STROBE：観察的疫学研究
・CONSORT：ランダム化比較試験
・PRISMA,MOOSE,STARD …etc
研究手法別のガイドライン
・How to Report Statistics in Medicine

…etc

投稿規定：学会誌毎
原文は：Enhancing

the QUAlity and Transparency Of health Research(EQUATOR)
http://www.equator-network.org/ など

3

この発表の主たる目的
ロジスティック回帰分析を用いた研究の論
文を書く際のTIPsをお伝えすること
ロジスティック回帰の分析ガイドライン
×
各種執筆ガイドライン
×
実際の執筆事例の紹介
を強迫的タイムスケジュールで・・・？？

((+_+))

4

この発表のフォーカス

2011/6/25第１回奥村さん資料より・・・

→ロジスティック回帰を使った「予測」

5

ロジスティック回帰分析とは
[論文上の表記例]
•(Multivariable/Multivariate)
Logistic regression analysis/model
•Binomial/Binary logit analysis/model

LOSIGTIC＝ロジスティックモデルを用いた
←
『対数(ロジット)変換』
7

線形回帰分析の基本

y=ax+b

ロジスティック回帰分析で求めたいのは
YES or NOの確率値 (0％~100％) しかし、
上の式では確率値がこの範囲に収束しない
Yの方をどうにかしてこの範囲に収めたい
→Yを対数で飾ろう
8

神童だったころ聞いたけど忘れた
Wikipedia によると
対数（たいすう、英語: logarithm）とは、任意の数 x を a を底とする指数関数により x = ap と表したときの冪指数 p の事で
ある。 p = loga(x) と書いて pはa を底とする x の対数という。対数関数 loga(x) は、指数関数 ax の逆関数として定義される。

23=8 ：2の3乗は8です
←
2=log38 ：３は２を８にする指数です

Y=ax であるとき、x=logaY
9

2を７にする指数(log27)は2~3の間だけれど数字に
は表せない・・・むむむむ
※常用対数表はあるけどね

2log27=7

でいいんじゃない？
log x=x
a a

どんな数であっても任意の底の指数表示
に変えることができる
10


loge{y/(1-y)}=ax+b
説明変数をS字のロジスティック曲線に回
帰させる＝線形関数としてモデル化
（目的変数が1になる確率を推定）
11

一般化線形モデル
 階層性のあるデータはNG

※混合ロジスティックモデル(MLM)は対象外

ロジスティックモデルは追跡研究を志向
・症例対象研究→ロバストな２条件※を充たす
・横断研究→実は明確な証明は無い
（データを追跡研究から得たものとして扱う）
個人のリスクやRRは計算できない（ORはOK）
つまり、αが必要な個人のリスクは推定できず、βが必要なOR
は計算できる
Prentice RL, Pyke R. Logistic disease incidence models and case-control studies. Biometrika 1979;66:403–11. doi:10.1093/biomet/66.3.403
Breslow NE, Day NE. Statistical methods in cancer research. Vol. 1. The analysis of case-control studies. 1980;1:338 pp.

12

研究計画を練り、データを集め、Rで計算し、

ロジスティック回帰モデルを用い
た論文を書き始めた

13

適正報告とは：STROBE声明チェックリスト
タイトル・抄録

デザインを明示、十分な情報でバラ
ンスのよい要約

方法：統計手法

全ての統計学的方法,欠損,脱落,マッチ
ング,サンプリング,感度分析

イントロダクション

科学的な背景と論拠

結果：参加者

各段階における人数,非参加の理由,フ
ローチャート

目的

仮説を含む目的

結果：記述的データ

人口統計学的、臨床的、社会学的特徴
と暴露や潜在的交絡因子の情報、欠損
数、追跡期間(平均と合計)

方法：デザイン

研究デザインの重要な要素を明示

結果：アウトカム
データ

発生数や集約尺度数値(経時的),各暴露
カテゴリーの数、暴露の集約尺度

方法：セッティング

セッティング,場所,基準となる日時
(登録、暴露、追跡、データ収集期
間)

結果：主な結果

調整前後の推定値と精度(信頼区間な
ど),カテゴリ化された連続変数はカテ
ゴリ境界、relevantな場合は相対リス
クを絶対リスクに換算

方法：参加者

研究方法により適格基準、母集団、
選定・追跡方法、マッチング基準等

結果：他の解析

感度分析などその他の分析全て

方法：変数

全てを明確に(アウトカム、暴露、予
測因子、潜在的交絡因子、潜在的な
効果修飾因子、診断方法など）

考察：keyとなる結
果

研究目的に対する結果

方法：データソース
/測定方法

データソース、測定・評価方法、2群
以上の場合比較可能性

考察：限界

潜在的なバイアスや精度の問題、バイ
アスの方向性を大きさ

方法：バイアス

バイアスへの措置

考察：解釈

目的、限界、多重性、他の研究結果を
考慮

方法：サイズ

どのように算出したか

考察：一般化可能性

一般化可能性を議論

方法：量的変数

分析方法、グルーピングの詳細

その他

研究の財源、元となる研究

STROBE声明：観察研究の報告において記載するべきチェックリスト上岡広晴、津谷喜一郎訳より大幅に抜粋して転載。利用時は原典を確認のこと 14

ロジスティック回帰の適正報告研究
要約すると…ほとんどの論文は必要な情報を報告していない
•
•
•
•

•
•

Bagley et al. Logistic regression in the medical literature:
standards for use and reporting, with particular attention to
one medical domain (J ClinEpidemiol54: 979-985, 2001)
Peng et al. An Introduction to Logistic Regression Analysis and
Reporting (J Edu Res 96:3-12,2002)
Moss et al. An appraisal of multivariable logistic models in the
pulmonary and critical care literature (Chest 123: 923-928,
2003)
Ottenbacher et al. A review of two journals found that articles
using multivariable logistic regression frequently did not report
commonly recommended assumptions (J ClinEpidemiol57: 11471152, 2004)
Mikolajczyk et al. Evaluation of logistic regression reporting in
current obstetrics and gynecology literature (ObstetGynecol111:
413-419, 2008)
Kalil et al. Recommendations for the Assessment and Reporting
of Multivariable Logistic Regression in Transplantation Literature.
(Am J Transplant 10:1686–94,2010)
15

ロジスティック回帰の適正報告研究
適切

クライテリア

不適切

1.線形性

線形性の仮定の記載

（加えて、臨床的知識やデータプロット）を報告する

報告なし

2.モデルの報告

係数と標準誤差を報告する

報告なし

3.交互作用

統計学的もしくは臨床的な情報の理由を報告し交互作用項を投入

報告なく投入

4.95%CLと有意性

ファイナルモデルに含まれた各変数のオッズ比と95%CL(もしくはp値)を報告する報告なし

5.過剰適合

イベント比10以上を保つ

10以下

6.外れ値

外れ値がある場合、どのように扱ったかを報告する

報告なし

7.適合度

HL値等の適合度の検定を報告する。予測モデルの場合、AUCを報告する

報告なし

8.多重共線性

独立変数間の相関が高い場合、論文中で議論し、検定の結果を報告する

議論あるが検定結果なし

9.内的妥当性

予測モデルの場合、データスプリット(クロスバリデーション含む),ブートスト
ラップ、ジャックナイフのうち少なくとも１つを実施する

行わない

10.外的妥当性

予測モデルの場合、異なった集団でも追試する

行わない

11.エビデンスレベル
レベル１少なくとも１つの前向きな外的妥当性の確認とインパクト分析.臨床行動と患者アウトカムの改善を実証
レベル２

少なくとも１つの広範囲の患者に対する前向きな外的妥当性の確認か、複数の異なった小集団でのバリデー
ション

レベル３少なくとも１つの狭い範囲の(導出サンプルに類似した)患者に対する前向きな外的妥当性の確認
レベル４内的妥当性のみの確認
レベル５妥当性の確認が無い
Kail et al: Am J Transprant 10: 1695-1703, 2010

16

書き方の説明のながれ

17

必須記載項目①

変数の選択基準

• 変数選択の合理的な意義を記載すること
• アウトカム(詳細な診断基準),曝露,予測因子,潜在的な交絡因子,潜在
的な効果指標の修飾因子等、考慮した全ての変数の定義付け
• 初期の探索段階で多くの変数を考慮した場合は、付録や追加の表、その
他の発行物で提示することを考慮する
• 最終的にモデルに投入された変数だけでなく、全ての候補となった変
数について言及することが推奨される
>定義
-予測因子(predictor)
-決定因子(determinant)

>方法：どの変数を含むべきか？
先行研究や理論：臨床的・生物学的に意義のある変数
研究の目的：まず最も変数の多い意味のあるモデルを作成
18

必須記載項目①

変数の選択基準

＞事例
-Diagnosis of Lung Cancer
The diagnosis of lung cancer was made by histopathological examination of
resection specimens or cytopathological examination of needle-aspiration biopsy
samples. A microcoil localization technique was used to mark the nodule under CT
guidance before surgical resection. Resected tumors were classified with the use of
the World Health Organization classification of lung neoplasms.

-Statistical Analysis
Multivariable logistic-regression models were prepared to estimate the risk of lung
cancer associated with potential predictors, including sociodemographic variables
and clinical variables such as smoking exposure and nodule characteristics.
Inclusion of variables in the models was based on existing knowledge of risk factors
for lung cancer and on nodule characteristics that are readily discernible on lowdose CT images.

-Results

Predictive Model

The variables listed in Table 1 and in Table S1 in the Supplementary Appendix were
evaluated for inclusion in the models.
Annette McWilliams et al. N Engl J Med 2013;369:910-9.

19


20

必須記載項目②

過剰適合

過剰適合の対応を記載すること

>方法
従属変数のイベント/独立変数の数≥10にする
Peduzzi P. J Clin Epidemiol. 49(12):1373-9. 1996

＞事例
-Methods
We followed standard methods to estimate sample size for
multiple logistic regression, with at least ten outcomes needed
for each included independent variable. With an expected PV
catheterization success rate of 50%, we required 120 attempts
(60 successes) to appropriately perform multiple logistic
regression with six variables.
Schnadower D et al. Acad Emerg Med 14(5) 483–485. 2007

21


22

必須記載項目③

線形性

線形性の対応を記載すること
＞定義
連続変数とロジット変換後の従属変数が線形性を満たしている

>方法
散布図平滑化
パラメトリックな手法。重み付き平均で平滑化
変数のカテゴリ化
連続変数を四分位等でカテゴリ化し予測モデルに投入。推定した偏回
帰係数が線形に上昇しているかをプロットし比較
一般化加法モデル (generalized additive noidel:GAM)
散布図平滑化の手法をカテゴリカルデータにも応用
LOESS (locally weighted least squares) smoothing curves
局所重み付き最小二乗推定→推定精度は良いがバイアスにやや難(バンド幅）

スプライン関数
ペナルティ付き残差平方和を最小とする推定量を求める
23

必須記載項目③

線形性

>事例
First, a linear model was determined. This was followed by assessment of the fit of the model
and its performance characteristics. Finally, to assess whether the dependent variable was linear
in the logit, three methods, as proposed by Hosmer and Lemeshow, were used: lowess (locally
weighted least squares) smoothing curves, design variables and fractional polynomials.

-Results
Linearity of the dependent variable (SCA or SAD) for both erect and supine radiographs in the logit of
the independent variable was assessed using three methods. This was done to confirm that binomial
logistic regression was the appropriate method of analysis.
• Lowess smoothing curves showed that linearity varied over the interval of the dependent variable.
…(略) This is illustrated in Fig. 1.
• Logistic regression coefficients were plotted against the approximate quartile midpoints of dependent
variables as shown in Fig. 2. …(略) By contrast, the results suggest linearity in the logit for both SCA
and SAD in the first three quartiles for erect radiographs.
• Fractional polynomial model comparisons showed that the best non-linear transformations were not
significantly different from the linear model. Therefore, the fractional polynomial analysis supported
treating both variables as linear in the logit in general, with one exception: a significant p-value of 0.04
for the variable SCA suggested that the fit of the model might be improved if the variable was
transformed by its inverse square. This in turn suggested that the use of the transformed variable in the
logistic regression analysis might result in a superior model.

Cardiovasc J Afr. 2010 Sep-Oct;21(5):274-9

24

必須記載項目④

変数投入法

変数投入法を記載すること
＞定義
変数をいつモデルに投入するか

＞方法
強制投入法(simultaneous/collective/direct)
–指定した全ての独立変数を同時にモデルに投入
–変数の重要性の順序などに仮説がない場合
逐次投入法：(sequential/hierarchical)
–投入の順序を因果関係などで規定する
変数選択法：変数増加法・変数減少法・ステップワイズ・総当たり法
–統計的観点から最良のモデルになるよう投入する変数を選択する
–説明モデルの場合はNG、上記の方法と並行して検討する価値はある
–先行研究が無く最初に検討している変数が多すぎる場合も利用可能
27

必須記載項目④

変数投入法

＞事例
-Methods
We analysed differences in outcomes after 12 and 18 months of follow up with
logistic and multiple linear regression (hierarchical backward elimination
method), adjusting for possible differences in baseline scores and background
characteristics (sex, age, educational level, income, composition of household,
and course of gait problems experienced).

Jolanda C M van Haastregt et al. BMJ 2000;321:994

28

番外編

交互作用と交絡因子

研究の目的が予測モデルの構築である時
交互作用の有無に関わらず、いくつかの要因を調
整した上で曝露の影響を推定するのが目的

交絡因子や効果修飾因子がモデルの中で果たす役割や
構造に対しては注意する必要がない。後方変数減少法
(backward elimination procedure)等のコンピュータ
プログラムによる変数選択法を用いるべき
kleinbaum 2007 (5th 2013)

30

番外編

交互作用と交絡因子

① 変数の決定：完全層化モデル(HWF)
② 交互作用：チャンク検定・階層的後方変数減少法
③ 交絡因子の評価：変数を除去した際の推定オッズ
比の変化
交絡因子は妥当性の問題
交互作用因子

→
→

統計学的解析を使わない
統計学的解析を使う

交絡因子：偶然誤差＜系統誤差
統計学的解析：偶然誤差＞系統誤差

結局は複数の係数・オッズ比・信頼区間などの主観的な
比較になるため、色々な派閥がある
31

必須記載項目⑤

交互作用の評価

交互作用を記載すること
>定義
説明変数Xの効果が他の変数Zによって変わること
＞方法
説明変数間の積をモデルに投入する(量的変数の場合はセンタリング
をする場合も)
𝑙𝑜𝑔𝑖𝑡 𝑞𝑖 = 𝛽1 +𝛽2 𝑥 𝑖 +𝛽3 𝑓𝑖 +𝜷 𝟒 𝒙 𝒊 𝒇 𝒊

＞対応
完全階層化モデルHierarchically well-formulated (HWF) model
積項はHierarchical backward elimination approach:HBWEで検定

32



＞事例
-Methods

Coenen S et al. Br J Gen Pract. 56(524):183-90 2006.

we estimated a logistic model which contained all covariates as possible confounders
and all interaction terms between perceived patient demand and the covariates as
possible effect modifiers. …（略）We also added the interaction terms of patient sex
and age, and GP’s sex and year of birth…(略) If some of the covariates or interaction
terms dropped out of the starting model due to co-linearity, confounding and
interaction were evaluated in a stratified analysis of perceived patient demand versus
GPs’ antibiotic prescribing, controlling for each covariate separately.

-Results
This resulted in a model containing seven interaction terms (patient age, smoking, …(略).
After eliminating interaction terms with a P-value greater than 0.01, only one interaction
term was retained in the model.



We evaluated interactions between important predictors in final models by including
interaction terms along with main-effect terms. None of the interactions we tested
were significant, and they are not discussed further in this article.
33



＞事例

34

必須記載項目⑥

交絡因子の評価

交絡因子を記載すること
>定義
ある説明変数Xが目的変数Yに与える効果が、他の説明
変数Zの値によって異なること
＞方法：統計学的な検定方法は無い
交互作用ある：再び階層原則に従って係数の評価。精度
は信頼区間から評価残すほうが妥当
交互作用ない：全ての変数を含むgold standardとサブ
セットのオッズ比と信頼区間を報告
36



＞事例
-Methods


the confounding effect of all covariates not in significant interaction terms in the full
model was assessed, followed by precision considerations. We looked for a subset of
covariates for which the model gave roughly the same parameter estimates for
perceived patient demand and the significant interaction terms, but with narrower
confidence intervals.

-Figure 2
The relationship between the effect of
perceived patient demand on antibiotic
Prescribing for acute cough and the
significant confounders of this relation.
Adjusted odds ratios and 95% confidence
limits.
37



＞事例
-Methods
Potential confounders in multivariable models included number of non-PCP visits,
sex,….(略) Based on a study by Lin and colleagues (29), we also examined the impact
of an unmeasured confounder on our estimated odds ratios (ORs) in post hoc
sensitivity analyses.

-Discussion
Therefore, residual confounding by unmeasured factors may influence results. In post
hoc sensitivity analyses, our results become nonsignificant or potentially reversed
only when the proportions of an unmeasured confounder differed greatly between
individuals who visit PCPs often and those who do not. An unmeasured confounder
seems to have a greater effect on the association of PCP with CRC incidence than
with mortality (Appendix Tables 3 and 4, available at www.annals.org).

-Limitation
This study used administrative data, which made it difficult to identify potential
confounders and prevented examination of the content of primary care visits.
J M. Ferrante et al. Ann Intern Med. 2013;159(7):437-446. 38

＞事例


点推定と95%信頼区間を報告

J M. Ferrante et al. Ann Intern Med. 2013;159(7):437-446.

39

必須記載項目⑦

多重共線性

多重共線性について記載すること
＞定義
１つ以上のxに非常に強い線形関係があり、単独の影響を分離したり、
効果を評価することが不可能ではないが、困難な状態 (Goldberger 1968)
- 単純な相関関係ではわからない
- 信頼出来ない回帰係数を返す
- モデルが実行されない
一般的:VIF10以上

＞方法
・分散拡大係数(variance Inflartion Factors:VIF)
・分散分解係数(variance decomposition proportion:VDP)
条件指数(condition index:CNI)
マニアック：CNI(共線性の有無)が30以上
かつ VDP(それを引き起こす変数)が２つ
以上0.5より大きい ※SASのマクロにあります

41

必須記載項目⑦

多重共線性

>事例
-Statistical analysis
Variance Inflation Factor (VIF) was used to check for multicollinearity. Predictive and
complexity characteristics of the model were considered during modelling.

-Results Model fit and Predictive power of the models
Variance inflation factor (VIF) was employed to check for multicollinearity. None of
the VIF values were up to 10 and the mean VIF of the model was less than 6. It
means there was no collinearity in the model (see additional file 1).

Kayode et al. BMC Pregnancy and Childbirth 2012, 12:10

42

必須記載項目⑧

影響力のある観測値

観測値を除外した場合は記載すること
>定義
結果に大きな影響を与える被験者

＞方法
Δβ(任意の被験者をデータから除外した時の１個以上の推定回帰係数の修正の程度)
パッケージ：Cookの距離の類似尺度 , difference of fits (DFFITS)
理想的：exp(β)の加重平均変化量
∗

𝐶𝐷 𝑖 =

𝑗

𝑤 𝑗 Δ 𝑒𝑥𝑝 𝛽

𝑗, 𝑖

/

𝑗

𝑊𝑗

・データの誤り/外れ値/不適切なモデル
・データは論理的に重要かもしれない
・ファイナルモデルまでは影響力の有無を判定できない

＞対策
①除外：値の誤りかつ修正が不可能な場合など
②対象範囲の修正
③値は含めるが、これらの値を報告し考察で説明を行う
44

必須記載項目⑨

適合度の評価

適合度(goodness of fit:GOF)の評価を記載すること
＞定義
モデルがデータセットのアウトカムをどの程度予測できるか(モデルがデータに
フィットするか)の評価

＞方法
①Receiver Operating Characteristic:ROC curve
②共変量のパターン:ピアソンのχ2 検定, 逸脱度, 正規近似検定(Osius & Rojek’s)
③確率推定:Hosmer-LameshowのĈ・Ĥ検定
④２モデルの比較に基づくスコアテスト:Stukel’s test, Tsiatis’s test
⑤残差分析
Hosmer-Lemeshow(HL)統計量：有意確率大＝適合度が良い
-共変量パターン数が３以上。6未満で有意差を持つことは殆ど無い
-サンプルサイズが大だと適合しないと判断されやすい
-変数に連続変数が含まれる場合、パターン数が観測値数に近づき最も機能する
le Cessie-van Howelingen検定
46

必須記載項目⑨

適合度の評価

>事例
We evaluated the predictive performance of the model by assessing its
discrimination (ability to classify correctly) and its calibration (whether
probabilities predicted by the model match observed probabilities).
Discrimination was measured with the use of the area under the
receiveroperating-characteristic curve (AUC). All AUCs reported are presented
with bootstrap biascorrected 95% confidence intervals, with bootstrapping
techniques based on 1000 bootstrapped samples. We evaluated calibration by
subtracting the model-estimated probability from the observed probability for
each study participant, placing these absolute errors in rank order, and evaluating
the magnitude of the median and 90th percentile of the absolute errors. In
addition, the mean absolute errors for each decile of model-predicted risk were
evaluated.


47

番外編

妥当性

妥当性とは？
“A test is valid if it measures what it
purports to measure” (Kelley, 1927)

信頼性との違い：ダーツ投げのアナロジー

http://www.socialresearchmethods.net/kb/relandval.php

49

必須記載項目⑩ 内的妥当性(交差妥当性)
内的妥当性について記載すること
＞定義
予測モデルは別のサンプルでも当てはまりが良いか
誤差が少なければ当てはまりが良い
-信頼性の低いモデルでは、高い妥当性は望めない
-均一な集団では妥当性係数が低くなる

＞方法
validation data とのAUCの比較
-bootstrap
-K-fold cross validation
-double cross validation
-モデル推定用/交差妥当化用/検証用に３分割など

50

必須記載項目⑩ 内的妥当性(交差妥当性)
＞事例
-Methods
To examine the degree of overfitting of the prediction model to the development sample, we
performed a cross-validation procedure. First, the sample was split at random into 10 equal groups.
Second, a logistic regression model predicting diagnosis of PE was developed on nine tenths of the
sample, and the resulting prediction equation was applied to the remaining tenth; this procedure was
repeated 10 times, each time rotating the cross-validation subset. Finally, the ability of the crossvalidated scores to predict PE was examined by comparing the area under the receiver operating
characteristic curve (AUC) with that obtained from the naive prediction scores, without cross
validation. This cross-validation procedure was performed for the full multivariable prediction model,
where each covariate was assigned a separate regression coefficient, and for the simple model, where
the clinical probability score was the sole predictor. Confidence intervals on AUCs were obtained by
the bootstrap method: 250 subsamples with sample sizes of 986 were taken with replacement from the
original sample, the AUC was computed in each, and the 95% confidence interval was derived from
percentiles 2.5 and 97.5 of the distribution of AUCs.

-Results
When the 8 variables in the prediction model were allowed to vary independently, the AUC was 0.79
(range, 0.76-0.81) for the naive equation and 0.77 (range, 0.74-0.80) after cross validation. When the 8
variables were added to form the diagnostic score, the AUC was 0.79 (range, 0.76-0.81) for naive
prediction and 0.78 (range, 0.75-0.80) after cross validation. Hence, this analysis allows us to rule out
substantial overfitting of the clinical score.

Wicki J et al. Arch Intern Med. 2001;161(1):92-97. 51

必須記載項目⑪ 外的妥当性
外的妥当性について記載すること

＞定義
予測モデルは別のデータソースのサンプルでも
当てはまりが良いか→一般化可能性
＞方法
Validation data とのAUC比較など
53

＞事例
Prediction models developed in the PanCan cohort (excluding spiculation as a
predictor) were validated externally by means of an assessment of discrimination and
calibration in BCCA data.

-Reuslts /Characteristics
In the BCCA validation study, 1090 participants had 5021 nodules, and 40 of the 1090
persons with nodules (3.7%) were ….(略). The characteristics of the participants are
described in Table S1 in the Supplementary Appendix. The PanCan and BCCA study
populations were similar with respect to age, sex, body-mass index, percentage of
patients with emphysema, and percent of predicted forced expiratory volume in 1
second (FEV1).

54

＞事例

validation cohortは詳細に記載


55

＞事例
-Results /Predictive Model
Both parsimonious and full models showed excellent discrimination in the
PanCan and BCCA (validation) data with all AUCs more than 0.90 (Fig. S1
and Table S2 in the Supplementary Appendix). In the PanCan and BCCA data
sets, model-predicted probabilities of lung cancer showed good separation
between participants in whom lung cancer was diagnosed and those in whom
it was not diagnosed, with only modest overlap (Fig. 2). …(略)For those
nodules, the AUCs in model 1a were 0.894 and 0.907 in the PanCan and
BCCA data, respectively (Fig. S1 in the Supplementary Appendix). ….(略)
In the BCCA validation data, the full model performed significantly better
than the parsimonious model: the AUC was 0.960 (95% CI, 0.927 to 0.980) in
model 1a as compared with 0.970 (95% CI, 0.947 to 0.986) in model 2a (P =
0.009 for the difference in AUC) ….(略)

-Discussion
Our models show excellent predictive accuracy, with AUCs of at least 0.94 in
an external validation cohort.

56

必須記載項目⑫コード化・オッズ比・信頼区間
独立変数のコード化について記載すること
オッズ比とオッズ比の信頼区間を記載すること
＞事例

※量的変数の場合は増分を記載

J M. Ferrante et al. Ann Intern Med. 2013;159(7):437-44658
.

必須記載項目⑫

ソフトウエア

・分析に使用したソフトウエアを記載すること
・統計ソフトのマニュアルを引用すると丁寧
ロジスティック回帰
ではありませんが..

＞事例
-METHODS Statistical Analysis
Statistical analyses were performed using R 2.14
(19), with the cmprsk package (20) for Fine and Gray modeling.

-REFFERENCES
19. R Development Core Team. R: A language and environment for statistical
computing. Vienna, Austria: R Foundation for Statistical Computing; 2011.
Accessed at www.R-project.org on 18 March 2013.
20. Gray RJ. cmprsk: Subdistribution Analysis of Competing Risks. R package
version 2.2-2. 2011.
Timothy J. Daskivich et al. Ann Intern Med. 2013;158:709-717.

59

Applied Logistic Regression
Third Edition
Logistic Regression: A Self-Learning Text
(Statistics for Biology and Health)
Applied Regression Analysis and Other
Multivariable Methods
60

予測にロジスティック回帰はもは
や使っていないんじゃないか疑惑

・・・いや、医学系論文ならいいんですよ

61

ANNとロジスティック回帰分析

62

ANN：Artificial Neural Network Models

人工ニューラルネットワーク(人工神経回路網)
人工知能分野のかなり初期（1940年代）から存在し、ブームのあとは死んだようになっていた手法。

ニューロン

http://en.wikipedia.org/wiki/Brodmann_area

入力

ユニット
出力

Felleman DJ, Essen DCV. Distributed Hierarchical Processing in the
Primate Cerebral Cortex. Cereb Cortex 1991;1:1–47.
doi:10.1093/cercor/1.1.1

大脳皮質のコラム構造

カラー臨床神経解剖学―機能的アプローチ M.J.T. フィッ
ツジェラルド (著), ジーンフォラン=カーラン (著),西村書
店刊（2009 林隆博改変）

結合荷重

ニューロンの出力 →「ある」or 「ない」 63

ANNとロジスティック回帰モデルの違い
パラメーター

ロジスティック回帰

ANN

モデルビルディング

統計知識がかなり必要

統計知識がやや必要

複雑な関係の予測力

解析者次第

自動的

交互作用

明示的にモデリング

検知可能

一般化可能性と過剰適
合

過剰適合は小さな問題

過剰適合しやすい

識別能

アプリケーションと
データ次第

アプリケーションと
データ次第

計算時間

あんまりいらない

結構いる

信頼区間の算出

簡単にできる

結構難しい

モデルのシェア

簡単にできる

簡単にできる

解釈

重要な予測子は簡単に
説明できる

ブラックボックス

Turgay Ayer at al. RadioGraphics 30:13–22 2010. 64

ANN or ロジスティックの予測精度の比較
AUCはANNの方がいけてました！

Turgay Ayer at al. RadioGraphics 30:13–22 2010. 65

ANNの一歩先

Deep Learning

2006年にブレークスルーが起き、神経科学的知見
と計算機科学を結合し、大脳皮質のアーキテク
チャの模倣を行うDeep Learningが人工知能・機
械学習分野の最先端として現在一大ブーム中。
現在の研究のほぼ最先端で何が出来るのか？
人工ニューラルネットワークにYOUTUBEのタグのついていない画像を1000
万枚取得し、教師なし学習（self-taught learning）で猫の存在を学習し、猫
を見分ける

66

おまけ：Deep Learning
Olshausen BA, Field DJ
Emergence of simple-cell receptive field properties by learning a sparse code for natural
images Nature 381 (6583): 607-609 JUN 13 1996
Hinton, G. E., Osindero, S. and Teh, Y. (2006)
A fast learning algorithm for deep belief nets. Neural Computation, 18, pp 1527-1554.
"Deep Auto-Encoder" ？
Hinton, G. E. and Salakhutdinov, R. R. (2006)
Reducing the dimensionality of data with neural networks. Science, Vol. 313. no. 5786, pp.
504 - 507, 28 July 2006.
Y. Bengio. Learning deep architectures for AI.
Foundations & Trends in Mach. Learn., 2(1):1--127, 2009.
Le, Q. V., Ranzato, M. A., Monga, R., Devin, M., Chen, K., Corrado, G. S., ... & Ng, A. Y.
(2011). Building high-level features using large scale unsupervised learning. arXiv preprint
arXiv:1112.6209.

67

ロジスティック回帰分析の書き方

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (11)

Similar to ロジスティック回帰分析の書き方

Similar to ロジスティック回帰分析の書き方 (20)

ロジスティック回帰分析の書き方