This is my Master thesis in statistics from Cairo university. It is about using a continuous time Markov chains in the analysis of non-alcoholic fatty liver disease or the newly proposed name (metabolic associates fatty liver disease) .
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Β
Analysis of chronic diseases progression using stochastic models.pdf
1. Cairo University
Faculty of Graduate Studies
for Statistical Research
Mathematical Statistics
Department
Analysis of Chronic Diseases Progression
Using Stochastic Models
M.Sc. Thesis
By
Iman Mohamed Attia Abd-Elkhalik
Under Supervision
Prof. Esaam Ali Amin
Prof. of mathematical statistics
Faculty of Graduate Studies for
Statistical Research
Cairo University
Dr. Mahmoud Aboagwa
Lecturer of mathematical statistics
Faculty of Graduate Studies for
Statistical Research
Cairo University
A Thesis Submitted to the Department of Mathematical
Statisitcs in Partial Fulfillment of the Requirement for the
Degree of Master of Science in Statisitcs
2022
2.
3. Papers and Codes Published from the Thesis
Attia, I. M. (2022). Novel Approach of Multistate Markov Chains to Evaluate Progression in the
Expanded Model of Non-alcoholic Fatty Liver Disease. Frontiers in Applied Mathematics and
Statistics, 7. https://www.frontiersin.org/article/10.3389/fams.2021.766085
Attia, I.M.(2022). Log-Linear Model and Multistate Model to Assess the Rate of Fibrosis in
Patient with NAFLD. Frontiers in Applied Mathematics and Statistics, 8.
https://www.frontiersin.org/article/10.3389/fams.2022.899247
A MATLAB code illustrating the calculations, of chapter 4, is published in the code Ocean site
at the URL:
https://codeocean.com/capsule/8641183/tree/v1 with DOI = 10.24433/CO.6022979.v2
A MATLAB code illustrating the calculations, of chapter 5, is published in the code Ocean site
at the URL
https://codeocean.com/capsule/7628018/tree/v2 with DOI = 10.24433/CO.7719785.v3
The hypothetical study, in chapter 6, is coded by STATA-14 and is published in code ocean site
with the following URL:
https://codeocean.com/capsule/4752445/tree/v3 , DOI=10.24433/CO.8778229.v3
The MATLAB code to estimate the Q transition rate matrix, for the observed transition counts
used in the hypothetical example in chapter 6, using continuous time Markov chains is published
in the code Ocean site with following URL:
https://codeocean.com/capsule/6377472/tree/v2 , DOI=10.24433/CO.2144346.v2
The MATLAB code for solving the forward Kolmogorov differential equations using the
estimated Q rate matrix, obtained for the example in chapter 6, is published in the code Ocean
site with following URL:
https://codeocean.com/capsule/7258626/tree/v1 , DOI=10.24433/CO.2985270.v1
The dataset, in chapter 6, is present on IEEE Data Port site with the following URL:
https://ieee-dataport.org/documents/fibrosis-nfld#files , with the following DOI: 10.21227/dr5j-
gs46
4. i
Acknowledgment
I thank God Allah, The All-Mighty, The most Generous and The most Glorious, for help,
easiness, and facility in making this research accomplished.
I want to express my appreciation and thanks to Prof. Dr. Elsayed Ahmed Elsherpieny, Prof.
Dr. Esaam Ali Amin, Dr. Yassmen Yousef Abdelall, and Dr. Mahmoud AboAgwa for their
tremendous support, encouragement, and guidance throughout this work.
I want to thank all my professors in the Faculty of Graduate Studies for Statistical
Researches, Cairo University, for their effort in clarifying lots of facts and ideas since I have
been in this faculty in 2015. I want to thank all who had helped me by all different means and
who had clarified ideas and knowledge that aided me in better understanding some of the
statistical methodologies.
I owe great gratitude to my family, especially my mother, father, and brother, for their
patience, support, trustworthiness, fidelity, faithfulness in my doing, and incredible graciousness
that triggered me to this achievement. I cannot thank them enough for their continuous
cooperation, delicacy, kindness, and reinforcement. I cannot offer sufficient thanks to them.
I express tremendous and profound grace to my mother. The adversity she had experienced a
few months before delivering my proposal for this thesis was the inspiring and motivating driver
for me to choose and be engaged in this research. She most probably had non-alcoholic
steatohepatitis (NASH), a complication of fatal cachexia developed due to her multiple sclerosis.
The disease had a rapidly progressive course, and she did not have a chance to be thoroughly
diagnosed before her death. NASH is a subtype of non-alcoholic fatty liver disease (NAFLD), a
prevalent disease with a rapidly increasing epidemic worldwide. Many questions have not yet
been answered, and they need more exploration and research. I hope this work will help throw
light on this disease process from the statistical point of view to help physicians and health
policymakers have better and more efficient management plans for the challenges they face
while assessing this worldwide rapidly rising epidemic.
5. ii
Abstract
Epidemiology is the science that studies the occurrence of the disease. Numerous
mathematical methods can analyze such diseases. Multistate models are one of these
mathematical methods that rely on solving differential equations to get some of the statistical
indices that describe the process of the disease. Thus multistate Markov model is a valuable tool
to model event data obtained from longitudinal studies. In medical research, this technique can
model disease evolution in which each patient starts in one initial state and eventually ends in an
absorbing or final one. Continuous-Time Markov Chain (CTMC) is one of these multistate
models. CTMCs can estimate transition intensities and probabilities between states, state
probability distribution at a specific time point, mean sojourn time in each state, life expectancy
for the patient, and the expected number of patients in each state. As the prevalence of obesity
and type 2 diabetes has reached epidemic levels that parallel the rates of the widely distributed
non-alcoholic fatty liver disease (NAFLD), CTMC can model NAFLD to get better insight into
the behavior of such a worldwide prevalent disease. CTMC helps improve the detection and
treatment of NAFLD stages to avoid morbid complications.
This work provides a new approach using maximum likelihood estimation (MLE) to predict
the transition rates among states. Once the rates are estimated, the transition probability matrix
can be estimated. This approach compensates for the missing values when patients do not
commit to the follow-up schedule by predicting the rate in each interval and taking weight from
each rate corresponding to the proportion of transition counts in each interval in relation to the
total transition counts.
The "health, disease, and death" model is the simplest form of the CTMCs to study disease
evolution. CTMC can model the expanded form of the disease constituting the nine states. Each
disease process has its unique stages and specific transitions among the states. Also, a subset of
the "nine states model" that defines the early reversible stages of the disease, pointing to how the
fibrosis evolves, was utilized to understand the factors that determine its existence, as fibrosis is
the ominous predictor of bad outcomes and death. The results have yielded that the observed
rates approximately equal the estimated rates obtained by MLE, as was the case when analyzing
the simplest and the expanded models. Exponentiation of the estimated rate matrix yielded The
transition probability matrix. The researcher used Poisson regression to relate these rates with the
covariate risk factors of the disease like age, body mass index (BMI), homeostasis measurement
assessment-insulin resistance (HOMA2-IR) reflecting insulin resistance, low-density lipoprotein
cholesterol (LDL-Chol), systolic, and diastolic blood pressure. The study results were that
insulin resistance was the most detrimental risk factor for disease progression. The more resistant
to insulin the cells were, the higher the transition rate to advanced liver fibrosis was. The study
contains hypothetical data for each model to highlight the statistical concepts used to analyze
such a widely spread disease.
Keywords: Continuous-time Markov chains, Life expectancy, Maximum Likelihood estimation,
Mean Sojourn Time, Non-Alcoholic Fatty Liver Disease, Panel Data, Poisson regression.
6. iii
List of Figures
Figure (2. 1): he alth,disease and death model ............................................................................................4
Figure (2. 2): general model for disease progression( reversible progression)............................................4
Figure (2. 3): general model for disease progression (irreversible progression)β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦ 5
Figure (3. 1): worldwide estimated prevalence of NAFLD and the distribution of PNPLA3 genotype.......17
Figure (3. 2): dynamic model of NAFLD ......................................................................................................21
Figure (3. 3): risk stratification and management of NAFLD patients ........................................................25
Figure (3. 4): EASL-EASD-EASO clinical practice guidelines for management of NAFLD in type 2 diabetes
....................................................................................................................................................................26
Figure (4. 1): general model structure........................................................................................................30
Figure (5. 1): expanded form of the disease model structure....................................................................63
Figure (6. 1):NAFLD with fibrosis stagesβ¦β¦β¦β¦β¦β¦β¦β¦β¦β¦..β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦ 106
Figure (6. 2.a):categorical group of patientsβ¦β¦β¦β¦β¦β¦β¦β¦β¦β¦..β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦ 112
Figure (6. 2.b): categorical group of patientsβ¦β¦β¦β¦β¦β¦β¦β¦β¦β¦..β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦ 113
Figure (6. 3.a): percentage of transition countsβ¦β¦β¦β¦β¦β¦β¦β¦..β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦ 114
Figure (6. 3.b): percentage of transition countsβ¦β¦β¦β¦β¦β¦β¦β¦..β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦ 115
Figure (6. 3.c): percentage of transition countsβ¦β¦β¦β¦β¦β¦β¦β¦..β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦ 116
Figure (6. 4): distribution of transition counts from 0 to 1......................................................................117
Figure (6. 5): distribution of transition counts from 1 to 2.......................................................................117
Figure (6. 6): distribution of transition counts from 2 to 3.......................................................................118
Figure (6. 7): distribution of transition counts from 3 to 4.......................................................................118
Figure (6. 8): distribution of transition counts from 1 to 0.......................................................................118
Figure (6. 9): distribution of transition counts from 2 to 1 ......................................................................119
Figure (6. 10): distribution of transition counts from 3 to 2β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦ 119
Figure (6. 11): distribution of transition counts from 2 to 0β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦ 119
Figure (6. 12): distribution of transition counts from 3 to 1β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦ 120
Figure (6. 13): lowess smoother for transition counts from 0 to 1...........................................................121
Figure (6. 14): lowess smoother for transition counts from 1 to 2...........................................................122
Figure (6. 15): lowess smoother for transition counts from 2 to 3...........................................................123
Figure (6. 16): lowess smoother for transition counts from 3 to 4...........................................................123
Figure (6. 17): lowess smoother for transition counts from 1 to 0...........................................................124
Figure (6. 18): lowess smoother for transition counts from 2 to 1...........................................................125
Figure (6. 19): lowess smoother for transition counts from 3 to 2...........................................................125
Figure (6. 20): lowess smoother for transition counts from 2 to 0...........................................................126
Figure (6. 21): lowess smoother for transition counts from 3 to 1β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦126
7. iv
List of Tables
Table (4. 1): number of observed transitions in different time intervals ...................................................46
Table (4. 2): total counts in the oeriod of study( 8 years) .........................................................................46
Table (4. 3): observed counts in the time interval t=1................................................................................46
Table (4. 4): observed counts in time interval t=2 .....................................................................................46
Table (4. 5): obsered counts in time interval t=3........................................................................................46
Table (4. 6 ) :layout of data for persons studied over 8 years....................................................................47
Table (6. 1): descriptive statistical summary of patientsβ characteristics ................................................109
Table (6. 2): definition of the categorical groups of the patients.............................................................109
Table (6. 3): descriptive summary of categorical groups of patients ......................................................110
Table (6. 4): correlation between continuous predictor variables...........................................................110
Table (6. 5.a): summary of transition counts among states .....................................................................110
Table (6. 5.b): correlation between the different response variables......................................................110
Table (6. 6): observed transition counts of patients over 28 years .........................................................111
Table (6. 7): location of knots for specified variables using Harrell approach .........................................121
Table (6. 8): correlation between the transformed variables ..................................................................121
Table (6. 9): estimated counts for each transition....................................................................................127
Table (6. 10): comparison between distribution of observable response rate and estimated rate ........127
Table (6. 11): comparison between null and full model (progressive transitions)...................................128
Table (6. 12): comparison between null and full model (regressive transitions).....................................128
Table (6. 13 ): results for transition from 0 to 1 .......................................................................................129
Table (6. 14): results for transition from 1 to 2 ........................................................................................130
Table (6. 15): results for transition from 2 to 3 ........................................................................................130
Table (6. 16): results for transition from 3 to 4 .......................................................................................131
Table (6. 17): results for transition from 1 to 0 ........................................................................................132
Table (6. 18): results for transition from 2 to 1 ........................................................................................132
Table (6. 19): results for transition from 3 to 2 ........................................................................................133
Table (6. 20): results for transition from 2 to 0 ........................................................................................133
Table (6. 21): results for transition from 3 to 1 ........................................................................................134
Table (6. 22): pearson dispersion statistics for different transitions .......................................................134
Table (Appendix-D. 1 ): patientsβ characteristics......................................................................................172
Table (Appendix-D. 2): transition counts for each patient .......................................................................178
Table (Appendix-D. 3): time line for each patientβ¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦..181
8. v
List of Abbreviations
Abbreviation Meaning
AAR AST to ALT ratio
ALT Alanine Transaminase
APRI AST to Platelet Ratio Index
AST Aspartate Transaminase
ATP Adenosine Triphosphate
BMI Body Mass Index= weight(kg) /height (m2
)
CAP Controlled Attenuation Parameter
CK-18 Cytokeratin-18
CT Computed Tomography
CTMC Continuous-time Markov chains
CVD Cardiovascular Disease
EASL-SASD-
EASO
European Association of Study of Liver Disease-European Association of Study of
Diabetes-European Association of Study of Obesity
ELF Enhanced Liver Fibrosis
FIB-4 Fibrosis Index Score
GGT Gamma Glutamyl Transferase
HCC Hepato-Cellular Carcinoma
HCV Hepatitis C Virus
HOMA-IR Homeostatic model Assessment of Insulin Resistance
LDL Low Density Lipoprotein-Cholesterol
LSM Liver Stiffness Measurement
MetS Metabolic Syndrome
MRE Magnetic Resonance Elastrography
MSIR Maternally Derived Immunity, Susceptible, Infectious, Recovery
NAFL Non-Alcoholic Fatty Liver
NAFLD Non-Alcoholic Fatty Liver Disease
NASH Non-Alcoholic steatohepatitis
NFS NAFLD Fibrosis Score
NIVs Non-Invasive Tests
PNPL-3 Patatin Like Phospholipase Domain Containing 3 Gene
SEIR Susceptible, Exposed, Infectious, Recovery
SEIS Susceptible, Exposed, Infectious, Susceptible
SIR Susceptible, Infectious, Recovery
SIRD Susceptible, Infectious, Recovery,Deceased
SIRV Susceptible, Infectious, Recovery,Vaccinated
SIS Susceptible, Infectious, Susceptible
U/S ultrasonography
VCTE Vibration Controlled Transient Elastography
9. vi
Table of Contents
Acknowledgment...........................................................................................................................................i
Abstract......................................................................................................................................................... ii
List of Figures ............................................................................................................................................... iii
List of Tables ................................................................................................................................................ iv
List of Abbreviations ..................................................................................................................................... v
Table of Contents......................................................................................................................................... vi
Chapter One: Introduction............................................................................................................................1
Chapter Two: Basic Definitions and Notation...............................................................................................4
2.1. Some Basic Definitions.......................................................................................................................6
2.2. Model Specification .........................................................................................................................10
2.3. Chapman-Kolmogorov Equation......................................................................................................12
2.4. Chapman-Kolmogorov Differential Equations.................................................................................12
2.5. The Mean Sojourn Time...................................................................................................................15
2.6. State Probability Distribution...........................................................................................................15
Chapter Three: Non-Alcoholic Fatty Liver Disease .....................................................................................17
3.1. Prevalence........................................................................................................................................17
3.2. Definition and Terminology .............................................................................................................19
3.3. Dynamic Model of NAFLD................................................................................................................20
3.4. Diagnosis of NAFLD..........................................................................................................................21
3.5. Treatment. .......................................................................................................................................23
Chapter Four: CTMC Analyzing NAFLD Progression (Small Model)............................................................30
4.1. Transition Probability Matrix ...........................................................................................................31
4.2. Maximum Likelihood Estimation of the Q Matrix............................................................................35
4.3. Mean Sojourn Time..........................................................................................................................40
4.4. State Probability Distribution: .........................................................................................................41
4.5. Life Expectancy of Patient in NAFLD Disease Process......................................................................43
4.6. Expected Number of Patients in Each State ....................................................................................45
4.7. Hypothetical Numerical Example.....................................................................................................45
Chapter Five: CTMC Analyzing NAFLD Progression (Big Model).................................................................63
5.1.Transition Probability matrix ............................................................................................................63
10. vii
5.2. Estimation Of The Q transition Rate Matrix.....................................................................................79
5.3. Mean Sojourn Time..........................................................................................................................88
5.4. State Probability Distribution...........................................................................................................88
5.5. Expected Number of Patients in Each State: ...................................................................................91
5.6. Life Expectancy of a Patient Suffering from NAFLD in Various Stages ............................................91
5.7. Hypothetical Model .........................................................................................................................92
Chapter six: Incorporation of Covariates in the CTMC .............................................................................106
6.1.Study Design ...................................................................................................................................108
6.2.Results .............................................................................................................................................109
6.3. Discussion.......................................................................................................................................135
6.4.Conclusions......................................................................................................................................139
Chapter Seven: Conclusions and Recommendations ...............................................................................140
7.1. Conclusions ....................................................................................................................................140
7.2. Recommendations.........................................................................................................................141
Appendix A: MATLAB Code for PDFs of the Small Model.........................................................................142
Appendix B: MATLAB Code for Testing Markovian Assumption of Small Model .....................................143
Appendix C: Differentiation of Eigenvalue Functions with Respect to Theta for Big Model....................143
Appendix D: MATLAB Code for Estimation of Rate Matrix of the Big Model ..........................................145
Appendix E: MATLAB Code for Estimation of Probability Matrix of the Big Model .................................158
Appendix F: Goodness of Fit .....................................................................................................................172
Appendix G: Selected Tables as Referred in Chapter 6 ............................................................................176
References ................................................................................................................................................188
Arabic Summary........................................................................................................................................194
11. 1
Chapter One: Introduction
The study of a disease occurrence is the scope of epidemiology. It is a science that studies the
distributions, forms, and factors of a disease in a specific population. Many mathematical models
can describe the disease process according to the stages of the disease. Ordinary differential
equations can describe the disease process. They are deterministic, but a stochastic frame close to
reality can formulate the disease process but with much more complexity to analyze.
Some of these models used in infectious diseases are the basic model, described with three
states susceptible, infectious, and recovered, thus named (SIR). SIR reflects the stages that the
patient can pass through when acquiring an infection. There are some variations of this model.
A model formulated with two states, susceptible and infectious. There is no recovery state as
in the common cold because the patient does not develop immunity. Thus this model describes
three states susceptible, infectious, and susceptible, and thus it is called (SIS) model.
Another variant describes four stages susceptible, infectious, recovered, and deceased. The
recovery stage means immunity, while the deceased stage means the chronic complications and
sequels of the disease. This variant is called the susceptible, infectious, recovered, and deceased
(SIRD) model.
The extension of the SIR model, taking into account the vaccinated stage, describes four
stages. These stages are susceptible, infectious, recovered, and vaccinated. Thus this model is
called (SIRV) model.
Another extension of the SIR model describes four stages. The extension has a first stage
reflecting the "maternally derived immunity" that the newborns have from their mothers in the
first few months after birth. Thus the model describes the following stages "maternally derived
immunity," susceptible, infectious, and recovered; thus, it is called (MSIR) model.
The model describing the following states susceptible, exposed, infectious, and recovered,
has the so-called exposed state. This state reflects the latency or incubation period during which
the patient has been infected, but the manifestations of the disease are still not apparent, and the
ability to infect others is still not acquired. Thus this model is called the susceptible, exposed,
infectious, and recovered (SEIR) model.
The model described with the states susceptible, exposed, infectious, and susceptible is the
same as the directly preceded model but with no recovery stage, and so the model name is
susceptible, exposed, infectious, and susceptible (SEIS) model. (Bailey, 1975).
Other models can describe chronic diseases other than infectious diseases like the diseases
caused by disturbed immunity, cancer, and genetic and metabolic diseases. One of these models
12. 2
is the DisMod II model with four states (susceptible, diseased, death due to illness, and death
from any other cause) (Barendregt et al., 2003). Multistate models like the continuous-time
Markov chains (CTMCs) are elaborative in analyzing chronic diseases. CTMC is frequently used
to model panel data in various fields of science, including medicine, sociology, biology, physics,
and finance. In medical studies, CTMCs model the illness-death process in which each patient
starts in one initial state and eventually ends in an absorbing or final one. The patient may
experience several events that are related to their original disease. There may be several
intermediate states in-between the initial and the final state. The patient can or cannot visit these
intermediate states. (Kruijshaar et al., 2002).
Continuous-Time Markov Chain (CTMC) has the main objective to identify all the possible
movements among the states to estimate the following:
ο· Transition intensities and probabilities between the states.
ο· State transition probabilities at a particular time point.
ο· Mean sojourn time in each state.
ο· Life expectancy for the patient.
ο· Expected number of the patients in each state at a specific time point.
Also, CTMC is used to incorporate and thus identify covariates that affect the transition
intensities aiming to evaluate the factors that influence various movements among these states.
In this work, the researcher throws light on some of the statistical concepts and indices
derived using CTMC to analyze the nonβalcoholic fatty liver disease (NAFLD). This rapidly
increasing metabolic derangement of the liver and its associated complications have an economic
burden on society. The information obtained by this analysis helps health policymakers to
allocate human and financial resources for investigating this disease process, preventing, and
treating it. The study contains the following chapters:
ο In chapter two, the researcher highlights the basic statistical and mathematical concepts.
ο In chapter three, the researcher explains the medical definition of the NAFLD disease and
how to investigate it. These concepts help understand the states of the NAFLD disease. The
researcher also mentions some of the ongoing studies concerning the treatments.
ο In chapter four, the researcher demonstrates the simple model of the "health, disease, and
death" process. The contained applied hypothetical numerical example illustrates how the
statistical indices can be mathematically derived.
ο In chapter five, the researcher expands the disease process into nine states to describe the
states for NAFLD, the transitions among the states, and the ultimate fate of the process.
Finally, a supplemented applied hypothetical example explains these statistical indices.
ο In chapter six, the researcher uses covariates in a Poisson regression model to relate the
transition rates among the states to these covariates. The researcher uses a subset of the
13. 3
disease states, taken from the extended model describing the early reversible stages of the
disease, in an applied hypothetical example.
The researcher summarizes the mathematical approach used in this work in the following items:
ο· Maximum-likelihood function ( MLE) followed by applying the quasi-newton method to
estimate the transition rate matrix (Q matrix).
ο· Exponentiation of this Q matrix to obtain the transition probability matrix. Solve the
Kolmogorov differential equations to get empirical PDFs and then substitute the estimated Q
matrix in these functions to get the final PDFs. Comparing the results of both methods,
especially if the results are equal, can support the time homogeneity of the process.
ο· The Poisson regression model incorporates the covariates, thus relating these covariates with
the transition rates.
14. 4
Chapter Two: Definitions and Notation
Review of literature
Many chronic diseases have a progressive course over time. The disease process passes
through successive stages comprising the disease process. CTMC can model this course of the
disease. The βhealth, disease, and deathβ model is the simplest disease model. It has three states
representing health, disease and death. Transitions are allowed to occur from health to disease
from disease to death, or from health to death. Recovery from disease to health is also permitted,
as shown in Figure (2.1):
FIGURE (2. 1): health disease death model
The model is expressed by a series of successive more sever disease stages and an absorbing
state, often a death state. The patient may advance into or recover from adjacent disease stages or
die at any disease stage. Observations of the state ππ(π‘) are made on many individuals π at
arbitrary time points π‘ , which may vary between individuals, as shown in the Figure (2.2) and
Figure (2.3):
FIGURE (2. 2): A general model for disease progression (reversible progression)
18. 8
Definition 3: Continuous Time Markov Process (CTMC):
Let {ππ‘; π‘ β₯ 0} be a stochastic process with countable state space S. A process is a continuous
time Markov chain if: π(ππ‘π
= π|ππ‘1
= π1, β¦ , ππ‘πβ1
= ππβ1) = π(ππ‘π
= π|ππ‘πβ1
= ππβ1)
For all π, π1, β¦ , ππβ1 β π and for all 0 β€ π‘1 β€ π‘2 β€ β― β€ π‘π
Definition 4: Homogenous Continuous Time Markov Chain:
CTMC is homogenous if and only if π(ππ‘+π = π|ππ = π) is independent of π for all t .
Definition 5: Transition Probability:
Let πππ(π‘) be the probability of the transition from state π to state π in an interval of length π‘.
πππ(π‘) = (ππ‘+π = π|ππ = π) where π , π‘ β₯ 0. In matrix notation : π(π‘) = [πππ(π‘)] πππ πππ π, π β π
It satisfies the following conditions:
1. πππ(π‘ + π ) = β πππ(π‘)πππ(π )
π.π.πβπ , β π‘ β₯ 0, π β₯ 0, πππ π, π, π β π
ππππ¦πππ πππππππππ£ πππ’ππ‘ππππ
2. β πππ(π‘) = 1
π
3. πππ(π‘) β₯ 0 , β π‘ β₯ 0 πππ π, π β π
Definition 6: Chapman-Kolmogorov equation for CTMC
πππ(π‘ + π ) = β πππ(π‘)πππ(π ) π π π(π‘ + π ) = π(π‘)π(π )
πβπ
πππ πππ π, π , π β π, & πππ πππ¦ π , π‘ β₯ 0
Definition 7: Transition Rate Matrix or Intensity Matrix or Infinitesimal Generator
Let {ππ‘; π‘ β₯ 0} be a CTMC , πππ(π‘) is the rate at which transition occur from state π to state π at
time π‘ or π(π‘) = lim
βπ‘β0
{
π(π‘,π‘+βπ‘)βπΌ
βπ‘
} = πβ²(π‘ = 0) πππππ π(0) = πΌ .
The Q matrix fulfills the criteria for the Q matrix which are:
1. β πππ(π‘) = 0
π
2. πππ(π‘) β₯ 0 , π β π
3. β β πππ(π‘)
π = πππ , π = π
Definition 8: Stationary Probability Distribution
Let {ππ‘; π‘ β₯ 0} be a CTMC with generator matrix π and transition probability matrix π(π‘) .
Suppose
19. 9
π = (π0, π1, β¦ , ππ)π‘
is nonnegative i.e. ππ β₯ 0 for π = 0,1,2, β¦ , π , where m is the number of
states. So ππ = 0 πππ β ππ = 1
π
π=0 . This ππ is called stationary probability distribution. It can
also be defined in terms of π(π‘) such that
ππ(π‘) = π , πππ π‘ β₯ 0 , β ππ = 1
π
π=0 , ππ β₯ 0 πππ π = 0,1,2, β¦ , π.
Definition 9: Embedded Markov Chain (EMC)
Let CTMC {ππ‘; π‘ β₯ 0} be a CTMC. The set of the random variables {ππ}π=0
β
is known as the
embedded Markov chain or the jump chain at the ππ‘β
jump associated with the CTMC {ππ‘; π‘ β₯
0} with a transition matrix π = π‘ππ , where ππ = π(π
π), π = 0,1,2, .. , such that ππ is the time at
which the ππ‘β
jump occurs and the ππ = ππ+1 β ππ is the holding time or the time spent in the
state until the next jump occurs at ππ.
Definition 10: Accessibility
State π can be reached from state, π β π , if πππ(π‘) > 0 for some π‘ β₯ 0.
Definition 11: Communicating State
State π communicate with state π , (π β π), if π β π and π β π
The set of states that communicate is called a communication class.
Definition 12: Irreducibility.
If every state can be reached from every other state, the Markov chain is irreducible; otherwise, it
is said to be reducible.
Definition 13: Closed Class
A set of states C is closed if it is impossible to reach any state outside of C from a state inside C,
πππ(π‘) = 0 ; πππ π‘ β₯ 0 πππ ππ ; π β πΆ πππ π β πΆ
Definition 14: First Return Time
πππ is the first time the chain is in state π after leaving state π , it can occur for π‘ > 0
Definition 15: Recurrent State
State π is recurrent in a CTMC {ππ‘; π‘ β₯ 0} , if the first return time is finite π{πππ < β|π(0) =
π} = 1
State π is recurrent in a CTMC {ππ‘; π‘ β₯ 0} if and only if state π in the corresponding embedded
Markov chain {ππ}π=0
β
is recurrent.
20. 10
Definition 16: Transient State
State π is transient in a CTMC {ππ‘; π‘ β₯ 0} , if the first return time is finite π{πππ < β|π(0) =
π} < 1
State π is transient in a CTMC {ππ‘; π‘ β₯ 0} if and only if state π in the corresponding embedded
Markov chain {ππ}π=0
β
is transient.
2.1.2. Some basic medical definitions:
Definition 17 : Fibro-genesis is the mechanism through which fibrous tissue is formed i.e the
process of fibrous tissue formation.
2.2. Model Specification:
2.2.1. Transition Probability Matrix:
According to (Allen 2010), the model is specified by a transition probability matrix π(π‘)
whose (π, π)π‘β entry ,πππ(π‘), is the probability of a transition from state π at time π‘ to some other
state π at rate πππ(π‘) per specified unit of time according to the studied process or system. So
continuous time Markov chain is modeled by its matrix of transition rates π(π‘) at time π‘ . The
probability that a transition occurs from a given source state to a specific destination state
depends on both the source and the length of the interval of observation. That is to say, if the
period of observation π = Ξπ‘ has a minimal duration so the probability of observing a transition
from state π at time π‘ to state π at time π‘ + Ξπ‘ during this interval [π‘, π‘ + Ξπ‘) i.e. πππ (π‘, π‘ + Ξ t)
is minimal.
So as Ξπ‘ β 0, πππ (π‘, π‘ + Ξ t) β 0 πππ π β π ,
and from the conservation of probability:
πππ (π‘, π‘ + Ξ t) β 1 ππ Ξπ‘ β 0 .
On the other hand, as Ξπ‘ enlarges this probability πππ(π‘, π‘ + Ξπ‘) increases to the level that the
larger the period is, the more probable multiple events will be observed. Nevertheless, the
observation periods are adequately selected to be small enough so that the probability of
observing multiple events in such a small observation period is of order π(Ξπ‘) , a quantity for
which
lim
Ξβ0
π(Ξπ‘)
Ξπ‘
= 0
Continuous Markov chain exhibits Markov (memoryless) property:
π[π(π‘π+1) = π₯π+1 | π(π‘π) = π₯π, π(π‘πβ1) = π₯πβ1, β¦ , π(π‘0) = π₯0 ]
= π[π(π‘π+1) = π₯π+1|π(π‘π) = π₯π]
21. 11
for any π‘0 β€ π‘1 β€ β― β€ π‘π β€ π‘π+1 . So if the current state π₯π is known, then the value taken by
π(π‘π+1) depends only on π₯π and not on any past history of the state (no state memory). Also,
the amount spent in the current state does not determine the next state (no age
memory)(Cassandras and Lafortune 2009).
2.2.2. Generator Matrix or the Transition Rate Matrix:
According to (Allen 2010), the transition probabilities πππ(π‘) are used to obtain transition
rates πππ(π‘). A rate of transition does not depend on the length or duration of observation period,
it is an instantaneously defined quantity that indicates the number of transitions that occur per
unit of time. The πππ(π‘) is the rate of transition from state π to state π at time t . In non-
homogenous Markov chain both, πππ(π‘) and πππ(π‘) may depend on the time π‘ not the interval Ξπ‘.
The transition probabilities πππ(π‘) are continuous and differentiable for > 0 .
At π‘ = 0, the transition probabilities equal:
πππ(0) = 0 πππ πππ(0) = 1
While defining:
πππ = lim
Ξπ‘β0
{
πππ(Ξπ‘) β πππ(0)
Ξπ‘
} = lim
Ξπ‘β0
{
πππ(Ξπ‘)
Ξπ‘
} πππ π β π
And
πππ = lim
Ξπ‘β0
{
πππ(Ξπ‘) β πππ(0)
Ξπ‘
} = lim
Ξπ‘β0
{
πππ(Ξπ‘) β 1
Ξπ‘
} πππ π = π
As well as
πππ(Ξπ‘) = πππ(π‘)Ξπ‘ πππ β πππ(Ξπ‘)
β
πβ π
= β πππ(π‘)Ξπ‘ + π(Ξπ‘)
β
πβ π
And from conservation of probability:
1 β πππ(Ξπ‘) = β πππ(Ξπ‘)
πβ π
= β πππ(π‘)Ξπ‘ + π(Ξπ‘)
πβ π
1 β πππ(Ξπ‘) = lim
Ξπ‘β0
β {
πππ(π‘)Ξπ‘ + π(Ξπ‘)
Ξπ‘
}
β
πβ π
= β β
πππ Ξπ‘
Ξπ‘
β
πβ π
πππ = β β πππ(π‘)
β
π=0,πβ π
In homogenous continuous time Markov chain:
πππ = lim
Ξπ‘β0
{
πππ(Ξπ‘)
Ξπ‘
} πππ π β π ; πππ = lim
Ξπ‘β0
{
πππ(Ξπ‘) β 1
Ξπ‘
} πππ π = π
22. 12
Or in matrix notation:
π = lim
Ξπ‘β0
{
π(Ξπ‘) β πΌ
Ξπ‘
}
It is called infinitesimal generator matrix or rate transition matrix of Markov chain
π = πβ²(0) ππ‘ π‘ = 0 , π ππππ π(0) = πΌ
If π is a finite or countable state space and π = (πππ)
π,πβπ
then it satisfies the following
properties:
πππ β₯ 0 πππ π β π πππ πππ β€ 0 πππ πππ π , β πππ
πβ π
= βπππ πππ β πππ
π
= 0 πππ πππ π
2.3. Chapman-Kolmogorov Equations:
According to (Cassandras and Lafortune 2009), the transition probability is called πππ(π , π‘) and is
defined as πππ(π , π‘) = π[π(π‘) = π|π(π ) = π], π β€ π‘
To derive the equation; the events of transitions [π(π‘) = π|π(π ) = π] are conditioned on
[ππ’ = π]
For some π’ such that π β€ π’ β€ π‘ and that the rule of total probability implies
πππ(π , π‘) = β π[π(π‘) = π|π(π’) = π, π(π ) = π]
πππ π
. π[π(π’) = π| π(π ) = π],
in addition the memory-less property ensures the following:
π[π(π‘) = π|π(π’) = π, π(π ) = π] = π[π(π‘) = π|π(π’) = π] = πππ(π’, π‘)
Moreover , π[π(π’) = π| π(π ) = π] = πππ(π , π’). Therefore,
πππ(π , π‘) = β πππ(π , π’).
πππ π
πππ(π’, π‘) , π β€ π’ β€ π‘
The above Chapman-Kolmogorov equation can be rewritten in matrix notation as:
π»(π , π‘) = π»(π , π’)π»(π’, π‘) , π β€ π’ β€ π‘ βΆ π€βπππ
π»(π , π‘) = [πππ(π , π‘)] πππ π»(π , π ) = πΌ ( πππππ‘ππ‘π¦ πππ‘πππ₯)
2.4. Chapman-Kolmogorov Differential Equations:
2.4.1. Forward Chapman-Kolmogorov differential equations:
The Chapman-Kolmogorov equations for time instants π β€ π’ β€ π‘ + Ξπ‘ πππ Ξπ‘ > 0 is
π»(π , π‘ + Ξ t) = π»(π , π‘)π»(π‘, π‘ + Ξπ‘ ) subtracting π»(π , π‘) from both sides of this equation yields
23. 13
π»(π , π‘ + Ξ t) β π»(π , π‘) = π»(π , π‘)[π»(π‘, π‘ + Ξπ‘ ) β πΌ] , then dividing by Ξπ‘ and taking the limit as
Ξπ‘ β 0 gives
lim
βπ‘β0
π»(π , π‘ + Ξπ‘ ) β π»(π , π‘)
βπ‘
= π»(π , π‘) lim
βπ‘β0
π»(π‘, π‘ + Ξπ‘ ) β πΌ
βπ‘
lim
βπ‘β0
π»(π , π‘ + Ξπ‘ ) β π»(π , π‘)
βπ‘
=
ππ»(π , π‘)
ππ‘
This limit represents the partial derivative of Pij(s,t) with respect to (t) if it exists .
π(π‘) = lim
βπ‘β0
π»(π‘, π‘ + Ξπ‘ ) β πΌ
βπ‘
lim
βπ‘β0
π»(π , π‘ + Ξπ‘ ) β π»(π , π‘)
βπ‘
= π»(π , π‘)π(π‘)
lim
βπ‘β0
π»(π , π‘ + Ξπ‘ ) β π»(π , π‘)
βπ‘
=
ππ»(π , π‘)
ππ‘
= π»(π , π‘)π(π‘) , π β€ π‘
This is called forward differential equation and in a similar fashion, the backward differential
equation can be derived. (Cassandras and Lafortune 2009)
According to (Stewart 2009) the forward differential equation can also be derived like this :
The Chapman-Kolmogorov equations for CTMC are derived from Markov property that states
πππ(π , π‘) = β πππ(π )
πππ π
πππ(π‘) πππ π, π = 0,1,2, β¦ πππ π β€ π’ β€ π‘
To transfer from state i at time s to state j at time t , some state k will be visited as an
intermediate state between states i and j at an intermediate time u . When the CTMC is
homogenous, this is written as follows:
πππ(π‘ + Ξπ‘) = β πππ(π‘)
πππ π
πππ(βπ‘) = β πππ(π‘)
πβ π
πππ(βπ‘) + πππ(π‘)πππ(βπ‘) πππ π‘, βπ‘ β₯ 0
Thus
πππ(π‘ + Ξπ‘) β πππ(π‘)
Ξπ‘
= β (πππ(π‘)
πππ(βπ‘)
Ξπ‘
+ πππ(π‘)
πππ(βπ‘)
Ξπ‘
β
πππ(π‘)
Ξπ‘
)
πβ π
= β ( πππ(π‘)
πππ(βπ‘)
Ξπ‘
+ πππ(π‘) (
πππ(βπ‘) β 1
Ξπ‘
))
πβ π
Taking the limit as Ξπ‘ β 0 and recalling
24. 14
πππ = lim
Ξπ‘β0
{
πππ(Ξπ‘)
Ξπ‘
} πππ π β π ; πππ = lim
Ξπ‘β0
{
πππ(Ξπ‘) β 1
Ξπ‘
} πππ π = π
ππππ(π‘)
ππ‘
= β πππ(π‘)
πβ π
πππ + πππ(π‘)πππ
That is to mean the forward differential equation is:
ππππ(π‘)
ππ‘
= β πππ(π‘)
πβ π
πππ πππ π, π = 0,1, β¦ ; πππ ππ πππ‘πππ₯ πππ‘ππ‘πππ:
ππ(π‘)
ππ‘
= π(π‘)π
2.4.2. The backward differential equation is derived as in the following steps:
πππ(π‘ + Ξπ‘) = β πππ(βπ‘)
πππ π
πππ(π‘) πππ π‘, βπ‘ β₯ 0
= β ( πππ(βπ‘) πππ(π‘) + πππ(βπ‘)πππ(π‘))
πβ π
πππ(π‘ + Ξπ‘) β πππ(π‘)
Ξπ‘
= β (
πππ(βπ‘)
Ξπ‘
πππ(π‘) +
πππ(βπ‘)
βπ‘
πππ(π‘) β
πππ(π‘)
Ξπ‘
)
πβ π
= β (
πππ(βπ‘)
Ξπ‘
πππ(π‘) + (
πππ(βπ‘) β 1
Ξπ‘
) πππ(π‘))
πβ π
Taking the limit as Ξπ‘ β 0 and recalling
πππ = lim
Ξπ‘β0
{
πππ(Ξπ‘)
Ξπ‘
} πππ π β π ; πππ = lim
Ξπ‘β0
{
πππ(Ξπ‘) β 1
Ξπ‘
} πππ π = π
Thus:
ππππ(π‘)
ππ‘
= β (πππ πππ(π‘) + πππ πππ(π‘))
πβ π
That is to mean, according to (Stewart 2009), the backward differential equation is:
ππππ(π‘)
ππ‘
= β πππ
πβ π
πππ(π‘) πππ π, π = 0,1, β¦ ; πππ ππ πππ‘πππ₯ πππ‘ππ‘πππ βΆ
ππ(π‘)
ππ‘
= ππ(π‘)
And the solution for this differential equation is by matrix exponential:
π(π‘) = ππππ‘
= πππ‘
= πΌ + β
ππ
π‘π
π!
β
π=1
25. 15
The constant is π = π(0) = πΌ .
2.5. The mean sojourn time of the continuous time Markov chain:
It is the average time spent in the state .Let {π(π‘), π‘ β₯ 0} be a homogenous continuous-time
Markov chain and it is in a non-absorbing state π at time π‘ = 0 . Let ππ be the time until a
transition out of state π occurs, if Markov chain started at time π‘ = 0 in state π and has not moved
from state π by time π which is equivalent to saying that π{ππ > π |π(0) = π} = 1, then π{ππ >
π + π‘|ππ > π } = π{ππ > π‘} and the continuous random variable time ππ is memoryless. Since the
only continuous distribution that has the memoryless property (the distribution of residual time
being equal to the distribution itself) is the exponential distribution, this leads to the fact that the
duration of the time until a transition occurs from state π is exponentially distributed. In a
homogenous continuous-time Markov chain with a non-absorbing π state which can move to one
or more states β π , the memoryless property of the chain compels the duration of time, until this
transition takes place, to be exponentially distributed with a rate of transition πππ . So the time to
reach some state π β π has an exponential distribution with rate πππ . More to say is that upon
exiting state , more than one state can be reached, and subsequently, a race condition is started
to take place and the transition to the winning state occurs, the state which minimizes the sojourn
time in state i . Because the minimum value of several exponentially distributed random
variables is also an exponentially distributed random variable with a rate equal to the sum of the
original rates, this drives the conclusion that the time spent in state π of a homogenous
continuous time Markov chain is exponentially distributed. This sojourn time is π’π = β πππ
πβ π .
Therefore, the probability distribution of the mean sojourn time in state π is given by:
πΉπ(π₯) = 1 β πβπ’ππ₯
, π₯ β₯ 0 & π’π = β πππ
πβ π
= βπππ
Putting these concepts together leads to realization that the mean sojourn time in any state of a
homogenous continuous time Markov chain must be exponentially distributed. This is violated in
non-homogenous chain as it is not exponentially distributed (Stewart 2009).
2.6. State Probability Distribution:
According to (Cassandras and Lafortune 2009), it is the probability vector π(π‘) that
represents the probability that a system will be in a particular state at a specific time point given
its initial state probability vector π(0), transition rate matrix π and the state space π. This
analysis can be conducted in two approaches:
2.6.1. Transient Analysis:
According to (Cassandras and Lafortune 2009), define state probability ππ(π‘) = π[π(π‘) = π]
and condition the event [π(π‘) = π] on the event [π(0) = π] with a defined ππ(0) = π[π(0) = π].
Thereafter the rule of total probability implies that:
26. 16
ππ(π‘) = π[π(π‘) = π] = β π[π(π‘) = π|π(0) = π]. π[π(0) = π]
πππ π = β πππ(π‘)ππ(0)
πππ π
This relation can be rewritten in matrix notation as: π(π‘) = π(0)π(π‘)
And since π(π‘) = πππ‘
thus the state probability vector at time t is given by π(π‘) = π(0)πππ‘
2.6.2. Steady State Analysis:
According to (Cassandras and Lafortune 2009), this approach, the steady state behavior of
the system is of much interest, a great advantage and a huge benefit to be calculated. The system
is turned on and has been working for some time; then its performance is tested in the long run to
see how all state probabilities have reached some fixed, unchangeable values and no longer vary
as time elapses. This relies on some basic requisites such as:
Presence and evaluation of the limit: ππ = β ππ(π‘)
π‘ββ . If this limit exists, ππ is called steady
state, equilibrium, or stationary state probability.
If differentiating π(π‘) = π(π‘)πππ‘
with respect to π‘ and substitute π‘ = 0 the differential equation is
π
ππ‘
π(π‘) = π(π‘)π. Solving such a system, to obtain an explicit solution even for a simple Markov
chain, is cumbersome. So if the ππ = β ππ(π‘)
π‘ββ exists, this implies that as π‘ β β this quantity
π
ππ‘
π(π‘) β 0, since π(π‘) no longer depends on π‘.Therefore,
π
ππ‘
π(π‘) = π(0)π reduces to π(π‘)π =
0.
In an irreducible continuous-time Markov chain consisting of positive recurrent states, a unique
stationary state probability distribution vector π exists such that ππ > 0 and is independent of
the initial state probability vector. Moreover, π is determined by solving ππ = 0 subject to
β ππ
πππ π = 1 .
27. 17
Chapter Three: Non-Alcoholic Fatty Liver Disease
3.1. Prevalence
NAFLD is one of the most common chronic liver diseases .Changing lifestyle behavior
during the last few decades attributed to the bad eating habits with consumption of high fat and
fructose diet resembling the western diets and the sedentary life with lack of exercise have
increased dramatically worldwide. These factors globally increase the prevalence of obesity and
type II diabetes worldwide. NAFLD is accidentally discovered during routine ultrasonography
follow-up or using magnetic resonance spectroscopy. Tests of liver enzymes underrate the true
prevalence (Younossi et al., 2017). Figure (3.1) illustrates the prevalence of the NAFLD
worldwide and the distribution of the PNPLA3 genotype:
FIGURE (3. 1): The worldwide estimated prevalence of NAFLD and the distribution of PNPLA3 genotype
3.1.1. NAFLD in USA:
The prevalence is estimated to be 24% using U/S in the USA. It is 21% if noninvasive
methods like Fatty Liver Index are used. It varies according to ethnicity. So Hispanic Americans
are first to come preceding Americans of European origin. Lastly, the African Americans are the
least affected group. The last group has the highest levels of hypertension and obesity. Some
studies reveal that the Latino patients having NASH were less aged and less physical exercise
than the white ptients of non-Latino descent; however, these patients with NASH were more
susceptible and liable to insulin resistance than the white Latino patients. Within the ethnicity
group, the prevalence exhibits variations according to the parent country: it is higher at 33% in
Americans of Mexican descent, at 16% in those of Dominican descent, and it has a prevalence of
18% in people of Puerto Rican descent, even after controlling for other risk factors (sex, age,
28. 18
waist circumference, BMI, serum HOMA, serum HDL, hypertension, serum C-reactive protein
and triglycerides as well as level of education) using multivariate analysis (Younossi et al. 2017).
3.1.2. NAFLD in South America:
in Brazil, the estimated prevalence was 30% using U/S. in Chile, it was 23% detected by U/S.
In Columbia, it was 26.6% of men using U/S. However, countries like Paraguay, Peru, Uruguay,
Ecuador, and Argentina had reported prevalence rates to be as minimum as 13% (Peru) up to as
maximum as 24 %( Uruguay) (Younossi et al. 2017).
3.1.3. NAFLD in Europe:
Although prevalence has wide variations according to the method used to reveal NAFLD,
nearly one-quarter of the European population suffers from NAFLD. A Meta-analysis study
released in 2016 documented a prevalence of 23.71% in Europe, ranging from 5% to 44% in
various countries. The data from Germany estimated a 30% prevalence rate using U/S. NAFLD
accounted for deranged liver tests is 26.4% in England. In France simple steatosis was observed
in 26.8% of liver biopsies due to questionable deranged liver tests, with 32.7 % of those biopsies
having NASH. In northern Italy, NAFLD evaluated by U/S was similar in those having liver
disease and those free of liver disease (25% in contrast with 20%, p=0.203). Statistical data from
Spain reported a resembling rate of 25.8%. From Romania, a study conducted on 3005 in-
patients population not suffering from any liver diseases revealed that NAFLD evaluated by
sonography was present in 20% of those hospitalized patients. A Hungarian study reported a
22.6% prevalence rate of fatty liver detected by U/S. (Younossi et al. 2017).
3.1.4. NAFLD in Asia-Pacific and Africa:
There are enormous variations between countries comprising this region due to huge
differences between its countries in economic, educational and political aspects and health-care
systems. These factors impact the individualsβ lifestyle, nutritional culture and sedentary
behavior. Data are not available in a comprehensive form due to the lack of statistical surveys
extending throughout the country to assess fatty liver. Subsequently, there are marked
dissimilarities in the NAFLD prevalence between different regions within the same nation.
Prevalence in Chengdu (Southwest China) was 12.5%. It was 15% in Shanghai (east china) and
17% in Guangdong (south China), while 24.5%. in central China. A study of 7152 employees in
shanghai utilizing U/S for detecting fatty liver released in 2012 estimated NAFLD prevalence to
be as great as 38.17%. In a study from Hong Kong, βproton magnetic resonance spectroscopyβ
determined the quantity of fat in the liver and estimated its prevalence rate as 28.8%; it was
19.3% in non-obese individuals, and it was 60.5% among the obese. A study from Taiwan
reported that the prevalence was 11.4% in the general population. It was 50% in the elderly and
66.4% in persons with a sedentary lifestyle like taxi drivers. It was 25% in Japan in 2005 as
diagnosed by U/S. Using the same modality, the prevalence in 141610 individuals from South
Korea was 27.3% in 2013 (Younossi et al. 2017).
29. 19
South Asia and Indian regions are towards quick urbanized changes in the social and
economic aspects. In the rural India, preserving their cultural diet and traditional lifestyles, the
prevalence rate is low (9%). In contrast, the prevalence rate fluctuates between 16 and 32% in
urban areas as the lifestyle mimics the trends in western countries. Similar variations in rates
between rural and urban areas (5-30%) were obtained from smaller surveys in Indonesia,
Malaysia, Singapore, and Sri Lanka. Data from Africa are scant. In Nigeria, the rate was 9.5-
16.6% in diabetic individuals and 1.2-4.5% in non-diabetic individuals. Comparably, the
prevalence in overweight or obese South African individuals was 45-50%, while in 2014; it was
20% in the population studied from Sudan (Younossi et al. 2017).
The data about the prevalence in Egypt is scarce and it needs further studies. (Fouad et al.,
2022)
3.2. Definition and Terminology:
NAFLD covers a spectrum of hepatic diseases containing two essential disease phenotypes.
The first is nonalcoholic fatty liver (NAFL; simple steatosis), where triglycerides accumulate in
more than 5% of the hepatocytes without histologic evidence of inflammation, cellular injury, or
fibrosis. The second one is non-alcoholic steato-hepatitis (NASH), where steatosis comes with
histologic features of necro-inflammation and hepatocyte ballooning degeneration (with or
without evidence of fibrosis). These phenotypes may evolve to cirrhosis, liver-related mortality
and hepatocellular carcinoma (HCC). NAFLD, by definition, occurs when excessively
consuming alcohol is absent; a cutoff point of not more than 20g/day for women and not more
than 30g/day for men is used to distinguish and separate it from alcohol-related liver disease.
Primary NAFLD is a disease related to obesity, insulin resistance, hypertension, and metabolic
syndrome criteria. Most NAFLD patients reside in this category. The secondary NAFLD points
toward patients with causes other than metabolic syndrome-associated conditions, such as drug
or toxin-induced fatty liver and rare inherited genetic metabolic diseases (Boyer and Lindor
2016). Table (3.1) lists risk factors of NAFLD.
NAFLD is a disease characterized by wide variation in the disease seriousness and disease
consequences reflecting the interactions between extrinsic and intrinsic factors. Although it is
initiated by high fat and carbohydrate diet associated with a sedentary lifestyle composing some
of the extrinsic factors, the contributors of intrinsic factors presented in genetic background
crucially determine how the patient responds to the excess caloric intake and metabolic stressors.
These factors are known as modifiers or adaptors (Ribeiro et al. 2004)(Brand and Esteves 2005).
30. 20
Table (3. 1): Risk factors for NAFLD
Risk factor Effect
Age High risk for NAFLD and advanced fibrosis if age>45 years
Metabolic
syndrome
The more criteria there are , the higher the probability of NASH and advanced fibrosis is .
sex Male > famale
ethnicity High in Hispanics, intermediate in white , low in black
Dietary factors Diets with high contents of saturated fats, fructose and cholesterol but low in carbohydrates
increase the risk. caffeine is beneficial
Obstructive sleep apnea It elevates risk for NASH and advanced fibrosis
Genetic factors PNPLA 3 and TM6SF2 increase the risk of NASH and advanced fibrosis, PNPLA3 increase the risk for
HCC.
βMetabolic syndromeβ is defined as having abdominal adiposity distinguished by waist
circumference > 94 cm for males and > 80 for females in eastern countries. It is > 102 cm for
males and > 88 cm for females in western countries. Plus two or more of the following:
1. Blood glucose β₯100 mg/dL or drug treating diabetes.
2. Arterial pressure β₯ 130/85 mmHg or drug treating hypertension.
3. Triglyceride levels β₯150 mg/dL or drug treating high level in the blood.
4. HDL cholesterol levels < 40mg/dL for males and <50 mg/dL for females or drug treating
this condition.
3.3. Dynamic model of NAFLD:
As discussed above, some NAFL patients can evolve to NASH and advanced fibrosis. On the
other hand, some of those NASH patients can regress to NAFL as time elapses. It is
hypothesized that the patient cycles between NAFL and NASH in early stages of NAFLD.
Regardless of the biopsy results being NAFL or NASH, about 80% of them are slow progressors.
That means they are unlikely to progress further beyond mild fibrosis (F0 to F2). However,
approximately 20% manifest rapid fibrosis progression and develop advanced fibrosis and
cirrhosis(F3 to F4) within a few years (De and Duseja 2020). Most NAFLD patients are slow
progressors in evolving from F0 to F1. However, a subset of these patients is rapid progressors,
evolving from F0 to F3 or F4. In a study on 108 patients having sequential liver biopsy with a
median interval of 6.6 years, 42% were progressors, 40% had stable fibrosis, while 18% were
regressors (McPherson et al. 2015). Figure (3.2) shows the dynamic model of NAFLD.
31. 21
FIGURE (3. 2): A dynamic model of NAFLD
3.4. Diagnosis of NAFLD:
The practice guidelines of the American Association for the study of Liver Disease define
NAFLD by the presence of the following features (Chalasani et al. 2018):
ο· The existance of hepatic steatosis established by histology or imaging.
ο· Exclusion of other secondary causes of hepatic steatosis, especially significant alcohol
consumption.
ο· Exclusion of any other causes of chronic liver disease like a drug, infection, autoimmune
diseases, hereditary, genetic, or metabolic causes like hepatitis B and C infection, HIV,
autoimmune hepatitis, primary sclerosing cholangitis, primary biliary cirrhosis, Celiac
disease, Wilsonβs disease, hemochromatosis, Ξ±-1 antitrypsin deficiency, cystic fibrosis
and porphoryia .
Assessment of the patient in clinical practice should evaluate the disease activity, that is to
mean, whether the disease is NAFL or NASH, the stage of fibrosis and the severity of the risk
factors such as insulin resistance (IR) and metabolic syndrome (MetS) components.
3.4.1. Liver biopsy:
Liver biopsy permits direct examination of liver tissue to evaluate inflammatory disease
activity (grade of disease) and evolution of fibrosis to cirrhosis (stage of fibrosis). However, one
of the major limitations of liver biopsy is sampling error because < 1/50000 of the total hepatic
32. 22
volume is sampled at a specific single time point with the heterogeneity of NAFLD distribution
throughout the hepatic parenchyma. Also, accurate diagnosis is observer-dependent and
influenced by pathologist experience. (Castera et al. 2019).
The noninvasive tests can be subdivided into two major groups that can be used to diagnose
steatosis, activity, and fibrosis. Noninvasive techniques include serological or biological tests
and imaging or physical techniques.
3.4.2. Noninvasive Tests for Diagnosis of NAFLD:
3.4.2.1. Imaging Tests:
The existence of NAFLD can be detected radio-logically using the ultrasonography,
computed tomography (CT), magnetic resonance imaging (MRI), and Transient elastography.
βMagnetic resonance elastographyβ (MRE) is a quantitative MRI-based methodology to image
the liver stiffness. MRE is better than other biomarkers, scoring systems and US-based
elastography for the diagnosis of liver fibrosis (Yoneda et al. 2018)(Boyer and Lindor 2016).
3.4.2.2. Serological Tests:
The serological markers for recognizing stages of liver fibrosis are classified into indirect
markers that reflect the reduction in hepatic functions (AST/ALT and platelet levels) and direct
markers concerned with fibro-genesis. Serum cytokeratin (CK)-18 is an indicator or sign for
βhepatocyte apoptosis or deathβ. (Paul 2020).
There are many scoring systems to detect steatosis and fibrosis; the followings are some of
them:
1) One of the most commonly used systems for detecting fibrosis is the AST to platelet ratio
index (APRI). (Tapper et al. 2014)
2) Another scoring system is the FIB-4 to detect fibrosis. (Shah et al. 2009)
3) NAFLD fibrosis score (NFS) is a test for fibrosis detection. (Angulo, Jason M Hui, et al.
2007)
4) Enhanced liver fibrosis (ELF) test is a commercial panel of markers concentrating on matrix
turnover for fibrosis detection. (Lichtinghagen et al. 2013).
5) Fibro-test can be used to exclude advanced fibrosis (F3-F4). It is a commercial marker panel
with a patent algorithm. (Boyer and Lindor 2016).
6) Fatty liver index (FLI) is a test to assess hepatic steatosis.(Bedogni et al. 2006).
33. 23
7) NAFLD liver fat score and liver fat equation supply the general practitioner with a simple,
inexpensive, and non-invasive tool to predict liver fat content in susceptible individuals.
(Kotronen et al. 2009).
3.4.3. Diagnostic Algorithms of NAFLD:
Liver biopsy, as previously mentioned, is the standard gold method to diagnose the degree of
steatosis, inflammation and fibrosis as well as to determine the presence or absence of liver-
related morbidities like hepatocellular carcinoma (HCC). Many algorithms have been proposed
by many societies worldwide to illustrate the suggested plan for physicians to follow, aiming to
achieve early diagnosis, treatment and follow-up of persons with NAFLD, thereby reducing
liver-related morbidities and mortalities. Here are some of these algorithms; almost all start with
risk stratification of individuals to identify patients with advanced fibrosis who are liable for
progression to severely morbid complications. The extent of liver fibrosis is the most prognostic
factor for such progression.
* Figure (3.3) points out a suggested algorithm for risk stratification in NAFLD patients utilizing the noninvasive
tests (Castera et al. 2019).
* Figure (3.4.1), Figure (3.4.2), Figure (3.4.3), and Figure (3.4.4) demonstrate the EASL-EASD-EASO clinical
practice guidelines for the management of NAFLD in Type 2 Diabetes (Sberna et al. 2018).
3.5. Treatment.
The keystone in treatment of NAFLD must start with treating the risk factors like obesity,
dyslipidemia, diabetes and hypertension. Initially, the patients must continue to change lifestyles
as diet habits and avoidance of sedentary life
Currently, no approved drug treatment for NAFLD/NASH is available; however, abundant
drugs are being scrutinized in phase II and phase III clinical trials. The outcome from these trails
has been favorable and hopeful regarding amelioration in histological features of the disease like
steatosis, inflammation and fibrosis. As NAFLD is a heterogeneous complex disease process
with a chronic prolonged course evolving, examining the effectiveness of these drugs in the long
run and their safety and effectiveness is crucial, especially when considering that some of these
drugs have serious and possible metabolic side effects. Heading the substantial and remarkable
impediments to registration (by increasing the awareness and alertness of the seriousness of the
disease by both physicians and patients) into clinical trials is paramount, as well as improving the
design of these trials will play a tremendous role in achieving success in these trials, and hence
approval of such therapeutic drugs. Utilizing the non-invasive markers of steatosis,
inflammation, and fibrosis and the clinical trial designs can accomplish this goal.
In the FLINT study conducted on 283 non-cirrhotic NASH patients taking 25 mg daily
obeticholic acid; the improvement in histology detected by NAS was 2 points or more with no
34. 24
deterioration of fibrosis, and 35% of patients taking OCA had a decrement in fibrosis score by at
least one stage in comparison with 19 % in the placebo arm.
REGENERATE study (still in progress with an estimated primary completion date of
September 2025 as shown on clinicaltrials.gov official site) will evaluate the safety and efficacy
of Obeticholic Acid(OCA) in NASH patients with fibrosis who are randomizes to a daily dose of
25 mg, 10 mg, and placebo, with endpoints like amelioration of fibrosis by at least one stage and
decaying of NASH with no deterioration of fibrosis. At 18 month of randomization, liver biopsy
revealed statistically significant histological amelioration of fibrosis and decaying of NASH with
no deterioration in fibrosis for both 10 mg and 25 mg doses. In the GOLDEN study, conducted
on 274 NASH patients, 120 mg Elafibranor taken daily for 52 weeks induced decaying of
moderate to severe NASH in meaningfully higher percentage of patients than to the placebo,
moreover; these patients also showed lowering in fibrosis stage compared to non-resolving
NASH patients. RESOLVE-IT trial (the last update was on November 30, 2020, as shown on
clinicalrials.gov official site, but the study is still in progress according to(Guirguis et al. 2020))
emerged in May 2020 had shown that 19.2% of patients, on 120 mg daily Elafibranor, had
NASH decay without deterioration of fibrosis compared to 14.7% in the placebo group, which
was not statistically significant. Furthermore, 24.5% of patients had shown fibrosis amelioration
of more than one stage compared to 22.4% in the placebo group, which was also not statistically
significant.
In the CENTAUR trial, conducted over 289 patients taking cenicriviroc (CVC), 150 mg daily
and placebo for 52 weeks, no comparative betterment in NAS between the NASH group and
placebo was seen, however; there was one stage or more amelioration of fibrosis with no
deterioration of NASH in the group taking the CVC compared with placebo group. The
AURORA trial (primary completion dates were October 2021 according to clinicaltrails.gov site
and October 2028 according to (Guirguis et al. 2020)) will evaluate long-term safety and efficacy
of 150 mg daily CVC for the treatment of fibrosis in NASH adult patients at 2 phases, the first
has an endpoint of at least one stage amelioration of fibrosis without deterioration of NASH at
month 12, and phase 2 has an endpoint that is cirrhosis, liver-related outcome as HCC, and all
causes of mortality. In a small, open-label, randomized phase II trial including 72 biopsy-proven
NASH patients (NAS β₯ 5 and stage 2-3 liver fibrosis) receiving 18 mg daily Selonsertib for 24
weeks ,there was significant improvement in liver disease activity, fibrosis, stiffness, liver fat
content, and progression to cirrhosis (Alkhouri, Poordad, and Lawitz 2018).
FLINT, GOLDEN, and CENTAUR are phase IIb placebo-controlled RCT( randomized
control trial), while REGENERATE, RESOLVE-IT, and AURORA are randomized, placebo-
controlled, double-blinded, multicenter phase III trials
35. 25
FIGURE (3. 3): risk stratification and management of NAFLD patients
NALF is suspected if steatosis is detected by U/S or abnormal elevation of liver enzymes in
high-risk patients ( type 2 diabetes, obesity, or metabolic syndrome) excluding other causes of
chronic liver disease. Sequential tests are performed according to local availability and the
situation of use: in primary health care units, the first line tools that are inexpensive, simple,
noninvasive, and widely available are the serum biomarkers like FIB-4 or NFS with high
negative predictive value (88%-95%) to negate advanced fibrosis. Low-risk fibrosis patients (55-
58% of cases, with FIB-4<1.3 or NFS<-1.455) are given no further assessments other than
lifestyle modifications and exercises. Intermediate-risk patients (30% of cases with FIB-4 =1.3 to
3.25 or NFS=-1.455 to 0.672) as well as high-risk patients of advanced fibrosis (12%-15% of
cases with FIB-4>3.25 or NFS >0.672 and positive predictive value 75%-90%) are referred to a
specialized center for LSM( liver stiffness measurements) using transient elastography (TE) in
fasting state with M probe for patients with βskin-liver capsule distanceβ <25 mm otherwise XL
probe is used for more obese patients. Low-risk patients for advanced fibrosis( LSM <8 kPa;
NPV= 94%-100%) should repeat the assessment within one year . Patients with intermediate-risk
( LSM =8-10 kPa) or high-risk ( LSM β₯10 kPa; PPV=47%-70%) of having advanced fibrosis
should undergo liver biopsy. According to availability, there are alternatives to the scoring
system like the commercial patent tests such as Fibro-test or Fibro-meter. In case of failure of
XL-probe of TE, there are alternatives such as MRE, especially in patients with BMI >35 kg/m2
.
All patients should be offered lifestyle modifications and exercise.
36. 26
Sberna et al. 2017 conducted a single-center retrospective observational study to evaluate the
application of the EASL-EASD-EASO in type 2 diabetic patients; they showed up that there was
an excessive referral rate to hepatologists in liver clinic. As illustrated in the above figure, 179
diabetic patients were screened for the presence of steatosis with FLI. Any patient with increased
liver enzymes, whether with steatosis absent [FLI < 60 ,(n=1)], or with steatosis present [FLI β₯
60 ,(n=55)] was referred to a liver clinic (a total of 56). Sixteen patients with FLI <60 and with
normal liver enzymes were assigned to follow up every 3-5 years. The remaining 107 patients
with FLI β₯ 60 and with normal liver enzymes were evaluated for liver fibrosis with NFS. Of
those, 11 patients with NFS β€ -1.455 were decided to be followed up every 2 years, 68 patients
with NFS levels between -1.455 and 0.675, and 28 patients with NFS β₯ 0.675 were referred to a
liver clinic. Thus the total referral rate was approximately 85%. Further assessment of fibrosis
score of those 56 patients with elevated liver enzyme could reduce the number of the patients to
be referred to a liver clinic.
People with T2DM(n=179)
Evaluation of Steatosis
With FLI
FLI β₯ 60 (n=162) FLI < 60 (n=17)
Any increase in ALT, AST,
GGT ( n= 55+1=56 )
Normal Liver
Enzymes (n=107)
Normal Liver
Enzymes (n=16)
Evaluation of Fibrosis With
NFS
NFS β€ -1.455
(n=11)
-1.455 < NFS < 0.675
(n=68)
NFS β₯ 0.675
(n=28)
Follow-Up Specialist Referral (n=56+68+28=152) Follow-Up
FIGURE (3. 4.1): EASL-EASD-EASO clinical practice guidelines for management of NAFLD in type 2 diabetes
37. 27
FIGURE (3.4.2):EASL-EASD-EASO clinical practice guidelines for the management of NAFLD in type 2 diabetis using
steatotest and fibrotest
But when Sberna et al. 2017 used another system to evaluate the application of the EASL-
EASD-EASO on type 2 diabetic patients, they showed up that the referral rate to liver clinic
decreased but still high. As illustrated in the above figure, 179 diabetic patients were screened
for the presence of steatosis with steatoTest. Any patient with increased liver enzymes, whether
with steatosis absent [steatoTest < 0.57, (n=5]), or with steatosis present [steatoTest β₯ 0.57,
(n=51)] was referred to a liver clinic (a total of 56). Thirty five patients with steatoTest < 0.57
and with normal liver enzymes were assigned to follow up every 3-5 years. The remaining 88
patients with steatoTest β₯ 0.57 and with normal liver enzymes were evaluated for liver fibrosis
with FibroTest. Of those, 82 patients with FibroTest < 0.58 were decided to be followed up every
two years while 6 patients with FibroTest β₯ 0.58 were referred to a liver clinic. Thus the total
referral rate was approximately 34.6% (62 patients). Sberna et al. reported that:ββit would not be
possible to refer such a high proportion of people with Type 2 diabetes to a liver clinic.The
application of EASL-EASD-EASO guidelines cannot be used in clinical practice in people with
Type 2 diabetes. It is essential to develop specific steatosis scores and fibrosis scores for people
with Type 2 diabetes in order to improve the selection of patients to be referred to a liver clinic
ββ.
People with T2DM(n=179)
Evaluation of Steatosis
With steato-Test
Steato-Test β₯ 0.57 (n=139)
Steato-Test < 0.57 (n=40)
Any increase in ALT, AST,
GGT (n=51+5=56)
Normal Liver
Enzymes (n=88)
Normal Liver
Enzymes (n=35)
Evaluation of Fibrosis with
Fibro-Test
Fibro-Test <0.58 (n=82)
Fibro-Test β₯0.58 (n=6)
Follow-Up
Specialist Referral (n=56+6=62)
Follow-Up
38. 28
FIGURE (3.4.3):EASL-EASD-EASO clinical practice guidelines for the management of NAFLD in type 2 diabetis using 1H-
MRS and NFS
Sberna et al. 2017 used another system. As illustrated in the above figure, 179 diabetic
patients were screened for the presence of steatosis with 1H-MRS (proton magnetic resonance
spectroscopy); it is very sensitive in detecting steatosis and can reproducibly quantify fat content
in the liver. Any patient with increased liver enzymes, whether with steatosis absent [hepatic fat
content β€ 5.5%, (n=9)], or with steatosis present [hepatic fat content > 5.5%, (n=47)] was
referred to a liver clinic (a total of 56). Forty seven patients with hepatic fat content β€ 5.5% and
with normal liver enzymes were assigned to follow up every 3-5 years. The remaining 76
patients with hepatic fat content > 5.5% and with normal liver enzymes were evaluated for liver
fibrosis with NFS. Of those, 9 patients with NFS β€ -1.455 were decided to be followed up every
2 years, 45 patients with NFS levels between -1.455 and 0.675 and 22 patients with NFS β₯ 0.675
were referred to a liver clinic. So the total referral rate was 68.7%, less than 85% obtained when
FLI was initially used to detect steatosis, as; FLI was mainly designed to detect steatosis in
general population not in diabetic patients and spectroscopy is more sensitive than FLI to detect
the hepatic fat content.
People with T2DM(n=179)
Evaluation of Steatosis
With 1H-MRS
1H-MRS > 5.5 % (n=123) 1H-MRS β€ 5.5 % (n=56)
Any increase in ALT, AST,
GGT (n=47+9=56)
Normal Liver
Enzymes (n=76)
Normal Liver
Enzymes (n=47)
Evaluation of Fibrosis With
NFS
NFS β€ -1.455(n=9) -1.455 < NFS <0.675
(n=45)
NFS β₯ 0.675
(n=22)
Follow-Up Specialist Referral (n= 56+45+22=123) Follow-Up
39. 29
FIGURE (3.4.4):EASL-EASD-EASO clinical practice guidelines for the management of NAFLD in type II diabetis using 1H-
MRS and fibrotest
Sberna et al. 2017 used another system. As illustrated in the above figure, 179 diabetic
patients were screened for the presence of steatosis with 1H-MRS, as in the previous figure. Any
patient with increased liver enzymes, whether with steatosis absent [hepatic fat content β€ 5.5%,
(n=9)], or with steatosis present [hepatic fat content > 5.5%, (n=47)] was referred to a liver clinic
(a total of 56). Forty seven patients with hepatic fat content β€ 5.5% and with normal liver
enzymes were assigned to follow up every 3-5 years. The remaining 76 patients with hepatic fat
content > 5.5% and with normal liver enzymes were evaluated for liver fibrosis with FibroTest.
Of those, 72 patients with FibroTest < 0.58 were decided to be followed up every 2 years while 4
patients with FibroTest β₯ 0.58 were referred to a liver clinic. Thus the total referral rate was
33.5% (60 patients). This is the same rate when SteatoTest was initially used to detect steatosis
followed by FibroTest to detect fibrosis, which was still high.
People with T2DM(n=179)
Evaluation of Steatosis
With 1H-MRS
1H-MRS > 5.5 % (n=123) 1H-MRS β€ 5.5 %(n=56)
Any increase in ALT, AST,
GGT (n=56)
Normal Liver
Enzymes (n=76)
Normal Liver
Enzymes(n=47)
Evaluation of Fibrosis with
Fibro-Test
Fibro-Test < 0.58 (n=72) Fibro-Test β₯ .58 (n=4)
Follow-Up Specialist Referral (n=56+4=60) Follow-Up
40. 30
Chapter Four: CTMC Analyzing NAFLD Progression (Small
Model)
Studying the natural history of disease during which individuals start at one initial state then
as time passes the patients move from one state to another, can be investigated by using
multistate Markov chains. The evolution of the disease over different phases can be monitored
by taking repeated observations of the disease stage at pre-specified time points following entry
into the study. The disease stage is recorded at the time of observation while the exact time of
state change is unobserved. NAFLD is a multistage disease process; in its simplest form has a
general structure model as depicted in Figure (4.1).
FIGURE (4. 1): general model structure .(Younossi et al. 2016)
NAFLD stages are modeled as time homogenous CTMC, meaning, πππ(βπ‘) depends on
βπ‘ and not on π‘ ,with constant transition intensities πππ over time, exponentially distributed time
spent within each state and patientsβ events follow the Poisson distribution. The states are one for
the susceptible cases (state 1) and one for NAFLD cases (state 2), and two absorbing states ; one
for the death due to NAFLD (state 3) and one for death due to any other cause (state 4). The
transition rate π12 is the rate of progression from state 1 to state 2, while the transition rate π12 is
the regression rate from state 2 to state 1. The transition rate π23 is the progression rate from
state 2 to state 3 and π24 is the progression rate from state 2 to state 4. For simplicity, all
individuals are assumed to enter the disease process at stage one, and they are all followed up
with the same length of the time interval between measurements.
In this chapter the transition probabilities and transition rates are thoroughly discussed. Also,
the mean sojourn time and its variance are reviewed as well as the state probability distribution
and its covariance matrix. This is followed by an exploration of the life expectancy of the
patients and the expected numbers of patients in each state. Lastly, a hypothetical numerical
example is used to illustrate these concepts.
41. 31
4.1. Transition Probability Matrix
The transitions can occur at any point in time. The rates at which these transitions occur are
constant over time and thus are independent of t, that is to say, the transition of patient from
π π‘ππ‘π π ππ‘ π‘πππ = π‘ π‘π π π‘ππ‘π π ππ‘ π‘ = π‘ + π π€βπππ π = βπ‘ depends on difference between two
consecutive time points. And itβs defined as πππ (π‘) = lim
βπ‘β0
πππ(βπ‘)βπΌ
βπ‘
or the Q matrix.
For the above multistate Markov model demonstrating the NAFLD disease process; the
forward Kolmogorov differential equations are represented by (4.1)
π
ππ‘
πππ(π‘) = [
π11 π12
π21 π22
π13 π14
π23 π24
0 0
0 0
π33 0
0 π44
] [
β(π12 + π14)
π21
π12
β(π21 + π23 + π24)
0
0
0
0
0
π23
π14
π24
0
0
0
0
] (4.1)
The Kolmogorov differential equations:
ππ11
ππ‘
= βπ11(π12 + π14) + π12π21 (4.2 )
ππ12
ππ‘
= π11π12βπ12(π21 + π23 + π24) (4.3)
ππ13
ππ‘
= π12π23 (4.4)
ππ14
ππ‘
= π11π14+ π12π24 (4.5)
ππ21
ππ‘
= βπ21(π12 + π14) + π22π21 (4.6)
ππ22
ππ‘
= π21π12βπ22(π21 + π23 + π24) (4.7)
ππ23
ππ‘
= π22π23 (4.8)
ππ24
ππ‘
= π21π14+ π22π24 (4.9)
π33 = 1 , π44 = 1
The solution of this system of equations will give the πππ(π‘)
To get P11 using equations (4.2) and ( 4.3):
πππ‘: πΎ1 = π12 + π14 , πΎ2 = π21 + π23 + π24
π·π11 + πΎ1π11 β π21π12 = 0 (4.10)
π·π12 + πΎ2π12 β π12π11 = 0 (4.11)