Covid Data Analytics : Case Study on India

•

0 likes•18 views

Vineet Kumar

Presented at SMILES 2020 Skoltech, Russia.

Healthcare

Covid Data Analytics : Case Study on India
Bankar Prasad Vilas Subhasis Panda Vaibhav Anand Vineet Kumar
Indian Statistical Institute, Kolkata
https://covid-isical.tech contact@covid-isical.tech
Introduction
We provide a macro-level India specific analysis on COVID
related data as obtained from [1]. We built a dashboard
available at [2] by giving multi-dimensional view and risk
summary for India.
Spread Assessment Metrics
Increasing trend of occurrence of new cases is likely to
indicate that the virus is spreading.
Figure 1: Average confirmed cases / day in India for each
week (last 7 weeks)
Increasing trend of active cases indicates the possibility
of increased stress on resources at present or in the near
future.
Figure 2: Active cases in India for the last 7 weeks
Risk Assessment Metrics
Avg. Confirmed Cases to Avg. Recoveries in Consecutive
Weeks Traffic Intensity (term from queuing theory) —
• Leading indicator of stress on resources
• Values > 1 implies faster arrival of new cases
compared to recovery
Figure 3: Traffic Intensity for India for the last 7 weeks
Points plot of Test in Each Week and % positive in each
week Higher proportion positive test results on a larger
test population (possibly less targeting) are anomalous
and need investigation.
There are at least three possibilities:
• Rapid increase of rate of infection
• Carrying out tests in new (hitherto untested) areas
with high but unknown rate of infection
• Improper testing leading to many incorrectly positive
results
Figure 4: Assessing adequacy of testing in India
Comparative Assessment
Heat maps showing the top 10 States in India w.r.t con-
firmed (blue), recovered (green), and deceased (red), re-
spectively.
Figure 5: Heat map of last 15 days
Bounds on Death Rate
Crude Fatality Rate Calculated on daily basis using
P
Death
P
Confirmed
.
Figure 6: Crude Fatality Rate
Kaplan Meier Analysis The estimator of the survival func-
tion S(t) (the probability that life is longer than t is given
by:
d
S(t) =
Y
i: ti≤t







1 −
di
ni







with ti a time when at least one event happened, di the
number of events (e.g., deaths) that happened at time ti,
and ni the individuals known to have survived (have not
yet had an event or been censored) up to time ti.
Figure 7: KM Curve for Survival Estimation
Spread-based Clustering
Clustered Districts based on number of active cases and
deaths using k means and agglomerate clustering tech-
niques.
MH
TN
DL
GJ
KA
UP
TG
AP
WB
RJ
HR
BR
MP
AS
OR
JK
PB
KL
-0.5
0.0
0.5
1.0
-5.0 -2.5 0.0
Dim1 (94.8%)
Dim2
(3.5%)
cluster
a
a
a
a
1
2
3
4
Cluster plot
MH
TN
DL
PB
JK
KL
OR
BR
AS
MP
RJ
HR
KA
GJ
UP
WB
TG
AP
0
2
4
6
8
10
Cluster Dendrogram
hclust (*, "ward.D2")
d
Height
Figure 8: Clustering states based on infection spread
References
[1] https://api.covid19india.org/
[2] https://covid-isical.tech/
Summer School of Machine Learning at Skoltech, SMILES – Aug’ 2020

Similar to Covid Data Analytics : Case Study on India

mainTomofumi Ogawa

[Workshop] The science of screening in Psycho-oncology (Oct10)Alex J Mitchell

Test positivity – Evaluation of a new metric to assess epidemic dispersal med...Olutosin Ademola Otekunrin

Interval observer for uncertain time-varying SIR-SI model of vector-borne dis...FGV Brazil

Marcadores predictores de actividad, flare lupico.pptxJuan Diego

Machine Learning for Survival AnalysisChandan Reddy

Group Testing with Test Errors Made EasierWaqas Tariq

Estimating the Statistical Significance of Classifiers used in the Predictio...IOSR Journals

Evaluating change in bruise colorimetry and the effect of subject characteris...Georgia State School of Public Health

Comparative study of decision tree algorithm and naive bayes classifier for s...eSAT Journals

BiostatisticsDina Ghoraba

A New SDM Classifier Using Jaccard Mining Procedure (CASE STUDY: RHEUMATIC FE...Soaad Abd El-Badie

A new sdm classifier using jaccard mining procedure case study rheumatic feve...ijbbjournal

Basic Statistics for application in Medical AssessmentShrushrita Sharma

Bachelor_thesisLodovico Terzi di Bergamo

Non-Parametric Survival ModelsMangaiK4

Science.abb3221gisa_legal

IRJET- Disease Prediction and Doctor Recommendation SystemIRJET Journal

Are age and week of first symptoms significant predictors GORDONOGWEYO

Time trends & patterns, TBRoutine Health Information NetwOrk (RHINO)

Similar to Covid Data Analytics : Case Study on India (20)

main

[Workshop] The science of screening in Psycho-oncology (Oct10)

Test positivity – Evaluation of a new metric to assess epidemic dispersal med...

Interval observer for uncertain time-varying SIR-SI model of vector-borne dis...

Marcadores predictores de actividad, flare lupico.pptx

Machine Learning for Survival Analysis

Group Testing with Test Errors Made Easier

Estimating the Statistical Significance of Classifiers used in the Predictio...

Evaluating change in bruise colorimetry and the effect of subject characteris...

Comparative study of decision tree algorithm and naive bayes classifier for s...

Biostatistics

A New SDM Classifier Using Jaccard Mining Procedure (CASE STUDY: RHEUMATIC FE...

A new sdm classifier using jaccard mining procedure case study rheumatic feve...

Basic Statistics for application in Medical Assessment

Bachelor_thesis

Non-Parametric Survival Models

Science.abb3221

IRJET- Disease Prediction and Doctor Recommendation System

Are age and week of first symptoms significant predictors

Time trends & patterns, TB

Recently uploaded

Unlock the Secrets to Optimizing Ambulatory Operations Efficiency and Change ...Health Catalyst

Session-1-MBFHI-A-part-of-the-Global-Strategy.pptMedidas Medical Center INC

An overview of Muir Wood Adolescent and Family Services teen treatment progra...pdamico1

Student ergonomics ( Samrth Pareta ) .pptxSamrth Pareta

Navigating Conflict in PE Using Strengths-Based ApproachesCHICommunications

Catheterization Procedure by Anushri Srivastav.pptxAnushriSrivastav

Abortion pills in Kuwait (+918133066128) Abortion clinic pills in KuwaitAbortion pills in Kuwait Cytotec pills in Kuwait

Abortion pills in Abu Dhabi ௵+918133066128௹Un_wandted Pregnancy Kit in Dubai UAEAbortion pills in Kuwait Cytotec pills in Kuwait

obat aborsi jogja wa 081313339699 jual obat aborsi cytotec asli di jogjanitatalita796

obat aborsi bengkulu wa 081313339699 jual obat aborsi cytotec asli di bengkulunitatalita796

Spauldings classification ppt by Dr C P PRINCEDR.PRINCE C P

Leading large scale change: a life at the interface between theory and practiceHelenBevan4

End of Response issues - Code and Rapid Response WorkshopBrian Locke

The 2024 Outlook for Older Adults: Healthcare Consumer SurveyMedia Logic

Cara menggugurkan kandungan paling ampuh 08561234742Jual obat penggugur 08561234742 Cara menggugurkan kandungan 08561234742

Making change happen: learning from "positive deviancts"HelenBevan4

VIP ℂall Girls Hyderabad 8250077686 WhatsApp: Me All Time Serviℂe Available D...Model Neeha Mumbai

Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...Levi Shapiro

obat aborsi Wonogiri wa 082135199655 jual obat aborsi cytotec asli di Wonogirisiskavia95

obat aborsi Trenggalek WA 081225888346 jual obat aborsi cytotec asli di Treng...icha27638

Recently uploaded (20)

Unlock the Secrets to Optimizing Ambulatory Operations Efficiency and Change ...

Session-1-MBFHI-A-part-of-the-Global-Strategy.ppt

An overview of Muir Wood Adolescent and Family Services teen treatment progra...

Student ergonomics ( Samrth Pareta ) .pptx

Navigating Conflict in PE Using Strengths-Based Approaches

Catheterization Procedure by Anushri Srivastav.pptx

Abortion pills in Kuwait (+918133066128) Abortion clinic pills in Kuwait

Abortion pills in Abu Dhabi ௵+918133066128௹Un_wandted Pregnancy Kit in Dubai UAE

obat aborsi jogja wa 081313339699 jual obat aborsi cytotec asli di jogja

obat aborsi bengkulu wa 081313339699 jual obat aborsi cytotec asli di bengkulu

Spauldings classification ppt by Dr C P PRINCE

Leading large scale change: a life at the interface between theory and practice

End of Response issues - Code and Rapid Response Workshop

The 2024 Outlook for Older Adults: Healthcare Consumer Survey

Cara menggugurkan kandungan paling ampuh 08561234742

Making change happen: learning from "positive deviancts"

VIP ℂall Girls Hyderabad 8250077686 WhatsApp: Me All Time Serviℂe Available D...

Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...

obat aborsi Wonogiri wa 082135199655 jual obat aborsi cytotec asli di Wonogiri

obat aborsi Trenggalek WA 081225888346 jual obat aborsi cytotec asli di Treng...

Covid Data Analytics : Case Study on India

1. Covid Data Analytics : Case Study on India Bankar Prasad Vilas Subhasis Panda Vaibhav Anand Vineet Kumar Indian Statistical Institute, Kolkata https://covid-isical.tech contact@covid-isical.tech Introduction We provide a macro-level India specific analysis on COVID related data as obtained from [1]. We built a dashboard available at [2] by giving multi-dimensional view and risk summary for India. Spread Assessment Metrics Increasing trend of occurrence of new cases is likely to indicate that the virus is spreading. Figure 1: Average confirmed cases / day in India for each week (last 7 weeks) Increasing trend of active cases indicates the possibility of increased stress on resources at present or in the near future. Figure 2: Active cases in India for the last 7 weeks Risk Assessment Metrics Avg. Confirmed Cases to Avg. Recoveries in Consecutive Weeks Traffic Intensity (term from queuing theory) — • Leading indicator of stress on resources • Values > 1 implies faster arrival of new cases compared to recovery Figure 3: Traffic Intensity for India for the last 7 weeks Points plot of Test in Each Week and % positive in each week Higher proportion positive test results on a larger test population (possibly less targeting) are anomalous and need investigation. There are at least three possibilities: • Rapid increase of rate of infection • Carrying out tests in new (hitherto untested) areas with high but unknown rate of infection • Improper testing leading to many incorrectly positive results Figure 4: Assessing adequacy of testing in India Comparative Assessment Heat maps showing the top 10 States in India w.r.t confirmed (blue), recovered (green), and deceased (red), re- spectively. Figure 5: Heat map of last 15 days Bounds on Death Rate Crude Fatality Rate Calculated on daily basis using P Death P Confirmed . Figure 6: Crude Fatality Rate Kaplan Meier Analysis The estimator of the survival func- tion S(t) (the probability that life is longer than t is given by: d S(t) = Y i: ti≤t        1 − di ni        with ti a time when at least one event happened, di the number of events (e.g., deaths) that happened at time ti, and ni the individuals known to have survived (have not yet had an event or been censored) up to time ti. Figure 7: KM Curve for Survival Estimation Spread-based Clustering Clustered Districts based on number of active cases and deaths using k means and agglomerate clustering tech- niques. MH TN DL GJ KA UP TG AP WB RJ HR BR MP AS OR JK PB KL -0.5 0.0 0.5 1.0 -5.0 -2.5 0.0 Dim1 (94.8%) Dim2 (3.5%) cluster a a a a 1 2 3 4 Cluster plot MH TN DL PB JK KL OR BR AS MP RJ HR KA GJ UP WB TG AP 0 2 4 6 8 10 Cluster Dendrogram hclust (*, "ward.D2") d Height Figure 8: Clustering states based on infection spread References [1] https://api.covid19india.org/ [2] https://covid-isical.tech/ Summer School of Machine Learning at Skoltech, SMILES – Aug’ 2020

Covid Data Analytics : Case Study on India

Recommended

Recommended

More Related Content

Similar to Covid Data Analytics : Case Study on India

Similar to Covid Data Analytics : Case Study on India (20)

Recently uploaded

Recently uploaded (20)

Covid Data Analytics : Case Study on India