SlideShare a Scribd company logo
1 of 36
Semi-covariance coefficient analysis of spike proteins
from SARS-CoV-2 and other coronaviruses for viral
evolution and characteristics associated with fatality
By
Jun Huang, Rebecca Spencer, Wandong Zhang
December 25 2020 – January 25 2021
Data Processing Training for 401 Lab
3.36%
Agenda
1. Introduction
2. Materials and Methods
3. Results and Conclusion
401 Lab 2020 Math Training
3.36%
1. Introduction
401 Lab 2020 Math Training
• Complex modeling has received significant attention in recent
years and is increasingly used to explain the statistical
phenomenon with increasing and decreasing fluctuations such
as the similarity or difference of spike protein charge patterns
of coronaviruses.
• Different from the existing covariance or correlation coefficient
methods in traditional integer dimension construction, this
study proposes a simplified novel fractional dimension
derivation with the exact Excel tool algorithm.
• It involves the fractional center moment extension to
covariance, which ends up as a complex covariance coefficient
that is better than the Pearson correlation coefficient, in the
sense that the nonlinearity relationship can be further depicted.
3.36%
Positive or Negative Charge
401 Lab 2020 Math Training
• The spike protein sequences of coronaviruses were obtained
from the GenBank and GISAID database, including the
coronaviruses from pangolin, bat, canine, porcine (three
variations), feline, tiger, SARS-CoV-1, MERS, and SARS-CoV-
2 (Wuhan, Beijing, New York, German and UK) were used as
the representative examples in this study.
• By examining the values above and below the average/mean
based on the positive and negative charge patterns of the
amino acid residues of the spike proteins from coronaviruses,
the proposed algorithm provides deep insights into the
nonlinear evolving trends of spike proteins for understanding
the viral gene sequence evolution and identifying the protein
characteristics associated with viral fatality.
• SARS-CoV-1 is negative charged, SARS-CoV-2 is 10 times
more positive charged. UK version 20% more charging!
3.36%
Viability
401 Lab 2020 Math Training
• The calculation results demonstrate that the complex
covariance coefficient analyzed by this algorithm is capable of
distinguishing the subtle nonlinear differences in the spike
protein charge patterns with reference to the Wuhan SARS-
CoV-2 for which the Pearson correlation coefficient may
overlook.
• Our analysis reveals the unique convergent (positive
correlative) to divergent (negative correlative) domain center
positions of each virus.
• The convergent or conserved region may be critical to the viral
evolution stability or viability; while the divergent region is
highly variable between coronaviruses suggesting high
frequency of mutations in this region.
3.36%
Residues
401 Lab 2020 Math Training
• The analysis shows that the conserved center region of SARS-
CoV-1 spike protein is located at amino acid residues 900, but
shifted to the amino acid residues 700 in MERS spike protein,
and then to amino acid residues 600 in SARS-CoV-2 spike
protein, indicating the evolvement of the coronaviruses.
• Another important characteristic our study reveals that the
distance between the divergent mean and the maximal
divergent point in each of the viruses (MERS>SARS-CoV-
1>SARS-CoV-2) is proportional to viral fatality rate.
• This algorithm may help to understand and analyze the
evolving trends and critical characteristics of other coronaviral
proteins and viruses.
Number Matters.
2. Materials and Methods
401 Lab 2020 Math Training
The coronavirus spike protein sequences used in this study were obtained
from the NCBI GenBank and the GISAID database, including SARS-
CoV-2 (the sequences isolated in Wuhan, Beijing Xinfadi wholesale
market, Germany, New York, UK and New York Zoo tiger), SARS-CoV-1,
Middle East respiratory syndrome (MERS), bat coronavirus (RaTG13),
pangolin coronavirus, feline coronavirus, canine coronavirus, and swine
coronaviruses [Swine Transmissible gastroenteritis virus (Swine-stomach),
swine enteric coronavirus (Swine-Ent), and porcine respiratory
coronavirus (Swine-Res)]. The sequence ID from the GenBank and
GISAID database are listed in Table 1.
Number Matters.
Hypo, Hyper or Gauss Variances and Covariance
401 Lab 2020 Math Training
3.36%
3. Results and Conclusion
• To compare and prove the usefulness of the simplified
complex variances, we compare the correlation of SARS-
CoV-2 viral spike protein sequence with other coronavirus
spike protein sequences.
• Since Excel is not capable of handling the imaginary
number, we simplify the calculation with integer power,
but separate the positive and negative covariance signs.
• Because coronaviruses spike proteins have different
electrical charge levels, we normalize the covariance by the
variance respectively just as the Pearson calculation does.
401 Lab 2020 Math Training
3.36%
Figures and Tables
• Figures 1-6 are the calculation results from our algorithm of semi-
covariance coefficient for spike protein Wuhan SARS-CoV-2 in
comparison with spike proteins of other coronaviruses listed in Table 1.
• Figure 7-9 are a combination of linear and nonlinear relationships
baselined on Wuhan.
• Figure 10-15 are nonlinear relationships baselined on Wuhan.
• Figure 16-19 are linear relationships baselined on Wuhan.
• Figure 20 is a combination of linear and nonlinear relationships again.
Nonlinear relationship is piece wised linear, that means only partial
proteins are related.
• It is evident that the fatality rate caused by the virus is highly related to
the distance between the divergent center (mean) and the maximal
divergent point (Table 2).
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
Scatter Plot
• A scatter graph (also called a scatter plot, scatter chart or scatter
diagram) is a type of plot or mathematical diagram using Cartesian
coordinates (with 4 quadrants) to display values for two variables for a
set of data.
• The data are displayed as a collection of points, each having the value
of one variable (charge value from Wuhan sequence) determining the
position on the horizontal axis (for Wuhan) and the value of the other
variable (charge value from others like Pangolin etc) determining the
position on the vertical axis (for other).
• A scatter plot can suggest various kinds of correlations between
variables with a certain linear or nonlinear pattern. Correlations may
be positive (rising), negative (falling), or neither (uncorrelated). If the
pattern of dots slopes from lower left to upper right, it indicates a
positive correlation between the variables being studied. If the
pattern of dots slopes from upper left to lower right, it indicates a
negative correlation.
401 Lab 2020 Math Training
3.36%
Linear vs Nonlinear
• If the dots are continuously connected one after another, we have a
simple linear relationship. If the dots form a few islands, we have the
nonlinear pattern. If both patterns are there, we have the mixed of
linear and nonlinear.
• If within the islands, it is linear, we can call it local linear, globally
nonlinear, or piece wised linear. It means only a particular charged
piece of the entire sequence is linear correlated within that piece.
• If we view the island as a super dot, and super dots forming a linear
relationship, we call it global linear, locally nonlinear. It means the
specially charged pieces of the entire sequence are linear
correlated among the pieces. Each piece has its unique electro-
biological functions.
• The 1st and 3rd quadrants are pieces where Wuhan sequence have
the same charge as the Pangolin's. The 2nd and 4th quadrants are
pieces where Wuhan sequence have the opposite charge as the
Pangolin's.
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
401 Lab 2020 Math Training
3.36%
Conclusion
• We have analyzed spike protein charge patterns of
coronaviruses by using our algorithm of semi-covariance
(nonlinear) coefficient as compared to Pearson (linear)
correlation.
• The analysis reveals additional performance index over Pearson
analysis, such as both positive- and negative-correlative
centers/regions in the spike proteins.
• The analysis provides in-depth understanding for the nonlinear
viral evolution pattern and identifies the protein characteristics
associated with viral fatality.
• The example code is available from the Excel file on the github
server (https://github.com/steedhuang/covid-19-gene-convertor).
• Our future work will pay more attention on the relationship
between positive charges to infectivity. As UK version has 20%
more positive charges!
401 Lab 2020 Math Training
3.36%
Acknowledgement
The work in Dr. Zhang’s lab is supported by a team
grant on the Rapid Research Response to COVID-19
Outbreak awarded from the Canadian Institute of Health
Research (CIHR) and by funding from the National
Research Council of Canada.
Thanks go to Lishen Wang from Jiangsu University for
writing Python code to covert sequences into charges.
Thanks also go to Mei Huang from Ottawa Hospital
COVID-19 patient unit for proof reading and editing the
final version.
401 Lab 2020 Math Training

More Related Content

What's hot

Finding important nodes in social networks based on modified pagerank
Finding important nodes in social networks based on modified pagerankFinding important nodes in social networks based on modified pagerank
Finding important nodes in social networks based on modified pagerankcsandit
 
Piecewise Controller Design for Affine Fuzzy Systems
Piecewise Controller Design for Affine Fuzzy SystemsPiecewise Controller Design for Affine Fuzzy Systems
Piecewise Controller Design for Affine Fuzzy SystemsISA Interchange
 
Bioinformatics_Sequence Analysis
Bioinformatics_Sequence AnalysisBioinformatics_Sequence Analysis
Bioinformatics_Sequence AnalysisSangeeta Das
 
Inference of the JAK-STAT Gene Network via Graphical Models
Inference of the JAK-STAT Gene Network via Graphical ModelsInference of the JAK-STAT Gene Network via Graphical Models
Inference of the JAK-STAT Gene Network via Graphical ModelsSSA KPI
 
Quantum Mechanics in Molecular modeling
Quantum Mechanics in Molecular modelingQuantum Mechanics in Molecular modeling
Quantum Mechanics in Molecular modelingAkshay Kank
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data miningSangeeta Das
 
Review On Molecular Modeling
Review On Molecular ModelingReview On Molecular Modeling
Review On Molecular Modelingankishukla000
 
Lecture 5 pharmacophore and qsar
Lecture 5  pharmacophore and  qsarLecture 5  pharmacophore and  qsar
Lecture 5 pharmacophore and qsarRAJAN ROLTA
 
what is Correlations
what is Correlationswhat is Correlations
what is Correlationsderiliumboy
 
Molecular modelling for M.Pharm according to PCI syllabus
Molecular modelling for M.Pharm according to PCI syllabusMolecular modelling for M.Pharm according to PCI syllabus
Molecular modelling for M.Pharm according to PCI syllabusShikha Popali
 
An Improved AC-BM Algorithm for Monitoring Watch List
An Improved AC-BM Algorithm for Monitoring Watch ListAn Improved AC-BM Algorithm for Monitoring Watch List
An Improved AC-BM Algorithm for Monitoring Watch ListNooria Sukmaningtyas
 

What's hot (20)

Finding important nodes in social networks based on modified pagerank
Finding important nodes in social networks based on modified pagerankFinding important nodes in social networks based on modified pagerank
Finding important nodes in social networks based on modified pagerank
 
final paper1
final paper1final paper1
final paper1
 
Piecewise Controller Design for Affine Fuzzy Systems
Piecewise Controller Design for Affine Fuzzy SystemsPiecewise Controller Design for Affine Fuzzy Systems
Piecewise Controller Design for Affine Fuzzy Systems
 
Data handling metabolomics
Data handling metabolomicsData handling metabolomics
Data handling metabolomics
 
Bioinformatics_Sequence Analysis
Bioinformatics_Sequence AnalysisBioinformatics_Sequence Analysis
Bioinformatics_Sequence Analysis
 
15-088-pub
15-088-pub15-088-pub
15-088-pub
 
Inference of the JAK-STAT Gene Network via Graphical Models
Inference of the JAK-STAT Gene Network via Graphical ModelsInference of the JAK-STAT Gene Network via Graphical Models
Inference of the JAK-STAT Gene Network via Graphical Models
 
Quantum Mechanics in Molecular modeling
Quantum Mechanics in Molecular modelingQuantum Mechanics in Molecular modeling
Quantum Mechanics in Molecular modeling
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data mining
 
Ijetr042111
Ijetr042111Ijetr042111
Ijetr042111
 
Review On Molecular Modeling
Review On Molecular ModelingReview On Molecular Modeling
Review On Molecular Modeling
 
Scoring function
Scoring functionScoring function
Scoring function
 
Lecture 5 pharmacophore and qsar
Lecture 5  pharmacophore and  qsarLecture 5  pharmacophore and  qsar
Lecture 5 pharmacophore and qsar
 
25.qsar
25.qsar25.qsar
25.qsar
 
Binary Logistic Regression
Binary Logistic RegressionBinary Logistic Regression
Binary Logistic Regression
 
what is Correlations
what is Correlationswhat is Correlations
what is Correlations
 
Final Version
Final VersionFinal Version
Final Version
 
Molecular modelling for M.Pharm according to PCI syllabus
Molecular modelling for M.Pharm according to PCI syllabusMolecular modelling for M.Pharm according to PCI syllabus
Molecular modelling for M.Pharm according to PCI syllabus
 
An Improved AC-BM Algorithm for Monitoring Watch List
An Improved AC-BM Algorithm for Monitoring Watch ListAn Improved AC-BM Algorithm for Monitoring Watch List
An Improved AC-BM Algorithm for Monitoring Watch List
 

Similar to Semi-covariance analysis of spike protein charge patterns distinguishes coronaviruses

Predicted COVID-19 Ending Time
Predicted COVID-19 Ending TimePredicted COVID-19 Ending Time
Predicted COVID-19 Ending TimeJun Steed Huang
 
cannonicalpresentation-110505114327-phpapp01.pdf
cannonicalpresentation-110505114327-phpapp01.pdfcannonicalpresentation-110505114327-phpapp01.pdf
cannonicalpresentation-110505114327-phpapp01.pdfJermaeDizon2
 
X18125514 ca2-statisticsfor dataanalytics
X18125514 ca2-statisticsfor dataanalyticsX18125514 ca2-statisticsfor dataanalytics
X18125514 ca2-statisticsfor dataanalyticsShantanu Deshpande
 
MCA_UNIT-4_Computer Oriented Numerical Statistical Methods
MCA_UNIT-4_Computer Oriented Numerical Statistical MethodsMCA_UNIT-4_Computer Oriented Numerical Statistical Methods
MCA_UNIT-4_Computer Oriented Numerical Statistical MethodsRai University
 
Statistics for Data Analytics
Statistics for Data AnalyticsStatistics for Data Analytics
Statistics for Data AnalyticsTushar Dalvi
 
Exercise 29Calculating Simple Linear RegressionSimple linear reg.docx
Exercise 29Calculating Simple Linear RegressionSimple linear reg.docxExercise 29Calculating Simple Linear RegressionSimple linear reg.docx
Exercise 29Calculating Simple Linear RegressionSimple linear reg.docxAlleneMcclendon878
 
Stats ca report_18180485
Stats ca report_18180485Stats ca report_18180485
Stats ca report_18180485sarthakkhare3
 
Cannonical Correlation
Cannonical CorrelationCannonical Correlation
Cannonical Correlationdomsr
 
Cannonical correlation
Cannonical correlationCannonical correlation
Cannonical correlationdomsr
 
Predicting the age of abalone
Predicting the age of abalonePredicting the age of abalone
Predicting the age of abalonehyperak
 
A NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENT
A NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENTA NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENT
A NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENTSavas Papadopoulos, Ph.D
 
Lesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And RegressionLesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And RegressionSumit Prajapati
 
30REGRESSION Regression is a statistical tool that a.docx
30REGRESSION  Regression is a statistical tool that a.docx30REGRESSION  Regression is a statistical tool that a.docx
30REGRESSION Regression is a statistical tool that a.docxtarifarmarie
 
ANTIC-2021_paper_95.pdf
ANTIC-2021_paper_95.pdfANTIC-2021_paper_95.pdf
ANTIC-2021_paper_95.pdfDrGRevathy
 

Similar to Semi-covariance analysis of spike protein charge patterns distinguishes coronaviruses (20)

Predicted COVID-19 Ending Time
Predicted COVID-19 Ending TimePredicted COVID-19 Ending Time
Predicted COVID-19 Ending Time
 
cannonicalpresentation-110505114327-phpapp01.pdf
cannonicalpresentation-110505114327-phpapp01.pdfcannonicalpresentation-110505114327-phpapp01.pdf
cannonicalpresentation-110505114327-phpapp01.pdf
 
X18125514 ca2-statisticsfor dataanalytics
X18125514 ca2-statisticsfor dataanalyticsX18125514 ca2-statisticsfor dataanalytics
X18125514 ca2-statisticsfor dataanalytics
 
MCA_UNIT-4_Computer Oriented Numerical Statistical Methods
MCA_UNIT-4_Computer Oriented Numerical Statistical MethodsMCA_UNIT-4_Computer Oriented Numerical Statistical Methods
MCA_UNIT-4_Computer Oriented Numerical Statistical Methods
 
Statistics for Data Analytics
Statistics for Data AnalyticsStatistics for Data Analytics
Statistics for Data Analytics
 
Exercise 29Calculating Simple Linear RegressionSimple linear reg.docx
Exercise 29Calculating Simple Linear RegressionSimple linear reg.docxExercise 29Calculating Simple Linear RegressionSimple linear reg.docx
Exercise 29Calculating Simple Linear RegressionSimple linear reg.docx
 
Stats ca report_18180485
Stats ca report_18180485Stats ca report_18180485
Stats ca report_18180485
 
Cannonical Correlation
Cannonical CorrelationCannonical Correlation
Cannonical Correlation
 
Cannonical correlation
Cannonical correlationCannonical correlation
Cannonical correlation
 
Predicting the age of abalone
Predicting the age of abalonePredicting the age of abalone
Predicting the age of abalone
 
A NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENT
A NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENTA NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENT
A NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENT
 
bayes_proj
bayes_projbayes_proj
bayes_proj
 
Lesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And RegressionLesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And Regression
 
Regression -Linear.pptx
Regression -Linear.pptxRegression -Linear.pptx
Regression -Linear.pptx
 
Characteristics and simulation analysis of nonlinear correlation coefficient ...
Characteristics and simulation analysis of nonlinear correlation coefficient ...Characteristics and simulation analysis of nonlinear correlation coefficient ...
Characteristics and simulation analysis of nonlinear correlation coefficient ...
 
30REGRESSION Regression is a statistical tool that a.docx
30REGRESSION  Regression is a statistical tool that a.docx30REGRESSION  Regression is a statistical tool that a.docx
30REGRESSION Regression is a statistical tool that a.docx
 
Inferential statistics correlations
Inferential statistics correlationsInferential statistics correlations
Inferential statistics correlations
 
Linear regression analysis
Linear regression analysisLinear regression analysis
Linear regression analysis
 
-P-M_118_17ICSB
-P-M_118_17ICSB-P-M_118_17ICSB
-P-M_118_17ICSB
 
ANTIC-2021_paper_95.pdf
ANTIC-2021_paper_95.pdfANTIC-2021_paper_95.pdf
ANTIC-2021_paper_95.pdf
 

More from Jun Steed Huang

Forest Environment Analysis for the Pandemic Health
Forest Environment Analysis for the Pandemic HealthForest Environment Analysis for the Pandemic Health
Forest Environment Analysis for the Pandemic HealthJun Steed Huang
 
Alphaba Smart Bus Autonomous Design
Alphaba Smart Bus Autonomous DesignAlphaba Smart Bus Autonomous Design
Alphaba Smart Bus Autonomous DesignJun Steed Huang
 
Analysis of 2020 USA Presidential Election
Analysis of 2020 USA Presidential Election Analysis of 2020 USA Presidential Election
Analysis of 2020 USA Presidential Election Jun Steed Huang
 
Cosmos Genesis Entanglement Speed
Cosmos Genesis Entanglement SpeedCosmos Genesis Entanglement Speed
Cosmos Genesis Entanglement SpeedJun Steed Huang
 
7 Safety +1 COVID-19 Protection Lines
7 Safety +1 COVID-19 Protection Lines7 Safety +1 COVID-19 Protection Lines
7 Safety +1 COVID-19 Protection LinesJun Steed Huang
 
Hyper variance and autonomous bus
Hyper variance and autonomous busHyper variance and autonomous bus
Hyper variance and autonomous busJun Steed Huang
 
From Hadamard to Langlands
From Hadamard to LanglandsFrom Hadamard to Langlands
From Hadamard to LanglandsJun Steed Huang
 
Homogeneous Cosmos Unique Genesis
Homogeneous Cosmos Unique GenesisHomogeneous Cosmos Unique Genesis
Homogeneous Cosmos Unique GenesisJun Steed Huang
 
Quantum Brain Storm Optimization
Quantum Brain Storm OptimizationQuantum Brain Storm Optimization
Quantum Brain Storm OptimizationJun Steed Huang
 

More from Jun Steed Huang (20)

ICFT-VNX-2022V2.pdf
ICFT-VNX-2022V2.pdfICFT-VNX-2022V2.pdf
ICFT-VNX-2022V2.pdf
 
Forest Environment Analysis for the Pandemic Health
Forest Environment Analysis for the Pandemic HealthForest Environment Analysis for the Pandemic Health
Forest Environment Analysis for the Pandemic Health
 
Alphaba Smart Bus Autonomous Design
Alphaba Smart Bus Autonomous DesignAlphaba Smart Bus Autonomous Design
Alphaba Smart Bus Autonomous Design
 
Analysis of 2020 USA Presidential Election
Analysis of 2020 USA Presidential Election Analysis of 2020 USA Presidential Election
Analysis of 2020 USA Presidential Election
 
Cosmos Genesis Entanglement Speed
Cosmos Genesis Entanglement SpeedCosmos Genesis Entanglement Speed
Cosmos Genesis Entanglement Speed
 
7 Safety +1 COVID-19 Protection Lines
7 Safety +1 COVID-19 Protection Lines7 Safety +1 COVID-19 Protection Lines
7 Safety +1 COVID-19 Protection Lines
 
Hyper variance and autonomous bus
Hyper variance and autonomous busHyper variance and autonomous bus
Hyper variance and autonomous bus
 
Mine Death Estimation
Mine Death EstimationMine Death Estimation
Mine Death Estimation
 
Steed Variance
Steed VarianceSteed Variance
Steed Variance
 
Power plant
Power plantPower plant
Power plant
 
Vtc9252019
Vtc9252019Vtc9252019
Vtc9252019
 
Quatum fridge
Quatum fridgeQuatum fridge
Quatum fridge
 
Hypo Variance
Hypo VarianceHypo Variance
Hypo Variance
 
From Hadamard to Langlands
From Hadamard to LanglandsFrom Hadamard to Langlands
From Hadamard to Langlands
 
Homogeneous Cosmos Unique Genesis
Homogeneous Cosmos Unique GenesisHomogeneous Cosmos Unique Genesis
Homogeneous Cosmos Unique Genesis
 
Complex Hurst for NDVI
Complex Hurst for NDVIComplex Hurst for NDVI
Complex Hurst for NDVI
 
Selabot Swarm
Selabot Swarm Selabot Swarm
Selabot Swarm
 
Autonomous bus
Autonomous busAutonomous bus
Autonomous bus
 
Deep Space Home
Deep Space HomeDeep Space Home
Deep Space Home
 
Quantum Brain Storm Optimization
Quantum Brain Storm OptimizationQuantum Brain Storm Optimization
Quantum Brain Storm Optimization
 

Recently uploaded

VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...narwatsonia7
 
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...narwatsonia7
 
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...Miss joya
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...Garima Khatri
 
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.MiadAlsulami
 
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service CoimbatoreCall Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatorenarwatsonia7
 
Ahmedabad Call Girls CG Road 🔝9907093804 Short 1500 💋 Night 6000
Ahmedabad Call Girls CG Road 🔝9907093804  Short 1500  💋 Night 6000Ahmedabad Call Girls CG Road 🔝9907093804  Short 1500  💋 Night 6000
Ahmedabad Call Girls CG Road 🔝9907093804 Short 1500 💋 Night 6000aliya bhat
 
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...Nehru place Escorts
 
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original PhotosCall Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photosnarwatsonia7
 
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...narwatsonia7
 
Hi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near Me
Hi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near MeHi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near Me
Hi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near Menarwatsonia7
 
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service BangaloreCall Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalorenarwatsonia7
 
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowSonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowRiya Pathan
 
Call Girls Chennai Megha 9907093804 Independent Call Girls Service Chennai
Call Girls Chennai Megha 9907093804 Independent Call Girls Service ChennaiCall Girls Chennai Megha 9907093804 Independent Call Girls Service Chennai
Call Girls Chennai Megha 9907093804 Independent Call Girls Service ChennaiNehru place Escorts
 
Call Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service NoidaCall Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service NoidaPooja Gupta
 
Call Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
Call Girl Chennai Indira 9907093804 Independent Call Girls Service ChennaiCall Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
Call Girl Chennai Indira 9907093804 Independent Call Girls Service ChennaiNehru place Escorts
 
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...narwatsonia7
 
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...narwatsonia7
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...CALL GIRLS
 
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safenarwatsonia7
 

Recently uploaded (20)

VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
 
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
Russian Call Girls Chickpet - 7001305949 Booking and charges genuine rate for...
 
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
VIP Call Girls Pune Vani 9907093804 Short 1500 Night 6000 Best call girls Ser...
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
 
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
 
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service CoimbatoreCall Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
 
Ahmedabad Call Girls CG Road 🔝9907093804 Short 1500 💋 Night 6000
Ahmedabad Call Girls CG Road 🔝9907093804  Short 1500  💋 Night 6000Ahmedabad Call Girls CG Road 🔝9907093804  Short 1500  💋 Night 6000
Ahmedabad Call Girls CG Road 🔝9907093804 Short 1500 💋 Night 6000
 
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
Russian Call Girls Chennai Madhuri 9907093804 Independent Call Girls Service ...
 
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original PhotosCall Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
 
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
 
Hi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near Me
Hi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near MeHi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near Me
Hi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near Me
 
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service BangaloreCall Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
 
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowSonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
 
Call Girls Chennai Megha 9907093804 Independent Call Girls Service Chennai
Call Girls Chennai Megha 9907093804 Independent Call Girls Service ChennaiCall Girls Chennai Megha 9907093804 Independent Call Girls Service Chennai
Call Girls Chennai Megha 9907093804 Independent Call Girls Service Chennai
 
Call Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service NoidaCall Girls Service Noida Maya 9711199012 Independent Escort Service Noida
Call Girls Service Noida Maya 9711199012 Independent Escort Service Noida
 
Call Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
Call Girl Chennai Indira 9907093804 Independent Call Girls Service ChennaiCall Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
Call Girl Chennai Indira 9907093804 Independent Call Girls Service Chennai
 
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
 
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
Russian Call Girls in Bangalore Manisha 7001305949 Independent Escort Service...
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
 
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
 

Semi-covariance analysis of spike protein charge patterns distinguishes coronaviruses

  • 1. Semi-covariance coefficient analysis of spike proteins from SARS-CoV-2 and other coronaviruses for viral evolution and characteristics associated with fatality By Jun Huang, Rebecca Spencer, Wandong Zhang December 25 2020 – January 25 2021 Data Processing Training for 401 Lab
  • 2. 3.36% Agenda 1. Introduction 2. Materials and Methods 3. Results and Conclusion 401 Lab 2020 Math Training
  • 3. 3.36% 1. Introduction 401 Lab 2020 Math Training • Complex modeling has received significant attention in recent years and is increasingly used to explain the statistical phenomenon with increasing and decreasing fluctuations such as the similarity or difference of spike protein charge patterns of coronaviruses. • Different from the existing covariance or correlation coefficient methods in traditional integer dimension construction, this study proposes a simplified novel fractional dimension derivation with the exact Excel tool algorithm. • It involves the fractional center moment extension to covariance, which ends up as a complex covariance coefficient that is better than the Pearson correlation coefficient, in the sense that the nonlinearity relationship can be further depicted.
  • 4. 3.36% Positive or Negative Charge 401 Lab 2020 Math Training • The spike protein sequences of coronaviruses were obtained from the GenBank and GISAID database, including the coronaviruses from pangolin, bat, canine, porcine (three variations), feline, tiger, SARS-CoV-1, MERS, and SARS-CoV- 2 (Wuhan, Beijing, New York, German and UK) were used as the representative examples in this study. • By examining the values above and below the average/mean based on the positive and negative charge patterns of the amino acid residues of the spike proteins from coronaviruses, the proposed algorithm provides deep insights into the nonlinear evolving trends of spike proteins for understanding the viral gene sequence evolution and identifying the protein characteristics associated with viral fatality. • SARS-CoV-1 is negative charged, SARS-CoV-2 is 10 times more positive charged. UK version 20% more charging!
  • 5. 3.36% Viability 401 Lab 2020 Math Training • The calculation results demonstrate that the complex covariance coefficient analyzed by this algorithm is capable of distinguishing the subtle nonlinear differences in the spike protein charge patterns with reference to the Wuhan SARS- CoV-2 for which the Pearson correlation coefficient may overlook. • Our analysis reveals the unique convergent (positive correlative) to divergent (negative correlative) domain center positions of each virus. • The convergent or conserved region may be critical to the viral evolution stability or viability; while the divergent region is highly variable between coronaviruses suggesting high frequency of mutations in this region.
  • 6. 3.36% Residues 401 Lab 2020 Math Training • The analysis shows that the conserved center region of SARS- CoV-1 spike protein is located at amino acid residues 900, but shifted to the amino acid residues 700 in MERS spike protein, and then to amino acid residues 600 in SARS-CoV-2 spike protein, indicating the evolvement of the coronaviruses. • Another important characteristic our study reveals that the distance between the divergent mean and the maximal divergent point in each of the viruses (MERS>SARS-CoV- 1>SARS-CoV-2) is proportional to viral fatality rate. • This algorithm may help to understand and analyze the evolving trends and critical characteristics of other coronaviral proteins and viruses.
  • 7. Number Matters. 2. Materials and Methods 401 Lab 2020 Math Training The coronavirus spike protein sequences used in this study were obtained from the NCBI GenBank and the GISAID database, including SARS- CoV-2 (the sequences isolated in Wuhan, Beijing Xinfadi wholesale market, Germany, New York, UK and New York Zoo tiger), SARS-CoV-1, Middle East respiratory syndrome (MERS), bat coronavirus (RaTG13), pangolin coronavirus, feline coronavirus, canine coronavirus, and swine coronaviruses [Swine Transmissible gastroenteritis virus (Swine-stomach), swine enteric coronavirus (Swine-Ent), and porcine respiratory coronavirus (Swine-Res)]. The sequence ID from the GenBank and GISAID database are listed in Table 1.
  • 8. Number Matters. Hypo, Hyper or Gauss Variances and Covariance 401 Lab 2020 Math Training
  • 9. 3.36% 3. Results and Conclusion • To compare and prove the usefulness of the simplified complex variances, we compare the correlation of SARS- CoV-2 viral spike protein sequence with other coronavirus spike protein sequences. • Since Excel is not capable of handling the imaginary number, we simplify the calculation with integer power, but separate the positive and negative covariance signs. • Because coronaviruses spike proteins have different electrical charge levels, we normalize the covariance by the variance respectively just as the Pearson calculation does. 401 Lab 2020 Math Training
  • 10. 3.36% Figures and Tables • Figures 1-6 are the calculation results from our algorithm of semi- covariance coefficient for spike protein Wuhan SARS-CoV-2 in comparison with spike proteins of other coronaviruses listed in Table 1. • Figure 7-9 are a combination of linear and nonlinear relationships baselined on Wuhan. • Figure 10-15 are nonlinear relationships baselined on Wuhan. • Figure 16-19 are linear relationships baselined on Wuhan. • Figure 20 is a combination of linear and nonlinear relationships again. Nonlinear relationship is piece wised linear, that means only partial proteins are related. • It is evident that the fatality rate caused by the virus is highly related to the distance between the divergent center (mean) and the maximal divergent point (Table 2). 401 Lab 2020 Math Training
  • 11. 3.36% 401 Lab 2020 Math Training
  • 12. 3.36% 401 Lab 2020 Math Training
  • 13. 3.36% 401 Lab 2020 Math Training
  • 14. 3.36% 401 Lab 2020 Math Training
  • 15. 3.36% 401 Lab 2020 Math Training
  • 16. 3.36% 401 Lab 2020 Math Training
  • 17. 3.36% Scatter Plot • A scatter graph (also called a scatter plot, scatter chart or scatter diagram) is a type of plot or mathematical diagram using Cartesian coordinates (with 4 quadrants) to display values for two variables for a set of data. • The data are displayed as a collection of points, each having the value of one variable (charge value from Wuhan sequence) determining the position on the horizontal axis (for Wuhan) and the value of the other variable (charge value from others like Pangolin etc) determining the position on the vertical axis (for other). • A scatter plot can suggest various kinds of correlations between variables with a certain linear or nonlinear pattern. Correlations may be positive (rising), negative (falling), or neither (uncorrelated). If the pattern of dots slopes from lower left to upper right, it indicates a positive correlation between the variables being studied. If the pattern of dots slopes from upper left to lower right, it indicates a negative correlation. 401 Lab 2020 Math Training
  • 18. 3.36% Linear vs Nonlinear • If the dots are continuously connected one after another, we have a simple linear relationship. If the dots form a few islands, we have the nonlinear pattern. If both patterns are there, we have the mixed of linear and nonlinear. • If within the islands, it is linear, we can call it local linear, globally nonlinear, or piece wised linear. It means only a particular charged piece of the entire sequence is linear correlated within that piece. • If we view the island as a super dot, and super dots forming a linear relationship, we call it global linear, locally nonlinear. It means the specially charged pieces of the entire sequence are linear correlated among the pieces. Each piece has its unique electro- biological functions. • The 1st and 3rd quadrants are pieces where Wuhan sequence have the same charge as the Pangolin's. The 2nd and 4th quadrants are pieces where Wuhan sequence have the opposite charge as the Pangolin's. 401 Lab 2020 Math Training
  • 19. 3.36% 401 Lab 2020 Math Training
  • 20. 3.36% 401 Lab 2020 Math Training
  • 21. 3.36% 401 Lab 2020 Math Training
  • 22. 3.36% 401 Lab 2020 Math Training
  • 23. 3.36% 401 Lab 2020 Math Training
  • 24. 3.36% 401 Lab 2020 Math Training
  • 25. 3.36% 401 Lab 2020 Math Training
  • 26. 3.36% 401 Lab 2020 Math Training
  • 27. 3.36% 401 Lab 2020 Math Training
  • 28. 3.36% 401 Lab 2020 Math Training
  • 29. 3.36% 401 Lab 2020 Math Training
  • 30. 3.36% 401 Lab 2020 Math Training
  • 31. 3.36% 401 Lab 2020 Math Training
  • 32. 3.36% 401 Lab 2020 Math Training
  • 33. 3.36% 401 Lab 2020 Math Training
  • 34. 3.36% 401 Lab 2020 Math Training
  • 35. 3.36% Conclusion • We have analyzed spike protein charge patterns of coronaviruses by using our algorithm of semi-covariance (nonlinear) coefficient as compared to Pearson (linear) correlation. • The analysis reveals additional performance index over Pearson analysis, such as both positive- and negative-correlative centers/regions in the spike proteins. • The analysis provides in-depth understanding for the nonlinear viral evolution pattern and identifies the protein characteristics associated with viral fatality. • The example code is available from the Excel file on the github server (https://github.com/steedhuang/covid-19-gene-convertor). • Our future work will pay more attention on the relationship between positive charges to infectivity. As UK version has 20% more positive charges! 401 Lab 2020 Math Training
  • 36. 3.36% Acknowledgement The work in Dr. Zhang’s lab is supported by a team grant on the Rapid Research Response to COVID-19 Outbreak awarded from the Canadian Institute of Health Research (CIHR) and by funding from the National Research Council of Canada. Thanks go to Lishen Wang from Jiangsu University for writing Python code to covert sequences into charges. Thanks also go to Mei Huang from Ottawa Hospital COVID-19 patient unit for proof reading and editing the final version. 401 Lab 2020 Math Training