“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
COVID 19
1. “COVID - 19 latest papers Review”
Subash Chandra Pakhrin
PhD Student
Wichita State University
Wichita, Kansas
1
2. Introduction
• alpha, beta, delta and gamma
• alpha and beta coronaviruses are known to infect humans
• Coronaviruses are spherical in shape with an average diameter of 80 –
120 nm
• Under the electron microscope, the virus are characterized by surface
projections (Spike Glycoprotein / S protein)
4. Objective
• Understand underlining mechanism of COVID-19 virus
• Understand different protein structure prediction of COVID-19 virus
• Role of Glycosylation Post Translation Modification at Spike Protein
• Understand how AI / ML is used to identify drug of COVID-19 virus
• Understand how Next Generation Sequencing is used to identify
intermediate host.
5. Angiotensin converting enzyme 2 (ACE2)
• Angiotensin converting enzyme 2
(ACE2) is an essential regulator
expressed in the heart, lung etc.
• Coronaviruses enter into human
lung cell through ACE2 [35].
• Molecules that can block S protein
binding to ACE2 may potentially
prevent the virus from entering
human cells and serve as an
effective antiviral drug.
Figure 2: ACE2 receptors interaction with the Spike protein
6. ACE2 receptors interaction with the Spike protein
• ACE2 receptors interacts with
the spike protein (S) of SARS-
COV-2 as confirmed by
structural analysis of SARS-COV-
2 [36] [37]
Figure 3: structural analysis of SARS-COV-2
7. Protein structure prediction
• Zhang lab developed three dimensional structural models and
function for all proteins of SARS-CoV-2
• The structure models are generated by the C-I-TASSER pipeline, which
utilizes deep convolutional neural-network based contact-map
predictions
• Based on this idea, authors designed a series of peptides that can
strongly bind to SARS-CoV-2 receptor binding domain (RBD) in
computational experiments [41]
10. Protein structure prediction
• The receptor binding domain (RBD)
located on the head of Spike protein,
bind with the cellular receptor
angiotensin-converting enzyme 2
(ACE2), initiating the membrane
fusion of the virus and host cell.
Figure 5: Spike glycoprotein (S)
11. Protein structure prediction
• Senior et al. [40] developed a model called AlphaFold
• AlphaFold is based on a deep learning architecture and it predicts
protein structure accurately when structures of similar proteins are
not available.
• AlphaFold predicted the structures of six proteins related to SARS-
CoV-2 (the membrane protein, protein 3a, nsp2, nsp4, nsp6, and
papain-like protease)
13. Spike Glycoprotein prediction
• B. Robson [14] worked to find a short section of viral protein
sequence suitable for preliminary design proposal for a peptide
synthetic vaccine, and to explore some design possibilities.
• Author used Q-UEL systems to access relevant and emerging
literature
• KRSFIEDLLFNKV motif was found to be well conserved and
corresponds to the region around one sites of the SARS virus that are
believed to be required for virus activation for cell entry.
• This motif and surrounding variations formed the basis for proposing
a specific synthetic vaccine epitope.
14. Virus Structure prediction
• Yan Gao, et al. [43] found Non Structural Protein (nsp) 12 or RdRp is
the central component of corona virus replication and appears to be a
primary target for the antiviral drug, remdesivir.
• Authors reported Cryogenic Electron Microscopy structure of nsp12 in
complex with cofactors nsp7 and nsp8 at 2.9-Å resolution.
• nsp12 possesses a newly identified β-hairpin domain at its N
terminus.
• The structure provides a basis for the design of new antiviral
therapeutics targeting viral protein nsp12.
15. Virus Structure prediction
• Ibrahim, et al. [23], found spike protein of coronavirus is the main
driving force for host cell recognition
• COVID-19 spike binding site to the cell-surface receptor (GRP78) is
predicted using combined molecular modeling docking and structural
bioinformatics.
• Authors, found that the binding is more favorable between regions III
(C391-C525) and IV (C480-C488) of the spike protein model and
GRP78.
• Region IV is the main driving force for GRP78 binding with the
predicted binding affinity of -9.8 kcal/mol.
16. Figure 7: (A) Part of the multiple sequence alignment for the spike protein of all of the currently reported human coronaviruses
strains (COVID-19, SARS, MERS, NL63, 229E, OC43, and HKU1). The red highlighted residues are identical, while yellow
highlighted residues are conserved among the seven HCoVs. Secondary structures are represented at the top of the MSA for
the COVID-19 spike, while the surface accessibility is shown at the bottom (blue, surface accessible, cyan, partially accessible,
and white for buried residues). The four regions of the spike protein are shaded with green, blue, magenta, and red for regions
I, II, III, and IV, respectively.
Virus Structure prediction
17. Figure 8: (B) Structural superposition of SARS spike structure (green cartoon) and
COVID-19 spike model (cyan cartoon).
Virus Structure prediction
18. Figure 9: (A) The structure of the spike protein model of COVID-19 in its homo - trimer state (colored
cartoon). Two chains, A (green) and B (cyan) are in the closed conformation, while chain C (magenta)
is the open configuration that makes it able to recognize the host cell receptor. Region IV of the spike
(C480-C488), which we suggest is the recognition site for cell-surface GRP78, is shown in the black
cartoon in the enlarged panel.
Virus Structure prediction
19. Figure 10: (B) Region III of spike (black cartoon)
Virus Structure prediction
20. Figure 11: The structure of the docking complexes of GRP78 (green cartoon) and COVID-19 spike (yellow
cartoon) regions I and II (A)
Virus Structure prediction
21. Figure 12: The structure of the docking complexes of GRP78 (green cartoon) and COVID-19 spike (yellow
cartoon) and regions III and IV (B)
Virus Structure prediction
22. Figure 13: (A) The proposed binding mode of the host cell GRP78 (cyan surface) and the COVID-19 spike model
(green surface) through region IV (C480-C488) (red surface). The amino acids from the GRP78 SBDβ that interact
with the spike protein region IV (red cartoon) are labeled and represented in yellow sticks in the enlarged panel.
Virus Structure prediction
23. Figure 14: (B) The proposed recognition mode of the COVID-19 spike (red surface) and cell-surface GRP78 (green
surface) through the spike protein region C480-C488.
Virus Structure prediction
24. Post Translation Modification
• Naveen Vankadari, et al. [32] predicted N - linked and O - linked
glycosylation occur at spike protein
• Many COVID - 19 proteins are modified by PTMs
• Palmitoylation occurs at envelope protein
• N - linked and O - linked glycosylation occurs at membrane protein
• Phosphorylation and ADP - ribosylation occurs at nucleo capsid
protein
25. Figure 15: Overall homo - trimer model structure of the COVID-19 spike glycoprotein (A) ligand unbound
conformation (B) ligand-bound conformation. The three protomers are colored pink, green and cyan. S1- and S2-
domains labelled. Receptor-binding induced hinge motion of S1 is distinguishable.
Post Translation Modification
26. Figure 16: (A and B) Ribbon and a surface diagram showing the docking interface of modelled COVID-19 (grey)
and human CD26 (orange)(PDB: 4QZV) complex. Predicted key residues involved in the interaction are shown in
sticks (CD26 residues are underlined) (C) Overall docking results showing the surface model of CD26 with COVID-
19 predicted homo - trimer structure (ligand-bound conformation).
Post Translation Modification
27. Figure 17: Predicted N-linked glycosylation sites for COVID-19(D) and SARS-CoV (E). Unique glycosylation sites
are colored in Blue, and shared sites are shaded in Red.
Post Translation Modification
28. Post Translation Modification, Glycosylation
• Oligosaccharides attaches via nitrogen atom on the side chain of
asparagine ( N-Linked)
• Oligosaccharides attaches via oxygen atom on the side chain of threonine
or serine (O-Linked)
Figure 18: N-linked glycosylation and O-linked glycosylation.
29. Post Translation Modification, Glycosylation
• Celia Henry Arnaud [46] reported that “Viruses use glycosylation to hide
their viral proteins”
• Glycosylation can act as camouflage because the sugars on viral proteins
come from the animal or person that has been infected.
• Viruses commandeer the enzymatic machinery that host cells use to add
sugars to their own proteins and get those enzymes to attach glycan's to
viral proteins.
• SARS-CoV-2 spike protein’s is covered in carbohydrates, but it’s slightly
lower than in HIV
• SARS-CoV-2’s sparser glycosylation means that the sugars are more
naturally processed than the ones in HIV
Note: Glycosylation helps to identify different types of blood groups
30. Camouflaging nature of SARS-CoV 2
• Author suggests that the coronavirus’s glycan shield may not be as effective as
that of HIV
• When a virus infects a human cell, the cell’s ribosome picks up viral RNA and
translates it into proteins inside a part of the cell called the endoplasmic
reticulum.
• Cellular enzymes start adding sugars to newly synthesized proteins as they exit
the ribosome.
• Once synthesized, the proteins travel through the Golgi apparatus to be secreted
into the cell.
• Along the way, those added glycan’s go through a maturation process in which
other enzymes iteratively trim sugar structures and decorate them with other
types of sugars to yield complex, branched structures.
• The more processed the sugars are, the more they look like the host’s own
sugars.
31. Next Generation sequencing application
• Lam et al. found several putative pangolin CoV sequences with 85.5%
to 92.4% similarity to 2019-nCoV, hence pangolins serves as a
potential intermediate host.
• Jeong-Min Kim, et al. [5] Sequence homology of SARS-CoV-2 with
SARS-CoV, and MERS-CoV was 77.5% and 50%, respectively.
33. AI / ML based approaches for Drug discover
• Zhang et al. [26] used dense fully connected neural network
architecture, trained to predict binding affinities on the PDBbind
database, in order to identify potential inhibitors of the 3C-like
protease.
• Ren Kong et al. [15] introduce COVID-19 Docking Server, that predicts
the binding modes between COVID-19 targets and the ligands
including small molecules, peptide and antibody.
• Anh-Tien Ton, et al. [18] a novel deep learning platform was used to
identify potential inhibitors of the SARS-CoV-2 main protease (Mpro)
by screening 1.3 billion compounds
34. AI / ML based approaches for Drug discover
• Muhammad, et al. [13] Authors analyzed the 3CLpro sequence,
constructed its 3D homology model, and screened it against a
medicinal plant library containing 32,297 potential anti-viral
phytochemicals / traditional Chinese medicinal compounds.
• Authors revealed top nine hits which might serve as potential anti-
SARS-CoV-2 lead molecules for drug development process to combat
COVID-19.
• Wang, et al. [16] screened some of the FDA approved anti-virus and
found that remdesivir and chloroquine could effectively inhibit the
virus
35. References
[1] Andersen, K.G., Rambaut, A., Lipkin, W.I. et al. The proximal origin of SARS-CoV-2. Nat Med 26, 450–452
(2020). https://doi.org/10.1038/s41591-020-0820-9
[2] Jiumeng Sun et al, COVID-19: Epidemiology, Evolution, and Cross-Disciplinary Perspectives
DOI: https://doi.org/10.1016/j.molmed.2020.02.008
[3] Swatantra Kumar et al, Structural, glycosylation and antigenic variation between 2019 novel coronavirus
(2019-nCoV) and SARS coronavirus (SARS-CoV) DOI: 10.1007/s13337-020-00571-5
[4] Emerging Wuhan (COVID-19) coronavirus: glycan shield and structure prediction of spike glycoprotein
and its interaction with human CD26, doi.org/10.1080/22221751.2020.1739565
[5] Identification of Coronavirus Isolated from a Patient in Korea with COVID-19
doi: 10.24171/j.phrp.2020.11.1.02
[6] Jiang, F., Deng, L., Zhang, L. et al. Review of the Clinical Characteristics of Coronavirus Disease 2019
(COVID-19). J GEN INTERN MED (2020). https://doi.org/10.1007/s11606-020-05762-w
[7] Zhou P, Yang XL, Wang XG, et al. A pneumonia outbreak associated with a new coronavirus of probable
bat origin. Nature. 2020; https://doi.org/10.1038/s41586-020-2012-7.
36. References
[8] Wan Y, Shang J, Graham R, Baric RS, Li F. Receptor recognition by novel coronavirus from Wuhan: An
analysis based on decade-long structural studies of SARS. J Virol. 2020; pii: JVI.00127-20.
https://doi.org/10.1128/JVI.00127-20.
[9] Holshue ML, DeBolt C, Lindquist S, et al. First case of 2019 novel coronavirus in the United States. N
Eng. J Med 2020; published online Jan 31. https://doi.org/10.1056/NEJMoa2001191.
[11] Huaxia. WHO says vaccines against novel coronavirus 18 months away, pushes global research. 2020.
Xinhuanet, Feb 12, 2020. http://www.xinhuanet.com/english/2020-02/12/c_138777886.htm.
[12] Mapping the landscape of Artificial Intelligence applications against COVID-19,
https://arxiv.org/pdf/2003.11336.pdf
[13] Muhammad Tahirul Qamar, Safar M.Alqahtani, Mubarak A.Alamri, Ling-Ling Chen, Structural basis of
SARS-CoV-2 3CLpro and anti-COVID-19 drug discovery from medicinal plants,
https://doi.org/10.1016/j.jpha.2020.03.009
[14] B. Robson, Computers and viral diseases. Preliminary bioinformatics studies on the design of a
synthetic vaccine and a preventative peptidomimetic antagonist against the SARS-CoV-2 (2019-nCoV,
COVID-19) coronavirus, https://doi.org/10.1016/j.compbiomed.2020.103670
37. References
[15] Ren Kong1, Guangbo Yang1, et al. COVID-19 Docking Server: An interactive server for docking small
molecules, peptides and antibodies against potential targets of COVID-19
https://arxiv.org/ftp/arxiv/papers/2003/2003.00163.pdf
[16] Wang, M., Cao, R., et al. (2020) Remdesivir and chloroquine effectively inhibit the recently emerged
novel coronavirus (2019-nCoV) in vitro, Cell research.
[17] Yong Zhang, Wanjun Zhao, et al. Site-specific N-glycosylation Characterization of Recombinant SARS-
CoV-2 Spike Proteins using High-Resolution Mass Spectrometry, doi:
https://doi.org/10.1101/2020.03.28.013276
[18] Anh-Tien Ton et al. Rapid identification of potential inhibitors of SARS-CoV-2 main protease by deep
docking of 1.3 billion compounds. ChemRxiv. 2020 doi: 10.1002/minf.202000028.
[19] Choudhary S. Identification of SARS-CoV-2 cell entry inhibitors by drug repurposing using in silico
structure-based virtual screening approach. ChemRxiv. 2020 doi: 10.26434/chemrxiv.12005988.
[20] Mottini C. Computer-aided drug repurposing for cancer therapy: approaches and opportunities to
challenge anticancer targets. Semin. Cancer Biol. 2019 doi: 10.1016/j.semcancer.2019.09.023.
38. References
[21] Boosting the arsenal against COVID-19 through computational drug repurposing
doi: 10.1016/j.drudis.2020.04.005
[22] Gordon, D.E., Jang, et al. A SARS-CoV-2 protein interaction map reveals targets for drug repurposing.
Nature (2020). https://doi.org/10.1038/s41586-020-2286-9
[23] Ibrahim M Ibrahim et al., COVID-19 Spike-Host Cell Receptor GRP78 Binding Site Prediction,
https://doi.org/10.1016/j.jinf.2020.02.026
[24] COVID-19 coronavirus vaccine design using reverse vaccinology and machine learning
doi: https://doi.org/10.1101/2020.03.20.000141
[25] Alex Zhavoronkov, et al. Potential 2019-nCoV 3C-like protease inhibitors designed using generative deep
learning approaches,
[26] Zhang H, Yang Y, Hossain MT, Li J, Ren X, et al. Deep Learning Based Drug Screening for Novel
Coronavirus 2019-nCov. Research Gate preprint 2020020061v1. 2020
[27] Ying Song, Shuangjia Zheng, Liang Li, et al. Deep learning Enables Accurate Diagnosis of Novel
Coronavirus (COVID-19) with CT images, doi: https://doi.org/10.1101/2020.02.23.20026930
39. References
[28] Shi, Weiya, Liu, et al. Deep Learning-Based Quantitative Computed Tomography Model in Predicting the
Severity of COVID-19: A Retrospective Study in 196 Patients (2/26/2020). Available at SSRN:
https://ssrn.com/abstract=3546089 or http://dx.doi.org/10.2139/ssrn.3546089
[29] Randy J Read, Flawed methods in “COVID-19: Attacks the 1-Beta Chain of Hemoglobin and Captures the
Porphyrin to Inhibit Human Heme Metabolism”, ChemRxiv, 2020
[30] Smith, Micholas; Smith, Jeremy C. (2020): Repurposing Therapeutics for COVID-19: Supercomputer-
Based Docking to the SARS-CoV-2 Viral Spike Protein and Viral Spike Protein-Human ACE2 Interface.
ChemRxiv. Preprint. https://doi.org/10.26434/chemrxiv.11871402.v3
[31] Redka, Dar'ya S.; MacKinnon, Stephen S. et al, (2020): PolypharmDB, a Deep Learning-Based Resource,
Quickly Identifies Repurposed Drug Candidates for COVID-19. ChemRxiv. Preprint.
https://doi.org/10.26434/chemrxiv.12071271.v1
[32] Naveen Vankadari, Jacqueline A. Wilce, Emerging Wuhan (COVID-19) coronavirus: glycan shield and
structure prediction of spike glycoprotein and its interaction with human CD26,
https://doi.org/10.1080/22221751.2020.1739565
[33] Ren Kong, Guangbo Yang, et al, COVID-19 Docking Server: An interactive server for docking small
molecules, peptides and antibodies against potential targets of COVID-19
[34] Yasunori Watanabe, Thomas A. Bowden, et al. Exploitation of glycosylation in enveloped virus
pathobiology, https://doi.org/10.1016/j.bbagen.2019.05.012
40. References
[35] Wan Y, Shang J, Graham R, Baric RS, Li F. Receptor recognition by novel coronavirus from Wuhan: An
analysis based on decade-long structural studies of SARS https://doi.org/10.1128/JVI.00127-20
[36] Lu R, Zhao X, Li J, et al. Genomic characterization and epidemiology of 2019 novel coronavirus:
implications for virus origins and receptor binding pdf icon[1.35 MB, 10 pages]. The Lancet. Published
online January 29, 2020. https://doi.org/10.1016/ S0140-6736(20)30251-8
[37] Jia HP, Look DC, Shi L, et al. ACE2 receptor expression and severe acute respiratory syndrome coronavirus
infection depend on differentiation of human airway epithelia. J Virol.
2005;https://doi.org/10.1128/JVI.79.23.14614-14621.2005
[38] To Sing Fung & Ding Xiang Liu, Post-translational modifications of coronavirus proteins: roles and
function, Future Virology, 21 May 2018, https://doi.org/10.2217/fvl-2018-0008
[39] Naveen Vankadari, Jacqueline A. Wilce, Emerging COVID-19 coronavirus: glycan shield and structure
prediction of spike glycoprotein and its interaction with human CD26,
https://doi.org/10.1080/22221751.2020.1739565
[40] Andrew Senior, John Jumper, et al. “Computational predictions of protein structures associated with
COVID-19”, Version 2, DeepMind website, 8 April 2020, https://deepmind.com/research/open-
source/computational-predictions-of-protein-structures-associated-with-COVID-19
41. References
[41] Xiaoqiang Huang, Yang Zhang, et al. Computational Design of Peptides to Block Binding of the SARS-
CoV-2 Spike Protein to Human ACE2. BioRxiv Preprint doi: https://doi.org/10.1101/2020.03.28.013607
[42] Chengxin Zhang, Wei Zheng, et al. (2020) Protein structure and sequence re-analysis of 2019-nCoV
genome refutes snakes as its intermediate host and the unique similarity between its spike protein
insertions and HIV-1. Journal of Proteome Research. 19: 1351-1360, 2020
[43] Yan Gao, et al. Structure of the RNA-dependent RNA polymerase from COVID-19 virus, Science,2020
DOI: 10.1126/science.abb7498
[44] Zhang L, Lin D, et al. Crystal structure of SARS-CoV-2 main protease provides a basis for design of
improved α-ketoamide inhibitors.
[45] Ibrahim IM, Abdelmalek DH, et al. COVID-19 spike-host cell receptor GRP78 binding site prediction,
PubMed, 2020, DOI: 10.1016/j.jinf.2020.02.026
[46] Celia Henry Arnaud, https://cen.acs.org/biological-chemistry/proteomics/Adding-missing-sugars-
coronavirus-protein/98/i16, CHEMICAL & ENGINEERING NEWS APRIL 22, 2020
42. List of Abbreviations
• receptor-binding motif (RBM)
• angiotensin-converting enzyme 2 (ACE2)
• N terminal domain (NTD) and a C terminal domain (CTD)
• Real-time reverse transcription polymerase chain reaction (RT-PCR)
• 3-chymotrypsin-like cysteine protease (3CLpro)
• Antagonist: a substance that interferes with or inhibits the physiological action of another.
• Lesion: a region in an organ or tissue which has suffered damage through injury or disease, such
as a wound, ulcer, abscess, or tumor.
• A least absolute shrinkage and selection operator (LASSO)
• Non Structural Protein (NSP)
• Glucose Regulated Protein 78 (GRP78)
• Substrate Binding Domain (SBD)
• RdRp: RNA-dependent RNA polymerase (RdRp also called nsp12)
• This scenario could occur in stressed cells when GRP78 us overexpressed and translocate from the
Endoplasmic Reticulum (ER) to the cell membrane (M).
• cluster of differentiation 26 (CD26)
• Solvent Accessible Surface Area (SASA)
Editor's Notes
nm = 1×10⁻⁹ m
Ligands are small molecules which bind with a protein to trigger a signal, which can be activation or inhibition.
A protein contact map represents the distance between all possible amino acid residue pairs of a three-dimensional protein structure using a binary two-dimensional matrix.
Knowing a protein’s structure provides an important resource for understanding how it functions, but experiments to determine the structure can take months or longer, and some prove to be intractable. For this reason, researchers have been developing computational methods to predict protein structure from the amino acid sequence.
In cases where the structure of a similar protein has already been experimentally determined, algorithms based on “template modelling” are able to provide accurate predictions of the protein structure.
Epitope: the part of an antigen molecule to which an antibody attaches itself
RdRp: RNA-dependent RNA polymerase (RdRp also called nsp12)
Region IV of the spike (C480: C488) seems suitable to be the binding site to the cell surface GRP78
Region IV is part of region III and it is the most surface exposed part of the spike receptor-binding domain.
Lower the binding affinity, the stronger the ligand bind to the protein
Lower the binding affinity, the stronger the ligand bind to the protein
This scenario could occur in stressed cells when GRP78 us overexpressed and translocate from the Endoplasmic Reticulum (ER) to the cell membrane (M).
cluster of differentiation 26 (CD26): associated with immune regulation, signal transduction, and apoptosis
Palmitoylation is the covalent attachment of fatty acids, such as palmitic acid, to cysteine (S-palmitoylation) and less frequently to serine and threonine (O-palmitoylation) residues of proteins, which are typically membrane proteins. The precise function of palmitoylation depends on the particular protein being considered. Palmitoylation enhances the hydrophobicity of proteins and contributes to their membrane association. Palmitoylation also appears to play a significant role in subcellular trafficking of proteins between membrane compartments, as well as in modulating protein–protein interactions.
our study also highlights the key finding that the S1 domain of COVID-19 spike glycoprotein potentially interacts with the human CD26, a key immunoregulatory factor for hijacking and virulence.
Solvent Accessible Surface Area (SASA)
N-Linked Glycosylation begins at the endoplasmic reticulum and is completed at the golgi apparatus.
O-Linked Glycosylation takes place entirely at the golgi complex.