1. Domain-specific analysis of collagen code
Mammalian collagen 1α1
Anne Kleinnijenhuis
anne.kleinnijenhuis@triskelion.nl
May 2017
2. Introduction
Collagen is the most abundant protein in mammals and an important
structural protein.
The amino acid sequence contains large stretches of repetitive Gxy triplets
(Gxy region) involved in the helical structure, where G represents glycine and
x and y represent another amino acid (often proline).
In Triskelion study 20959 a comprehensive investigation of collagen DNA and
protein code was performed. In the current presentation some of the findings
during the analysis of mammalian collagen 1α1 are highlighted.
The results are presented using a novel nomenclature for nucleotide changes
and a novel codon table.
6. Comparison to Homo sapiens collagen 1α1
The collagen 1α1 Gxy regions of
the following species were
compared to Homo sapiens:
(source: www.ebi.ac.uk/ena)
Pan troglodytes
Chimpanzee
Macaca mulatta
Rhesus monkey
Mus musculus
House mouse
16. Summary table
Summary of the changes between
human and chimpanzee / rhesus
monkey / murine collagen 1α1
17. Codon group involvement
Codon table with color ranges indicating, from white to red, the increasing
involvement of codon groups in amino acid changes in the Gxy region between
chimpanzee and murine collagen 1α1 and, from white to blue, the codon usage
in chimpanzee collagen 1α1. Especially the S2N change route is notable.
28. Conclusions
In the present study collagen sequences from the Gxy region were analyzed
on the nucleotide, codon group, amino acid and target peptide level.
To describe the changes between collagens a novel nomenclature and codon
table were proposed.
Several biases were observed in codon usage, codon usage related to Gxy
position and amino acid usage related to Gxy position.
The conserved position of positively charged amino acids K and R offers the
opportunity for selection of analogous tryptic target peptides.
29. SlideShare series Anne Kleinnijenhuis
Domain-specific analysis of collagen code (May 2017)
Exploring LC-MS peptide dynamic range (December 2016)
Strategies for bioanalysis of proteins using LC-MS (May 2016)
Proposal for the absolute quantification of modular molecules
(October 2015)