SlideShare a Scribd company logo
1 of 54
Download to read offline
Honours Research Project Report
BI4016
2015/2016
School of Biological Sciences
Surveying the small-spotted catshark (Scyliorhinus canicula)
tumour necrosis factor superfamily (TNFSF)
Fiona Bakke
Supervisor: Dr Helen Dooley
Work completed in partial fulfilment of the requirements for
a degree in MSci Biological Sciences
Word count: 8548
1
Abstract
The S. canicula transcriptome has recently become available and was used to identify TNFSF
ligand genes and compare these to TNFSF ligand genes in other vertebrates. As a cytokine
subgroup, the TNFSF enables immunomodulatory signalling between cells, being primarily
associated with cell survival, inflammation and apoptosis. Genetic polymorphisms in the
TNFSF have been implicated in the development of tumours and inflammatory and
autoimmune diseases in humans, and the molecular biology of the human TNFSF has been
investigated in order to develop therapeutics. These therapeutics are often associated with
undesirable side effects, creating a need for alternate methods of their development. One
such method is the investigation of the evolutionary history of the TNFSF, so that TNFSF
characteristics present in other vertebrates may be identified and exploited. As part of this
investigation, a phylogenetic tree of vertebrate TNFSF ligand genes was generated, to produce
the most complete representation currently available of the evolutionary history of these
genes. S. canicula did not appear to express 6 of the 18 human TNFSF genes (TNF-α, TNF-β,
CD27L, 4-1BBL, TWEAK and EDA) but, with the exception of the pyrogenic TNFSF genes TNF-α
and TNF-β, did appear to possess TNFSF genes fulfilling similar functions. Only one S. canicula
LT-β sequence appeared to be present, with its position on the phylogenetic tree suggesting
that it may have been an ancient precursor of TNF-α and TNF-β. The evolutionary
development of CD30L, CD40L and TRAIL require further investigation since S. canicula
sequences did not appear to share conserved regions with similar sequences from other
vertebrates. S. canicula APRIL BAFF, OX40L, GITRL, FasL, TL1A, RANKL and LIGHT did appear
to be present, with S. canicula also appearing to express BALM, a TNFSF gene currently
identified only in cartilaginous fish and teleosts. Target S. canicula TNFSF genes were then
amplified, cloned and sequenced, and the presence of the 3’ end of RANKL confirmed.
2
Keywords.
Scyliorhinus canicula; tumour necrosis factor superfamily; comparative immunology;
phylogenetic analysis.
1. Introduction
The tumour necrosis factor superfamily (TNFSF) is a subgroup of the larger cytokine family
(Dinarello, 2000; Opal and DePalo, 2000). Cytokines are small (up to 40 kDa) proteins and
glycoproteins, associated with both the humoral and innate immune systems, which regulate
and direct immune responses following their secretion (Astrakhantseva et al., 2014). Immune
cells which secrete cytokines include dendritic cells (DCs), macrophages, and B and T
lymphocytes, although other cells, such as epithelial and stromal cells, also possess this
capability (Dinarello, 2000; Opal and DePalo, 2000). Cytokine secretion enables
immunomodulatory signalling between cells through autocrine, paracrine and endocrine
means and, depending on the cytokines secreted and the nature of their signals, causes
targeted cells to respond to an immune insult through their activation, differentiation, growth
or apoptosis (Borish and Steinke, 2003; Commins et al., 2010; Testar, 2015). A cascade of
further cytokine secretion may also be generated until the immune response is complete
(Testar, 2015).
Within the larger cytokine family, the role of the TNFSF typically involves initiating
inflammation and apoptosis, but also the development of lymphocytes, and epithelial and
other tissues (Hehlgans and Pfeffer, 2005; Croft et al., 2013; Wiens and Glenney, 2011). The
TNFSF consists of structurally-related ligands which are capable of binding to one or more
3
transmembrane protein receptors; some receptors may also be able to bind to more than one
ligand (Bossen et al., 2006; Vince & Silke, 2006; Roosnek et al., 2009), resulting in a complex
network of interactions between ligands and receptors. Ligands may be either membrane
bound or secreted in soluble form following proteolytic processing, whereas receptors are
principally membrane bound (Glenney and Wiens, 2007; Vujanovic, 2011). In addition to the
complexity of ligand-receptor interaction and the ability of a wide variety of cells to express
both, the effects of binding can also vary, since the form in which a ligand is expressed
determines whether its stimulus is antagonistic or inhibitory (Aggarwal, 2003; Robert, 2015).
All these factors, together with TNFSF involvement in inflammation and apoptosis, can lead to
unwanted harmful effects such as tumour growth and allergic or autoimmune responses
(Figure 1) (Adipogen, 2013).
Figure 1. TNFSF members associated with human inflammatory and autoimmune diseases.
Abbreviations: GVHD: Graft-versus-host disease; MS: multiple sclerosis; IBD: irritable bowel disease.
Adapted from Croft et al., 2012.
4
In recent years, much work has been done to investigate the role of the TNFSF human disease
conditions in order to develop mitigatory and preventative therapeutic agents (Wang and
Yang-Xin, 2005; Roosnek et al., 2009; Croft et al., 2012; Siakavellas et al., 2015; Walczak, 2015;
Collette et al., 2003). This work has largely focused on the molecular pathways of the TNFSF
in humans, and some effective therapeutic agents have emerged (Adipogen, 2013; Grewal,
2009; Astrakhantseva et al., 2014; Sedger and McDermott, 2014). However, due to the
complexity of the TNFSF mechanisms, undesirable side effects to these agents frequently
occur (Tansey and Szymkowski, 2009). Alternate methods of developing therapeutic agents
are therefore necessary, and investigating the evolutionary history of the TNFSF provides one
such means, since it may identify TNFSF characteristics present in mammals which are not yet
known.
Human TNFSF genes (Table 1) are organised in clusters on major histocompatibility complex
(MHC) paralogous chromosomes 1, 6, 9 and 19, suggesting that the mammalian TNFSF may
have evolved from an earlier, more rudimentary form during the two rounds of whole genome
duplication (WGD) that occurred around 500-800 million years ago (mya) (Biswas et al, 2015;
Wiens and Glenney, 2011; Collette et al, 2003). Additional human TNFSF genes are located
on chromosomes X, 3, 13 and 17, possibly as a result of cis and/or trans duplications during
their further evolution (Glenney and Wiens, 2007). Teleosts underwent a further WGD
approximately 350-320 mya, with salmonids and cyprinids undergoing a fourth WGD
approximately 70-80mya (Macqueen and Johnston, 2014). The resultant genomic complexity
of teleosts, due to these further WGD events, makes direct comparison of the evolution of
their TNFSF with that of humans problematic. Exploration of the evolutionary relationships
between the TNFSF in humans and other vertebrates which did not undergo these further
5
WGDs may therefore provide a clearer representation of the ways in which the vertebrate
TNFSF evolved over time.
Table 1. Human TNFSF ligand gene names. Adapted from Gray et al., 2015.
Name Synonyms TNFSF Number
TNF-α TNF, tumour necrosis factor, DIF 1
TNF-β Lymphotoxin-α, LT 2
LT-β Lymphotoxin-β, p33 3
OX40L CD252, gp34, TXGP1 4
CD40L HIGM1, IMD3, TRAP, gp39, Hcd40l, CD154 5
FasL APT1LGI, CD178 6
CD27L CD70 7
CD30L CD153 8
4-1BBL 9
TRAIL Apo-2L, TL2, CD253 10
RANKL TRANCE, OPGL, ODF, CD254 11
TWEAK DR3LG, APO3L 12
APRIL CD256 13
BAFF THANK, BLYS, TALL-1, CD257 13b
LIGHT LTg, HVEM-L, CD258 14
TL1A TL1, VEGI, VEGI192A, MGC129934, MGC129935 15
GITRL AITRL, TL6, hGITRL 18
EDA Ectodysplasin A, EDA1, HED, XHED, ED1-A1, ED1-A2, EDA-A1, EDA-A2 Not allocated
Research into the TNFSF of other vertebrates however, remains limited (De Zoysa, Jung and
Lee, 2009; Biswas et al., 2015). Until now, no characterisation of the TNFSF in cartilaginous
fish has been performed, and similar information is only available for a small number of
amphibians, reptiles, birds and non-human mammals. In 2007, Glenney and Wiens identified
and characterised 71 TNFSF members in different teleosts, using the structure of membrane
bound TNFSF ligands in mammals as a basis. These ligands possess a relatively well conserved
(20-30%) TNF homology domain within their extracellular C terminus, which facilitates their
comparison with ligands of other species. In this project we used the information obtained
by Glenney and Wiens (2007) to detect possible TNFSF orthologs in two species of
chondrichthyes, Scyliorhinus canicula (small-spotted catshark, subclass Elasmobranchii) for
which in-house transcriptomic data are now available (Dooley et al, unpublished data), and
6
Callorhinchus milii (elephant shark, subclass Holocephali), the only cartilaginous fish for which
the genome has been sequenced. Bioinformatic searches of these protein sequences were
performed to identify orthologs in other species. This information was then used to generate
a phylogenetic tree, which provided an indication of possible evolutionary relationships
between TNFSF members in mammals, birds, reptiles, amphibians, teleosts and cartilaginous
fish. Alignment of TNFSF sequences allowed the identification of conserved domains among
individual TNFSF sequences for different species, enhancing the information provided by the
phylogenetic tree. Amplification, cloning and sequencing of S. canicula TNFSF sequences
which demonstrated both phylogenetic clustering and the sharing of conserved domains with
similar sequences for other species were performed to confirm the presence of these
sequences.
We hypothesised that, since cartilaginous fish were the first vertebrates to possess a canonical
adaptive immune system, S. canicula TNFSF gene orthologs would be found for TNFSF genes
in other vertebrates and that more recently evolved vertebrates would also possess TNFSF
genes not present in S. canicula that arose from gene duplication and rearrangement following
their divergence from their common ancestors. Given the chromosomal clustering of human
TNFSF genes, and the fact that the genes in these clusters often share some functional
similarities (Collette et al., 2003), we further hypothesised that S. canicula would possess an
ortholog for at least one human TNFSF gene from each chromosomal cluster.
2. Materials and methods
2.1 Bioinformatics
7
Accession numbers of TNFSF members previously identified in teleosts (Glenney and Wiens,
2007) were used to obtain protein sequences from the genomic databases of NCBI (http://w
ww.ncbi.nlm.nih.gov/protein/) and Ensembl version 82, September 2015 (http://www.ense
mbl.org/index.html). Nucleotide sequences were translated to amino acid sequences using
the Expasy Translate tool (http://web.expasy.org/translate/). The teleost TNFSF sequences
were used to BLAST search (Altschul et al., 1997) S. canicula transcriptomes available via the
catshark in-house transcriptome database, the Chondrichthyes Transcriptome Resource
(Jones and Blaxter, 2013) http://afterparty.bio.ed.ac.uk/study/show/6232601), SkateBase
(http://skatebase.org/skateBLAST) (Wyffels et al., 2014; Wang et al., 2012), and the SwissProt
(http://www.ebi.ac.uk/uniprot) database. All S. canicula sequences identified were then
BLASTed against the NCBI protein database to confirm the identity of the S. canicula hits as
TNFSF members.
2.2 Sequence alignment and phylogenetic analysis
Evolutionary relationships among these sequences were then investigated. S. canicula
sequences were combined with previously obtained TNFSF sequence information from S. milii
(Venkatesh et al., 2014), and with similar data for representative members of other vertebrate
classes (Homo sapiens [human], Bos taurus [domestic cow], Mus musculus [house mouse],
Gallus gallus domesticus [chicken], Python sebae sebae [African rock python], Anolis
carolinensis [Carolina anole lizard], Chrysemys picta [painted turtle], Xenopus tropicalis
[Western clawed frog], Danio rerio [zebrafish], Onchorhynchus mykiss [rainbow trout],
Latimeria chalumnae [African coelacanth]). All sequences were aligned using MAFFT (Katoh
et al., 2002). Highly variable regions were removed using BioEdit software (Hall, 1999). The
resulting data were then used to construct a phylogenetic tree in IQ-TREE (Nguyen et al.,
8
2015; Minh, Nguyen and von Haeseler, 2013) and FigTree (Morariu et al., 2009) phylogenetic
software. A maximum likelihood tree, using 164 TNFSF sequences containing 493 amino acid
sites, was produced, using 1000 bootstrap replications to obtain branch support values. The
S. canicula TNFSF member sequences which formed clades with similar sequences from other
vertebrate classes, suggesting that they emerged early in the development of the adaptive
immune system and also that they were maintained during the divergence of more recently
evolved vertebrates, were then considered individually. The TNFSF nucleotide sequences for
all vertebrates in such clusters were aligned once more, again using MAFFT and BioEdit
software, to identify regions of similarity and the presence of conserved TNFSF domains.
2.3 Primer design
TNFSF sequences which contained conserved TNFSF domains were used to design primers for
PCR amplification of the S. canicula sequences. RACE-PCR was performed for those sequences
which appeared similar to those for H. sapiens and M. musculus, but which were incomplete
at the 3’ and/or 5’ ends (Sambrook and Russel, 2001). Primers were designed to try to fulfil
the following conditions: to anneal within the evolutionarily most conserved region; to contain
conserved cysteine (C) and/or tryptophan (W) amino acids; to have an A or T at the 3’ end and
a G or C at the 5’ end; to be 16-24 base pairs (bp) long; to have an optimal melting temperature
of ~60oC; to have a difference in melting temperature for forward and reverse primers
of ≤ 4oC, and to have a GC content of 40-60%.
Oligonucleotide Properties Calculator (www.basic.northeastern.edu/biotools/oligocalc.html)
was used to confirm that these conditions had been fulfilled, and also that there were no self-
annealing sites or areas of potential hairpin formation, self-dimerization or complementarity
9
(Kibbe, 2007). Primers were purchased from Sigma Aldrich Co. LLC. One forward and one
reverse primer was designed for each target sequence to be amplified using PCR, with two
forward and two reverse primers being designed for those sequences being amplified using
RACE-PCR (Table 2).
Table 2. Primers used for amplification of FasL, TL1A, RANKL and LIGHT TNFSF ligands
Name Nucleotide Sequence (5’-3’)
ScFasL-F1 ACAAGTGTTTATGGTGGATGGC
ScFasL-R1 GTCAATGTGTCAGATTCAAGGCTA
ScTL1A-F1 ATGGCGTATGGCTATCTATTTGG
ScTL1A-R1 TTCGCCCTTCGTAAACGCAAG
ScTL1A-F2 AGTGTAAAGCAGGTGGATTTTACC
ScTL1A-R2 AAGTCCGTTCTTGTGTTCCCAC
ScRANKL-F1 TTGGTGCTCGGGCAATCATAAC
ScRANKL-R1 TAGGGTGTCAGCCATGAATTC
ScRANKL-F2 TCTGCTCAATGGCGACAAAATC
ScRANKL-R2 TGAATTCTTCCTCATGCGATTGG
ScLIGHT-F1 AATTCTTCTGCACTGGGGAGAG
ScLIGHT-R1 TCTACATACAGCACACATCACAC
ScLIGHT-F2 AATAGCTACCTGGGGGCAATC
ScLIGHT-R2 ATGTCTTCAGCAGGAACTCTAC
ScEF1a-F1 CGTCTTCCTTTATTGCACAGGTTATTATC
ScEF1a-R1 GGACAGCGAAACGACCAAGAG
UPM Long (0.4µM) CTAATACGACTCACTATAGGGCAAGCAGTGGTATCAACGCAGAGT
UPM Short (2µM) CTAATACGACTCACTATAGGGC
NUP (10µM) AAGCAGTGGTATCAACGCAGAGT
Redesigned Primers
ScFasL-F1a ATCCATATGTGCCTCCACCTGCC
ScFasL-R1a GGCTAGTTGACTTTGCTGAGTCCA
ScTL1A-F1a TCGGAAATTCTTCTGCACTGGGG
ScTL1A-R1a ACACATCACACATCAGGCAGCGC
ScTL1A-F2a AGTTTTGAGAATCTCCAACGGTGC
ScTL1A-R2a ACCCAGTTACATGAGCTGCAGGC
ScRANKL-F1a TTGGTGCTCGGGCAATCATAAC
ScRANKL-R1a TAGGGTGTCAGCCATGAATTC
ScRANKL-F2a TCTGCTCAATGGCGACAAAATC
ScRANKL-R2a TGAATTCTTCCTCATGCGATTGG
ScLIGHT-F1a TCGGAAATTCTTCTGCACTGGGG
ScLIGHT-R1a ACACATCACACATCAGGCAGCGC
ScLIGHT-F2a AGTTTTGAGAATCTCCAACGGTGC
ScLIGHT-R2a ACCCAGTTACATGAGCTGCAGGC
T7 Universal Sequencing Primer TAATACGACTCACTATAG
10
2.4 Amplification of target sequences from S. canicula spleen cDNA
PCR was used to amplify target sequences which appeared complete, using a PCR reaction of
10µl MyTaq Red Reaction Buffer (Bioline), 2.5µl of cDNA template, 1µl of both forward and
reverse primers, and 1µl MyTaq (Bioline), and 34.5µl PCR grade water, to achieve a final
volume of 50µl. S. canicula spleen oligo-dT primed cDNA was used for the template, since this
tissue is a secondary immune organ in sharks and therefore would be expected to highly
express TNFSF genes (Biswas et al., 2015; Schneider, Potter and Ware, 2004). To confirm
template integrity, and that the PCR reagents and PCR cycler were functioning correctly,
elongation factor 1 alpha (EF-1-alpha-1) was used as a positive control, as this gene is
expressed ubiquitously and has previously been assessed as suitable for this purpose (Infante
et al, 2008; Olsvik et al., 2005). The negative control, to confirm absence of contamination of
experimental samples, was PCR grade water in place of cDNA.
A standard PCR procedure was applied, using an initial cycle of 2 min at 95oC to denature the
template (subsequent denaturation cycles lasted 30 sec), 30 sec at a temperature determined
by primer requirements to allow annealing, and 1.5 min at 72oC for primer extension along
the cDNA template. PCR cycles were repeated 30 times to generate maximum DNA yield
(approximately 1 million copies of the target sequences) (Sambrook and Russel, 2001).
Touchdown PCR was also performed, across an annealing temperature range of 63OC-45oC, in
decreasing increments of 0.4OC, in order to ensure that optimal annealing temperature had
been applied. RACE PCR was used to obtain a complete open reading frame from target
sequences which appeared to lack 3’ and/or 5’ ends. RACE cDNA was prepared using the
SMARTer RACE cDNA kit according to the manufacturer’s protocol (Clontech). First round
RACE reactions used 5µl MyTaq Red Reaction Buffer (Bioline), 2.5µl cDNA template, 0.5µl
11
forward primer 1 or reverse primer 1, 0.5µl Universal Primer Mix (UPM, Clonetech), 0.5µl
MyTaq (Bioline), and 17.25µl PCR grade water to attain a final volume of 25µl. EF-1-alpha-1
was again used for the positive control, and PCR grade water as the negative control, with S.
canicula spleen RACE cDNA being used as the template. Second round RACE reactions used
10µl MyTaq Red Reaction Buffer (Bioline), 5µl round 1 RACE PCR product, 1µl nested universal
primer (NUP), 1µl MyTaq (Bioline) and 32µl PCR grade water, to make a final volume of 50µl.
2.5 Visualisation of PCR and RACE PCR amplified sequences
Agarose gel electrophoresis was performed following PCR/RACE PCR to enable separation and
visualisation of amplified products (Sambrook and Russel, 2001). Agarose gels were prepared
using 0.5g molecular grade powdered agarose (Bioline) and 50ml TAE (Tris-Acetate-EDTA 50x,
Severn Biotech, at a dilution of 200ml in 10L water). 1-4µL SYBR Safe DNA Gel Stain (Bioline)
was also added, which intercalates into any DNA present and fluoresces in the presence of
ultraviolet (UV) light, enabling visualisation of the DNA. The solidified gel was loaded into a
horizontal gel rig (Bio-Rad) and was submerged in TAE buffer. Samples were loaded into wells
with a molecular weight size marker (3µl 100bp hyperLadder, Bioline) being loaded into a
neighbouring well to provide a visual guide to DNA molecule size (base pairs [bp]). Gels were
run at 100v for 60 min then visualised on a UV transillumination system (BioDoc-It).
2.6 DNA cloning of amplified sequences
2.6.1 Extraction and purification of DNA from agarose gel
Products which had been amplified during PCR were prepared for cloning using the
manufacturer’s instructions for the QIAquick Gel Extraction Kit (Qiagen). Visualised bands
were removed from the gel with a clean scalpel and placed in 1.5ml Eppendorf tubes. Three
12
times the gel volume of QG buffer (Qiagen) was added to the gel and the tubes incubated in
a 50oC water bath for 10 minutes to re-solubilise the agarose gel. The gel and an equal volume
of isopropanol were pipetted into spin columns inserted into 2ml collection tubes and
inverted. The sample was then transferred to a Qiaspin column (Qiagen) and centrifuged at
13000rpm for 1 minute, after which the flow-through was discarded. A further 500µl of QG
buffer was added and the tubes centrifuged again at 13000rpm for 1 minute. After discarding
the flow-through, 750µl PE buffer (Qiagen) was added and the tubes were left to rest for 5
minutes. They were then centrifuged again at 13000rpm for 1 minute. After discarding the
flow-through, 30µl purified water was added and the flow-through collected for cloning.
2.6.2 DNA cloning using E. coli plasmids
X-Gal/IPTG/LB plates were prepared by mixing 10g tryptone, 5g yeast extract, 10g NaCl and
20g agar per litre deionised water and autoclaving. After cooling, X-Gal (25ug/ml), 0.1mM
IPTG (25µg/ml,) and ampicillin (100µg/ml) were added, before being poured into sterile petri
dishes and allowed to set. Ligation reactions were prepared by mixing 5µl 2X Rapid Ligation
Buffer, 1µl pGEM-T Easy Vector, 5µl PCR product and 1µl T4 DNA Ligase (all Promega) in 0.5ml
Eppendorf tubes. Reactions were incubated at 4oC overnight. 2µl of each ligation mix was
pipetted into separate 1.5ml Eppendorf tubes and 50 µl E. coli NEB 5-α heat shock-competent
bacteria added, before incubating on ice for 20 mins. The bacteria were then heat shocked at
42oC for 30 secs in a water bath. Cells were then returned to ice for a further 2 mins. 950µl
SOC medium (Sigma Aldrich) was added to each tube before incubating at 37oC for 1.5
hours. 100 µl of each culture was then plated onto the X-Gal/IPTG/LB plates and incubated
for 12 hours at 37oC. White colonies were then selected from the plates and put
13
into LB broth (as above but without agar) containing 100 µg/ml ampicillin & grown overnight
at 37oC with rotation.
2.6.3 Isolation of cloned DNA
Plasmid DNA was isolated from the overnight bacterial cultures by applying the
manufacturer’s instructions for the QIAprep Spin Miniprep Kit (Qiagen). Bacterial cells were
pelleted by centrifuging at 13000rpm for 3 min before being resuspended in 250µl Buffer P1
and pipetted into sterile microcentrifuge Eppendorf tubes. 250µl Buffer P2 was added and
the solution mixed by inverting until the mixture became blue. 350µl Buffer N3 was then
added and immediately mixed to produce a colourless solution. The tubes were then
centrifuged at 13000rpm for 10 minutes, after which the supernatant was pipetted into
QIAprep spin columns. These were centrifuged at 13000rpm for 60s and the flow-through
discarded. 500µl Buffer PB was added to each spin column, before centrifuging once more
and again discarding the flow-through. 750µl Buffer PE wash buffer was then added to each
spin column before again centrifuging and discarding flow-through. QIAprep spin columns
were then placed into 1.5ml Eppendorf collection tubes and centrifuged for a further 1 minute
to remove remaining wash buffer. Each QIAprep column was then placed in a new sterile
1.5ml microcentrifuge tube to which 50µl Buffer EB was added to the centre of the column
and allowed to stand for 1 minute to elute the DNA. The tubes were then centrifuged for 1
minute and the flow-through collected and pipetted into new sterile 1.5ml Eppendorf tubes.
2.6.4 Nucleic acid preparation for sequencing
A NanoDrop 1000 spectrophotometer (Thermo Fisher Scientific) was used to calculate the
nucleic acid concentration (ng/µl) in each sample by measuring the absorbance of UV-visible
14
light by 1µl of each sample. 15µl of sample at 75ng/µl was prepared and 2µl T7 Universal
Sequencing primer at 10ng/µl added before sending for sequencing to Eurofins Genomics,
Ebersberg, Germany.
3. Results
3.1 Bioinformatics
Accession numbers published by Glenney & Wiens (2007) provided protein sequences for 53
of the 71 teleost TNFSF members identified in that study. Three sequences could not be
acquired from the discontinued TIGR Gene Indices database (www.tigr.org/tdb/tgi/), and a
further 15 had been retired from the Ensembl database without further successors being
provided. The supplementary data provided 12 of these outstanding sequences, leading to a
total of 65 sequences being identified. The 6 unavailable sequences included TNFSF members
TWEAK, TL1A, lymphotoxin α and β, and EDA, but similar sequences for these TNFSF members
had already been identified in other teleosts in that study.
BLASTing these 65 sequences with transcriptomic data for S. canicula resulted in the
identification of 30 potentially orthologous sequences (Appendix 1), representing 14 TNFSF
members (Table 3). Further BLASTing of these sequences in the NCBI protein database
resulted in the putative confirmation of 9 S. canicula TNFSF members (Table 3).
Table 3. S. canicula potential TNFSF sequences obtained through bioinformatics searches.
“+” indicates sequences which were obtained and “-“ indicates sequences which were not obtained.
TNFSF
member
BLAST 1
AfterParty/SkateBase
BLAST 2
(NCBI)
TNFSF
member
BLAST 1
AfterParty/SkateBase
BLAST 2
(NCBI)
TNF-α + - RANKL + +
TNF-β + - TWEAK - -
LT-β - - APRIL + +
OX40L + - BAFF + +
15
CD40L + - LIGHT + +
FasL + + TL1A + +
CD27L - - GITRL - +
CD30L - - EDA + -
4-1BBL - - BALM + +
TRAIL + +
3.2 Sequence alignment and phylogenetic analysis
MAFFT Clustal alignment of these 30 sequences with similar sequences from representative
members of other vertebrate classes (H. sapiens, B. taurus, M. musculus, G. gallus, P. sebae,
A. carolinensis, C. picta, X. tropicalis, D. rerio, O. mykiss, L. chalumnae), and subsequent
removal of highly variable regions, enabled production of an unrooted maximum likelihood
phylogenetic tree, consisting of 164 TNFSF sequences containing 493 amino acid sites, and
using 1000 replications to obtain bootstrap support values (Figure 2). This indicated that 10
TNFSF members were expressed by S. canicula.
16
17
Figure 2. Unrooted maximum likelihood phylogenetic tree of vertebrate TNFSF members. Coloured
boxes denote TNFSF members where S. canicula forms a clade with similar sequences to vertebrates
from other classes. Dotted boxes denote TNFSF members where S. canicula forms a clade only with
teleosts and other cartilaginous fish. Bootstrap values >70% are considered indicative of a clade.
Sequences include both cloned full-length amino acid sequences and predicted sequences obtained
during bioinformatics searches.  Denotes a BALM sequence misannotated as a BAFF sequence in
genomic and protein databases. Abbreviations: Almi: Alligator mississippiensis; Anca: Anolis
carolinensis; Bota: Bos Taurus; Cami: Callorhynchus milii; Chpi: Chrysemys picta; Chpl: Chiloscyllium
plagiosum: Dare: Danio rerio; Gaac: Gasterosteus aculeatus; Gaga: Gallus gallus; Hosa: Homo sapiens;
Lach: Latimeria chalumnae; Mumu: Mus musculus; Onmy: Onchorhynchus mykiss; Opha: Ophiophagus
hannah; Pyse: Python sebae; Sasa: Salmo salar; Sqac: Squalus acanthias; Taru: Takifugu rubripes; Xela:
Xenopus laevis; Xetr: Xenopus tropicalis.
TNFSF members which did not appear to be present in S. canicula included TNF-α, TNF-β,
CD27L, 4-1BBL, TWEAK and EDA. The 2 putative S. canicula CD30L sequences did not group
with the other CD30L sequences from H. sapiens, G. gallus, C. picta, A. carolinensis and X.
tropicalis, nor with any other TNFSF sequences, raising questions about the identity of these
2 sequences, which will require further investigation. Similarly, the one suggested S. canicula
sequence for LT-β appeared to be grouped together with a variety of different TNF sequences
from other vertebrates, including those for TNF (which is a commonly used designator of TNF-
α [Albarbar et al., in press]) and TNF-β, again rendering the identify of this S. canicula sequence
unclear and in need of further investigation. The S. canicula CD40L sequence appeared more
similar to the TRAIL and RANKL sequences for other vertebrates (TRAIL: H. sapiens, M.
musculus, A. carolinensis, C. milii and L. chalumnae; RANKL: L. chalumnae, G. gallus, C. picta,
M. musculus, H. sapiens, C. milii), than it did to CD40L sequences for other vertebrates (C. milii,
L. chalumnae, M. musculus, H. sapiens, A. carolinensis, G. gallus, X. tropicalis). TNFSF
sequences for APRIL appeared in two discrete clusters, with APRIL sequences for a
cartilaginous fish, a lobe-finned fish, 2 teleosts and an amphibian (L. chalumnae, O. mykiss, S.
salar, X. laevis and S. canicula) forming one group, and sequences for 2 mammals and a reptile
(M. musculus, H. sapiens and A. carolinensis) forming a different group. Bootstrap value
18
between these two groups was 47%, therefore they could not be considered a clade. TWEAK
sequences for 2 mammals, 2 reptiles and a lobe-finned fish (H. sapiens, M. musculus, C. picta,
A. mississippiensis and L. chalumnae) also appeared to share some similarities with both
groups of APRIL sequences. S. canicula TRAIL sequences did appear to group together with
TRAIL sequences for H. sapiens, M. musculus, A. carolinensis, C. mykiss and L. chalumnae, but
the S. canicula sequences did not cluster as closely together with the sequences for these 5
species as the sequences from these 5 species did with each other.
S. canicula TNFSF sequences which did appear to cluster with similar sequences for other
vertebrates in the phylogenetic tree included those for BALM, BAFF, OX40L, GITRL, FasL, TL1A,
RANKL and LIGHT. Three S. canicula BALM sequences clustered with BALM sequences for O.
mykiss, L. chalumnae, C. milii and a BAFF sequence for C. plagiosum, although the bootstrap
value of 56% for this grouping precludes these sequences being described as a clade (Figure
3). The C. plagiosum sequence had previously been mis-annotated and had recently been
identified as a BALM sequence (Li et al., 2015), although the genomic and protein databases
had not been amended, leading to this sequence being shown as BAFF in Figure 2. BALM is a
recently identified TNFSF member which has not yet been described in vertebrates other than
cartilaginous and bony fish, and this study also identified BALM sequences only in these 2
vertebrate classes. BAFF sequences for representatives from all vertebrate classes did form a
clade (bootstrap values > 70%) (Figure 3).
19
Figure 3. Phylogenetic grouping for TNFSF members BALM and BAFF.
OX40L sequences from S. canicula, M. musculus, H. sapiens, C. picta and G. gallus formed a
clade, with bootstrap values of 77% and above (Figure 4). However, examination of MAFFT
sequence alignment did not support this (Appendix 2), since it revealed no conserved domains
shared by S. canicula and other vertebrates. The 2 S. canicula GITRL sequences also formed a
clade with similar sequences for H. sapiens, M. musculus and C. picta, with bootstrap values
of 94% and above (Figure 4) but, again, MAFFT alignment did not reveal the presence of shared
conserved regions (Appendix 2).
20
Figure 4. Phylogenetic grouping of TNFSF members OX40L and GITRL
The S. canicula FasL sequence clustered with FasL sequences for another cartilaginous fish, C.
milii (bootstrap value 85%), and two mammals (H. sapiens and M. musculus; bootstrap value
66%), but no amphibian, reptile or avian FasL sequences were identified as being similar
(Figure 5). However, alignment did appear to indicate the presence of a number of conserved
domains among the species for which this sequence was identified (Appendix 2). Alignment
of S. canicula TL1A sequences with TL1A sequences for H. sapiens, M. musculus, C. picta, A.
carolinensis and O. hannah suggested that these species shared conserved domains (Appendix
2) and would therefore be expected to form a clade with similar sequences from these
sequences. However, although there was evidence of some grouping, the S. canicula
sequences did not form such a clade (Figure 5).
21
Figure 5. Phylogenetic grouping of TNFSF members FasL and TL1A
22
RANKL sequences for H. sapiens, M. musculus, C. picta, G. gallus, L. chalumnae and C. milii
formed a clade, with high bootstrap values (Figure 6), and alignment reinforced this through
the apparent presence of a number of conserved domains (Appendix 2). Alignment of 2 S.
canicula LIGHT sequences with that for 2 mammals (H. sapiens and M. musculus) and 2 reptiles
(C. picta and A. carolinensis) indicated much similarity across the sequences (Appendix 2), and
this was reflected in the subsequent clustering in the phylogenetic tree (Figure 6). Bootstrap
values of 100% were also suggestive of close evolutionary relationship.
Figure 6. Phylogenetic grouping of TNFSF members RANKL and LIGHT
3.3 Sequence completion and amplification
Time constraints precluded further investigation of all putative S. canicula TNFSF members.
Those that had not appeared to be present (TNF-α, TNF-β, CD27L, 4-1BBL, TWEAK, EDA) and
those whose identity was unclear (CD30L, LT-β, CD40L, TRAIL) were therefore not selected for
sequence completion, amplification and cloning, since further bioinformatic examination of
these TNFSF members was required prior to commencement of these procedures. OX40L and
GITRL also required further bioinformatic investigation due to alignment dissimilarities with
23
other vertebrate OX40L and GITRL sequences respectively. Since this project forms part of a
larger study of the TNFSF of cartilaginous fish, 3 TNFSF members (BAFF, BALM and APRIL) were
already being examined by colleagues in Dr Dooley’s team, and duplication of these efforts
was considered undesirable. The remaining 4 TNFSF members, FasL, TL1A, RANKL and LIGHT
were chosen for further investigation.
Examination of amino acid and nucleotide sequences for each of these 5 sequences provided
an indication of whether each sequence appeared to represent a complete or partial protein
sequence. Only one TNFSF sequence, FasL, appeared to contain a complete open reading
frame (Appendix 2). The remaining 4 sequences all appeared to lack both 5’ and 3’ ends
(Appendix 2). PCR was therefore chosen for amplification of FasL, and RACE-PCR for
completion and amplification of TL1A, RANKL and LIGHT sequences.
Initial FasL amplification using PCR did not reveal a predicted ~600bp band following
transillumination. PCR was repeated following changes to buffer and PCR grade water, but
the same result was obtained. Primer redesign to include at least 3 G/C nucleotides at the 5’
end was carried out and the new primers used for FasL PCR, but no amplification was
obtained. Annealing temperatures were adjusted through the use of Touchdown PCR, which
used annealing temperatures between 63oC-49oC, decreasing in increments of 0.4oC after
each cycle, to ensure that optimal annealing temperatures had been used, but no sequence
amplification was achieved.
RACE-PCR of TL1A was then attempted, after adopting the changes described above, but this
also did not result in completion or amplification of this TNFSF sequence (expected band
24
~700p). Since it was possible that the S. canicula spleen mRNA from which the template cDNA
was generated may have degraded, this was replaced with spleen cDNA from a different
mRNA sample and TL1A RACE-PCR repeated. No sequence amplification was obtained.
Annealing temperature was lowered from 58oC to 53oC, and cycle number increased from 30
to 34, and TL1A RACE-PCR again repeated, but no sequence amplification was achieved.
RACE-PCR of LIGHT and RANKL was then carried out, again incorporating previous procedural
amendments. No sequence completion or amplification of either of these TNFSF members
was achieved (Figure 7: 1 & 2 show results for 3’ ends for both RANKL and LIGHT). Different
S. canicula tissues were then substituted as cDNA templates. The use of S. canicula liver cDNA
template resulted in amplification of the 3’ end for both RANKL and LIGHT (Figure 7: 5 & 6),
with the RANKL product appearing to be separated into 2 distinct bands. No amplification of
the 5’ ends for either RANKL or LIGHT was achieved (Figure 7: 9 & 10). Similarly, the use of S.
canicula epigonal cDNA template also resulted in amplification of the 3’ end for both RANKL
and LIGHT (Figure 7: 12 & 13). The bands obtained for the 5’ ends of both RANKL and LIGHT
were unclear (Figure 7: 16 & 17) and it was therefore not possible to estimate band size nor
to be certain that these indicated amplification.
25
Figure 7. Scanned photo of electrophoresed gel under UV light showing amplification of 3’
ends of S. canicula TNFSF ligands LIGHT and RANKL, after RACE-PCR using S. canicula spleen,
liver and epigonal oligo dT-primed cDNA as template. 1-4: S. canicula spleen cDNA template:
1: RANKL 3’ product; 2: LIGHT 3’ product; 3: positive control using EF1a F1 primer; 4: negative
control using EF1A primer and H2O as template. 5-11: S. canicula liver cDNA template: 5: 3’
RANKL amplification; 6: 3’ LIGHT amplification; 7: positive control; 8: negative control; 9: 5’
RANKL product; 10: 5’ LIGHT product; 11: positive control. 12-18: S. canicula epigonal cDNA
template: 12: 3’ RANKL amplification; 13: 3’ LIGHT amplification; 14: positive control; 15:
negative control; 16: 5’ RANKL amplification; 17: 5’ LIGHT amplification; 18: positive control.
3.4 RACE-PCR product cloning and sequencing
Cloning and sequencing of the amplified 3’ RANKL and LIGHT RACE-PCR products enabled
comparison of these sequences with the previously obtained S. canicula nucleotide sequences
used for primer design (Table 2 and Appendix 2). The LIGHT 3’ and RANKL 3’ cloned and
sequenced RACE-PCR products obtained using S. canicula liver cDNA as a template shared no
regions of similarity with the nucleotide sequences used to design their primers, indicating
non-specific amplification, as was also the case for the LIGHT 3’ sequence with epigonal cDNA
as the template. However, comparison of the RANKL 3’ sequence, with epigonal cDNA as the
template, indicated completion of the 3’ end of the S. canicula RANKL transcript (Figure 8).
26
Figure 8. Comparison of RANKL primer design and sequenced 3’ end. Red text indicates nucleotide
sequence used for ScRANKL-RACE-F2 primer; yellow highlighted area indicates completed 3’ end,
including stop codon.
4. Discussion
This study provides the first evidence of the existence of a number of TNFSF members in the
cartilaginous fish, S. canicula. Bioinformatic searches suggested that some mammalian TNFSF
members do not appear to be present in S. canicula (TNF-α, TNF-β, CD27L, 4-1BBL, TWEAK,
EDA). It is possible that, although present in the animal, these were not represented in the in-
house multi-tissue transcriptome database due to the types of tissue chosen for sequencing,
the immunisation status of the animal whose tissues were used, and the fact that the
sequence information derived from RNA samples from single tissues at single time points.
Since variations in tissues, cell types and environmental conditions influence gene expression,
the resulting vertebrate transcriptome information is almost certainly likely to be incomplete.
However, this database contains 623430 S. canicula sequences (from the spleen, gills, blood,
brain, epigonal, Leydig’s organ, spiral valve, stomach and liver), and has a Core Eukaryotic
Genes Mapping Approach (CEGMA) completeness score of 100% partial and 96% complete,
27
suggesting a high degree of completeness and annotation reliability (Parra et al., 2007).
Possible limitations created by the use of transcriptomic data were also mitigated by pooling
the results of multiple different tissues and by normalising cDNA samples to maximise the
expression of genes expressed at lower levels, making it less likely that sequence information
for the above TNFSF members had been omitted from the databases. This is reinforced by the
fact that, with the exception of one apparent sequence for 4-1BBL, another cartilaginous fish,
C. milii, also does not appear to have the genes for TNF-α, TNF-β, CD27L, TWEAK or EDA in its
genome or transcriptome. Thus, while it is not possible to confirm the absence of these TNFSF
members in cartilaginous fish, and further work investigating their expression in other such
species, such as Leucoraja erinacea (little skate), for which some transcriptomic data is now
available, is required to corroborate this finding, the fact that two shark species demonstrate
similar results does provide a preliminary indication that these TNFSF members may not be
expressed by this group of animals. Other TNFSF sequences also require further investigation
to confirm their identities and possible relationships to TNFSF members in other vertebrates
(CD30L, LT-β, CD40L, APRIL, TRAIL, OX40L, GITRL). However, FasL, TL1A, LIGHT and RANKL
TNFSF sequences did appear to be present and to share some conserved domains with similar
sequences from other vertebrate classes.
It has been hypothesised that the evolution of the vertebrate TNFSF increased in complexity
during the development of the adaptive immune system, and involved WGD prior to the
emergence of vertebrates, followed by a series of localised duplications and reorganisations
of ancestral TNFSF genes (Glenney and Wiens, 2007). In humans, TNFSF genes tend to be
located in clusters on 8 different chromosomes (Table 4), with genes on the same
chromosome appearing to share some functional aspects (Collette et al., 2003). It is therefore
28
striking that, even though homologues for all mammalian TNFSF members did not appear to
be present in S. canicula, at least one TNFSF member from each human “chromosomal cluster”
was either putatively identified or demonstrated sequence similarities sufficient to warrant
further investigation. This appears to support the hypothesis that the 19 mammalian TNFSF
members evolved from a smaller number of ancestral TNFSF members and, given the
chromosomal clustering of human TNFSF members, that this occurred on these chromosomes
through localised gene duplication and reorganisation.
Table 4. TNFSF human chromosomal organisation
1: S. canicula TNFSF sequences sharing conserved domains and forming a clade with similar sequences
in other vertebrates; 2: S. canicula TNFSF sequences whose identity requires further investigation and
confirmation. S. canicula sequences not identified included: TNF-α, LT-α, CD27L, 4-1BBL, EDA, TWEAK.
S. canicula BALM is not included since this has not been identified in the human TNFSF.
Human
chromosomal
location
Human TNF ligands Putatively identified S. canicula TNFSF ligands
1 2
1 FasL, GITRL, OX40L FasL GITRL, OX40L
6 LT-β, TNF-α, LT-α LT-β
9 CD30L, TL1A TL1A CD30L
19 LIGHT, CD27L, 4-1BBL LIGHT
X EDA, CD40L CD40L
3 TRAIL TRAIL
13 BAFF, RANKL BAFF, RANKL
17 APRIL, TWEAK APRIL
In mammals, TNF-α, LT-α and LT-β are all involved in cell apoptosis, with TNF-α being a
pyrogen, particularly during the acute phase of infection, and LT-α being an inflammation-
mediator (Croft et al., 2012). LT-β also interacts with LT-α, promoting the function of the
latter. Only one S. canicula LT-β sequence was putatively identified, and this appeared to
group with mammalian, reptile and teleost sequences for TNF-α and LT-α. This sequences
requires further evaluation to confirm its identity, but it is possible that a gene similar to this
may have been present in the common ancestor of all vertebrates, acting as a precursor to
the 3 human genes in this TNFSF subgroup and explaining why this S. canicula sequence
29
cannot be assigned to any one of these groups. The apparent absence, apart from the one
possible LT-β sequence, of similar genes in S. canicula, and the fact that none of the other S.
canicula TNFSF members are pyrogens, also suggests that the adaptive immune system in
these animals may have a limited ability to mount an inflammatory response. It is possible
that the inflammatory response developed following the emergence of endothermic
vertebrates, and this is reinforced by the fact that teleosts, amphibians and reptiles appear
only to express TNF-α (Rollins-Smith & Woodhams, 2012; Roca et al., 2008). However, it is
also possible that such an inflammatory response is not beneficial to cartilaginous fish, since
it has not evolved in extant species. It may even lead to adverse energetic consequences and
fitness deficits, such as increased oxygen consumption and metabolic rate, dehydration or
damage to tissues (Rollins-Smith & Woodhams, 2012). Other ectothermic vertebrates often
respond to pathogen infection by raising overall body temperature behaviourally, for instance
by moving to warmer areas of water or land, and it is possible that cartilaginous fish respond
similarly (Rollins-Smith & Woodhams, 2012). However, other cytokines, such as interleukin-1
(IL-1) and interleukin-6 (IL-6), do act as pyrogens, and are expressed by cartilaginous fish,
teleosts, amphibians and reptiles, therefore it is possible that these cytokines, rather than
TNFSF members, perform inflammatory functions in these vertebrates (Zimmerman, Bowden
and Vogel, 2014). Lastly, the apparent deficiency of pyrogenic TNFSF members in S. canicula
may provide one explanation for the slow adaptive immune response of cartilaginous fish
relative to mammals (Dooley and Flajnik, 2005).
BALM sequences were only found in cartilaginous and bony fish, suggesting that this TNFSF
member may have been lost following the divergence of other vertebrate classes, or that it is
a cytokine specific to cartilaginous fish and teleosts. However, the fact that it has not yet been
30
found in other vertebrates, particularly in other ectotherms, may also be due to a lack of
research into the TNFSF in non-mammalian species, with much work remaining to be done in
this area. BALM appeared to derive from a common ancestor from which BAFF, APRIL and
TWEAK also derive, but appeared most similar to BAFF and APRIL. Glenney and Wiens (2007)
hypothesised that BALM may either have been an ancestral gene of BAFF and APRIL, or that it
developed during gene duplication in teleosts; its presence in cartilaginous fish appears to
support the former hypothesis. In mammals, BAFF and APRIL are primarily involved in B cell
development, proliferation and survival, as well as in immunoglobulin production and class
switching; BAFF also promotes plasma cell differentiation and stimulates activated T cells,
while APRIL also regulates the growth of tumour cells (Roosnek et al., 2009; Mackay et al.,
2003). Although TWEAK appeared to share an ancient common ancestor with BAFF, BALM
and APRIL, its functions in mammals differ, since it appears to promote endothelial cell
migration and angiogenesis, as well as inducing apoptosis (Vince and Silk, 2006). The apparent
presence of BAFF, BALM and APRIL sequences in S. canicula suggest that B cell and
immunoglobulin regulation are important immunological requirements for these animals, and
also that B cell developmental requirements might be different in terms of TNFSF ligands. This
is reinforced by recent immunisation studies (Pettinello and Dooley, 2014; Crouch et al.,
2013). In mammals, RANKL genes are located on the same chromosome as BAFF, suggesting
that they may have evolved following gene duplication. The fact that these TNFSF members
perform very different functions (RANKL triggers dendritic cells to stimulate naïve T cells)
(Walsh and Choi, 2014), and the fact that few conserved domains are shared between similar
sequences for S. canicula and other vertebrates, would point to a duplication event during a
very early stage in vertebrate evolution.
31
TL1A and CD30L are both found on human chromosome 9 (Table 2). Although S. canicula TL1A
did not appear to form a clade with TL1A sequences from other vertebrates, it did appear to
share some conserved domains with these. S. canicula CD30L did not appear to form a clade,
nor share conserved domains with CD30L sequences from other vertebrates, although this
remains to be confirmed. The apparent presence in S. canicula of TL1A, with less evidence of
the presence of CD30L, suggests that CD30L may have evolved in mammals from a duplication
and/or rearrangement of TL1A. This is supported by the fact that both are involved in
apoptosis, although TL1A is expressed by endothelial cells and CD30L by B and T cells. It is
also possible that endothelial cell expression of TL1A in cartilaginous fish also represents an
ancestral mode of expression, and that subsequent evolution of CD30L accompanied the
refinement of the adaptive immune system in mammals, leading to an alteration in cell
expression. In mammals, CD30L also triggers Ig class switching, and may have developed this
function subsequent to the emergence of this TNFSF member; this is reinforced by the fact
that Ig class switching does not occur in sharks due to the organisation of their IG genes in
clusters rather than in the translocon configuration found in mammals (Pettinello and Dooley,
2014).
In humans, LIGHT is located on chromosome 19, along with CD27L and 4-1BBL. S. canicula
LIGHT sequences demonstrated strong evidence of a close relationship with LIGHT sequences
in other vertebrates, suggesting that it may have emerged early in vertebrate development
and continued to play an important role in the vertebrate adaptive immune response. This
TNFSF member is generated by T cells, which it also stimulates, and may also induce cell
apoptosis, according to receptor expression. It is also responsible for promoting BAFF
production, which may be one reason why it has been maintained during vertebrate
32
evolution. Neither CD27L nor 4-1BBL sequences were identified in S. canicula, suggesting
that these genes emerged more recently, possibly through duplication and rearrangement of
LIGHT. This is reinforced by the fact that both CD27L and 4-1BBL perform similar functions to
LIGHT through activating and causing proliferation of T cells (Kober et al., 2010).
S. canicula sequences for CD40L and TRAIL did appear to be present. CD40L was also
characterised in a recent study of S. canicula (Li et al., 2015), but TRAIL requires further
investigation, since this did not demonstrate phylogenetic clustering nor conserved domains
with similar sequences from other vertebrates. In humans, these TNFSF members are located
on chromosomes X and 3, respectively, and perform differing functions, with CD40L
modulating Ig class switching, memory B cell development and the formation of germinal
centres, and TRAIL inducing tumour cell apoptosis.
Recently identified teleost TNFSF member sequences formed a useful basis from which to
initiate a study of the TNFSF in cartilaginous fish. Teleosts are more closely related in terms
of evolutionary time to cartilaginous fish, and the TNFSF genes of these two vertebrate classes
might therefore be expected to exhibit close evolutionary relationships. However, this was
not the case, due to the fact that teleosts underwent an additional one (two, in the case of
salmonids and cyprinids) WGD events, during which they appear to have developed many
variations in their TNFSF genes (Macqueen and Johnston, 2014; Glenney and Wiens, 2007;
Volff, 2005). A more accurate impression of the evolutionary history and development of the
TNFSF was therefore gained from a comparison of the TNFSF genes in cartilaginous fish and
other, non-teleost, vertebrates, since these animals had undergone the same number of WGD
events. The evolutionary development of the TNFSF among these species would therefore be
33
due solely to individual gene duplication and/or reorganisation as these classes diverged over
evolutionary time. Additional phylogenetic information was provided by a comparison of S.
canicula TNFSF genes with those of another cartilaginous fish, C. milii, since these 2 species
are both in the class Chondrichthyes and exhibit many physiological and genetic similarities
(Heinicke et al., 2009). However, C. milii is in the subclass Holocephali, and S. canicula in the
class Elasmobranchii, 2 classes that are believed to have diverged approximately 410mya
(Heinicke et al., 2009); both may therefore have had the potential to develop many genetic
variations. Any TNFSF gene similarities that have been maintained in these species would
therefore point towards these genes being of increased importance to these animals. The
presence of similar conserved genes in other vertebrates would also reinforce their
importance, and the development of novel TNFSF genes in other vertebrates would allow
further work to be carried out to gain a greater understanding of the potential fitness benefits
of these novel genes in more recently evolved vertebrate classes. Characterisation of the
TNFSF in cartilaginous fish may therefore clarify our understanding of the early nature and
function of the TNFSF, and comparisons to the TNFSF in non-teleost vertebrates may provide
insights into how these have been maintained, lost or changed over evolutionary time.
It is acknowledged that bioinformatic searches for TNFSF member sequences of all species
relied upon the quality of the data maintained in the various databases. Mis-annotation of
genes may have affected data accuracy; at least one sequence, Chiloscyllium plagiosum BAFF,
has recently been re-annotated as BALM (Li et al., 2015), as indicated on Figure 2. Similarly,
while BAFF sequences for representative mammals, birds, reptiles, amphibians and
cartilaginous fish (including S. canicula) clustered relatively closely on the phylogenetic tree,
two sequences in this grouping were annotated as APRIL, suggesting that they may be mis-
34
annotated and require further investigation. Other sequences for APRIL appeared to form
two separate phylogenetic groups. Since one group contained mammalian and reptilian
representatives, while the other contained cartilaginous fish, coelacanth, teleost and
amphibian representatives, it is possible that this group separation was due to sequence
changes accumulated since evolutionary divergence, and this might suggest different
functional roles. However, sequence mis-annotation may also apply here. Lastly, since
relatively few studies have considered the TNFSF in vertebrates other than mammals, there
was a corresponding lack of sequence information available, which limited our ability to
produce a more comprehensive representation of the evolutionary history of the TNFSF
among different vertebrates.
The use of a secondary lymphoid tissue (spleen) cDNA as a template (Criscitiello, 2014;
Rumfelt et al., 2002) may have influenced these results, and different tissues were used as a
template during the later stages of the study as TNFSF expression may have been greater in
these tissues than in the spleen. The epigonal is a primary immune organ in cartilaginous fish
and may therefore express TNFSF genes at higher levels than the spleen. This appeared to be
the case following positive amplification of the 3’ ends of both LIGHT and RANKL following
RACE-PCR. These results suggest that any investigation of immune gene expression in S.
canicula and other animals should use a variety of different tissues as cDNA templates. Failure
to do this may provide an incomplete representation of immune gene expression in these
species. The immune status of the animals whose tissues are used is also important: this study
used tissue samples from an unimmunised animal which therefore may not have been
expressing TNFSF genes at sufficiently high levels to detect. Since TNFSF members are part of
the cytokine family and are involved in the immune system’s signalling process, it is possible
35
that an immunological challenge would lead to increased cytokine signalling and hence to
higher levels of TNFSF gene expression in the S. canicula spleen. Further studies should
therefore be carried out on immunised animals, to determine whether this may be the case.
Time limitations precluded the amplification, cloning and sequencing of all putative TNFSF
members identified, and further work is required in order to synthesise a comprehensive
picture, not only of precisely which TNFSF members exist in cartilaginous fish, but also of their
development over evolutionary time in other vertebrates.
5. Conclusion
Recent advances in genome and transcriptome sequencing are now providing a wealth of
information that was previously unavailable, enabling investigation of the evolutionary
development of the TNFSF in all vertebrate classes. The results of this study support the
hypothesis that TNFSF genes in humans and other vertebrates evolved from a smaller number
of similar genes in cartilaginous fish and also that, with the exception of certain pyrogenic
capabilities, the TNFSF of cartilaginous fish fulfils similar functions to that of more “advanced”
vertebrates. Further investigation of the TNFSF in different vertebrate classes will increase
our understanding of the evolution of the TNFSF and may reveal novel roles for TNFSF genes
which may be utilised in the future development of therapeutics for both humans and other
vertebrates.
Acknowledgements
I would like to thank my supervisor, Dr Helen Dooley, for her unstinting advice and support
throughout this project, and for making the process both instructive and enjoyable. I would
36
also like to thank her team for all their help, in particular Anthony Redmond for his patience
and help with all things bioinformatic, and Rita Pettinello and Dr Kimberley Mackenzie for their
valuable assistance in the lab.
37
References
Adipogen, 2013. TNF Superfamily: linking autoimmune diseases, inflammation and cancer,
[online] Available at:
<http://www.adipogen.com/media/Catalogs/PDFs/AdipoGen_TNF%20Superfamily_Br
ochure_2013.pdf> [Accessed 10 October 2015].
Aggarwal, B.B. 2003. Signalling pathways of the TNF superfamily: a double-edged sword.
Nature Reviews Immunology, 3(9), 745-756.
Albarbar, B., Dunnill, C. & Georgopoulos, N.T. (in press) Regulation of cell fate by
lymphotoxin (LT) receptor signalling: functional differences and similarities of the LT
system to other TNF superfamily (TNFSF) members. Cytokine & Growth Factor
Reviews. (Accepted for publication May 2015).
Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W. & Lipman, D.J. 1997.
Gapped BLAST and PSI-BLAST: A new generation of protein database search programs.
Nucleic Acids Research, 25(17), 3389-3402.
Astrakhantseva, I.V., Efimov, G.A., Drutskaya, M.S., Kruglov, A.A. & Nedospasov, S.A. 2014.
Modern anti-cytokine therapy of autoimmune diseases. Biochemistry (Moscow),
79(12), 1308-1321.
Biswas, G., Kinoshita, S., Kono, T., Hikima, J-I. & Sakai, M. 2015. Evolutionary evidence of
tumor necrosis factor super family members in the Japanese pufferfish (Takifugu
rubripes): comprehensive genomic identification and expression analysis. Marine
Genomics, 22, 25-36.
Borish, L.C & Steinke, J.W. 2003. Cytokines and chemokines. The Journal of Allergy and
Clinical Immunology, 111(suppl), S460-S475.
Bossen, C., Ingold, K., Tardivel, A., Bodmer, J-L., Gaide, O., Hertig, S., Ambrose, C., Tschopp, J.
& Schneider, P. 2006. Interactions of tumor necrosis factor (TNF) and TNF receptor
family members in the mouse and human. The Journal of Biological Chemistry,
281(20), 13964–13971.
Collette, Y., Gilles, A., Pontarotti, P. & Olive, D. 2003. A co-evolution perspective of the
TNFSF and TNFRSF families in the immune system. Trends in Immunology, 24(7), 387-
394.
Commins, S.P., Borish, L. & Steinke, J.W. 2010. Immunologic messenger molecules: cytokines,
interferons, and chemokines. Journal of Allergy and Clinical Immunology, 125(2), S53-
S72.
Croft, M., Duan, W., Choi, H., Eun, S-Y., Madireddi, S. & Mehta, A. 2012. TNF superfamily in
inflammatory disease: translating basic insights. Trends in Immunology, 33(3), 144-152.
38
Croft, M., Benedict, C.A. & Ware, C.F. 2013. Clinical targeting of the TNF and TNFR
superfamilies. Nature Reviews Drug Discovery, 12, 147-168.
Crouch, K., Smith, L.E., Williams, R., Cao, W., Lee, M., Jensen, A. & Dooley, H. 2013. Humoral
immune response of the small-spotted catshark, Scyliorhinus canicula. Fish and
Shellfish Immunology, 34(5), pp. 1158-1169.
De Zoysa, M., Jung,S. & Lee, J. 2009. First molluscan TNF-α homologue of the TNF superfamily
in disk abalone: molecular characterization and expression analysis. Fish and Shellfish
Immunology, 26(4), 625-631.
Dinarello, C.A. 2000. Proinflammatory cytokines. Chest Journal, 118(2), 503-508.
Glenney, G.W. & Wiens, G.D. 2007. Early diversification of the TNF superfamily in teleosts:
genomic characterisation and expression analysis. The Journal of Immunology, 178(2),
7955-7973.
Gray, K.A., Yates, B., Seal, R.L. Wright, M.W. & Bruford, E.A. Genenames.org: the HGNC
resources in 2015. Nucleic Acids Research, 43(D1), D1079-D1085.
Grewal, I.S. 2009. Overview of TNF superfamily: a chest full of potential therapeutic targets.
In: I.S. Grewal, ed. 2009. Therapeutic Targets of the TNF Superfamily (Vol. 647). New
York: Springer Science & Business Media.
Hall, T.A. 1999. BioEdit: a user-friendly biological sequence alignment editor and analysis
program for Windows 95/98/NT. Nucleic Acids Symposium Series, 41, 95–98.
Hehlgans, T. & Pfeffer, K. 2005. The intriguing biology of the tumour necrosis factor/tumour
necrosis factor receptor superfamily: players, rules and the games. Immunology,
15(1), 1-20.
Heinicke, M.P., Naylor, G.J.P. & Hedges, S.B. 2009. Cartilaginous fishes (Chondrichthyes).
In: Eds S.B. Hedges & S. Kumar, eds, 2009. The Timetree of Life. Oxford: Oxford
University Press.
Infante, R.E. Wang, M.L., Radhakrishnan, A., Kwon, H.J., Brown, M.S. & Goldstein, J.L. 2008.
NPC2 facilitates bidirectional transfer of cholesterol between NPC1 and lipid bilayers, a
step in cholesterol egress from lysosomes. Proceedings of the National Academy of
Sciences, 105(40), 15287-15292.
Jones, M. & Blaxter, M. 2013. afterParty: turning raw transcriptomes into permanent
resources. BMC Bioinformatics, 14(1), 301.
Katoh, K., Misawa, K., Kuma, K. & Miyata, T. 2002. MAFFT: a novel method for rapid multiple
sequence alignment based on fast Fourier transform. Nucleic acids research, 30(14),
3059–3066.
39
Kibbe, W.A. 2007. OligoCalc: an online oligonucleotide properties calculator, Nucleic Acids
Research, 35(Suppl 2), W43-W46.
Li, R., Redmond, A.K., Wang, T., Bird, S., Dooley, H. & Secombes, C.J. 2015. Characterisation
of the TNF superfamily members CD40L and BAFF in the small-spotted catshark
(Syliorhinus canicula). Fish and Shellfish Immunology, 47(1), 381-389.
Mackay, F., Schneider, P., Rennert, P. & Browning, J. 2003. BAFF and APRIL: a tutorial on B
cell survival. Annual Review of Immunology, 21(1), 231-264.
Macqueen, D.J. & Johnston, I.A. 2014. A well-constrained estimate for the timing of the
salmonid whole genome duplication reveals major decoupling from species
diversification. Proceedings of the Royal Society of London B: Biological Sciences,
281(1778), 20132881.
Minh, B.Q., Nguyen, M.A.T., & von Haeseler, A. 2013. Ultrafast approximation for
phylogenetic bootstrap. Molecular Biology and Evolution, 30(1), 1188-1195.
Morariu, V.I., Srinivasan, B.V., Raykar, V.C., Duraiswami, R. & Davis, L.S. 2009. Automatic
online tuning for fast Gaussian summation. Advances in Neural Information Processing
Systems, 1113-1120.
Nguyen, L.T., Schmidt, H.A., von Haeseler, A. & Minh, B.Q. 2015. IQ-tree: a fast and
effective stochastic algorithm for estimating maximum-likelihood
phylogenies. Molecular Biology and Evolution, 32(1), 268-274.
Olsvik, P.A., Lie, K.K., Jordal, A.E.O., Nilsen, T.O. & Hordvik, I. 2005. Evaluation of potential
reference genes in real-time RT-PCR studies of Atlantic salmon. BMC Molecular
Biology, 6(1), 21.
Opal, S.M. & DePalo, V.A. 2000. Anti-inflammatory cytokines. Chest Journal, 117(4), 1162-
1172.
Parra, G., Bradnam, K. & Korf, I. 2007. CEGMA: a pipeline to accurately annotate core genes
in eukaryotic genomes. Bioinformatics, 23(9), 1061-1067.
Pettinello, R. & Dooley, H. 2014. The immunoglobulins of cold-blooded vertebrates.
Biomolecules, 4(4), pp. 1045-1069.
Robert, J. 2015. Textbook of Cell Signalling in Cancer: An Educational Approach. Paris:
Springer.
Roca, F.J., Mulero, I., López-Mũnoz, A. Sepulcre, M.P., Renshaw, S.A. Meseguer, J. & Mulero,
V. 2008. Evolution of the inflammatory response in vertebrates: fish TNF-α is a
powerful activator of endothelial cells but hardly activates phagocytes. The Journal of
Immunology, 181(7), 5071-5081.
40
Roosnek, E., Burjanadze, M., Dietrich, P.Y., Matthes, T., Passweg, J. & Huard, B. 2009. Tumors
that look for their springtime in APRIL. Critical Reviews in Oncology/Hematology, 72(2),
91-97.
Sambrook, J.S. & Russel, D.W.R. 2001. Molecular cloning: a laboratory manual. New York:
Cold Spring Harbor Press.
Schneider, K., Potter, K.G. & Ware, C.F. 2004. Lymphotoxin and LIGHT signalling pathways
and target genes. Immunological Reviews, 202(1), 49-66.
Siakavellas, S.I., Sfikakis, P.P. & Bamias, G. 2015. The TL1A/DR3/DcR3 pathway in
autoimmune rheumatic diseases. Seminars in Arthritis and Rheumatism, 45(1), 1-8.
Sedger, L.M. & McDermott, M.F. 2014. TNF and TNF-receptors: from mediators of cell death
and inflammation to therapeutic giants – past, present and future. Cytokine and Growth
Factor Reviews, 25(4), 453-472.
Tansey, M.G. & Szymkowski, D.E. 2009. The TNF superfamily in 2009: new pathways, new
indications, and new drugs. Drug Discovery Today, 14(23), 1082-1088.
Testar, J., 2015. Cytokines: Introduction. [online] Available at:
<http://bitesized.immunology.org/receptors-and-molecules/cytokines> [Accessed 27
September 2015].
Venkatesh, B., Lee, A.P., Ravi, V., Maurya, A.K., Lian, M.M., Swann, J.B., Ohta, Y., Flajnik, M.F.,
Sutoh, Y., Kasahara, M., Hoon, S., Gangu, V., Roy, S.W., Irimia, M., Korzh, V., Kondrychyn,
I., Lim, Z.W., Tay, B-H., Tohari, S., Kong, K.W., Ho, S., Lorente-Galdos, B., Quilez, J.,
Marques-Bonet, T., Raney, B.J., Ingham, P.W., Tay, A., Hillier, L.W., Minx, P., Boehm, T.,
Wilson, R.K., Brenner, S. & Warren, W.C. 2014. Elephant shark genome provides unique
insights into gnathostome evolution. Nature, 505(7482), 174-179.
Vince, J.E. & Silke, J. 2006. TWEAK shall inherit the earth. Cell Death and Differentiation,
13(11), 1842.
Vujanovic, N.L. 2011. Role of TNF superfamily ligands in innate immunity. Immunologic
Research, 50(2-3), 159-174.
Walczak, H. 2015. Death receptor-ligand systems in cancer, cell death, and inflammation.
Cold Spring Harbor Perspectives in Biology, 5(5), a008698.
Walsh, M.C. & Choi, Y. 2014. Biology of the RANKL-RANK-OPG system in immunity, bone, and
beyond. Frontiers in Immunology, 5, 1-11.
Wang, J. & Yang-Xin, F. 2005. Tumor necrosis factor family members and inflammatory
bowel disease. Immunological Reviews, 204(1), 144-155.
41
Wang, Q., Arighi, C.N., King, B.L., Polson, S.W., Vincent, J., Chen, C., Huang, H., Kingham, B.F.,
Page, S.T., Rendino, M.F., Thomas, W.K., Udwary, D.W., Wu, C.H. & The North East
Bioinformatics Collaborative Curation Team. 2012. Community annotation and
bioinformatics workforce development in concert – Little Skate Genome Annotation
Workshops and Jamborees. Database, 2012: bar064.
Wiens, G.D. & Glenney, G.W. 2011. Origin and evolution of TNF and TNF receptor
superfamilies. Developmental and Comparative Immunology, 35(12), 1324-1335.
Wyffels, J., King, B.L., Vincent, J., Chen, C.., Wu, C.H., & Polson, S.W. 2014. SkateBase, an
elasmobranch genome project and collection of molecular resources for
chondrichthyan fishes. F1000Research, 3.
42
Appendix 1. S. canicula TNFSF amino acid sequences demonstrating possible orthology with those
of other vertebrates
>lcl|comp149704_c0_seq1 CD40L/TNF10
TTVEEEGKKQRKENSSYRKCELIERLTLLLASNSKIDFLDSSCNQGKTPLVHLIQEGKVLRKKGRGVVCWYTSLHIQGR
DIFSMQEVWALKKCAVLIFPERCEMTRDMLRPGNCTVIEETVLTVSALVDNMNETSNKLTEPRKVCEHQPSVSVRV
LIYLITALLLGHMVTSALLYVYFSMKLDKVQEIVNADPEYTFLRSIKRCENKVEKSNIFCDGTRSELQHLIQKYLTNNTR
EQLHGAGVPTQEKQIGNHHTTITNSVDNPRRTAAHLIATNKAENIPDSAFPRKKGQPIKQWMAQGFPAFTHNINY
TSGKLVITQPGFYYVYCQVSFRLTNNVTDDTSSVPFLQYIYLQRIEEQTTLLMKASKTPVDKKQTASFNSVNQGGVFE
LKTGDKLFVTVTQADLLSYDRATYFGIFHLFLCSVMYGLLRNRFCSQLSLPTLNCPPPP
>lcl|comp167134_c2_seq5 TNF15/14/A
LRPRGFLLAGLSVYLLLQHVPAQKSEAQIGPEMIRLPVDTAEKPRAHLTAKVTSSDKKDGEYFALQWEHKNGLAFTK
GELNYQDRALTIPKGDYFVYSQVSFRGYASDMHENNSLIIHMITKLTPRYPKPMQLLSSTRSVNGEKSLWRMAIYLG
AVFHLEQGDRIIVNVSSVKQVDFTN
DHKTYSNDH
>lcl|comp174543_c0_seq4 TNF13B/A
SEFLKKIHHHREKPEEEKTKFRGRRSLPEQVQQSFVQLIATHDQRSFVKVNDCLKYMTGRSQPTCRVKRRPVSNQSS
KQETTVPWILSLKKGNALDKKDNKILVNEAGFFLVYSQVWYKDNTFTMGHFIKRIKASSVGSELRTVILFRCIQNMSA
CCPNDTCFTAGIAKLEVGDELELVIPRTQAQIALKGDGTFFWGNEAAVTVMAKAVEAKFLPFKEKRRLRKKSCQYRL
VSWLLMIAMWMENFSENQIKFVYLCANKFKGKKKEKT
>lcl|comp175589_c0_seq2 TNF13B/13/EDA/A
AITAAGSHLKIASSVMTSSVVKELYQSDLDKQNLFNIINDATQMLDITRIKEQMYLQRADWDYGYSLLGCFMLSVTA
CDFTAANKCLRSELAKVHKQLDVLAREETCGSSGTKRNGKSVKSKKNNNVKFQIKEQKTGVNEKNYDLRSISRKKRD
VRPCEKQRSFLQLIPNAKQSIILQDDSTFIPWIVASRRGEALEATSNKIVVKENGYYTVYSQVLYYDNCMAMGHVVQ
RRKANVVGDERRIMTLFRCIQNMPNNTASNTCFTAGIVSLTKRMSWNSCYHPDPMLTSPWTKIQHFLVPYNYYEA
TMEDDDCPRVGLFRQRHSSPNTSPLLSKISVAIEGISIVNEGGFTCLHRLRFGTKICSSHPLAKLLHNQFCFGCNLMQL
EALLSHMQVFCFHCLKNKLLLAF
>lcl|comp175589_c0_seq1 TNF13B/13/EDA
AITAAGSHLKIASSVMTSSVVKELYQSDLDKQNLFNIINDATQMLDITRIKEQMYLQRADWDYGYSLLGCFMLSVTA
CDFTAANKCLRSELAKVHKQLDVLAREETCGSSGTKRNGKSVKSKKNNNVKFQIKEQKTGVNEKNYDLRSISRKKRD
VRPCENSDHSSSFLMPSKALFCKVLYYDNCMAMGHVVQRRKANVVGDERRIMTLFRCIQNMPNNTASNTCFTAG
IVSLTKRMSWNSCYHPTQCHLHGQRFNIFWCHTITMKQLCERMMTAQEWDYFARDTVHLIRLPCSPRLVLQSKDI
SIVNEGGFTCLHRLRFGTKICSSHPLAKLLHNQFCFGCNLMQLEALLSHMQVFCFHCLKNKLLLAFINLKKKRK
>lcl|comp175589_c0_seq3 TNF13B/13/A
FSFFFVYESKQFIFQTVKTEDLSHVRQKGLKLHVASKAKLIVKLSKWVGTADLSPKPQPVKTSGKSTLLINNADVLRLQ
HSWRAGETYVNCVSGEIIPLLGSHHPLTLLHSNCMAPKNVESLSMEMLALGRDGNRSSNSSSWSSFTIPDCILCNSH
FLSQLFYWKLPPMPHPSLKPQSKERLNHLAECFAWHESAGGMIAAFTRSYIP
>lcl|comp175589_c0_seq4 TNF13B
RDVRPCESSDHSSSFLMPSKALFCKELSLTKRMSWNSCYHPTQCHLHGQRFNIFWCHTITMKQLCERMMTAQEW
DYFARDTVHLIRLPCSPRLVLQSKDISIVNEGGFTCLHRLRFGTKICSSHPLAKLLHNQFCFGCNLMQLEALLSHMQV
FCFHCLKNKLLLAFINLKKKRK
>lcl|comp167134_c2_seq3 TNF15/14/A
LRPRGFLLAGLSVYLLLQHVPAQKSEAQIGPEMIRLPVDTAEKPRAHLTAKVTSSDKKDGEYFALQWEHKNGLAFTK
GELNYQDRALTIPRKGDYFVYSQVSFRGYASDMHENNSLIIHMITKLTPRYPKPMQLLSSTRSVNGEKSLWRMAIYL
GAVFHLEQGDRIIVNVSSVKQVDFTNDHKTYFGAFLLWLSYIESATFINGSIDSWRATKDTLLEVLSISSNDVWVATH
43
HKFLQQLFSVDGLSFKLCYMGTKSTCELGIQRVAHVRWSFPTLQFELNGPFKVRDQKIQHWDAGVAALKMGRLEN
LTARKWECGCKTVYLYLSMGCGHRQLGLHLLFIPNDTCLEGIETSTLLSVWSHMA
>lcl|comp175049_c0_seq1 TNF13/13B/12
RCSTPIFGLQWSESGSNHKDDEVRVRVGEEKPEDGSKWFEEFLGSKIEMEDPGKGPQKSQRTLSYRRRTPEERCSG
ELHVQSNLHQSHERVPIHDNVSRITRNHHFYDVSFLKLVDTCTVTSIERLISRHILRASEQCLSLILPPLFPSLYHVTHDK
RPVLIQDVMRPFPRMAIVLLFCSKDSPVCKIFCQITVLSPCDLEDGTGGLMSFFFSVCACACPRSLFPRGVLAALVPR
CVRGLHVCSPETEVRATCSLGASCYPTRARPKGHRVVHLHFQLQVGRGVVVAVNLAVYVVSVFENDELPCLIVRQCI
FHLRLFQPLSDDPIIFNYIRPTFIIGHFEMSCKRLFVLCCRSFSPLSPYPRQPLALQQPLQELNLTPGLPGSRGRSGRML
SSNFGVSPLSESRAESGNW
>lcl|comp354964_c0_seq1 TNFA/B
PSTATKWPSCHRVIRRKSCSKPPSPPVTMATMEIPGTRLPTREPYSSSRRETRSSPGLTRMWWSMSTQLQERVSLE
YLLCKDSESLPVISDDGLRISIGIQQDCWTYCPIRTNSIPSD
>lcl|comp173264_c1_seq1 TNF10/CD40L/TNF11/14/15
HYGRGHFPACPTQRALLKIHTIGQTKCPHRRRITITRNWAGNPWPPRRENGASCRFWDLLFVEICVTCYFFCYFTDQ
IDQMQQKLNSDNYPGRCLQYMNSYIDHLGKGEVPVEDFLCDSWLKNVKFLINQRLQSDVRNLIYKELKAHNATFLG
ADKPAIHLIAQQRNKPPKNVPSYFTRVNGHDVLYWNDVKGFAVHQGAMKYHGGEITIPKDGIYYIYSHVYFWHVQ
KPDGEDQQYNLFLQYLYKMSMNYYKPLLLTKAIFTKCWSKESKFDVYTSFQGAAFELEQGDKIFIKVTNASAVNVNE
GATYFGAFMIFMEWSNLSLHQCNNSEEQLILNLLSKFRNQLWAVIFRPFPQSGYSCPAEGENRDTHEGNSLWGLG
SHTMLGTDLVCRVSLPAHTSRIHHNSNAITSWFKKTKIPQSLPGGFSQTRVQPLFQLCQSLALWFLLLNSQFKLIHFV
LSLPTIKTLISQLLFNVELSFVNNLDVWLNMLLVQKLLLESLIYFLLIDS
>lcl|comp208326_c0_seq1 TNF6/14/15
AGPWACLLCNCTMQQNYLYPQVFMVDGSQNLNPYVPPPAWALPTLKKKRKLDWRTVGTILLLNLVLLALAGLAL
GTVYLLELQKEIQEMKQDNGGKGPVAHKIIGAQNFTQPPKDLRIAAHLTGDSITAGSNPLVWDDQKGHAFTSGILY
KDRGLVINKAGIFFVYSKIYFRSYVCDRKNLEHIVFKRTDRYRKELILMEAKKTNYCSGNGWATNSYQAGIIHLLKGESI
YVNVSDSRLVDFAESKTFFGLYKLFTLLIYILLKSRKNSIKIK
>lcl|comp288857_c0_seq1 TNF10/11/6/15
PEELCCGTGRQVQCLIRVAQRGAFFPRGCLPGWCAHSNPQPCSQQPCPTLSPSWPPEAPRPPAHRTAGSRAVHQ
RPATGSHHRLHLLHPQPVAAKRCQWDLGGGFAPIVVLQWQQPLVRVGNPESLQVLFAATERTETTSPGGSWCCR
KTATQRFAQGIQSHEEEFMADTLRSPHYKKKTTRNSSDLETSFPGLKGKPIISWEEKVGLAHLREMRYSQGSLRIKHR
GLYYIYAQILFRYHADHPMVTREYQLIQYVYRKMSKYPKPTLILRGASTWCSGNHNLYSVYQGGVFSLLNGDKIFVTV
SNVSLVDVDEASSYLGAFKMDISVTCALFRFSSSTLRILEWVVNNSHRSATQFNHCNRIS
>lcl|comp170458_c0_seq2 TNF14/15/6
PLRPRDIKIGTTRNSAACVMCAVCRVPAEDMIFQDSTKPAAHVTGCNCSSSGYDRLQWEHSKGLAFLREIEYKEGAL
VIKRAGYYFIYSKILLGEIHCGDERNGFLKQTLFKVGEAYPQPIELMAVRKFFCTGESTDQWLDNSYLGAIFDFSEGDE
VFIKITEKKVLRISNGAETFFGAFMVCAYRTFYSLCFCQMSANMSVPALRMLMSKCATGVPNLYCCFLGTIYSGMW
AGILLLCRTYRPEGMRGVRGRGLQLLKAASCMLPPCPFHTHCFYKSLFITEKKKKK
>lcl|comp 165409_c1_seq1 TNF6/14/B
AITAAGLEDWEQLCVHVICPLKELVSLHKAYNAWHCETLLLIMAEVLMETLDISNPLTNAMPPNNKIAQPEKKEKLQ
KLFTAIALLLILVTLMLSTWNAVNISKLQLKQVNSRKQRTEMQNQRQALKNAAHLTGMNVINSSTALIWDSVKGLA
FVQGVSHSAEELIIIEAGFYLIYTKVSFGGQVCQSDMLIDHSVLKRSDRYPVAIEIMKTRNSNNCPGNNGQWSRDSF
QSGIFQLRKGDRIYVTVSNPAFVNFRQVSTFFGLHKLLICIFNACFHLLWVNDQALSLFIFDIRELYREKKLNDILSLVKP
VIETIEYILRSIRRKVRTILNKVYNGFNVTNWNLKKKKRK
>lcl|comp159900_c0_seq1 TNF15/A/14/10
44
RVAISGRQGLKVKSDLECSSTSYTDSRFIKPFGDSIQKSDSHQRVNSGVFKMLKPTSTDPEDLTCRNSKCTLSRKHSCS
SLLLSIFIILQWLAIAAMCSIIFFYINVLKEDIHTEVLAKMKNITISSFQTTGGYTFKSSVAKSEKPIAHFPALTAAADIQNT
SGHLLFEHKRGHAFKTSDMKYDEGGYLVIPTSGVYFIYAQVTFICSQDCTKLSQQFFAAILKKNNQYPYPEELLKSYAR
PQGDDFLKVSTYQAGAFQLYADDYIYAKVPENLYTHISEGEHETYFGAFLLDPATSEASCQFSYVVRECIHAYPHTST
WCFFGAICLVQQTVPNEDVQEIYATIAQYLFLSYTPMSDLSYHFVNCNAAIISCIGLGSHIIKIISIHVKKEK
>lcl|comp159900_c0_seq2 TNF15/A/14
LQHLFENNLLKYQDFFEALDVPFIRLVSRDCANVQCQNIYFSGFQTTGGYTFKSSVAKSEKPIAHFPALTAAADIQNT
SGHLLFEHKRGHAFKTSDMKYDEGGYLVIPTSGVYFIYAQVTFICSQDCTKLSQQFFAAILKKNNQYPYPEELLKSYAR
PQGDDFLKVSTYQAGAFQLYADDYIYAKVPENLYTHISEGEHETYFGAFLLDPATSEASCQFSYVVRECIHAYPHTST
WCFFGAICLVQQTVPNEDVQEIYATIAQYLFLSYTPMSDLSYHFVNCNAAIISCIGLGSHIIKIISIHVKKEK
>lcl|comp170458_c0_seq1 TNF14/6/15
TAAGEMMLPGDRHRSTMCVPLCFIPTSPESSFLYFFALRTPKMGDNYVYPSIFVVDGQSDTVPFATSTKRKKMTLA
HKFLLMLVLLALTGDGLGAFYLWRLQSDVQEFSSMFLAAKNTSLIVRSIQDMIFQDSTKPAAHVTGCNCSSSGYDRL
QWEHSKGLAFLREIEYKEGALVIKRAGYYFIYSKILLGEIHCGDERNGFLKQTLFKVGEAYPQPIELMAVRKFFCTGEST
DQWLDNSYLGAIFDFSEGDEVFIKITEKKFESPTVQKHFWCFHGLMCLSDVLFIVFLSVNVCHVVGSCFADVNVMC
HRSVITLILLLLTWNHLLRNVGWNLAAVPNVSTGGYEGGAGQRIAAIVKSCQLYVATIMPISYSLFLIFIYYKKKKE
>lcl|comp147674_c0_seq1 TNF10/14/6/A
LRPRGCCSTKNSAPQLKTFALNLQRASFRMGTSSNSITAQRLGFALSSVLLLQCVCMTLTFLYFHSELKQVQESFSKN
NMACLTEDILHLHAGPKGDEEFYVDPTGEKTERNEDLCWKFRAQIQQLIEKTISNQYEQDMSVIVKGEVSRLLPSLT
QSESGARLTVQKIAAHLTGNYKTRSTPLAGPASRKLQGRKSAPGSPREGWRSCTVWNTKMESSSRKLDSITSTARSY
FRYWEAA
>lcl|comp349734_c0_seq1 TNFA/15
SIAPQPEPEPLKPVTVFQGLKQVANSSSKLEWKDGSHRIIEKGNYFVYVQVTFKGECSGEKLFLKTTLCKKQVKIKEKE
LLVTSLSVCESQNHMSWTRTLSQAAIFYFEKGDNIHVNTTTMEYVDFNHSYNNIFGIYRFVTLGNTSVKIFYSLVSVLF
IVVFPFAIYRSLNDSIPFSYLSCCLPPYLSIYTSMKFLLSFPIMSLFFYVPLFENFTAAKYFIYAEKQKAMIHRCAIDFSVM
HATLR
>lcl|comp170479_c0_seq1 TNF4/A
ELPETPQFFISSMPWIFFHVHDASLSYQRDCISIIVPGTYVPVSGLQLGFMTFLRGKKYSHPKGWYLKPDDTLVSRVSL
HAKMIIHNFHIINNTDQHATNINFYYLFLGSFMFVIHAEADGISEKKVKLRSKDSAWVKNNAIVVPCDGLYLIYAHVS
SNRTPLMEIKKKEDSDSDADTDSWLLPKTKAYSPLITSMLRVNNTIRMLLDLTNDELKDDNSYLGIALLQIHQDINSC
QQIIQKSCKYEKHRDGLCLSTPTSGKYIGQLEWEKRGQNVHCLRGAIFNGLELNLLHIGWCGEKRRCVWGGKRML
HRGNWLLNCIFHYRIEDRGPNRNLAQIKKLNLLLKRVLEIFNHITILFSGEWGLEPDVCPKMGISSWCGRETSNQAW
TLDRVMLDGLSGLFFCSRFSWCPGLSLSLDQHHRASDLVVISLLAGSRWAVPTLQELHFEKYLIGRKALWDVQTLITL
LKSEVLMFCLLHVASLPHAVRWYFRMSLLRPAHSRLHIHSMGGGGRNVRFRVNIKFSFFLMYMFTTCSKIKYMMP
GILPQMRDWHHSESDQIGSVCSLTWFIAHTFLNC
TIECLAQFLKVKQVHLLSDQNFITEGDCAQFSKSCLCVEGGTSSFKKGIQPKIIFINILGLQYSRGGQTAGKEALECRLF
DLRIVFIYCMCSYLMGKKSFTRKLMFYCNLYFYMELCFVLFGNVHKKYK
>lcl|comp156182_c0_seq1 TNF6/14/A/B
KNAAHLTGYGLTENSKMLTWKGTTGTAFMRGVDYQDEGLVVHAPGLFFIYSRILFRDVNSVNDRFQEQNVFKKTK
QYHRDISLMRIIKHNYCAMNINAWSESTGIFMLSEGDYIYVSVSYPDFVGDEHSTYFELYWLFALPVFCILAHNSPAY
SYRRAYRNFYLVGMSKAGVLHEVIYLIFQFLLLLLRSSFVLIGQHGSIVV
>lcl|comp156182_c0_seq2 TNF6/A
GPCLCPRRYGLTENSKMLTWKGTTGTAFMRGVDYQDEGLVVHAPGLFFIYSRILFRDVNSVNDRFQEQNVFKKTK
QYHRDISLMRIIKHNYCAMNINAWSESTGIFMLSEGDYIYVSVSYPDFVGDEHSTYFELYWLFALPVFCILAHNSPAY
SYRRAYRNFYLVGMSKAGVLHEVIYLIFQFLLLLLRSSFVLIGQHGSIVV
45
>lcl|comp177788_c0_seq1 TNF6/14/15
HYGRGTVNNLQQTSGRARGSFNRKWTILCPYRNWSVMQRRNLYSLQMLDGENPPSCHPTTAWPVPLQELRIEKK
LGWLKICHFVIVSMLFIALAGLMFGTYYLHQLQQEVEGMKLKIDGTPQNRREKQIGSEERTDKPKSEVKMAAHLTG
KTTTDHPEDSGRFLWVMHFYKGFSTRMERSSMKLDSTLFTQKSISVEWNVRQSRCNKLSSAMSITYTTVQSTSTTA
VAQEHGGKAPPGRNIPSEGVQLFVSVSHPRMVSTDELMTFFGAYKLIRPVWTMKLQITMLRDNCDTSQFKRFRCV
HLLIVFLTRFNDHYKQKSISSLRSICGLRRMDLSICILAIKIYTSGEYIVHLHFPHFFLIFRTVEVQFNGHEPSKVSNIVIIFL
LVTVCIIKLFSPEVGGIIFEPCSLLYPSHLSSVTFNIYAQKRINDSWTFSTANVTAAFWRRDFQIFGIFCTFRFYRTVQFC
GYSPFFWSVPSEDIVSLYQPVKSSVSLLYPLNTPGNASLIAFPCWYHSGEYALHPLHCQYILPEWCPELNAVYKVRSN
EACNNVSSIHLTNLLIKTNIPLTFIIQFCICPLPFSDSTWTPKSLYSSVVLNFLYRVRKCFRKKVNWLGDTKVDNLCATM
CHSVVKNVDSKAMQVTPCDCPIVLQPQGPYFLWSVLFSDFYSTLLTALRNDYTIDLENTDPERVVSFTWLKLNDYLL
ASKDMIGSLASISAIDFYDLEVGNFTNSKPVKDSMICKSNNKNPSLLKCLTFRFILKDYVILLLSYLSGLCWTISTFPRQG
GNQPPNISQLPKGVLYHGNRAFFCLPAWDSRHSLTDSCVNKEPGMALHQCATPLYCLWTDRTIRNCCHPGESWEL
PHITFLLNVSHGDNASSTQIITPQLNAEMGLGLISILSFQGGRLNRFSSLHMSVKLHYRNGFIFCLCSLLYCRKQYSSNL
LHVLRSLCNRKKCICNFKIKLLQKKKEKT
>lcl|comp165144_c0_seq4 TNF15
HYGRGRQWKAAKKLGLLSFTDTGLKKNGRRFCRRVGSVVNRTTLRQFASGGMARDICVYKVGAPHPLQSSCTPAL
LNIQVSLQIPGEMPMEVTAPLRELEAPSGNRRKCPRALFYLSLIMMVTMAQVLAASGLSFYFLRKQQQPVATPNRE
KCITHWAAASKQRQGENQALAWETTMGDVRTGNCTWYNENKNSLEIQSPGDYFIYIQVMLLDTWKNVTFSVDE
RKTYHTRNSTGSSNTLYFGTTCKVRAGAAVSVILNPGHVLNGETETFLILFKLTISKLVARPRANVQFSYYCIVPTKVTF
SSLCITDNTENWLYSLTKIVVISGSLQSSLFVTVNIFCKKLRIKYFKDNAKKKKRK
>lcl|comp165144_c0_seq1 TNF15
FSFFFFSVIFKIFNSKLLFTENIYSYKRDDRLPLITTIFVSESQFSVLSVIHYNEEKVTLVGTIQLNWTFARGLATSCWSITT
ESEMFQSRHSGHDQGVPRRLPLLALCKWFQSIRCWNSLWSFGCDKSSAHRRMSHSSMCLTTSPGCRSNRQGFVF
PVSFYFHCTRCSYRCEHRPSWSPKRVPDSRPVSVWTQLPNELYISLCWVCSPVLPLQNDIVTQPRIHSQTAPPDGRV
SAASAIGHICLPARNKKAWPRLAQSRLRQQELRQGVEGKQQANSSRVGFLY
>lcl|comp165144_c0_seq3 TNF15/9/CD97
TFYFKVTLTVRYSASPSNPFWLCFSQQTPNREKCITHWAAASKQRQGENQALAWETTMGDVRTGNCTWYNENK
NSLEIQSPGDYFIYIQVMLLDTWKNVTFSVDERKTYHTRNSTGSSNTLYFGTTCKVRAGAAVSVILNPGHVLNGETET
FLILFKLTISKLVARPRANVQFSYYCIVPTKVTFSSLCITDNTENWLYSLTKIVVISGSLQSSLFVTVNIFCKKLRIKYFKDN
AKKKKRK
>lcl|comp173450_c0_seq2 TNF10/11/CD40/15/A
KGFFFVGYFLAGAQMIVLFTPRDAMAGQYSRSSSSESSVPMIRTAAPANSTLIVGAVFGAVALLLSIVSTAGMFVYF
TREISKVKNESVSQELWCLNSPQRFEQLISTETEDLSDQIKKDPCWRLANSAKLLATKIADKTNKKHIIQGVMDNIPV
DVLKSAAEFNDKPAAVPLLISLYGSLLHKMCLTITSRAVTLLNGALERGYPTFRTHSIMENCEPVKRADTTFMLRPISDI
STRQEMIMMTTLQTRMSSSLCSAYIRQLPMHHQSSSKVWEQNVRQRMLSMVIPSTKEDFLNGWNICHHFHKVS
GCGILLWVLTGYVKSFNCRKRSLFRRKSLHKTITIRSVILACFLFKALLNQICRSFDLLTLSADKAVKWNGLTLLMCTVR
ESVSFLSSSWYTYFAHVLIIEARTATSLLCDCTMIFNGTDRNKSGQSSSQRHLLFLWEGTAPSGNEDRGIPKVGKREV
VFWKLKVSATTGRFWRSGQKEECLIPKLRHPQECPIPKPSPRSLDMSNSKTAKMGKDKLSLQNLSDKCFKRNEGTLY
FQGSAIIFSSILFNMRENLACMHYNRTKQVHLNQTQYSLNEGKKKFDAPPLFSAWVMIESTFLVNLSMVDHTSGQL
GMGIVPQQGWVCTGLLPAADWAKNPLSNSKNRKNIIIFVSNRYKRSALA
>lcl|comp147674_c0_seq4 TNF10
LQKGFNNKKALKNIHFSKIISSPLSPHLIHFSGGTSLTKLLVTCPRTEFDNRLMKPLGTIFNLKGTMQMQVNAVDVPF
LSAAGSVLGISSRDPHSFPFSPRWDRRRIPHLPWGQREDAECPPSGRPCCSWRRTPVPVLTRCESTRRSGSCTHTVA
GGQSSGQSPISGQSNCCWSPSMTLAVSLRQMFAEERSFLLSSSPAAV
>lcl|comp147674_c0_seq2 TNF10
46
LRPRGCCSTKNSAPQLKTFALNLQRASFRMGTSSNSITAQRLGFALSSVLLLQCVCMTLTFLYFHSELKQVQESFSKN
NMACLTEDILHLHAGPKGDEEFYVDPTGEKTERNEDLCWKFRAQIQQLIEKTISNQYEQDMSVIVKGKYILTVWIDIF
SKKCFRFDRFACVLMGEFESRFYPSRTEAPVLVRVREEVRSQESIRYSRGGQTTAR
47
Appendix 2. Clustal format alignments (MAFFT)
Red, highlighted text indicates putative start and stop codons
OX40L
Mumu_OX40 ----------------------------------------------------MEGGVQPL
Hosa_OX40 ----------------------------------------------------MR--VQPL
Chpi_OX40 ----------------------------------MGML-------------EME--AQRL
Gaga_OX40 --------------------MVCASALLVALLVEMGQPDTELKKHTDQKDCEKE--MRSD
Scca_OX40_proba ELPETPQFFISSMPWIFFHLSYCISIIVVVSL-QLGFMTF------------LRGKYSHL
Scca_OX40_proba F-----SFF------------------FVYEIFQTKTEDL-----------HVR------
Scca_OX40_proba F-----SFF-------FFSLLFLITTIFVSS--QFSVLSV----------------IHYN
Mumu_OX40 DENLENGSRRFKWKKTLLVVSGCLQLSS--SPAKDPPIQRLRGCEDGQLISYKYQTMEVQ
Hosa_OX40 EENVGNAARRFERNKLLLVASVCLHFSALQVSHRYPRIQSIKVYKKEKGITSQDEIMKVQ
Chpi_OX40 EEQLELERRQVRRNRLQLVSAV---------------LQWIEVYVPKEKIISWRGTMNVR
Gaga_OX40 DEWRGWQKGQAKRNTLYLVSAA---------------LKWTYIQSIAGVMLSAFTSIPVI
Scca_OX40_proba DD--TLVSRSLHAKIIHFI------------------INNTDQHAHAEADSEKKVLAWVK
Scca_OX40_proba -----------QKLKLHVASKAKL-------------IKWVGTKTSGKSLINNVLRLQHV
Scca_OX40_proba EEKVTLLARGLATSCWSITTESE--------------MFQSRHSGHDQGVPRRLPLLALQ
. :
Mumu_OX40 NNSVVIKCDGLYII---------YLKGSFFQ------------------------EVKI-
Hosa_OX40 NNSVIINCDGFYLI---------SLKGYFSQ------------------------EVNI-
Chpi_OX40 NLSIPIYCDGFYLV---------SLKGTIRA--------TQRNA-----------SLSL-
Gaga_OX40 NGSIMIPCDGLYVV---------SLKGVLSP--------DEKSS----------LKLMM-
Scca_OX40_proba NNAIVVPCDGLYLIYA-------HVSSNRTPLMEIKKKEDSKAYSPLITSMLRTIRMLLT
Scca_OX40_proba SGEI-IPLLGSHLLHSNCMAPKNSLSMEMLA-LGRSSNSSSSSFTIPDCFLSQLWKLPPM
Scca_OX40_proba SILWSFGCD--------------KSSARMSHSMCLTTRSNRQGFVFPVSFYFHCTRCSYR
. . .
Mumu_OX40 -------------------DLHFREDIS---MLNDGRRIVFTVVASLAFKDKVYLTVNAP
Hosa_OX40 -------------------SLHYQKDLF---QLKKVRSVNSLMVASLTYKDKVYLNVTTD
Chpi_OX40 -------------------KVYKRSDLC---VKVTEEVVDSVTVSEMRFKDEVYLEANRD
Gaga_OX40 -------------------KNTESKNLWERDVQNSSNAVDLITMLYLFAQNNIILSTSSN
Scca_OX40_proba NDN----------------SIHQDINSCYI-GQLLELNLGKRMLVNTSNQDRVMLDGLSG
Scca_OX40_proba SLKPQSER----------------------------------------------LNHLAE
Scca_OX40_proba CEHRPSKRVPSRPVSVWTNELYISLCVC---SPVLPLQNDIVTISVSAASGHICLPARNK
*
Mumu_OX40 TLLIVVQL
Hosa_OX40 TSLILIHQ
Chpi_OX40 NYLGLFLL
Gaga_OX40 TIFSLVLL
Scca_OX40_proba LFFGLFLL
Scca_OX40_proba CFWGMF--
Scca_OX40_proba AWLGVFLY
:.
48
GITRL
Mumu_GITRL -------------EEMPLRESSPQRAERCK-KSWLLCIA-LLLMLLSLGTLIYTSLK-PT
Hosa_GITRL -FLFSTALISPKMENMPLSHSRTQGAQRSSWKLWLFC-S-VMLLFLSFSWLIFIFLQLET
Chpi_GITRL NFLEN--------GNIPKEA---------------LC-T-LALLILVVPLLIVCSCLHSQ
Scca_GITRL_prob -FCRRVG--SVVNRTAPLRELEAPSGNRRKCPRALFYLSLIMMVTMVAASLSFYFLRKQQ
Scca_GITRL_prob ------------------------------------YFK-VTLTVRYASPNPFWLCFSQQ
: :
Mumu_GITRL AI------ESCMVKFLSSS----------KWHMTS---PHC---VNTSDGKLKILQSGTY
Hosa_GITRL AK------EPCMAKFPLPS----------KWQMAS---PPC---VNKSDWKLEILQNGLY
Chpi_GITRL SP------EICWAHTSPSDDH----GHYIKWNMT-----HCSSFVRNSNQKLEILQSGMY
Scca_GITRL_prob QPVATPNREKCITHWAAASKQRQGENQALAWETTMGDVGNCTWYNKNS---LEIQSPGDY
Scca_GITRL_prob ----TPNREKCITHWAAASKQRQGENQALAWETTMGDVGNCTWYNKNS---LEIQSPGDY
* * .: .. *. : * ..* *:* . * *
Mumu_GITRL LIYGQVIPVDKYIKDNAPFVVQIYKKNDVL-QLM-----NDFQILPIGGVYELHAGDNIY
Hosa_GITRL LIYGQVAPN--NYNDVAPFEVRLYKNKDMI-QLT-----NKSKIQNVGGTYELHVGDTID
Chpi_GITRL FIYAQVT----RIKEESYFTMMLYKDENLLNQMTGTNTGEKTASINFGRPYFLSKGEKLF
Scca_GITRL_prob FIYIQVM----LLDTNVTFSV----DE----RKTTRNSTGSSNTLYFGTTCKVRAGAAVS
Scca_GITRL_prob FIYIQVM----LLDTNVTFSV----DE----RKTTRNSTGSSNTLYFGTTCKVRAGAAVS
:** ** . * : .: : . .* : * :
Mumu_GITRL LKFNSKDHIQKTN--TYWGIILM
Hosa_GITRL LIFNSEHQVLKNN--TYWGIILL
Chpi_GITRL CTINSGGHILKDN-QTYWGLYKM
Scca_GITRL_prob VL---NGHVLNGETETFLILFKL
Scca_GITRL_prob VL---NGHVLNGETETFLILFKL
:: : : *: : :
49
FasL
Hosa_FasL ---------MQQP-----------FNYPYPQIYWVDSSASSPWAPPGTVLPCPT------
Mumu_FasL ---------MQQP-----------MNYPCPQIFWVDSSATSSWAPPGSVFPCPS------
Cami_FASL ---------MEQ-------------SYKTPHVYSVDGNPSFS------------------
Scca_FasL_proba ---------MQQ-------------NYLYPQVFMVDGSQNLN------------------
Scca_FasL_proba AITAAGLEDWEQLC----------VHVICPLKELVSLHKAYN------AWHCETLLLIMA
Scca_FasL_proba AITAAGLEDWEQLC----------VHVICPLKELVSLHKAYN------AWHCETLLLIMA
Scca_FasL_proba ------------------------GPCLCPRRY---------------------------
Scca_FasL_proba HYGRGTVNNLQQTSGRARGSFNRKWTILCPYRNWSVMQRRNLYS----------------
*
Hosa_FasL ---------SVPRRPGQRRPPPPPPP-PPLPPPPPPPPLPPLPLPPLKKRGN-HSTGLCL
Mumu_FasL ---------CGPRGPDQRRPPPPPPPVSPLPPPSQPLPLP--PLTPLKKK-D-HNTNLWL
Cami_FASL --------------------QYPIRPSCPV------------LVPTLKKQNT-DRIKACV
Scca_FasL_proba --------------------PYVPPPAWAL--------------PTLKKKRKLDWRTVGT
Scca_FasL_proba EVLMETLDISNP-------LTNAMPPNNKI--------------AQPEKKEKLQKLFTAI
Scca_FasL_proba EVLMETLDISNP-------LTNAMPPNNKI--------------AQPEKKEKLQKLFTAI
Scca_FasL_proba ------------------------------------------------------------
Scca_FasL_proba ---LQMLDGENP-------PSCHPTTAWPV------------PLQELRIEKKLGWLKICH
Hosa_FasL LVMFFMVLVALVGLGLGMFQLFHLQKELAELRESTSQMHTASSLEKQIGHPSPP-PEKKE
Mumu_FasL PVVFFMVLVALVGMGLGMYQLFHLQKELAELREFTNQSLKVSSFEKQIANPSTP-SEKKE
Cami_FASL ILILILALIILVLLACGIFFIFNLRSQIEKITQKQIKEAEGQH-AKFTGHKQPI-KPSEN
Scca_FasL_proba ILLLNLVLLALAGLALGTVYLLELQKEIQEMKQDNGGKGPVAH--KIIGAQNFT-QPPKD
Scca_FasL_proba ALLLILVTLMLSTWNA---------VNISKLQLKQVNSRKQRT-EMQ--------NQRQA
Scca_FasL_proba ALLLILVTLMLSTWNA---------VNISKLQLKQVNSRKQRT-EMQ--------NQRQA
Scca_FasL_proba ------------------------------------------------------------
Scca_FasL_proba FVIVSMLFIALAGLMFGTYYLHQLQQEVEGMKLKIDGTPQNRR-EKQIGSEERTDKPKSE
Hosa_FasL LRKVAHLTGKS-N---SRSMPLEWEDTYGIVLLSGVKYKKGGLVINETGLYFVYSKVYFR
Mumu_FasL PRSVAHLTGNP-H---SRSIPLEWEDTYGTALISGVKYKKGGLVINETGLYFVYSKVYFR
Cami_FASL IREAAHLTGKS-----SAANTLLWDDTKDIAFTRGVRYKDGGLIINETGLFFIYSKIYFR
Scca_FasL_proba LRIAAHLTGDSIT---AGSNPLVWDDQKGHAFTSGILYKDRGLVINKAGIFFVYSKIYFR
Scca_FasL_proba LKNAAHLTGMNVI---NSSTALIWDSVKGLAFVQGVSHSAEELIIIEAGFYLIYTKVSFG
Scca_FasL_proba LKNAAHLTGMNVI---NSSTALIWDSVKGLAFVQGVSHSAEELIIIEAGFYLIYTKVSFG
Scca_FasL_proba --------GLT-----ENSKMLTWKGTTGTAFMRGVDYQDEGLVVHAPGLFFIYSRILFR
Scca_FasL_proba VKMAAHLTGKTTTDHPEDSGRFLW----VMHFYKGFSTRMERSSMKLDSTLFTQKSISVE
* : : * : *. : . : . : .
Hosa_FasL GQSCNNL-PLSHKVYMRNSKYPQDLVMMEGKMMSYCT-TGQMWARSSYLGAVFNLTSADH
Mumu_FasL GQSCNNQ-PLNHKVYMRNSKYPEDLVLMEEKRLNYCT-TGQIWAHSSHLGAVFNLTSADH
Cami_FASL NLICVKDLELEQTLFVRTSRYPKDMILMETKKNKYCVVKDTVWYINSYQAGIFKLHNGDH
Scca_FasL_proba SYVCDRK-NLEHIVFKRTDRYRKELILMEAKKTNYC--SGNGWATNSYQAGIIHLLKGES
Scca_FasL_proba GQVCQSDMLIDHSVLKRSDRYPVAIEIMKTRNSNNCPGNNGQWSRDSFQSGIFQLRKGDR
Scca_FasL_proba GQVCQSDMLIDHSVLKRSDRYPVAIEIMKTRNSNNCPGNNGQWSRDSFQSGIFQLRKGDR
50
Scca_FasL_proba DVNSVNDRFQEQNVFKKTKQYHRDISLMRIIKHNYCAMNINAWSEST---GIFMLSEGDY
Scca_FasL_proba WNVRQS----------RCNKLSSAMSITYTTVQSTSTTAVAQEHGGKAPPGRNIPSEGVQ
: .: : : . . .. . ..
Hosa_FasL LYVNVSELSLVNFEESQTFFGLYKL-----------------------------------
Mumu_FasL LYVNISQLSLINFEESKTFFGLYKL-----------------------------------
Cami_FASL IYVNVSNTNLVSFDQTKTFFGLYKL-----------------------------------
Scca_FasL_proba IYVNVSDSRLVDFAESKTFFGLYKL-----------------------------------
Scca_FasL_proba IYVTVSNPAFVNFRQVSTFFGLHKLLICIFNACFHLLWVNDQA-------------LSLF
Scca_FasL_proba IYVTVSNPAFVNFRQVSTFFGLHKLLICIFNACFHLLWVNDQA-------------LSLF
Scca_FasL_proba IYVSVSYPDFVG-DEHSTYFELYWLFAL---PVFCILAHNSPA-----------------
Scca_FasL_proba LFVSVSHPRMVSTDELMTFFGAYKLIRPVWTMKLQITMLRDNCDTSQFKRFRCVHLLIVF
::*.:* ::. : *:* : *
51
TL1A
Mumu_TL1A EERSEP--S--PQQVYSPPR----------GKPRAHLTIKKQTPAHLKNQLSALHWEHDL
Hosa_TL1A --SHTKHIS--PFVTDAPLRA-------DGDKPRAHLTVVRQTPTHFKNQFPALHWEHEL
Chpi_TL1A DESMSRFLK--QQASPISLVS-------SAGKPRAHLTVKKQDISPMGNQLPVLQWEHER
Scca_TL1A_proba -------------MIRLP-VD-------TAEKPRAHLT-AKVTSSKKDGEYFALQWEHKN
Scca_TL1A_proba AQKSEAQIG--PEMIRLP-VD-------TAEKPRAHLT-AKVTSSKKDGEYFALQWEHKN
Anca_TL1A TTDGEVPTGKMPEAQRLDAHDHNCCNSTSIRKARTHLA-GAQKQSCMPGSLPILYWRND-
Opha_TL1A RTRWTRDVAKIPQQEPL-------------MRPRK------------EGDATEIWWTP--
:.* .. : *
Mumu_TL1A GMAFTKNGMKYINKSLVIPESGDYFIYSQITFRGTTSVCPDSITMVITKVADSY-PEPAR
Hosa_TL1A GLAFTKNRMNYTNKFLLIPESGDYFIYSQVTFRGMTSECPDSITVVITKVTDSY-PEPTQ
Chpi_TL1A GLAFTKNNMNYTNKSLMIPKAGDYYIYSQVTFRGPTPNHPVSITQTITKVTDSY-PEPTQ
Scca_TL1A_proba GLAFTKGELNYQDRALTIPRKGDYFVYSQVSFRGYASDMNSLIIHMITKLTPRY-PKPMQ
Scca_TL1A_proba GLAFTKGELNYQDRALTIPRKGDYFVYSQVSFRGYASDMNSLIIHMITKLTPRY-PKPMQ
Anca_TL1A -RTHEEDTITYSDGYLTVPETGDYFVYTQVSYRCPDGKCKTIITQIISKISSSYGSEPKQ
Opha_TL1A -FLQQGRALEPSGQEVVVRHTGLYFVYSQVLFHDPTF----TMGQVLRRVAL---GRPDQ
: . : : . * *::*:*: :: : : ::: .* :
Mumu_TL1A LLTGSKSVCEISNNWFQSLYLGATFSLEEGDRLMVNVSDISLVDYTKEDKTFFGAFL---
Hosa_TL1A LLMGTKSVCEVGSNWFQPIYLGAMFSLQEGDKLMVNVSDISLVDYTKEDKTFFGAFL---
Chpi_TL1A LLTATKTLCEMGNNWFQPIYLGAVLFMEEGDRLMVNVSEIQWVDYTKEDKTFFGAFL---
Scca_TL1A_proba LLSSTRSVNGEKSLWRMAIYLGAVFHLEQGDRIIVNVSSVKQVDFTNDHKTY--------
Scca_TL1A_proba LLSSTRSVNGEKSLWRMAIYLGAVFHLEQGDRIIVNVSSVKQVDFTNDHKTYFGAFL---
Anca_TL1A QLITSTSIGE-KEMWKKTLYLGGILELRERDKLMVNVSNPELVDIALPAMTFFGAFLRAN
Opha_TL1A ILFRC---------------IQRVFHLQQGDRLSLSIPRFASFDISTH-GTFLGVLR---
* : : :.: *:: :.:. .* : *:
52
RANKL
Cami_RANKL Q--RHTGHI-TES--TESSQSHHPQRT-PCAHLTMKRIKDSPKTKFIGLKGKPITSWEEK
Scca_RANKL_prob -----------------------MADTLRSPHYKKKTTRNSSETSFPGLKGKPIISWEEK
Scca_RANKL_prob K--TATQRF-AQG--IQSHEEEFMADTLRSPHYKKKTTRNSSETSFPGLKGKPIISWEEK
Hosa_RANKL QHIRAEKAM-VDGSWLDLAKRSKLEAQ-PFAHLTINA------TDIPSSHKVSLSSWYHD
Mumu_RANKL QRFSGAPAM-MEGSWLDVAQRGKPEAQ-PFAHLTINA------ASIPSSHKVTLSSWYHD
Chpi_RANKL EPPRSGKAI-MEAPWFDLSKRNKPEKQ-PFAHLIIDD------KNIPATRKVNLTSWHHD
Gaga_RANKL ESPRPEKAA-MEAIGMELYRRNKPEKQ-PFAHLIIDD------KNILTTRKVNLTSWHHN
Lach_RANKL GYSKTENVIRINPPFMDPAKKSEPERR-PTSHLTIKR------SKYSYSTLQLLTSWEDN
.* . . : ** ..
Cami_RANKL IGIAHTIEMSHNQGLLKIKHHGLYYIYAQILFRYHE----YQLVQYVYWKTANYRKPHLI
Scca_RANKL_prob VGLAHLREMRYSQGSLRIKHRGLYYIYAQILFRYHE----YQLIQYVYRKMSKYPKPTLI
Scca_RANKL_prob VGLAHLREMRYSQGSLRIKHRGLYYIYAQILFRYHE----YQLIQYVYRKMSKYPKPTLI
Hosa_RANKL RGWAKISNMTFSNGKLIVNQDGFYYLYANICFRHHETSGYLQLMVYVTKTSIKIPSSHTL
Mumu_RANKL RGWAKISNMTLSNGKLRVNQDGFYYLYANICFRHHETSGYLQLMVYVVKTSIKIPSSHNL
Chpi_RANKL KGQANLSNMTFRDGKLIVNQDGFYYLYANICFRHHETSGGLQLMVYMSKTNLKIRRSDVL
Gaga_RANKL KGQANLSNMTFSDGKLIVNQDGFYYLYANICFRHHETSGGLQLMVYMTKTNLKIRRSDVL
Lach_RANKL LGLAHASNMTYKDGKLKIHQDGFYYVYTNICFRHHETSG-IQLMLYMYHQKVTGSKPRYL
* *: :* :* * ::: *:**:*::* **:** **: *: . . :
Cami_RANKL LKGAGT-WCSGN—HNLYSVYLGGVFNLHRGDEIFVTVSNISLVDADETSSYLGAFKLN--
Scca_RANKL_prob LRGAST-WCSGN—HNLYSVYQGGVFSLLNGDKIFVTVSNVSLVDVDEASSYLGAFKMD--
Scca_RANKL_prob LRGAST-WCSGN—HNLYSVYQGGVFSLLNGDKIFVTVSNVSLVDVDEASSYLGAFKMD--
Hosa_RANKL MKGGSTKYWSGNSEFHFYSINVGGFFKLRSGEEISIEVSNPSLLDPDQDATYFGAFKVRDI
Mumu_RANKL MKGGSTKNWSGNSEFHFYSINVGGFFKLRAGEEISIQVSNPSLLDPDQDATYFGAFKVQDI
Chpi_RANKL MKGGSTKYWSGNSEFHFYSVNIGGFFKLKSGEVISIQVSNPSLLDSSQEATYFGAFKVRDL
Gaga_RANKL MKGGSTKYWSGNSEFHFYSVNIGGFLKLKTGDMISIQVSNPLLLDSSQEATYFGAFKVRDL
Lach_RANKL MKGGSTKYWSSHTEHNFYCIYQGGIFKLKSGDEIFLKISYASLLDQSQEASYFGAFKLRDL
::*..* *.: .::*.: **.:.* *: * : :* *:* .: ::*:****:
53
LIGHT
Mumu_LIGHT GDIVAHLPDGGKGSWEKLIQDQRSH-ANPAAHLTGANASLIGIGGPLLWETRLGLAFLR
Hosa_LIGHT --------DGPAGSWEQLIQERRSH-VNPAAHLTGANSSLTGSGGPLLWETQLGLAFLR
Chpi_LIGHT EKATSQVGEEAGAISEKSIQDRKLH-AEKPAAHVLGAATT--GNGSLQWEHSKDWAFLH
Anca_LIGHT TKATTQGGAEA----EKIVQGERPP-EKPAAHLTLGALINTAENGALLWEDKKGLAFLQ
Scca_LIGHT_prob -------------------EDMIFQDSTKPAAHVTGCNSS--GYDRLQWEHSKGLAFLR
Scca_LIGHT_prob QEFSSMF-LAAKNTSLRSIQDMIFQDSTKPAAHVTGCNSS--GYDRLQWEHSKGLAFLR
: :****: . . * ** . ***:
Mumu_LIGHT GLTYHDGALVTMEPGYYYVYSKVQLSGVGCPQLP-THGLYKRTSRYPKELELLVSRRSP
Hosa_LIGHT GLSYHDGALVVTKAGYYYIYSKVQLGGVGCP-LT-THGLYKRTPRYPEELELLVSQQSP
Chpi_LIGHT MRYQGGSLVCIKPGYYYIYSKIHLMAQSCSQNSVIYHSIYKKTPRYVEEMKLVENLILF
Anca_LIGHT MGYKDGVLIFNQSGHYYIYSKLLLGDSRCPSSQMVIHEICKRTSRYPREIVLLTNKILT
Scca_LIGHT_prob EIEYKEGALVIKRAGYYFIYSKILLGEIHCGDNGFLKQTLFKVGEAYPQPIELMAVRKF
Scca_LIGHT_prob IEYKEGALVIKRAGYYFIYSKILLGEIHCGDNGFLKQTLFKVGEAYPQPIELMAVRKFF
: *: * *: ..*:*::***: * * : : : * * . : *:
Mumu_LIGHT CGR-NSSRVWWDSSFLGGVVHLEAGEEVVVRVPGNRLVRPRDGTRSYFGAFMV-
Hosa_LIGHT CGRTSSSRVWWDSSFLGGVVHLEAGEKVVVRVLDERLVRLRDGTRSYFGAFMV-
Chpi_LIGHT CDG-KDNRFWRQNSFLGGIFHLEEEEQIYVKVSQRQMLQVRDGTRSYFGTFMI-
Anca_LIGHT CNL-QNTPGWRHNSFLAGMVHLEENEEVFVKVSHKQMVRVGDDTRSYFGTFMI-
Scca_LIGHT_prob CTG-ESTDQWLDNSYLGAIFDFSEGDEVFIKITEKKVLRISNGAETFFGAFMV-
Scca_LIGHT_prob CTG-ESTDQWLDNSYLGAIFDFSEGDEVFIKITEKKFE—SPTVQKHFWCFLM--
* ... * ..*:*..:..:. ::: ::: .:. ....* *::

More Related Content

What's hot

Eugene Koonin for Knowledge Stream
Eugene Koonin for Knowledge StreamEugene Koonin for Knowledge Stream
Eugene Koonin for Knowledge Streamspasibokep
 
Pugacheva et al. COMPLETE GB_16.1_p.161_publ.online_08_14_2015 2
Pugacheva et al. COMPLETE GB_16.1_p.161_publ.online_08_14_2015 2Pugacheva et al. COMPLETE GB_16.1_p.161_publ.online_08_14_2015 2
Pugacheva et al. COMPLETE GB_16.1_p.161_publ.online_08_14_2015 2Victor Lobanenkov
 
thesis_AndreiaReis2014
thesis_AndreiaReis2014thesis_AndreiaReis2014
thesis_AndreiaReis2014Andreia Reis
 
J. Clin. Microbiol.-2014-Davidson-JCM.01144-14
J. Clin. Microbiol.-2014-Davidson-JCM.01144-14J. Clin. Microbiol.-2014-Davidson-JCM.01144-14
J. Clin. Microbiol.-2014-Davidson-JCM.01144-14PreveenRamamoorthy
 
ABCG2pos lung mesenchymal stem cells are a novel pericyte subpopulation that ...
ABCG2pos lung mesenchymal stem cells are a novel pericyte subpopulation that ...ABCG2pos lung mesenchymal stem cells are a novel pericyte subpopulation that ...
ABCG2pos lung mesenchymal stem cells are a novel pericyte subpopulation that ...Rubin Baskir, Ph.D.
 
Mol Cell Proteomics-2013-Nakayasu-3297-309
Mol Cell Proteomics-2013-Nakayasu-3297-309Mol Cell Proteomics-2013-Nakayasu-3297-309
Mol Cell Proteomics-2013-Nakayasu-3297-309Rebecca Tempel
 
EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18Jonathan Eisen
 
EVE 161 Winter 2018 Class 17
EVE 161 Winter 2018 Class 17EVE 161 Winter 2018 Class 17
EVE 161 Winter 2018 Class 17Jonathan Eisen
 
2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM
2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM
2013_CarterEtal_MultiplexPCR-Cronobacter_ AEMMonica Pava-Ripoll
 
A Review on the role of Wnt-4 in Stem Cell Biology
A Review on the role of Wnt-4 in Stem Cell BiologyA Review on the role of Wnt-4 in Stem Cell Biology
A Review on the role of Wnt-4 in Stem Cell BiologyGayathri Vijayakumar
 
Gray et al. 2004 Science copy
Gray et al. 2004 Science copyGray et al. 2004 Science copy
Gray et al. 2004 Science copyPaul Gray
 
Whats Beyond The Finished Human Genome Sequence
Whats Beyond The Finished Human Genome SequenceWhats Beyond The Finished Human Genome Sequence
Whats Beyond The Finished Human Genome SequenceJohn Morgan
 
Transposable elements of Agavoideae
Transposable elements of AgavoideaeTransposable elements of Agavoideae
Transposable elements of AgavoideaeKate Hertweck
 

What's hot (20)

Eugene Koonin for Knowledge Stream
Eugene Koonin for Knowledge StreamEugene Koonin for Knowledge Stream
Eugene Koonin for Knowledge Stream
 
Pugacheva et al. COMPLETE GB_16.1_p.161_publ.online_08_14_2015 2
Pugacheva et al. COMPLETE GB_16.1_p.161_publ.online_08_14_2015 2Pugacheva et al. COMPLETE GB_16.1_p.161_publ.online_08_14_2015 2
Pugacheva et al. COMPLETE GB_16.1_p.161_publ.online_08_14_2015 2
 
thesis_AndreiaReis2014
thesis_AndreiaReis2014thesis_AndreiaReis2014
thesis_AndreiaReis2014
 
1-s2.0-S0923250811001872-main
1-s2.0-S0923250811001872-main1-s2.0-S0923250811001872-main
1-s2.0-S0923250811001872-main
 
2000 JME (51)278-285
2000 JME (51)278-2852000 JME (51)278-285
2000 JME (51)278-285
 
Space flight alters bacteria
Space flight alters bacteriaSpace flight alters bacteria
Space flight alters bacteria
 
J. Clin. Microbiol.-2014-Davidson-JCM.01144-14
J. Clin. Microbiol.-2014-Davidson-JCM.01144-14J. Clin. Microbiol.-2014-Davidson-JCM.01144-14
J. Clin. Microbiol.-2014-Davidson-JCM.01144-14
 
4
44
4
 
ABCG2pos lung mesenchymal stem cells are a novel pericyte subpopulation that ...
ABCG2pos lung mesenchymal stem cells are a novel pericyte subpopulation that ...ABCG2pos lung mesenchymal stem cells are a novel pericyte subpopulation that ...
ABCG2pos lung mesenchymal stem cells are a novel pericyte subpopulation that ...
 
Austin Andrology
Austin AndrologyAustin Andrology
Austin Andrology
 
Mol Cell Proteomics-2013-Nakayasu-3297-309
Mol Cell Proteomics-2013-Nakayasu-3297-309Mol Cell Proteomics-2013-Nakayasu-3297-309
Mol Cell Proteomics-2013-Nakayasu-3297-309
 
EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18
 
EVE 161 Winter 2018 Class 17
EVE 161 Winter 2018 Class 17EVE 161 Winter 2018 Class 17
EVE 161 Winter 2018 Class 17
 
2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM
2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM
2013_CarterEtal_MultiplexPCR-Cronobacter_ AEM
 
Janse_2013_EID_pla
Janse_2013_EID_plaJanse_2013_EID_pla
Janse_2013_EID_pla
 
A Review on the role of Wnt-4 in Stem Cell Biology
A Review on the role of Wnt-4 in Stem Cell BiologyA Review on the role of Wnt-4 in Stem Cell Biology
A Review on the role of Wnt-4 in Stem Cell Biology
 
Gray et al. 2004 Science copy
Gray et al. 2004 Science copyGray et al. 2004 Science copy
Gray et al. 2004 Science copy
 
Whats Beyond The Finished Human Genome Sequence
Whats Beyond The Finished Human Genome SequenceWhats Beyond The Finished Human Genome Sequence
Whats Beyond The Finished Human Genome Sequence
 
malaria_paper
malaria_papermalaria_paper
malaria_paper
 
Transposable elements of Agavoideae
Transposable elements of AgavoideaeTransposable elements of Agavoideae
Transposable elements of Agavoideae
 

Similar to Surveying the small-spotted catshark (Scyliorhinus canicula) tumour necrosis factor superfamily (TNFSF)

Molecular markers of innate immunity
Molecular markers of innate immunityMolecular markers of innate immunity
Molecular markers of innate immunitymahdi zarei
 
Bioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogeneticsBioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogeneticsProf. Wim Van Criekinge
 
Fernando de la Cruz - Simposio Microbiología: Transmisión
Fernando de la Cruz - Simposio Microbiología: TransmisiónFernando de la Cruz - Simposio Microbiología: Transmisión
Fernando de la Cruz - Simposio Microbiología: TransmisiónFundación Ramón Areces
 
Proinflammatory Consequences of FasL Expression in the Heart
Proinflammatory Consequences of FasL Expression in the HeartProinflammatory Consequences of FasL Expression in the Heart
Proinflammatory Consequences of FasL Expression in the HeartElizabeth Berry
 
The genomes of four tapeworm species reveal adaptations to parasitism
The genomes of four tapeworm species reveal adaptations to parasitismThe genomes of four tapeworm species reveal adaptations to parasitism
The genomes of four tapeworm species reveal adaptations to parasitismJoão Soares
 
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)Kate Hertweck
 
TIF1-gamma Controls Erythroid Cell Fate by Regulating Transcription Elongation
TIF1-gamma Controls Erythroid Cell Fate by Regulating Transcription ElongationTIF1-gamma Controls Erythroid Cell Fate by Regulating Transcription Elongation
TIF1-gamma Controls Erythroid Cell Fate by Regulating Transcription ElongationJoe Lee
 
Symptoms And Treatment Of Trypanosoma Brucei
Symptoms And Treatment Of Trypanosoma BruceiSymptoms And Treatment Of Trypanosoma Brucei
Symptoms And Treatment Of Trypanosoma BruceiJennifer Baker
 
J Neurosci 2006
J Neurosci 2006J Neurosci 2006
J Neurosci 2006Raul Pardo
 
Opinion_FOXP3_Frontiers in Genetics
Opinion_FOXP3_Frontiers in GeneticsOpinion_FOXP3_Frontiers in Genetics
Opinion_FOXP3_Frontiers in GeneticsSurekha Tippisetty
 
Gerstner et al J Neurosci 2012
Gerstner et al J Neurosci 2012Gerstner et al J Neurosci 2012
Gerstner et al J Neurosci 2012jrgerstn
 

Similar to Surveying the small-spotted catshark (Scyliorhinus canicula) tumour necrosis factor superfamily (TNFSF) (20)

H2A Case Study
H2A Case StudyH2A Case Study
H2A Case Study
 
Molecular markers of innate immunity
Molecular markers of innate immunityMolecular markers of innate immunity
Molecular markers of innate immunity
 
TNF
TNFTNF
TNF
 
Biochemistry Poster
Biochemistry PosterBiochemistry Poster
Biochemistry Poster
 
Bioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogeneticsBioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogenetics
 
Fernando de la Cruz - Simposio Microbiología: Transmisión
Fernando de la Cruz - Simposio Microbiología: TransmisiónFernando de la Cruz - Simposio Microbiología: Transmisión
Fernando de la Cruz - Simposio Microbiología: Transmisión
 
Proinflammatory Consequences of FasL Expression in the Heart
Proinflammatory Consequences of FasL Expression in the HeartProinflammatory Consequences of FasL Expression in the Heart
Proinflammatory Consequences of FasL Expression in the Heart
 
Role of Wnt Signaling in Cancer
Role of Wnt Signaling in CancerRole of Wnt Signaling in Cancer
Role of Wnt Signaling in Cancer
 
The genomes of four tapeworm species reveal adaptations to parasitism
The genomes of four tapeworm species reveal adaptations to parasitismThe genomes of four tapeworm species reveal adaptations to parasitism
The genomes of four tapeworm species reveal adaptations to parasitism
 
440
440440
440
 
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
 
TIF1-gamma Controls Erythroid Cell Fate by Regulating Transcription Elongation
TIF1-gamma Controls Erythroid Cell Fate by Regulating Transcription ElongationTIF1-gamma Controls Erythroid Cell Fate by Regulating Transcription Elongation
TIF1-gamma Controls Erythroid Cell Fate by Regulating Transcription Elongation
 
Symptoms And Treatment Of Trypanosoma Brucei
Symptoms And Treatment Of Trypanosoma BruceiSymptoms And Treatment Of Trypanosoma Brucei
Symptoms And Treatment Of Trypanosoma Brucei
 
Sandlund
SandlundSandlund
Sandlund
 
mmmmm
mmmmmmmmmm
mmmmm
 
J Neurosci 2006
J Neurosci 2006J Neurosci 2006
J Neurosci 2006
 
Opinion_FOXP3_Frontiers in Genetics
Opinion_FOXP3_Frontiers in GeneticsOpinion_FOXP3_Frontiers in Genetics
Opinion_FOXP3_Frontiers in Genetics
 
d
dd
d
 
PhD thesis_Opt
PhD thesis_OptPhD thesis_Opt
PhD thesis_Opt
 
Gerstner et al J Neurosci 2012
Gerstner et al J Neurosci 2012Gerstner et al J Neurosci 2012
Gerstner et al J Neurosci 2012
 

Surveying the small-spotted catshark (Scyliorhinus canicula) tumour necrosis factor superfamily (TNFSF)

  • 1. Honours Research Project Report BI4016 2015/2016 School of Biological Sciences Surveying the small-spotted catshark (Scyliorhinus canicula) tumour necrosis factor superfamily (TNFSF) Fiona Bakke Supervisor: Dr Helen Dooley Work completed in partial fulfilment of the requirements for a degree in MSci Biological Sciences Word count: 8548
  • 2. 1 Abstract The S. canicula transcriptome has recently become available and was used to identify TNFSF ligand genes and compare these to TNFSF ligand genes in other vertebrates. As a cytokine subgroup, the TNFSF enables immunomodulatory signalling between cells, being primarily associated with cell survival, inflammation and apoptosis. Genetic polymorphisms in the TNFSF have been implicated in the development of tumours and inflammatory and autoimmune diseases in humans, and the molecular biology of the human TNFSF has been investigated in order to develop therapeutics. These therapeutics are often associated with undesirable side effects, creating a need for alternate methods of their development. One such method is the investigation of the evolutionary history of the TNFSF, so that TNFSF characteristics present in other vertebrates may be identified and exploited. As part of this investigation, a phylogenetic tree of vertebrate TNFSF ligand genes was generated, to produce the most complete representation currently available of the evolutionary history of these genes. S. canicula did not appear to express 6 of the 18 human TNFSF genes (TNF-α, TNF-β, CD27L, 4-1BBL, TWEAK and EDA) but, with the exception of the pyrogenic TNFSF genes TNF-α and TNF-β, did appear to possess TNFSF genes fulfilling similar functions. Only one S. canicula LT-β sequence appeared to be present, with its position on the phylogenetic tree suggesting that it may have been an ancient precursor of TNF-α and TNF-β. The evolutionary development of CD30L, CD40L and TRAIL require further investigation since S. canicula sequences did not appear to share conserved regions with similar sequences from other vertebrates. S. canicula APRIL BAFF, OX40L, GITRL, FasL, TL1A, RANKL and LIGHT did appear to be present, with S. canicula also appearing to express BALM, a TNFSF gene currently identified only in cartilaginous fish and teleosts. Target S. canicula TNFSF genes were then amplified, cloned and sequenced, and the presence of the 3’ end of RANKL confirmed.
  • 3. 2 Keywords. Scyliorhinus canicula; tumour necrosis factor superfamily; comparative immunology; phylogenetic analysis. 1. Introduction The tumour necrosis factor superfamily (TNFSF) is a subgroup of the larger cytokine family (Dinarello, 2000; Opal and DePalo, 2000). Cytokines are small (up to 40 kDa) proteins and glycoproteins, associated with both the humoral and innate immune systems, which regulate and direct immune responses following their secretion (Astrakhantseva et al., 2014). Immune cells which secrete cytokines include dendritic cells (DCs), macrophages, and B and T lymphocytes, although other cells, such as epithelial and stromal cells, also possess this capability (Dinarello, 2000; Opal and DePalo, 2000). Cytokine secretion enables immunomodulatory signalling between cells through autocrine, paracrine and endocrine means and, depending on the cytokines secreted and the nature of their signals, causes targeted cells to respond to an immune insult through their activation, differentiation, growth or apoptosis (Borish and Steinke, 2003; Commins et al., 2010; Testar, 2015). A cascade of further cytokine secretion may also be generated until the immune response is complete (Testar, 2015). Within the larger cytokine family, the role of the TNFSF typically involves initiating inflammation and apoptosis, but also the development of lymphocytes, and epithelial and other tissues (Hehlgans and Pfeffer, 2005; Croft et al., 2013; Wiens and Glenney, 2011). The TNFSF consists of structurally-related ligands which are capable of binding to one or more
  • 4. 3 transmembrane protein receptors; some receptors may also be able to bind to more than one ligand (Bossen et al., 2006; Vince & Silke, 2006; Roosnek et al., 2009), resulting in a complex network of interactions between ligands and receptors. Ligands may be either membrane bound or secreted in soluble form following proteolytic processing, whereas receptors are principally membrane bound (Glenney and Wiens, 2007; Vujanovic, 2011). In addition to the complexity of ligand-receptor interaction and the ability of a wide variety of cells to express both, the effects of binding can also vary, since the form in which a ligand is expressed determines whether its stimulus is antagonistic or inhibitory (Aggarwal, 2003; Robert, 2015). All these factors, together with TNFSF involvement in inflammation and apoptosis, can lead to unwanted harmful effects such as tumour growth and allergic or autoimmune responses (Figure 1) (Adipogen, 2013). Figure 1. TNFSF members associated with human inflammatory and autoimmune diseases. Abbreviations: GVHD: Graft-versus-host disease; MS: multiple sclerosis; IBD: irritable bowel disease. Adapted from Croft et al., 2012.
  • 5. 4 In recent years, much work has been done to investigate the role of the TNFSF human disease conditions in order to develop mitigatory and preventative therapeutic agents (Wang and Yang-Xin, 2005; Roosnek et al., 2009; Croft et al., 2012; Siakavellas et al., 2015; Walczak, 2015; Collette et al., 2003). This work has largely focused on the molecular pathways of the TNFSF in humans, and some effective therapeutic agents have emerged (Adipogen, 2013; Grewal, 2009; Astrakhantseva et al., 2014; Sedger and McDermott, 2014). However, due to the complexity of the TNFSF mechanisms, undesirable side effects to these agents frequently occur (Tansey and Szymkowski, 2009). Alternate methods of developing therapeutic agents are therefore necessary, and investigating the evolutionary history of the TNFSF provides one such means, since it may identify TNFSF characteristics present in mammals which are not yet known. Human TNFSF genes (Table 1) are organised in clusters on major histocompatibility complex (MHC) paralogous chromosomes 1, 6, 9 and 19, suggesting that the mammalian TNFSF may have evolved from an earlier, more rudimentary form during the two rounds of whole genome duplication (WGD) that occurred around 500-800 million years ago (mya) (Biswas et al, 2015; Wiens and Glenney, 2011; Collette et al, 2003). Additional human TNFSF genes are located on chromosomes X, 3, 13 and 17, possibly as a result of cis and/or trans duplications during their further evolution (Glenney and Wiens, 2007). Teleosts underwent a further WGD approximately 350-320 mya, with salmonids and cyprinids undergoing a fourth WGD approximately 70-80mya (Macqueen and Johnston, 2014). The resultant genomic complexity of teleosts, due to these further WGD events, makes direct comparison of the evolution of their TNFSF with that of humans problematic. Exploration of the evolutionary relationships between the TNFSF in humans and other vertebrates which did not undergo these further
  • 6. 5 WGDs may therefore provide a clearer representation of the ways in which the vertebrate TNFSF evolved over time. Table 1. Human TNFSF ligand gene names. Adapted from Gray et al., 2015. Name Synonyms TNFSF Number TNF-α TNF, tumour necrosis factor, DIF 1 TNF-β Lymphotoxin-α, LT 2 LT-β Lymphotoxin-β, p33 3 OX40L CD252, gp34, TXGP1 4 CD40L HIGM1, IMD3, TRAP, gp39, Hcd40l, CD154 5 FasL APT1LGI, CD178 6 CD27L CD70 7 CD30L CD153 8 4-1BBL 9 TRAIL Apo-2L, TL2, CD253 10 RANKL TRANCE, OPGL, ODF, CD254 11 TWEAK DR3LG, APO3L 12 APRIL CD256 13 BAFF THANK, BLYS, TALL-1, CD257 13b LIGHT LTg, HVEM-L, CD258 14 TL1A TL1, VEGI, VEGI192A, MGC129934, MGC129935 15 GITRL AITRL, TL6, hGITRL 18 EDA Ectodysplasin A, EDA1, HED, XHED, ED1-A1, ED1-A2, EDA-A1, EDA-A2 Not allocated Research into the TNFSF of other vertebrates however, remains limited (De Zoysa, Jung and Lee, 2009; Biswas et al., 2015). Until now, no characterisation of the TNFSF in cartilaginous fish has been performed, and similar information is only available for a small number of amphibians, reptiles, birds and non-human mammals. In 2007, Glenney and Wiens identified and characterised 71 TNFSF members in different teleosts, using the structure of membrane bound TNFSF ligands in mammals as a basis. These ligands possess a relatively well conserved (20-30%) TNF homology domain within their extracellular C terminus, which facilitates their comparison with ligands of other species. In this project we used the information obtained by Glenney and Wiens (2007) to detect possible TNFSF orthologs in two species of chondrichthyes, Scyliorhinus canicula (small-spotted catshark, subclass Elasmobranchii) for which in-house transcriptomic data are now available (Dooley et al, unpublished data), and
  • 7. 6 Callorhinchus milii (elephant shark, subclass Holocephali), the only cartilaginous fish for which the genome has been sequenced. Bioinformatic searches of these protein sequences were performed to identify orthologs in other species. This information was then used to generate a phylogenetic tree, which provided an indication of possible evolutionary relationships between TNFSF members in mammals, birds, reptiles, amphibians, teleosts and cartilaginous fish. Alignment of TNFSF sequences allowed the identification of conserved domains among individual TNFSF sequences for different species, enhancing the information provided by the phylogenetic tree. Amplification, cloning and sequencing of S. canicula TNFSF sequences which demonstrated both phylogenetic clustering and the sharing of conserved domains with similar sequences for other species were performed to confirm the presence of these sequences. We hypothesised that, since cartilaginous fish were the first vertebrates to possess a canonical adaptive immune system, S. canicula TNFSF gene orthologs would be found for TNFSF genes in other vertebrates and that more recently evolved vertebrates would also possess TNFSF genes not present in S. canicula that arose from gene duplication and rearrangement following their divergence from their common ancestors. Given the chromosomal clustering of human TNFSF genes, and the fact that the genes in these clusters often share some functional similarities (Collette et al., 2003), we further hypothesised that S. canicula would possess an ortholog for at least one human TNFSF gene from each chromosomal cluster. 2. Materials and methods 2.1 Bioinformatics
  • 8. 7 Accession numbers of TNFSF members previously identified in teleosts (Glenney and Wiens, 2007) were used to obtain protein sequences from the genomic databases of NCBI (http://w ww.ncbi.nlm.nih.gov/protein/) and Ensembl version 82, September 2015 (http://www.ense mbl.org/index.html). Nucleotide sequences were translated to amino acid sequences using the Expasy Translate tool (http://web.expasy.org/translate/). The teleost TNFSF sequences were used to BLAST search (Altschul et al., 1997) S. canicula transcriptomes available via the catshark in-house transcriptome database, the Chondrichthyes Transcriptome Resource (Jones and Blaxter, 2013) http://afterparty.bio.ed.ac.uk/study/show/6232601), SkateBase (http://skatebase.org/skateBLAST) (Wyffels et al., 2014; Wang et al., 2012), and the SwissProt (http://www.ebi.ac.uk/uniprot) database. All S. canicula sequences identified were then BLASTed against the NCBI protein database to confirm the identity of the S. canicula hits as TNFSF members. 2.2 Sequence alignment and phylogenetic analysis Evolutionary relationships among these sequences were then investigated. S. canicula sequences were combined with previously obtained TNFSF sequence information from S. milii (Venkatesh et al., 2014), and with similar data for representative members of other vertebrate classes (Homo sapiens [human], Bos taurus [domestic cow], Mus musculus [house mouse], Gallus gallus domesticus [chicken], Python sebae sebae [African rock python], Anolis carolinensis [Carolina anole lizard], Chrysemys picta [painted turtle], Xenopus tropicalis [Western clawed frog], Danio rerio [zebrafish], Onchorhynchus mykiss [rainbow trout], Latimeria chalumnae [African coelacanth]). All sequences were aligned using MAFFT (Katoh et al., 2002). Highly variable regions were removed using BioEdit software (Hall, 1999). The resulting data were then used to construct a phylogenetic tree in IQ-TREE (Nguyen et al.,
  • 9. 8 2015; Minh, Nguyen and von Haeseler, 2013) and FigTree (Morariu et al., 2009) phylogenetic software. A maximum likelihood tree, using 164 TNFSF sequences containing 493 amino acid sites, was produced, using 1000 bootstrap replications to obtain branch support values. The S. canicula TNFSF member sequences which formed clades with similar sequences from other vertebrate classes, suggesting that they emerged early in the development of the adaptive immune system and also that they were maintained during the divergence of more recently evolved vertebrates, were then considered individually. The TNFSF nucleotide sequences for all vertebrates in such clusters were aligned once more, again using MAFFT and BioEdit software, to identify regions of similarity and the presence of conserved TNFSF domains. 2.3 Primer design TNFSF sequences which contained conserved TNFSF domains were used to design primers for PCR amplification of the S. canicula sequences. RACE-PCR was performed for those sequences which appeared similar to those for H. sapiens and M. musculus, but which were incomplete at the 3’ and/or 5’ ends (Sambrook and Russel, 2001). Primers were designed to try to fulfil the following conditions: to anneal within the evolutionarily most conserved region; to contain conserved cysteine (C) and/or tryptophan (W) amino acids; to have an A or T at the 3’ end and a G or C at the 5’ end; to be 16-24 base pairs (bp) long; to have an optimal melting temperature of ~60oC; to have a difference in melting temperature for forward and reverse primers of ≤ 4oC, and to have a GC content of 40-60%. Oligonucleotide Properties Calculator (www.basic.northeastern.edu/biotools/oligocalc.html) was used to confirm that these conditions had been fulfilled, and also that there were no self- annealing sites or areas of potential hairpin formation, self-dimerization or complementarity
  • 10. 9 (Kibbe, 2007). Primers were purchased from Sigma Aldrich Co. LLC. One forward and one reverse primer was designed for each target sequence to be amplified using PCR, with two forward and two reverse primers being designed for those sequences being amplified using RACE-PCR (Table 2). Table 2. Primers used for amplification of FasL, TL1A, RANKL and LIGHT TNFSF ligands Name Nucleotide Sequence (5’-3’) ScFasL-F1 ACAAGTGTTTATGGTGGATGGC ScFasL-R1 GTCAATGTGTCAGATTCAAGGCTA ScTL1A-F1 ATGGCGTATGGCTATCTATTTGG ScTL1A-R1 TTCGCCCTTCGTAAACGCAAG ScTL1A-F2 AGTGTAAAGCAGGTGGATTTTACC ScTL1A-R2 AAGTCCGTTCTTGTGTTCCCAC ScRANKL-F1 TTGGTGCTCGGGCAATCATAAC ScRANKL-R1 TAGGGTGTCAGCCATGAATTC ScRANKL-F2 TCTGCTCAATGGCGACAAAATC ScRANKL-R2 TGAATTCTTCCTCATGCGATTGG ScLIGHT-F1 AATTCTTCTGCACTGGGGAGAG ScLIGHT-R1 TCTACATACAGCACACATCACAC ScLIGHT-F2 AATAGCTACCTGGGGGCAATC ScLIGHT-R2 ATGTCTTCAGCAGGAACTCTAC ScEF1a-F1 CGTCTTCCTTTATTGCACAGGTTATTATC ScEF1a-R1 GGACAGCGAAACGACCAAGAG UPM Long (0.4µM) CTAATACGACTCACTATAGGGCAAGCAGTGGTATCAACGCAGAGT UPM Short (2µM) CTAATACGACTCACTATAGGGC NUP (10µM) AAGCAGTGGTATCAACGCAGAGT Redesigned Primers ScFasL-F1a ATCCATATGTGCCTCCACCTGCC ScFasL-R1a GGCTAGTTGACTTTGCTGAGTCCA ScTL1A-F1a TCGGAAATTCTTCTGCACTGGGG ScTL1A-R1a ACACATCACACATCAGGCAGCGC ScTL1A-F2a AGTTTTGAGAATCTCCAACGGTGC ScTL1A-R2a ACCCAGTTACATGAGCTGCAGGC ScRANKL-F1a TTGGTGCTCGGGCAATCATAAC ScRANKL-R1a TAGGGTGTCAGCCATGAATTC ScRANKL-F2a TCTGCTCAATGGCGACAAAATC ScRANKL-R2a TGAATTCTTCCTCATGCGATTGG ScLIGHT-F1a TCGGAAATTCTTCTGCACTGGGG ScLIGHT-R1a ACACATCACACATCAGGCAGCGC ScLIGHT-F2a AGTTTTGAGAATCTCCAACGGTGC ScLIGHT-R2a ACCCAGTTACATGAGCTGCAGGC T7 Universal Sequencing Primer TAATACGACTCACTATAG
  • 11. 10 2.4 Amplification of target sequences from S. canicula spleen cDNA PCR was used to amplify target sequences which appeared complete, using a PCR reaction of 10µl MyTaq Red Reaction Buffer (Bioline), 2.5µl of cDNA template, 1µl of both forward and reverse primers, and 1µl MyTaq (Bioline), and 34.5µl PCR grade water, to achieve a final volume of 50µl. S. canicula spleen oligo-dT primed cDNA was used for the template, since this tissue is a secondary immune organ in sharks and therefore would be expected to highly express TNFSF genes (Biswas et al., 2015; Schneider, Potter and Ware, 2004). To confirm template integrity, and that the PCR reagents and PCR cycler were functioning correctly, elongation factor 1 alpha (EF-1-alpha-1) was used as a positive control, as this gene is expressed ubiquitously and has previously been assessed as suitable for this purpose (Infante et al, 2008; Olsvik et al., 2005). The negative control, to confirm absence of contamination of experimental samples, was PCR grade water in place of cDNA. A standard PCR procedure was applied, using an initial cycle of 2 min at 95oC to denature the template (subsequent denaturation cycles lasted 30 sec), 30 sec at a temperature determined by primer requirements to allow annealing, and 1.5 min at 72oC for primer extension along the cDNA template. PCR cycles were repeated 30 times to generate maximum DNA yield (approximately 1 million copies of the target sequences) (Sambrook and Russel, 2001). Touchdown PCR was also performed, across an annealing temperature range of 63OC-45oC, in decreasing increments of 0.4OC, in order to ensure that optimal annealing temperature had been applied. RACE PCR was used to obtain a complete open reading frame from target sequences which appeared to lack 3’ and/or 5’ ends. RACE cDNA was prepared using the SMARTer RACE cDNA kit according to the manufacturer’s protocol (Clontech). First round RACE reactions used 5µl MyTaq Red Reaction Buffer (Bioline), 2.5µl cDNA template, 0.5µl
  • 12. 11 forward primer 1 or reverse primer 1, 0.5µl Universal Primer Mix (UPM, Clonetech), 0.5µl MyTaq (Bioline), and 17.25µl PCR grade water to attain a final volume of 25µl. EF-1-alpha-1 was again used for the positive control, and PCR grade water as the negative control, with S. canicula spleen RACE cDNA being used as the template. Second round RACE reactions used 10µl MyTaq Red Reaction Buffer (Bioline), 5µl round 1 RACE PCR product, 1µl nested universal primer (NUP), 1µl MyTaq (Bioline) and 32µl PCR grade water, to make a final volume of 50µl. 2.5 Visualisation of PCR and RACE PCR amplified sequences Agarose gel electrophoresis was performed following PCR/RACE PCR to enable separation and visualisation of amplified products (Sambrook and Russel, 2001). Agarose gels were prepared using 0.5g molecular grade powdered agarose (Bioline) and 50ml TAE (Tris-Acetate-EDTA 50x, Severn Biotech, at a dilution of 200ml in 10L water). 1-4µL SYBR Safe DNA Gel Stain (Bioline) was also added, which intercalates into any DNA present and fluoresces in the presence of ultraviolet (UV) light, enabling visualisation of the DNA. The solidified gel was loaded into a horizontal gel rig (Bio-Rad) and was submerged in TAE buffer. Samples were loaded into wells with a molecular weight size marker (3µl 100bp hyperLadder, Bioline) being loaded into a neighbouring well to provide a visual guide to DNA molecule size (base pairs [bp]). Gels were run at 100v for 60 min then visualised on a UV transillumination system (BioDoc-It). 2.6 DNA cloning of amplified sequences 2.6.1 Extraction and purification of DNA from agarose gel Products which had been amplified during PCR were prepared for cloning using the manufacturer’s instructions for the QIAquick Gel Extraction Kit (Qiagen). Visualised bands were removed from the gel with a clean scalpel and placed in 1.5ml Eppendorf tubes. Three
  • 13. 12 times the gel volume of QG buffer (Qiagen) was added to the gel and the tubes incubated in a 50oC water bath for 10 minutes to re-solubilise the agarose gel. The gel and an equal volume of isopropanol were pipetted into spin columns inserted into 2ml collection tubes and inverted. The sample was then transferred to a Qiaspin column (Qiagen) and centrifuged at 13000rpm for 1 minute, after which the flow-through was discarded. A further 500µl of QG buffer was added and the tubes centrifuged again at 13000rpm for 1 minute. After discarding the flow-through, 750µl PE buffer (Qiagen) was added and the tubes were left to rest for 5 minutes. They were then centrifuged again at 13000rpm for 1 minute. After discarding the flow-through, 30µl purified water was added and the flow-through collected for cloning. 2.6.2 DNA cloning using E. coli plasmids X-Gal/IPTG/LB plates were prepared by mixing 10g tryptone, 5g yeast extract, 10g NaCl and 20g agar per litre deionised water and autoclaving. After cooling, X-Gal (25ug/ml), 0.1mM IPTG (25µg/ml,) and ampicillin (100µg/ml) were added, before being poured into sterile petri dishes and allowed to set. Ligation reactions were prepared by mixing 5µl 2X Rapid Ligation Buffer, 1µl pGEM-T Easy Vector, 5µl PCR product and 1µl T4 DNA Ligase (all Promega) in 0.5ml Eppendorf tubes. Reactions were incubated at 4oC overnight. 2µl of each ligation mix was pipetted into separate 1.5ml Eppendorf tubes and 50 µl E. coli NEB 5-α heat shock-competent bacteria added, before incubating on ice for 20 mins. The bacteria were then heat shocked at 42oC for 30 secs in a water bath. Cells were then returned to ice for a further 2 mins. 950µl SOC medium (Sigma Aldrich) was added to each tube before incubating at 37oC for 1.5 hours. 100 µl of each culture was then plated onto the X-Gal/IPTG/LB plates and incubated for 12 hours at 37oC. White colonies were then selected from the plates and put
  • 14. 13 into LB broth (as above but without agar) containing 100 µg/ml ampicillin & grown overnight at 37oC with rotation. 2.6.3 Isolation of cloned DNA Plasmid DNA was isolated from the overnight bacterial cultures by applying the manufacturer’s instructions for the QIAprep Spin Miniprep Kit (Qiagen). Bacterial cells were pelleted by centrifuging at 13000rpm for 3 min before being resuspended in 250µl Buffer P1 and pipetted into sterile microcentrifuge Eppendorf tubes. 250µl Buffer P2 was added and the solution mixed by inverting until the mixture became blue. 350µl Buffer N3 was then added and immediately mixed to produce a colourless solution. The tubes were then centrifuged at 13000rpm for 10 minutes, after which the supernatant was pipetted into QIAprep spin columns. These were centrifuged at 13000rpm for 60s and the flow-through discarded. 500µl Buffer PB was added to each spin column, before centrifuging once more and again discarding the flow-through. 750µl Buffer PE wash buffer was then added to each spin column before again centrifuging and discarding flow-through. QIAprep spin columns were then placed into 1.5ml Eppendorf collection tubes and centrifuged for a further 1 minute to remove remaining wash buffer. Each QIAprep column was then placed in a new sterile 1.5ml microcentrifuge tube to which 50µl Buffer EB was added to the centre of the column and allowed to stand for 1 minute to elute the DNA. The tubes were then centrifuged for 1 minute and the flow-through collected and pipetted into new sterile 1.5ml Eppendorf tubes. 2.6.4 Nucleic acid preparation for sequencing A NanoDrop 1000 spectrophotometer (Thermo Fisher Scientific) was used to calculate the nucleic acid concentration (ng/µl) in each sample by measuring the absorbance of UV-visible
  • 15. 14 light by 1µl of each sample. 15µl of sample at 75ng/µl was prepared and 2µl T7 Universal Sequencing primer at 10ng/µl added before sending for sequencing to Eurofins Genomics, Ebersberg, Germany. 3. Results 3.1 Bioinformatics Accession numbers published by Glenney & Wiens (2007) provided protein sequences for 53 of the 71 teleost TNFSF members identified in that study. Three sequences could not be acquired from the discontinued TIGR Gene Indices database (www.tigr.org/tdb/tgi/), and a further 15 had been retired from the Ensembl database without further successors being provided. The supplementary data provided 12 of these outstanding sequences, leading to a total of 65 sequences being identified. The 6 unavailable sequences included TNFSF members TWEAK, TL1A, lymphotoxin α and β, and EDA, but similar sequences for these TNFSF members had already been identified in other teleosts in that study. BLASTing these 65 sequences with transcriptomic data for S. canicula resulted in the identification of 30 potentially orthologous sequences (Appendix 1), representing 14 TNFSF members (Table 3). Further BLASTing of these sequences in the NCBI protein database resulted in the putative confirmation of 9 S. canicula TNFSF members (Table 3). Table 3. S. canicula potential TNFSF sequences obtained through bioinformatics searches. “+” indicates sequences which were obtained and “-“ indicates sequences which were not obtained. TNFSF member BLAST 1 AfterParty/SkateBase BLAST 2 (NCBI) TNFSF member BLAST 1 AfterParty/SkateBase BLAST 2 (NCBI) TNF-α + - RANKL + + TNF-β + - TWEAK - - LT-β - - APRIL + + OX40L + - BAFF + +
  • 16. 15 CD40L + - LIGHT + + FasL + + TL1A + + CD27L - - GITRL - + CD30L - - EDA + - 4-1BBL - - BALM + + TRAIL + + 3.2 Sequence alignment and phylogenetic analysis MAFFT Clustal alignment of these 30 sequences with similar sequences from representative members of other vertebrate classes (H. sapiens, B. taurus, M. musculus, G. gallus, P. sebae, A. carolinensis, C. picta, X. tropicalis, D. rerio, O. mykiss, L. chalumnae), and subsequent removal of highly variable regions, enabled production of an unrooted maximum likelihood phylogenetic tree, consisting of 164 TNFSF sequences containing 493 amino acid sites, and using 1000 replications to obtain bootstrap support values (Figure 2). This indicated that 10 TNFSF members were expressed by S. canicula.
  • 17. 16
  • 18. 17 Figure 2. Unrooted maximum likelihood phylogenetic tree of vertebrate TNFSF members. Coloured boxes denote TNFSF members where S. canicula forms a clade with similar sequences to vertebrates from other classes. Dotted boxes denote TNFSF members where S. canicula forms a clade only with teleosts and other cartilaginous fish. Bootstrap values >70% are considered indicative of a clade. Sequences include both cloned full-length amino acid sequences and predicted sequences obtained during bioinformatics searches.  Denotes a BALM sequence misannotated as a BAFF sequence in genomic and protein databases. Abbreviations: Almi: Alligator mississippiensis; Anca: Anolis carolinensis; Bota: Bos Taurus; Cami: Callorhynchus milii; Chpi: Chrysemys picta; Chpl: Chiloscyllium plagiosum: Dare: Danio rerio; Gaac: Gasterosteus aculeatus; Gaga: Gallus gallus; Hosa: Homo sapiens; Lach: Latimeria chalumnae; Mumu: Mus musculus; Onmy: Onchorhynchus mykiss; Opha: Ophiophagus hannah; Pyse: Python sebae; Sasa: Salmo salar; Sqac: Squalus acanthias; Taru: Takifugu rubripes; Xela: Xenopus laevis; Xetr: Xenopus tropicalis. TNFSF members which did not appear to be present in S. canicula included TNF-α, TNF-β, CD27L, 4-1BBL, TWEAK and EDA. The 2 putative S. canicula CD30L sequences did not group with the other CD30L sequences from H. sapiens, G. gallus, C. picta, A. carolinensis and X. tropicalis, nor with any other TNFSF sequences, raising questions about the identity of these 2 sequences, which will require further investigation. Similarly, the one suggested S. canicula sequence for LT-β appeared to be grouped together with a variety of different TNF sequences from other vertebrates, including those for TNF (which is a commonly used designator of TNF- α [Albarbar et al., in press]) and TNF-β, again rendering the identify of this S. canicula sequence unclear and in need of further investigation. The S. canicula CD40L sequence appeared more similar to the TRAIL and RANKL sequences for other vertebrates (TRAIL: H. sapiens, M. musculus, A. carolinensis, C. milii and L. chalumnae; RANKL: L. chalumnae, G. gallus, C. picta, M. musculus, H. sapiens, C. milii), than it did to CD40L sequences for other vertebrates (C. milii, L. chalumnae, M. musculus, H. sapiens, A. carolinensis, G. gallus, X. tropicalis). TNFSF sequences for APRIL appeared in two discrete clusters, with APRIL sequences for a cartilaginous fish, a lobe-finned fish, 2 teleosts and an amphibian (L. chalumnae, O. mykiss, S. salar, X. laevis and S. canicula) forming one group, and sequences for 2 mammals and a reptile (M. musculus, H. sapiens and A. carolinensis) forming a different group. Bootstrap value
  • 19. 18 between these two groups was 47%, therefore they could not be considered a clade. TWEAK sequences for 2 mammals, 2 reptiles and a lobe-finned fish (H. sapiens, M. musculus, C. picta, A. mississippiensis and L. chalumnae) also appeared to share some similarities with both groups of APRIL sequences. S. canicula TRAIL sequences did appear to group together with TRAIL sequences for H. sapiens, M. musculus, A. carolinensis, C. mykiss and L. chalumnae, but the S. canicula sequences did not cluster as closely together with the sequences for these 5 species as the sequences from these 5 species did with each other. S. canicula TNFSF sequences which did appear to cluster with similar sequences for other vertebrates in the phylogenetic tree included those for BALM, BAFF, OX40L, GITRL, FasL, TL1A, RANKL and LIGHT. Three S. canicula BALM sequences clustered with BALM sequences for O. mykiss, L. chalumnae, C. milii and a BAFF sequence for C. plagiosum, although the bootstrap value of 56% for this grouping precludes these sequences being described as a clade (Figure 3). The C. plagiosum sequence had previously been mis-annotated and had recently been identified as a BALM sequence (Li et al., 2015), although the genomic and protein databases had not been amended, leading to this sequence being shown as BAFF in Figure 2. BALM is a recently identified TNFSF member which has not yet been described in vertebrates other than cartilaginous and bony fish, and this study also identified BALM sequences only in these 2 vertebrate classes. BAFF sequences for representatives from all vertebrate classes did form a clade (bootstrap values > 70%) (Figure 3).
  • 20. 19 Figure 3. Phylogenetic grouping for TNFSF members BALM and BAFF. OX40L sequences from S. canicula, M. musculus, H. sapiens, C. picta and G. gallus formed a clade, with bootstrap values of 77% and above (Figure 4). However, examination of MAFFT sequence alignment did not support this (Appendix 2), since it revealed no conserved domains shared by S. canicula and other vertebrates. The 2 S. canicula GITRL sequences also formed a clade with similar sequences for H. sapiens, M. musculus and C. picta, with bootstrap values of 94% and above (Figure 4) but, again, MAFFT alignment did not reveal the presence of shared conserved regions (Appendix 2).
  • 21. 20 Figure 4. Phylogenetic grouping of TNFSF members OX40L and GITRL The S. canicula FasL sequence clustered with FasL sequences for another cartilaginous fish, C. milii (bootstrap value 85%), and two mammals (H. sapiens and M. musculus; bootstrap value 66%), but no amphibian, reptile or avian FasL sequences were identified as being similar (Figure 5). However, alignment did appear to indicate the presence of a number of conserved domains among the species for which this sequence was identified (Appendix 2). Alignment of S. canicula TL1A sequences with TL1A sequences for H. sapiens, M. musculus, C. picta, A. carolinensis and O. hannah suggested that these species shared conserved domains (Appendix 2) and would therefore be expected to form a clade with similar sequences from these sequences. However, although there was evidence of some grouping, the S. canicula sequences did not form such a clade (Figure 5).
  • 22. 21 Figure 5. Phylogenetic grouping of TNFSF members FasL and TL1A
  • 23. 22 RANKL sequences for H. sapiens, M. musculus, C. picta, G. gallus, L. chalumnae and C. milii formed a clade, with high bootstrap values (Figure 6), and alignment reinforced this through the apparent presence of a number of conserved domains (Appendix 2). Alignment of 2 S. canicula LIGHT sequences with that for 2 mammals (H. sapiens and M. musculus) and 2 reptiles (C. picta and A. carolinensis) indicated much similarity across the sequences (Appendix 2), and this was reflected in the subsequent clustering in the phylogenetic tree (Figure 6). Bootstrap values of 100% were also suggestive of close evolutionary relationship. Figure 6. Phylogenetic grouping of TNFSF members RANKL and LIGHT 3.3 Sequence completion and amplification Time constraints precluded further investigation of all putative S. canicula TNFSF members. Those that had not appeared to be present (TNF-α, TNF-β, CD27L, 4-1BBL, TWEAK, EDA) and those whose identity was unclear (CD30L, LT-β, CD40L, TRAIL) were therefore not selected for sequence completion, amplification and cloning, since further bioinformatic examination of these TNFSF members was required prior to commencement of these procedures. OX40L and GITRL also required further bioinformatic investigation due to alignment dissimilarities with
  • 24. 23 other vertebrate OX40L and GITRL sequences respectively. Since this project forms part of a larger study of the TNFSF of cartilaginous fish, 3 TNFSF members (BAFF, BALM and APRIL) were already being examined by colleagues in Dr Dooley’s team, and duplication of these efforts was considered undesirable. The remaining 4 TNFSF members, FasL, TL1A, RANKL and LIGHT were chosen for further investigation. Examination of amino acid and nucleotide sequences for each of these 5 sequences provided an indication of whether each sequence appeared to represent a complete or partial protein sequence. Only one TNFSF sequence, FasL, appeared to contain a complete open reading frame (Appendix 2). The remaining 4 sequences all appeared to lack both 5’ and 3’ ends (Appendix 2). PCR was therefore chosen for amplification of FasL, and RACE-PCR for completion and amplification of TL1A, RANKL and LIGHT sequences. Initial FasL amplification using PCR did not reveal a predicted ~600bp band following transillumination. PCR was repeated following changes to buffer and PCR grade water, but the same result was obtained. Primer redesign to include at least 3 G/C nucleotides at the 5’ end was carried out and the new primers used for FasL PCR, but no amplification was obtained. Annealing temperatures were adjusted through the use of Touchdown PCR, which used annealing temperatures between 63oC-49oC, decreasing in increments of 0.4oC after each cycle, to ensure that optimal annealing temperatures had been used, but no sequence amplification was achieved. RACE-PCR of TL1A was then attempted, after adopting the changes described above, but this also did not result in completion or amplification of this TNFSF sequence (expected band
  • 25. 24 ~700p). Since it was possible that the S. canicula spleen mRNA from which the template cDNA was generated may have degraded, this was replaced with spleen cDNA from a different mRNA sample and TL1A RACE-PCR repeated. No sequence amplification was obtained. Annealing temperature was lowered from 58oC to 53oC, and cycle number increased from 30 to 34, and TL1A RACE-PCR again repeated, but no sequence amplification was achieved. RACE-PCR of LIGHT and RANKL was then carried out, again incorporating previous procedural amendments. No sequence completion or amplification of either of these TNFSF members was achieved (Figure 7: 1 & 2 show results for 3’ ends for both RANKL and LIGHT). Different S. canicula tissues were then substituted as cDNA templates. The use of S. canicula liver cDNA template resulted in amplification of the 3’ end for both RANKL and LIGHT (Figure 7: 5 & 6), with the RANKL product appearing to be separated into 2 distinct bands. No amplification of the 5’ ends for either RANKL or LIGHT was achieved (Figure 7: 9 & 10). Similarly, the use of S. canicula epigonal cDNA template also resulted in amplification of the 3’ end for both RANKL and LIGHT (Figure 7: 12 & 13). The bands obtained for the 5’ ends of both RANKL and LIGHT were unclear (Figure 7: 16 & 17) and it was therefore not possible to estimate band size nor to be certain that these indicated amplification.
  • 26. 25 Figure 7. Scanned photo of electrophoresed gel under UV light showing amplification of 3’ ends of S. canicula TNFSF ligands LIGHT and RANKL, after RACE-PCR using S. canicula spleen, liver and epigonal oligo dT-primed cDNA as template. 1-4: S. canicula spleen cDNA template: 1: RANKL 3’ product; 2: LIGHT 3’ product; 3: positive control using EF1a F1 primer; 4: negative control using EF1A primer and H2O as template. 5-11: S. canicula liver cDNA template: 5: 3’ RANKL amplification; 6: 3’ LIGHT amplification; 7: positive control; 8: negative control; 9: 5’ RANKL product; 10: 5’ LIGHT product; 11: positive control. 12-18: S. canicula epigonal cDNA template: 12: 3’ RANKL amplification; 13: 3’ LIGHT amplification; 14: positive control; 15: negative control; 16: 5’ RANKL amplification; 17: 5’ LIGHT amplification; 18: positive control. 3.4 RACE-PCR product cloning and sequencing Cloning and sequencing of the amplified 3’ RANKL and LIGHT RACE-PCR products enabled comparison of these sequences with the previously obtained S. canicula nucleotide sequences used for primer design (Table 2 and Appendix 2). The LIGHT 3’ and RANKL 3’ cloned and sequenced RACE-PCR products obtained using S. canicula liver cDNA as a template shared no regions of similarity with the nucleotide sequences used to design their primers, indicating non-specific amplification, as was also the case for the LIGHT 3’ sequence with epigonal cDNA as the template. However, comparison of the RANKL 3’ sequence, with epigonal cDNA as the template, indicated completion of the 3’ end of the S. canicula RANKL transcript (Figure 8).
  • 27. 26 Figure 8. Comparison of RANKL primer design and sequenced 3’ end. Red text indicates nucleotide sequence used for ScRANKL-RACE-F2 primer; yellow highlighted area indicates completed 3’ end, including stop codon. 4. Discussion This study provides the first evidence of the existence of a number of TNFSF members in the cartilaginous fish, S. canicula. Bioinformatic searches suggested that some mammalian TNFSF members do not appear to be present in S. canicula (TNF-α, TNF-β, CD27L, 4-1BBL, TWEAK, EDA). It is possible that, although present in the animal, these were not represented in the in- house multi-tissue transcriptome database due to the types of tissue chosen for sequencing, the immunisation status of the animal whose tissues were used, and the fact that the sequence information derived from RNA samples from single tissues at single time points. Since variations in tissues, cell types and environmental conditions influence gene expression, the resulting vertebrate transcriptome information is almost certainly likely to be incomplete. However, this database contains 623430 S. canicula sequences (from the spleen, gills, blood, brain, epigonal, Leydig’s organ, spiral valve, stomach and liver), and has a Core Eukaryotic Genes Mapping Approach (CEGMA) completeness score of 100% partial and 96% complete,
  • 28. 27 suggesting a high degree of completeness and annotation reliability (Parra et al., 2007). Possible limitations created by the use of transcriptomic data were also mitigated by pooling the results of multiple different tissues and by normalising cDNA samples to maximise the expression of genes expressed at lower levels, making it less likely that sequence information for the above TNFSF members had been omitted from the databases. This is reinforced by the fact that, with the exception of one apparent sequence for 4-1BBL, another cartilaginous fish, C. milii, also does not appear to have the genes for TNF-α, TNF-β, CD27L, TWEAK or EDA in its genome or transcriptome. Thus, while it is not possible to confirm the absence of these TNFSF members in cartilaginous fish, and further work investigating their expression in other such species, such as Leucoraja erinacea (little skate), for which some transcriptomic data is now available, is required to corroborate this finding, the fact that two shark species demonstrate similar results does provide a preliminary indication that these TNFSF members may not be expressed by this group of animals. Other TNFSF sequences also require further investigation to confirm their identities and possible relationships to TNFSF members in other vertebrates (CD30L, LT-β, CD40L, APRIL, TRAIL, OX40L, GITRL). However, FasL, TL1A, LIGHT and RANKL TNFSF sequences did appear to be present and to share some conserved domains with similar sequences from other vertebrate classes. It has been hypothesised that the evolution of the vertebrate TNFSF increased in complexity during the development of the adaptive immune system, and involved WGD prior to the emergence of vertebrates, followed by a series of localised duplications and reorganisations of ancestral TNFSF genes (Glenney and Wiens, 2007). In humans, TNFSF genes tend to be located in clusters on 8 different chromosomes (Table 4), with genes on the same chromosome appearing to share some functional aspects (Collette et al., 2003). It is therefore
  • 29. 28 striking that, even though homologues for all mammalian TNFSF members did not appear to be present in S. canicula, at least one TNFSF member from each human “chromosomal cluster” was either putatively identified or demonstrated sequence similarities sufficient to warrant further investigation. This appears to support the hypothesis that the 19 mammalian TNFSF members evolved from a smaller number of ancestral TNFSF members and, given the chromosomal clustering of human TNFSF members, that this occurred on these chromosomes through localised gene duplication and reorganisation. Table 4. TNFSF human chromosomal organisation 1: S. canicula TNFSF sequences sharing conserved domains and forming a clade with similar sequences in other vertebrates; 2: S. canicula TNFSF sequences whose identity requires further investigation and confirmation. S. canicula sequences not identified included: TNF-α, LT-α, CD27L, 4-1BBL, EDA, TWEAK. S. canicula BALM is not included since this has not been identified in the human TNFSF. Human chromosomal location Human TNF ligands Putatively identified S. canicula TNFSF ligands 1 2 1 FasL, GITRL, OX40L FasL GITRL, OX40L 6 LT-β, TNF-α, LT-α LT-β 9 CD30L, TL1A TL1A CD30L 19 LIGHT, CD27L, 4-1BBL LIGHT X EDA, CD40L CD40L 3 TRAIL TRAIL 13 BAFF, RANKL BAFF, RANKL 17 APRIL, TWEAK APRIL In mammals, TNF-α, LT-α and LT-β are all involved in cell apoptosis, with TNF-α being a pyrogen, particularly during the acute phase of infection, and LT-α being an inflammation- mediator (Croft et al., 2012). LT-β also interacts with LT-α, promoting the function of the latter. Only one S. canicula LT-β sequence was putatively identified, and this appeared to group with mammalian, reptile and teleost sequences for TNF-α and LT-α. This sequences requires further evaluation to confirm its identity, but it is possible that a gene similar to this may have been present in the common ancestor of all vertebrates, acting as a precursor to the 3 human genes in this TNFSF subgroup and explaining why this S. canicula sequence
  • 30. 29 cannot be assigned to any one of these groups. The apparent absence, apart from the one possible LT-β sequence, of similar genes in S. canicula, and the fact that none of the other S. canicula TNFSF members are pyrogens, also suggests that the adaptive immune system in these animals may have a limited ability to mount an inflammatory response. It is possible that the inflammatory response developed following the emergence of endothermic vertebrates, and this is reinforced by the fact that teleosts, amphibians and reptiles appear only to express TNF-α (Rollins-Smith & Woodhams, 2012; Roca et al., 2008). However, it is also possible that such an inflammatory response is not beneficial to cartilaginous fish, since it has not evolved in extant species. It may even lead to adverse energetic consequences and fitness deficits, such as increased oxygen consumption and metabolic rate, dehydration or damage to tissues (Rollins-Smith & Woodhams, 2012). Other ectothermic vertebrates often respond to pathogen infection by raising overall body temperature behaviourally, for instance by moving to warmer areas of water or land, and it is possible that cartilaginous fish respond similarly (Rollins-Smith & Woodhams, 2012). However, other cytokines, such as interleukin-1 (IL-1) and interleukin-6 (IL-6), do act as pyrogens, and are expressed by cartilaginous fish, teleosts, amphibians and reptiles, therefore it is possible that these cytokines, rather than TNFSF members, perform inflammatory functions in these vertebrates (Zimmerman, Bowden and Vogel, 2014). Lastly, the apparent deficiency of pyrogenic TNFSF members in S. canicula may provide one explanation for the slow adaptive immune response of cartilaginous fish relative to mammals (Dooley and Flajnik, 2005). BALM sequences were only found in cartilaginous and bony fish, suggesting that this TNFSF member may have been lost following the divergence of other vertebrate classes, or that it is a cytokine specific to cartilaginous fish and teleosts. However, the fact that it has not yet been
  • 31. 30 found in other vertebrates, particularly in other ectotherms, may also be due to a lack of research into the TNFSF in non-mammalian species, with much work remaining to be done in this area. BALM appeared to derive from a common ancestor from which BAFF, APRIL and TWEAK also derive, but appeared most similar to BAFF and APRIL. Glenney and Wiens (2007) hypothesised that BALM may either have been an ancestral gene of BAFF and APRIL, or that it developed during gene duplication in teleosts; its presence in cartilaginous fish appears to support the former hypothesis. In mammals, BAFF and APRIL are primarily involved in B cell development, proliferation and survival, as well as in immunoglobulin production and class switching; BAFF also promotes plasma cell differentiation and stimulates activated T cells, while APRIL also regulates the growth of tumour cells (Roosnek et al., 2009; Mackay et al., 2003). Although TWEAK appeared to share an ancient common ancestor with BAFF, BALM and APRIL, its functions in mammals differ, since it appears to promote endothelial cell migration and angiogenesis, as well as inducing apoptosis (Vince and Silk, 2006). The apparent presence of BAFF, BALM and APRIL sequences in S. canicula suggest that B cell and immunoglobulin regulation are important immunological requirements for these animals, and also that B cell developmental requirements might be different in terms of TNFSF ligands. This is reinforced by recent immunisation studies (Pettinello and Dooley, 2014; Crouch et al., 2013). In mammals, RANKL genes are located on the same chromosome as BAFF, suggesting that they may have evolved following gene duplication. The fact that these TNFSF members perform very different functions (RANKL triggers dendritic cells to stimulate naïve T cells) (Walsh and Choi, 2014), and the fact that few conserved domains are shared between similar sequences for S. canicula and other vertebrates, would point to a duplication event during a very early stage in vertebrate evolution.
  • 32. 31 TL1A and CD30L are both found on human chromosome 9 (Table 2). Although S. canicula TL1A did not appear to form a clade with TL1A sequences from other vertebrates, it did appear to share some conserved domains with these. S. canicula CD30L did not appear to form a clade, nor share conserved domains with CD30L sequences from other vertebrates, although this remains to be confirmed. The apparent presence in S. canicula of TL1A, with less evidence of the presence of CD30L, suggests that CD30L may have evolved in mammals from a duplication and/or rearrangement of TL1A. This is supported by the fact that both are involved in apoptosis, although TL1A is expressed by endothelial cells and CD30L by B and T cells. It is also possible that endothelial cell expression of TL1A in cartilaginous fish also represents an ancestral mode of expression, and that subsequent evolution of CD30L accompanied the refinement of the adaptive immune system in mammals, leading to an alteration in cell expression. In mammals, CD30L also triggers Ig class switching, and may have developed this function subsequent to the emergence of this TNFSF member; this is reinforced by the fact that Ig class switching does not occur in sharks due to the organisation of their IG genes in clusters rather than in the translocon configuration found in mammals (Pettinello and Dooley, 2014). In humans, LIGHT is located on chromosome 19, along with CD27L and 4-1BBL. S. canicula LIGHT sequences demonstrated strong evidence of a close relationship with LIGHT sequences in other vertebrates, suggesting that it may have emerged early in vertebrate development and continued to play an important role in the vertebrate adaptive immune response. This TNFSF member is generated by T cells, which it also stimulates, and may also induce cell apoptosis, according to receptor expression. It is also responsible for promoting BAFF production, which may be one reason why it has been maintained during vertebrate
  • 33. 32 evolution. Neither CD27L nor 4-1BBL sequences were identified in S. canicula, suggesting that these genes emerged more recently, possibly through duplication and rearrangement of LIGHT. This is reinforced by the fact that both CD27L and 4-1BBL perform similar functions to LIGHT through activating and causing proliferation of T cells (Kober et al., 2010). S. canicula sequences for CD40L and TRAIL did appear to be present. CD40L was also characterised in a recent study of S. canicula (Li et al., 2015), but TRAIL requires further investigation, since this did not demonstrate phylogenetic clustering nor conserved domains with similar sequences from other vertebrates. In humans, these TNFSF members are located on chromosomes X and 3, respectively, and perform differing functions, with CD40L modulating Ig class switching, memory B cell development and the formation of germinal centres, and TRAIL inducing tumour cell apoptosis. Recently identified teleost TNFSF member sequences formed a useful basis from which to initiate a study of the TNFSF in cartilaginous fish. Teleosts are more closely related in terms of evolutionary time to cartilaginous fish, and the TNFSF genes of these two vertebrate classes might therefore be expected to exhibit close evolutionary relationships. However, this was not the case, due to the fact that teleosts underwent an additional one (two, in the case of salmonids and cyprinids) WGD events, during which they appear to have developed many variations in their TNFSF genes (Macqueen and Johnston, 2014; Glenney and Wiens, 2007; Volff, 2005). A more accurate impression of the evolutionary history and development of the TNFSF was therefore gained from a comparison of the TNFSF genes in cartilaginous fish and other, non-teleost, vertebrates, since these animals had undergone the same number of WGD events. The evolutionary development of the TNFSF among these species would therefore be
  • 34. 33 due solely to individual gene duplication and/or reorganisation as these classes diverged over evolutionary time. Additional phylogenetic information was provided by a comparison of S. canicula TNFSF genes with those of another cartilaginous fish, C. milii, since these 2 species are both in the class Chondrichthyes and exhibit many physiological and genetic similarities (Heinicke et al., 2009). However, C. milii is in the subclass Holocephali, and S. canicula in the class Elasmobranchii, 2 classes that are believed to have diverged approximately 410mya (Heinicke et al., 2009); both may therefore have had the potential to develop many genetic variations. Any TNFSF gene similarities that have been maintained in these species would therefore point towards these genes being of increased importance to these animals. The presence of similar conserved genes in other vertebrates would also reinforce their importance, and the development of novel TNFSF genes in other vertebrates would allow further work to be carried out to gain a greater understanding of the potential fitness benefits of these novel genes in more recently evolved vertebrate classes. Characterisation of the TNFSF in cartilaginous fish may therefore clarify our understanding of the early nature and function of the TNFSF, and comparisons to the TNFSF in non-teleost vertebrates may provide insights into how these have been maintained, lost or changed over evolutionary time. It is acknowledged that bioinformatic searches for TNFSF member sequences of all species relied upon the quality of the data maintained in the various databases. Mis-annotation of genes may have affected data accuracy; at least one sequence, Chiloscyllium plagiosum BAFF, has recently been re-annotated as BALM (Li et al., 2015), as indicated on Figure 2. Similarly, while BAFF sequences for representative mammals, birds, reptiles, amphibians and cartilaginous fish (including S. canicula) clustered relatively closely on the phylogenetic tree, two sequences in this grouping were annotated as APRIL, suggesting that they may be mis-
  • 35. 34 annotated and require further investigation. Other sequences for APRIL appeared to form two separate phylogenetic groups. Since one group contained mammalian and reptilian representatives, while the other contained cartilaginous fish, coelacanth, teleost and amphibian representatives, it is possible that this group separation was due to sequence changes accumulated since evolutionary divergence, and this might suggest different functional roles. However, sequence mis-annotation may also apply here. Lastly, since relatively few studies have considered the TNFSF in vertebrates other than mammals, there was a corresponding lack of sequence information available, which limited our ability to produce a more comprehensive representation of the evolutionary history of the TNFSF among different vertebrates. The use of a secondary lymphoid tissue (spleen) cDNA as a template (Criscitiello, 2014; Rumfelt et al., 2002) may have influenced these results, and different tissues were used as a template during the later stages of the study as TNFSF expression may have been greater in these tissues than in the spleen. The epigonal is a primary immune organ in cartilaginous fish and may therefore express TNFSF genes at higher levels than the spleen. This appeared to be the case following positive amplification of the 3’ ends of both LIGHT and RANKL following RACE-PCR. These results suggest that any investigation of immune gene expression in S. canicula and other animals should use a variety of different tissues as cDNA templates. Failure to do this may provide an incomplete representation of immune gene expression in these species. The immune status of the animals whose tissues are used is also important: this study used tissue samples from an unimmunised animal which therefore may not have been expressing TNFSF genes at sufficiently high levels to detect. Since TNFSF members are part of the cytokine family and are involved in the immune system’s signalling process, it is possible
  • 36. 35 that an immunological challenge would lead to increased cytokine signalling and hence to higher levels of TNFSF gene expression in the S. canicula spleen. Further studies should therefore be carried out on immunised animals, to determine whether this may be the case. Time limitations precluded the amplification, cloning and sequencing of all putative TNFSF members identified, and further work is required in order to synthesise a comprehensive picture, not only of precisely which TNFSF members exist in cartilaginous fish, but also of their development over evolutionary time in other vertebrates. 5. Conclusion Recent advances in genome and transcriptome sequencing are now providing a wealth of information that was previously unavailable, enabling investigation of the evolutionary development of the TNFSF in all vertebrate classes. The results of this study support the hypothesis that TNFSF genes in humans and other vertebrates evolved from a smaller number of similar genes in cartilaginous fish and also that, with the exception of certain pyrogenic capabilities, the TNFSF of cartilaginous fish fulfils similar functions to that of more “advanced” vertebrates. Further investigation of the TNFSF in different vertebrate classes will increase our understanding of the evolution of the TNFSF and may reveal novel roles for TNFSF genes which may be utilised in the future development of therapeutics for both humans and other vertebrates. Acknowledgements I would like to thank my supervisor, Dr Helen Dooley, for her unstinting advice and support throughout this project, and for making the process both instructive and enjoyable. I would
  • 37. 36 also like to thank her team for all their help, in particular Anthony Redmond for his patience and help with all things bioinformatic, and Rita Pettinello and Dr Kimberley Mackenzie for their valuable assistance in the lab.
  • 38. 37 References Adipogen, 2013. TNF Superfamily: linking autoimmune diseases, inflammation and cancer, [online] Available at: <http://www.adipogen.com/media/Catalogs/PDFs/AdipoGen_TNF%20Superfamily_Br ochure_2013.pdf> [Accessed 10 October 2015]. Aggarwal, B.B. 2003. Signalling pathways of the TNF superfamily: a double-edged sword. Nature Reviews Immunology, 3(9), 745-756. Albarbar, B., Dunnill, C. & Georgopoulos, N.T. (in press) Regulation of cell fate by lymphotoxin (LT) receptor signalling: functional differences and similarities of the LT system to other TNF superfamily (TNFSF) members. Cytokine & Growth Factor Reviews. (Accepted for publication May 2015). Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W. & Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Research, 25(17), 3389-3402. Astrakhantseva, I.V., Efimov, G.A., Drutskaya, M.S., Kruglov, A.A. & Nedospasov, S.A. 2014. Modern anti-cytokine therapy of autoimmune diseases. Biochemistry (Moscow), 79(12), 1308-1321. Biswas, G., Kinoshita, S., Kono, T., Hikima, J-I. & Sakai, M. 2015. Evolutionary evidence of tumor necrosis factor super family members in the Japanese pufferfish (Takifugu rubripes): comprehensive genomic identification and expression analysis. Marine Genomics, 22, 25-36. Borish, L.C & Steinke, J.W. 2003. Cytokines and chemokines. The Journal of Allergy and Clinical Immunology, 111(suppl), S460-S475. Bossen, C., Ingold, K., Tardivel, A., Bodmer, J-L., Gaide, O., Hertig, S., Ambrose, C., Tschopp, J. & Schneider, P. 2006. Interactions of tumor necrosis factor (TNF) and TNF receptor family members in the mouse and human. The Journal of Biological Chemistry, 281(20), 13964–13971. Collette, Y., Gilles, A., Pontarotti, P. & Olive, D. 2003. A co-evolution perspective of the TNFSF and TNFRSF families in the immune system. Trends in Immunology, 24(7), 387- 394. Commins, S.P., Borish, L. & Steinke, J.W. 2010. Immunologic messenger molecules: cytokines, interferons, and chemokines. Journal of Allergy and Clinical Immunology, 125(2), S53- S72. Croft, M., Duan, W., Choi, H., Eun, S-Y., Madireddi, S. & Mehta, A. 2012. TNF superfamily in inflammatory disease: translating basic insights. Trends in Immunology, 33(3), 144-152.
  • 39. 38 Croft, M., Benedict, C.A. & Ware, C.F. 2013. Clinical targeting of the TNF and TNFR superfamilies. Nature Reviews Drug Discovery, 12, 147-168. Crouch, K., Smith, L.E., Williams, R., Cao, W., Lee, M., Jensen, A. & Dooley, H. 2013. Humoral immune response of the small-spotted catshark, Scyliorhinus canicula. Fish and Shellfish Immunology, 34(5), pp. 1158-1169. De Zoysa, M., Jung,S. & Lee, J. 2009. First molluscan TNF-α homologue of the TNF superfamily in disk abalone: molecular characterization and expression analysis. Fish and Shellfish Immunology, 26(4), 625-631. Dinarello, C.A. 2000. Proinflammatory cytokines. Chest Journal, 118(2), 503-508. Glenney, G.W. & Wiens, G.D. 2007. Early diversification of the TNF superfamily in teleosts: genomic characterisation and expression analysis. The Journal of Immunology, 178(2), 7955-7973. Gray, K.A., Yates, B., Seal, R.L. Wright, M.W. & Bruford, E.A. Genenames.org: the HGNC resources in 2015. Nucleic Acids Research, 43(D1), D1079-D1085. Grewal, I.S. 2009. Overview of TNF superfamily: a chest full of potential therapeutic targets. In: I.S. Grewal, ed. 2009. Therapeutic Targets of the TNF Superfamily (Vol. 647). New York: Springer Science & Business Media. Hall, T.A. 1999. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symposium Series, 41, 95–98. Hehlgans, T. & Pfeffer, K. 2005. The intriguing biology of the tumour necrosis factor/tumour necrosis factor receptor superfamily: players, rules and the games. Immunology, 15(1), 1-20. Heinicke, M.P., Naylor, G.J.P. & Hedges, S.B. 2009. Cartilaginous fishes (Chondrichthyes). In: Eds S.B. Hedges & S. Kumar, eds, 2009. The Timetree of Life. Oxford: Oxford University Press. Infante, R.E. Wang, M.L., Radhakrishnan, A., Kwon, H.J., Brown, M.S. & Goldstein, J.L. 2008. NPC2 facilitates bidirectional transfer of cholesterol between NPC1 and lipid bilayers, a step in cholesterol egress from lysosomes. Proceedings of the National Academy of Sciences, 105(40), 15287-15292. Jones, M. & Blaxter, M. 2013. afterParty: turning raw transcriptomes into permanent resources. BMC Bioinformatics, 14(1), 301. Katoh, K., Misawa, K., Kuma, K. & Miyata, T. 2002. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic acids research, 30(14), 3059–3066.
  • 40. 39 Kibbe, W.A. 2007. OligoCalc: an online oligonucleotide properties calculator, Nucleic Acids Research, 35(Suppl 2), W43-W46. Li, R., Redmond, A.K., Wang, T., Bird, S., Dooley, H. & Secombes, C.J. 2015. Characterisation of the TNF superfamily members CD40L and BAFF in the small-spotted catshark (Syliorhinus canicula). Fish and Shellfish Immunology, 47(1), 381-389. Mackay, F., Schneider, P., Rennert, P. & Browning, J. 2003. BAFF and APRIL: a tutorial on B cell survival. Annual Review of Immunology, 21(1), 231-264. Macqueen, D.J. & Johnston, I.A. 2014. A well-constrained estimate for the timing of the salmonid whole genome duplication reveals major decoupling from species diversification. Proceedings of the Royal Society of London B: Biological Sciences, 281(1778), 20132881. Minh, B.Q., Nguyen, M.A.T., & von Haeseler, A. 2013. Ultrafast approximation for phylogenetic bootstrap. Molecular Biology and Evolution, 30(1), 1188-1195. Morariu, V.I., Srinivasan, B.V., Raykar, V.C., Duraiswami, R. & Davis, L.S. 2009. Automatic online tuning for fast Gaussian summation. Advances in Neural Information Processing Systems, 1113-1120. Nguyen, L.T., Schmidt, H.A., von Haeseler, A. & Minh, B.Q. 2015. IQ-tree: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Molecular Biology and Evolution, 32(1), 268-274. Olsvik, P.A., Lie, K.K., Jordal, A.E.O., Nilsen, T.O. & Hordvik, I. 2005. Evaluation of potential reference genes in real-time RT-PCR studies of Atlantic salmon. BMC Molecular Biology, 6(1), 21. Opal, S.M. & DePalo, V.A. 2000. Anti-inflammatory cytokines. Chest Journal, 117(4), 1162- 1172. Parra, G., Bradnam, K. & Korf, I. 2007. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics, 23(9), 1061-1067. Pettinello, R. & Dooley, H. 2014. The immunoglobulins of cold-blooded vertebrates. Biomolecules, 4(4), pp. 1045-1069. Robert, J. 2015. Textbook of Cell Signalling in Cancer: An Educational Approach. Paris: Springer. Roca, F.J., Mulero, I., López-Mũnoz, A. Sepulcre, M.P., Renshaw, S.A. Meseguer, J. & Mulero, V. 2008. Evolution of the inflammatory response in vertebrates: fish TNF-α is a powerful activator of endothelial cells but hardly activates phagocytes. The Journal of Immunology, 181(7), 5071-5081.
  • 41. 40 Roosnek, E., Burjanadze, M., Dietrich, P.Y., Matthes, T., Passweg, J. & Huard, B. 2009. Tumors that look for their springtime in APRIL. Critical Reviews in Oncology/Hematology, 72(2), 91-97. Sambrook, J.S. & Russel, D.W.R. 2001. Molecular cloning: a laboratory manual. New York: Cold Spring Harbor Press. Schneider, K., Potter, K.G. & Ware, C.F. 2004. Lymphotoxin and LIGHT signalling pathways and target genes. Immunological Reviews, 202(1), 49-66. Siakavellas, S.I., Sfikakis, P.P. & Bamias, G. 2015. The TL1A/DR3/DcR3 pathway in autoimmune rheumatic diseases. Seminars in Arthritis and Rheumatism, 45(1), 1-8. Sedger, L.M. & McDermott, M.F. 2014. TNF and TNF-receptors: from mediators of cell death and inflammation to therapeutic giants – past, present and future. Cytokine and Growth Factor Reviews, 25(4), 453-472. Tansey, M.G. & Szymkowski, D.E. 2009. The TNF superfamily in 2009: new pathways, new indications, and new drugs. Drug Discovery Today, 14(23), 1082-1088. Testar, J., 2015. Cytokines: Introduction. [online] Available at: <http://bitesized.immunology.org/receptors-and-molecules/cytokines> [Accessed 27 September 2015]. Venkatesh, B., Lee, A.P., Ravi, V., Maurya, A.K., Lian, M.M., Swann, J.B., Ohta, Y., Flajnik, M.F., Sutoh, Y., Kasahara, M., Hoon, S., Gangu, V., Roy, S.W., Irimia, M., Korzh, V., Kondrychyn, I., Lim, Z.W., Tay, B-H., Tohari, S., Kong, K.W., Ho, S., Lorente-Galdos, B., Quilez, J., Marques-Bonet, T., Raney, B.J., Ingham, P.W., Tay, A., Hillier, L.W., Minx, P., Boehm, T., Wilson, R.K., Brenner, S. & Warren, W.C. 2014. Elephant shark genome provides unique insights into gnathostome evolution. Nature, 505(7482), 174-179. Vince, J.E. & Silke, J. 2006. TWEAK shall inherit the earth. Cell Death and Differentiation, 13(11), 1842. Vujanovic, N.L. 2011. Role of TNF superfamily ligands in innate immunity. Immunologic Research, 50(2-3), 159-174. Walczak, H. 2015. Death receptor-ligand systems in cancer, cell death, and inflammation. Cold Spring Harbor Perspectives in Biology, 5(5), a008698. Walsh, M.C. & Choi, Y. 2014. Biology of the RANKL-RANK-OPG system in immunity, bone, and beyond. Frontiers in Immunology, 5, 1-11. Wang, J. & Yang-Xin, F. 2005. Tumor necrosis factor family members and inflammatory bowel disease. Immunological Reviews, 204(1), 144-155.
  • 42. 41 Wang, Q., Arighi, C.N., King, B.L., Polson, S.W., Vincent, J., Chen, C., Huang, H., Kingham, B.F., Page, S.T., Rendino, M.F., Thomas, W.K., Udwary, D.W., Wu, C.H. & The North East Bioinformatics Collaborative Curation Team. 2012. Community annotation and bioinformatics workforce development in concert – Little Skate Genome Annotation Workshops and Jamborees. Database, 2012: bar064. Wiens, G.D. & Glenney, G.W. 2011. Origin and evolution of TNF and TNF receptor superfamilies. Developmental and Comparative Immunology, 35(12), 1324-1335. Wyffels, J., King, B.L., Vincent, J., Chen, C.., Wu, C.H., & Polson, S.W. 2014. SkateBase, an elasmobranch genome project and collection of molecular resources for chondrichthyan fishes. F1000Research, 3.
  • 43. 42 Appendix 1. S. canicula TNFSF amino acid sequences demonstrating possible orthology with those of other vertebrates >lcl|comp149704_c0_seq1 CD40L/TNF10 TTVEEEGKKQRKENSSYRKCELIERLTLLLASNSKIDFLDSSCNQGKTPLVHLIQEGKVLRKKGRGVVCWYTSLHIQGR DIFSMQEVWALKKCAVLIFPERCEMTRDMLRPGNCTVIEETVLTVSALVDNMNETSNKLTEPRKVCEHQPSVSVRV LIYLITALLLGHMVTSALLYVYFSMKLDKVQEIVNADPEYTFLRSIKRCENKVEKSNIFCDGTRSELQHLIQKYLTNNTR EQLHGAGVPTQEKQIGNHHTTITNSVDNPRRTAAHLIATNKAENIPDSAFPRKKGQPIKQWMAQGFPAFTHNINY TSGKLVITQPGFYYVYCQVSFRLTNNVTDDTSSVPFLQYIYLQRIEEQTTLLMKASKTPVDKKQTASFNSVNQGGVFE LKTGDKLFVTVTQADLLSYDRATYFGIFHLFLCSVMYGLLRNRFCSQLSLPTLNCPPPP >lcl|comp167134_c2_seq5 TNF15/14/A LRPRGFLLAGLSVYLLLQHVPAQKSEAQIGPEMIRLPVDTAEKPRAHLTAKVTSSDKKDGEYFALQWEHKNGLAFTK GELNYQDRALTIPKGDYFVYSQVSFRGYASDMHENNSLIIHMITKLTPRYPKPMQLLSSTRSVNGEKSLWRMAIYLG AVFHLEQGDRIIVNVSSVKQVDFTN DHKTYSNDH >lcl|comp174543_c0_seq4 TNF13B/A SEFLKKIHHHREKPEEEKTKFRGRRSLPEQVQQSFVQLIATHDQRSFVKVNDCLKYMTGRSQPTCRVKRRPVSNQSS KQETTVPWILSLKKGNALDKKDNKILVNEAGFFLVYSQVWYKDNTFTMGHFIKRIKASSVGSELRTVILFRCIQNMSA CCPNDTCFTAGIAKLEVGDELELVIPRTQAQIALKGDGTFFWGNEAAVTVMAKAVEAKFLPFKEKRRLRKKSCQYRL VSWLLMIAMWMENFSENQIKFVYLCANKFKGKKKEKT >lcl|comp175589_c0_seq2 TNF13B/13/EDA/A AITAAGSHLKIASSVMTSSVVKELYQSDLDKQNLFNIINDATQMLDITRIKEQMYLQRADWDYGYSLLGCFMLSVTA CDFTAANKCLRSELAKVHKQLDVLAREETCGSSGTKRNGKSVKSKKNNNVKFQIKEQKTGVNEKNYDLRSISRKKRD VRPCEKQRSFLQLIPNAKQSIILQDDSTFIPWIVASRRGEALEATSNKIVVKENGYYTVYSQVLYYDNCMAMGHVVQ RRKANVVGDERRIMTLFRCIQNMPNNTASNTCFTAGIVSLTKRMSWNSCYHPDPMLTSPWTKIQHFLVPYNYYEA TMEDDDCPRVGLFRQRHSSPNTSPLLSKISVAIEGISIVNEGGFTCLHRLRFGTKICSSHPLAKLLHNQFCFGCNLMQL EALLSHMQVFCFHCLKNKLLLAF >lcl|comp175589_c0_seq1 TNF13B/13/EDA AITAAGSHLKIASSVMTSSVVKELYQSDLDKQNLFNIINDATQMLDITRIKEQMYLQRADWDYGYSLLGCFMLSVTA CDFTAANKCLRSELAKVHKQLDVLAREETCGSSGTKRNGKSVKSKKNNNVKFQIKEQKTGVNEKNYDLRSISRKKRD VRPCENSDHSSSFLMPSKALFCKVLYYDNCMAMGHVVQRRKANVVGDERRIMTLFRCIQNMPNNTASNTCFTAG IVSLTKRMSWNSCYHPTQCHLHGQRFNIFWCHTITMKQLCERMMTAQEWDYFARDTVHLIRLPCSPRLVLQSKDI SIVNEGGFTCLHRLRFGTKICSSHPLAKLLHNQFCFGCNLMQLEALLSHMQVFCFHCLKNKLLLAFINLKKKRK >lcl|comp175589_c0_seq3 TNF13B/13/A FSFFFVYESKQFIFQTVKTEDLSHVRQKGLKLHVASKAKLIVKLSKWVGTADLSPKPQPVKTSGKSTLLINNADVLRLQ HSWRAGETYVNCVSGEIIPLLGSHHPLTLLHSNCMAPKNVESLSMEMLALGRDGNRSSNSSSWSSFTIPDCILCNSH FLSQLFYWKLPPMPHPSLKPQSKERLNHLAECFAWHESAGGMIAAFTRSYIP >lcl|comp175589_c0_seq4 TNF13B RDVRPCESSDHSSSFLMPSKALFCKELSLTKRMSWNSCYHPTQCHLHGQRFNIFWCHTITMKQLCERMMTAQEW DYFARDTVHLIRLPCSPRLVLQSKDISIVNEGGFTCLHRLRFGTKICSSHPLAKLLHNQFCFGCNLMQLEALLSHMQV FCFHCLKNKLLLAFINLKKKRK >lcl|comp167134_c2_seq3 TNF15/14/A LRPRGFLLAGLSVYLLLQHVPAQKSEAQIGPEMIRLPVDTAEKPRAHLTAKVTSSDKKDGEYFALQWEHKNGLAFTK GELNYQDRALTIPRKGDYFVYSQVSFRGYASDMHENNSLIIHMITKLTPRYPKPMQLLSSTRSVNGEKSLWRMAIYL GAVFHLEQGDRIIVNVSSVKQVDFTNDHKTYFGAFLLWLSYIESATFINGSIDSWRATKDTLLEVLSISSNDVWVATH
  • 44. 43 HKFLQQLFSVDGLSFKLCYMGTKSTCELGIQRVAHVRWSFPTLQFELNGPFKVRDQKIQHWDAGVAALKMGRLEN LTARKWECGCKTVYLYLSMGCGHRQLGLHLLFIPNDTCLEGIETSTLLSVWSHMA >lcl|comp175049_c0_seq1 TNF13/13B/12 RCSTPIFGLQWSESGSNHKDDEVRVRVGEEKPEDGSKWFEEFLGSKIEMEDPGKGPQKSQRTLSYRRRTPEERCSG ELHVQSNLHQSHERVPIHDNVSRITRNHHFYDVSFLKLVDTCTVTSIERLISRHILRASEQCLSLILPPLFPSLYHVTHDK RPVLIQDVMRPFPRMAIVLLFCSKDSPVCKIFCQITVLSPCDLEDGTGGLMSFFFSVCACACPRSLFPRGVLAALVPR CVRGLHVCSPETEVRATCSLGASCYPTRARPKGHRVVHLHFQLQVGRGVVVAVNLAVYVVSVFENDELPCLIVRQCI FHLRLFQPLSDDPIIFNYIRPTFIIGHFEMSCKRLFVLCCRSFSPLSPYPRQPLALQQPLQELNLTPGLPGSRGRSGRML SSNFGVSPLSESRAESGNW >lcl|comp354964_c0_seq1 TNFA/B PSTATKWPSCHRVIRRKSCSKPPSPPVTMATMEIPGTRLPTREPYSSSRRETRSSPGLTRMWWSMSTQLQERVSLE YLLCKDSESLPVISDDGLRISIGIQQDCWTYCPIRTNSIPSD >lcl|comp173264_c1_seq1 TNF10/CD40L/TNF11/14/15 HYGRGHFPACPTQRALLKIHTIGQTKCPHRRRITITRNWAGNPWPPRRENGASCRFWDLLFVEICVTCYFFCYFTDQ IDQMQQKLNSDNYPGRCLQYMNSYIDHLGKGEVPVEDFLCDSWLKNVKFLINQRLQSDVRNLIYKELKAHNATFLG ADKPAIHLIAQQRNKPPKNVPSYFTRVNGHDVLYWNDVKGFAVHQGAMKYHGGEITIPKDGIYYIYSHVYFWHVQ KPDGEDQQYNLFLQYLYKMSMNYYKPLLLTKAIFTKCWSKESKFDVYTSFQGAAFELEQGDKIFIKVTNASAVNVNE GATYFGAFMIFMEWSNLSLHQCNNSEEQLILNLLSKFRNQLWAVIFRPFPQSGYSCPAEGENRDTHEGNSLWGLG SHTMLGTDLVCRVSLPAHTSRIHHNSNAITSWFKKTKIPQSLPGGFSQTRVQPLFQLCQSLALWFLLLNSQFKLIHFV LSLPTIKTLISQLLFNVELSFVNNLDVWLNMLLVQKLLLESLIYFLLIDS >lcl|comp208326_c0_seq1 TNF6/14/15 AGPWACLLCNCTMQQNYLYPQVFMVDGSQNLNPYVPPPAWALPTLKKKRKLDWRTVGTILLLNLVLLALAGLAL GTVYLLELQKEIQEMKQDNGGKGPVAHKIIGAQNFTQPPKDLRIAAHLTGDSITAGSNPLVWDDQKGHAFTSGILY KDRGLVINKAGIFFVYSKIYFRSYVCDRKNLEHIVFKRTDRYRKELILMEAKKTNYCSGNGWATNSYQAGIIHLLKGESI YVNVSDSRLVDFAESKTFFGLYKLFTLLIYILLKSRKNSIKIK >lcl|comp288857_c0_seq1 TNF10/11/6/15 PEELCCGTGRQVQCLIRVAQRGAFFPRGCLPGWCAHSNPQPCSQQPCPTLSPSWPPEAPRPPAHRTAGSRAVHQ RPATGSHHRLHLLHPQPVAAKRCQWDLGGGFAPIVVLQWQQPLVRVGNPESLQVLFAATERTETTSPGGSWCCR KTATQRFAQGIQSHEEEFMADTLRSPHYKKKTTRNSSDLETSFPGLKGKPIISWEEKVGLAHLREMRYSQGSLRIKHR GLYYIYAQILFRYHADHPMVTREYQLIQYVYRKMSKYPKPTLILRGASTWCSGNHNLYSVYQGGVFSLLNGDKIFVTV SNVSLVDVDEASSYLGAFKMDISVTCALFRFSSSTLRILEWVVNNSHRSATQFNHCNRIS >lcl|comp170458_c0_seq2 TNF14/15/6 PLRPRDIKIGTTRNSAACVMCAVCRVPAEDMIFQDSTKPAAHVTGCNCSSSGYDRLQWEHSKGLAFLREIEYKEGAL VIKRAGYYFIYSKILLGEIHCGDERNGFLKQTLFKVGEAYPQPIELMAVRKFFCTGESTDQWLDNSYLGAIFDFSEGDE VFIKITEKKVLRISNGAETFFGAFMVCAYRTFYSLCFCQMSANMSVPALRMLMSKCATGVPNLYCCFLGTIYSGMW AGILLLCRTYRPEGMRGVRGRGLQLLKAASCMLPPCPFHTHCFYKSLFITEKKKKK >lcl|comp 165409_c1_seq1 TNF6/14/B AITAAGLEDWEQLCVHVICPLKELVSLHKAYNAWHCETLLLIMAEVLMETLDISNPLTNAMPPNNKIAQPEKKEKLQ KLFTAIALLLILVTLMLSTWNAVNISKLQLKQVNSRKQRTEMQNQRQALKNAAHLTGMNVINSSTALIWDSVKGLA FVQGVSHSAEELIIIEAGFYLIYTKVSFGGQVCQSDMLIDHSVLKRSDRYPVAIEIMKTRNSNNCPGNNGQWSRDSF QSGIFQLRKGDRIYVTVSNPAFVNFRQVSTFFGLHKLLICIFNACFHLLWVNDQALSLFIFDIRELYREKKLNDILSLVKP VIETIEYILRSIRRKVRTILNKVYNGFNVTNWNLKKKKRK >lcl|comp159900_c0_seq1 TNF15/A/14/10
  • 45. 44 RVAISGRQGLKVKSDLECSSTSYTDSRFIKPFGDSIQKSDSHQRVNSGVFKMLKPTSTDPEDLTCRNSKCTLSRKHSCS SLLLSIFIILQWLAIAAMCSIIFFYINVLKEDIHTEVLAKMKNITISSFQTTGGYTFKSSVAKSEKPIAHFPALTAAADIQNT SGHLLFEHKRGHAFKTSDMKYDEGGYLVIPTSGVYFIYAQVTFICSQDCTKLSQQFFAAILKKNNQYPYPEELLKSYAR PQGDDFLKVSTYQAGAFQLYADDYIYAKVPENLYTHISEGEHETYFGAFLLDPATSEASCQFSYVVRECIHAYPHTST WCFFGAICLVQQTVPNEDVQEIYATIAQYLFLSYTPMSDLSYHFVNCNAAIISCIGLGSHIIKIISIHVKKEK >lcl|comp159900_c0_seq2 TNF15/A/14 LQHLFENNLLKYQDFFEALDVPFIRLVSRDCANVQCQNIYFSGFQTTGGYTFKSSVAKSEKPIAHFPALTAAADIQNT SGHLLFEHKRGHAFKTSDMKYDEGGYLVIPTSGVYFIYAQVTFICSQDCTKLSQQFFAAILKKNNQYPYPEELLKSYAR PQGDDFLKVSTYQAGAFQLYADDYIYAKVPENLYTHISEGEHETYFGAFLLDPATSEASCQFSYVVRECIHAYPHTST WCFFGAICLVQQTVPNEDVQEIYATIAQYLFLSYTPMSDLSYHFVNCNAAIISCIGLGSHIIKIISIHVKKEK >lcl|comp170458_c0_seq1 TNF14/6/15 TAAGEMMLPGDRHRSTMCVPLCFIPTSPESSFLYFFALRTPKMGDNYVYPSIFVVDGQSDTVPFATSTKRKKMTLA HKFLLMLVLLALTGDGLGAFYLWRLQSDVQEFSSMFLAAKNTSLIVRSIQDMIFQDSTKPAAHVTGCNCSSSGYDRL QWEHSKGLAFLREIEYKEGALVIKRAGYYFIYSKILLGEIHCGDERNGFLKQTLFKVGEAYPQPIELMAVRKFFCTGEST DQWLDNSYLGAIFDFSEGDEVFIKITEKKFESPTVQKHFWCFHGLMCLSDVLFIVFLSVNVCHVVGSCFADVNVMC HRSVITLILLLLTWNHLLRNVGWNLAAVPNVSTGGYEGGAGQRIAAIVKSCQLYVATIMPISYSLFLIFIYYKKKKE >lcl|comp147674_c0_seq1 TNF10/14/6/A LRPRGCCSTKNSAPQLKTFALNLQRASFRMGTSSNSITAQRLGFALSSVLLLQCVCMTLTFLYFHSELKQVQESFSKN NMACLTEDILHLHAGPKGDEEFYVDPTGEKTERNEDLCWKFRAQIQQLIEKTISNQYEQDMSVIVKGEVSRLLPSLT QSESGARLTVQKIAAHLTGNYKTRSTPLAGPASRKLQGRKSAPGSPREGWRSCTVWNTKMESSSRKLDSITSTARSY FRYWEAA >lcl|comp349734_c0_seq1 TNFA/15 SIAPQPEPEPLKPVTVFQGLKQVANSSSKLEWKDGSHRIIEKGNYFVYVQVTFKGECSGEKLFLKTTLCKKQVKIKEKE LLVTSLSVCESQNHMSWTRTLSQAAIFYFEKGDNIHVNTTTMEYVDFNHSYNNIFGIYRFVTLGNTSVKIFYSLVSVLF IVVFPFAIYRSLNDSIPFSYLSCCLPPYLSIYTSMKFLLSFPIMSLFFYVPLFENFTAAKYFIYAEKQKAMIHRCAIDFSVM HATLR >lcl|comp170479_c0_seq1 TNF4/A ELPETPQFFISSMPWIFFHVHDASLSYQRDCISIIVPGTYVPVSGLQLGFMTFLRGKKYSHPKGWYLKPDDTLVSRVSL HAKMIIHNFHIINNTDQHATNINFYYLFLGSFMFVIHAEADGISEKKVKLRSKDSAWVKNNAIVVPCDGLYLIYAHVS SNRTPLMEIKKKEDSDSDADTDSWLLPKTKAYSPLITSMLRVNNTIRMLLDLTNDELKDDNSYLGIALLQIHQDINSC QQIIQKSCKYEKHRDGLCLSTPTSGKYIGQLEWEKRGQNVHCLRGAIFNGLELNLLHIGWCGEKRRCVWGGKRML HRGNWLLNCIFHYRIEDRGPNRNLAQIKKLNLLLKRVLEIFNHITILFSGEWGLEPDVCPKMGISSWCGRETSNQAW TLDRVMLDGLSGLFFCSRFSWCPGLSLSLDQHHRASDLVVISLLAGSRWAVPTLQELHFEKYLIGRKALWDVQTLITL LKSEVLMFCLLHVASLPHAVRWYFRMSLLRPAHSRLHIHSMGGGGRNVRFRVNIKFSFFLMYMFTTCSKIKYMMP GILPQMRDWHHSESDQIGSVCSLTWFIAHTFLNC TIECLAQFLKVKQVHLLSDQNFITEGDCAQFSKSCLCVEGGTSSFKKGIQPKIIFINILGLQYSRGGQTAGKEALECRLF DLRIVFIYCMCSYLMGKKSFTRKLMFYCNLYFYMELCFVLFGNVHKKYK >lcl|comp156182_c0_seq1 TNF6/14/A/B KNAAHLTGYGLTENSKMLTWKGTTGTAFMRGVDYQDEGLVVHAPGLFFIYSRILFRDVNSVNDRFQEQNVFKKTK QYHRDISLMRIIKHNYCAMNINAWSESTGIFMLSEGDYIYVSVSYPDFVGDEHSTYFELYWLFALPVFCILAHNSPAY SYRRAYRNFYLVGMSKAGVLHEVIYLIFQFLLLLLRSSFVLIGQHGSIVV >lcl|comp156182_c0_seq2 TNF6/A GPCLCPRRYGLTENSKMLTWKGTTGTAFMRGVDYQDEGLVVHAPGLFFIYSRILFRDVNSVNDRFQEQNVFKKTK QYHRDISLMRIIKHNYCAMNINAWSESTGIFMLSEGDYIYVSVSYPDFVGDEHSTYFELYWLFALPVFCILAHNSPAY SYRRAYRNFYLVGMSKAGVLHEVIYLIFQFLLLLLRSSFVLIGQHGSIVV
  • 46. 45 >lcl|comp177788_c0_seq1 TNF6/14/15 HYGRGTVNNLQQTSGRARGSFNRKWTILCPYRNWSVMQRRNLYSLQMLDGENPPSCHPTTAWPVPLQELRIEKK LGWLKICHFVIVSMLFIALAGLMFGTYYLHQLQQEVEGMKLKIDGTPQNRREKQIGSEERTDKPKSEVKMAAHLTG KTTTDHPEDSGRFLWVMHFYKGFSTRMERSSMKLDSTLFTQKSISVEWNVRQSRCNKLSSAMSITYTTVQSTSTTA VAQEHGGKAPPGRNIPSEGVQLFVSVSHPRMVSTDELMTFFGAYKLIRPVWTMKLQITMLRDNCDTSQFKRFRCV HLLIVFLTRFNDHYKQKSISSLRSICGLRRMDLSICILAIKIYTSGEYIVHLHFPHFFLIFRTVEVQFNGHEPSKVSNIVIIFL LVTVCIIKLFSPEVGGIIFEPCSLLYPSHLSSVTFNIYAQKRINDSWTFSTANVTAAFWRRDFQIFGIFCTFRFYRTVQFC GYSPFFWSVPSEDIVSLYQPVKSSVSLLYPLNTPGNASLIAFPCWYHSGEYALHPLHCQYILPEWCPELNAVYKVRSN EACNNVSSIHLTNLLIKTNIPLTFIIQFCICPLPFSDSTWTPKSLYSSVVLNFLYRVRKCFRKKVNWLGDTKVDNLCATM CHSVVKNVDSKAMQVTPCDCPIVLQPQGPYFLWSVLFSDFYSTLLTALRNDYTIDLENTDPERVVSFTWLKLNDYLL ASKDMIGSLASISAIDFYDLEVGNFTNSKPVKDSMICKSNNKNPSLLKCLTFRFILKDYVILLLSYLSGLCWTISTFPRQG GNQPPNISQLPKGVLYHGNRAFFCLPAWDSRHSLTDSCVNKEPGMALHQCATPLYCLWTDRTIRNCCHPGESWEL PHITFLLNVSHGDNASSTQIITPQLNAEMGLGLISILSFQGGRLNRFSSLHMSVKLHYRNGFIFCLCSLLYCRKQYSSNL LHVLRSLCNRKKCICNFKIKLLQKKKEKT >lcl|comp165144_c0_seq4 TNF15 HYGRGRQWKAAKKLGLLSFTDTGLKKNGRRFCRRVGSVVNRTTLRQFASGGMARDICVYKVGAPHPLQSSCTPAL LNIQVSLQIPGEMPMEVTAPLRELEAPSGNRRKCPRALFYLSLIMMVTMAQVLAASGLSFYFLRKQQQPVATPNRE KCITHWAAASKQRQGENQALAWETTMGDVRTGNCTWYNENKNSLEIQSPGDYFIYIQVMLLDTWKNVTFSVDE RKTYHTRNSTGSSNTLYFGTTCKVRAGAAVSVILNPGHVLNGETETFLILFKLTISKLVARPRANVQFSYYCIVPTKVTF SSLCITDNTENWLYSLTKIVVISGSLQSSLFVTVNIFCKKLRIKYFKDNAKKKKRK >lcl|comp165144_c0_seq1 TNF15 FSFFFFSVIFKIFNSKLLFTENIYSYKRDDRLPLITTIFVSESQFSVLSVIHYNEEKVTLVGTIQLNWTFARGLATSCWSITT ESEMFQSRHSGHDQGVPRRLPLLALCKWFQSIRCWNSLWSFGCDKSSAHRRMSHSSMCLTTSPGCRSNRQGFVF PVSFYFHCTRCSYRCEHRPSWSPKRVPDSRPVSVWTQLPNELYISLCWVCSPVLPLQNDIVTQPRIHSQTAPPDGRV SAASAIGHICLPARNKKAWPRLAQSRLRQQELRQGVEGKQQANSSRVGFLY >lcl|comp165144_c0_seq3 TNF15/9/CD97 TFYFKVTLTVRYSASPSNPFWLCFSQQTPNREKCITHWAAASKQRQGENQALAWETTMGDVRTGNCTWYNENK NSLEIQSPGDYFIYIQVMLLDTWKNVTFSVDERKTYHTRNSTGSSNTLYFGTTCKVRAGAAVSVILNPGHVLNGETET FLILFKLTISKLVARPRANVQFSYYCIVPTKVTFSSLCITDNTENWLYSLTKIVVISGSLQSSLFVTVNIFCKKLRIKYFKDN AKKKKRK >lcl|comp173450_c0_seq2 TNF10/11/CD40/15/A KGFFFVGYFLAGAQMIVLFTPRDAMAGQYSRSSSSESSVPMIRTAAPANSTLIVGAVFGAVALLLSIVSTAGMFVYF TREISKVKNESVSQELWCLNSPQRFEQLISTETEDLSDQIKKDPCWRLANSAKLLATKIADKTNKKHIIQGVMDNIPV DVLKSAAEFNDKPAAVPLLISLYGSLLHKMCLTITSRAVTLLNGALERGYPTFRTHSIMENCEPVKRADTTFMLRPISDI STRQEMIMMTTLQTRMSSSLCSAYIRQLPMHHQSSSKVWEQNVRQRMLSMVIPSTKEDFLNGWNICHHFHKVS GCGILLWVLTGYVKSFNCRKRSLFRRKSLHKTITIRSVILACFLFKALLNQICRSFDLLTLSADKAVKWNGLTLLMCTVR ESVSFLSSSWYTYFAHVLIIEARTATSLLCDCTMIFNGTDRNKSGQSSSQRHLLFLWEGTAPSGNEDRGIPKVGKREV VFWKLKVSATTGRFWRSGQKEECLIPKLRHPQECPIPKPSPRSLDMSNSKTAKMGKDKLSLQNLSDKCFKRNEGTLY FQGSAIIFSSILFNMRENLACMHYNRTKQVHLNQTQYSLNEGKKKFDAPPLFSAWVMIESTFLVNLSMVDHTSGQL GMGIVPQQGWVCTGLLPAADWAKNPLSNSKNRKNIIIFVSNRYKRSALA >lcl|comp147674_c0_seq4 TNF10 LQKGFNNKKALKNIHFSKIISSPLSPHLIHFSGGTSLTKLLVTCPRTEFDNRLMKPLGTIFNLKGTMQMQVNAVDVPF LSAAGSVLGISSRDPHSFPFSPRWDRRRIPHLPWGQREDAECPPSGRPCCSWRRTPVPVLTRCESTRRSGSCTHTVA GGQSSGQSPISGQSNCCWSPSMTLAVSLRQMFAEERSFLLSSSPAAV >lcl|comp147674_c0_seq2 TNF10
  • 48. 47 Appendix 2. Clustal format alignments (MAFFT) Red, highlighted text indicates putative start and stop codons OX40L Mumu_OX40 ----------------------------------------------------MEGGVQPL Hosa_OX40 ----------------------------------------------------MR--VQPL Chpi_OX40 ----------------------------------MGML-------------EME--AQRL Gaga_OX40 --------------------MVCASALLVALLVEMGQPDTELKKHTDQKDCEKE--MRSD Scca_OX40_proba ELPETPQFFISSMPWIFFHLSYCISIIVVVSL-QLGFMTF------------LRGKYSHL Scca_OX40_proba F-----SFF------------------FVYEIFQTKTEDL-----------HVR------ Scca_OX40_proba F-----SFF-------FFSLLFLITTIFVSS--QFSVLSV----------------IHYN Mumu_OX40 DENLENGSRRFKWKKTLLVVSGCLQLSS--SPAKDPPIQRLRGCEDGQLISYKYQTMEVQ Hosa_OX40 EENVGNAARRFERNKLLLVASVCLHFSALQVSHRYPRIQSIKVYKKEKGITSQDEIMKVQ Chpi_OX40 EEQLELERRQVRRNRLQLVSAV---------------LQWIEVYVPKEKIISWRGTMNVR Gaga_OX40 DEWRGWQKGQAKRNTLYLVSAA---------------LKWTYIQSIAGVMLSAFTSIPVI Scca_OX40_proba DD--TLVSRSLHAKIIHFI------------------INNTDQHAHAEADSEKKVLAWVK Scca_OX40_proba -----------QKLKLHVASKAKL-------------IKWVGTKTSGKSLINNVLRLQHV Scca_OX40_proba EEKVTLLARGLATSCWSITTESE--------------MFQSRHSGHDQGVPRRLPLLALQ . : Mumu_OX40 NNSVVIKCDGLYII---------YLKGSFFQ------------------------EVKI- Hosa_OX40 NNSVIINCDGFYLI---------SLKGYFSQ------------------------EVNI- Chpi_OX40 NLSIPIYCDGFYLV---------SLKGTIRA--------TQRNA-----------SLSL- Gaga_OX40 NGSIMIPCDGLYVV---------SLKGVLSP--------DEKSS----------LKLMM- Scca_OX40_proba NNAIVVPCDGLYLIYA-------HVSSNRTPLMEIKKKEDSKAYSPLITSMLRTIRMLLT Scca_OX40_proba SGEI-IPLLGSHLLHSNCMAPKNSLSMEMLA-LGRSSNSSSSSFTIPDCFLSQLWKLPPM Scca_OX40_proba SILWSFGCD--------------KSSARMSHSMCLTTRSNRQGFVFPVSFYFHCTRCSYR . . . Mumu_OX40 -------------------DLHFREDIS---MLNDGRRIVFTVVASLAFKDKVYLTVNAP Hosa_OX40 -------------------SLHYQKDLF---QLKKVRSVNSLMVASLTYKDKVYLNVTTD Chpi_OX40 -------------------KVYKRSDLC---VKVTEEVVDSVTVSEMRFKDEVYLEANRD Gaga_OX40 -------------------KNTESKNLWERDVQNSSNAVDLITMLYLFAQNNIILSTSSN Scca_OX40_proba NDN----------------SIHQDINSCYI-GQLLELNLGKRMLVNTSNQDRVMLDGLSG Scca_OX40_proba SLKPQSER----------------------------------------------LNHLAE Scca_OX40_proba CEHRPSKRVPSRPVSVWTNELYISLCVC---SPVLPLQNDIVTISVSAASGHICLPARNK * Mumu_OX40 TLLIVVQL Hosa_OX40 TSLILIHQ Chpi_OX40 NYLGLFLL Gaga_OX40 TIFSLVLL Scca_OX40_proba LFFGLFLL Scca_OX40_proba CFWGMF-- Scca_OX40_proba AWLGVFLY :.
  • 49. 48 GITRL Mumu_GITRL -------------EEMPLRESSPQRAERCK-KSWLLCIA-LLLMLLSLGTLIYTSLK-PT Hosa_GITRL -FLFSTALISPKMENMPLSHSRTQGAQRSSWKLWLFC-S-VMLLFLSFSWLIFIFLQLET Chpi_GITRL NFLEN--------GNIPKEA---------------LC-T-LALLILVVPLLIVCSCLHSQ Scca_GITRL_prob -FCRRVG--SVVNRTAPLRELEAPSGNRRKCPRALFYLSLIMMVTMVAASLSFYFLRKQQ Scca_GITRL_prob ------------------------------------YFK-VTLTVRYASPNPFWLCFSQQ : : Mumu_GITRL AI------ESCMVKFLSSS----------KWHMTS---PHC---VNTSDGKLKILQSGTY Hosa_GITRL AK------EPCMAKFPLPS----------KWQMAS---PPC---VNKSDWKLEILQNGLY Chpi_GITRL SP------EICWAHTSPSDDH----GHYIKWNMT-----HCSSFVRNSNQKLEILQSGMY Scca_GITRL_prob QPVATPNREKCITHWAAASKQRQGENQALAWETTMGDVGNCTWYNKNS---LEIQSPGDY Scca_GITRL_prob ----TPNREKCITHWAAASKQRQGENQALAWETTMGDVGNCTWYNKNS---LEIQSPGDY * * .: .. *. : * ..* *:* . * * Mumu_GITRL LIYGQVIPVDKYIKDNAPFVVQIYKKNDVL-QLM-----NDFQILPIGGVYELHAGDNIY Hosa_GITRL LIYGQVAPN--NYNDVAPFEVRLYKNKDMI-QLT-----NKSKIQNVGGTYELHVGDTID Chpi_GITRL FIYAQVT----RIKEESYFTMMLYKDENLLNQMTGTNTGEKTASINFGRPYFLSKGEKLF Scca_GITRL_prob FIYIQVM----LLDTNVTFSV----DE----RKTTRNSTGSSNTLYFGTTCKVRAGAAVS Scca_GITRL_prob FIYIQVM----LLDTNVTFSV----DE----RKTTRNSTGSSNTLYFGTTCKVRAGAAVS :** ** . * : .: : . .* : * : Mumu_GITRL LKFNSKDHIQKTN--TYWGIILM Hosa_GITRL LIFNSEHQVLKNN--TYWGIILL Chpi_GITRL CTINSGGHILKDN-QTYWGLYKM Scca_GITRL_prob VL---NGHVLNGETETFLILFKL Scca_GITRL_prob VL---NGHVLNGETETFLILFKL :: : : *: : :
  • 50. 49 FasL Hosa_FasL ---------MQQP-----------FNYPYPQIYWVDSSASSPWAPPGTVLPCPT------ Mumu_FasL ---------MQQP-----------MNYPCPQIFWVDSSATSSWAPPGSVFPCPS------ Cami_FASL ---------MEQ-------------SYKTPHVYSVDGNPSFS------------------ Scca_FasL_proba ---------MQQ-------------NYLYPQVFMVDGSQNLN------------------ Scca_FasL_proba AITAAGLEDWEQLC----------VHVICPLKELVSLHKAYN------AWHCETLLLIMA Scca_FasL_proba AITAAGLEDWEQLC----------VHVICPLKELVSLHKAYN------AWHCETLLLIMA Scca_FasL_proba ------------------------GPCLCPRRY--------------------------- Scca_FasL_proba HYGRGTVNNLQQTSGRARGSFNRKWTILCPYRNWSVMQRRNLYS---------------- * Hosa_FasL ---------SVPRRPGQRRPPPPPPP-PPLPPPPPPPPLPPLPLPPLKKRGN-HSTGLCL Mumu_FasL ---------CGPRGPDQRRPPPPPPPVSPLPPPSQPLPLP--PLTPLKKK-D-HNTNLWL Cami_FASL --------------------QYPIRPSCPV------------LVPTLKKQNT-DRIKACV Scca_FasL_proba --------------------PYVPPPAWAL--------------PTLKKKRKLDWRTVGT Scca_FasL_proba EVLMETLDISNP-------LTNAMPPNNKI--------------AQPEKKEKLQKLFTAI Scca_FasL_proba EVLMETLDISNP-------LTNAMPPNNKI--------------AQPEKKEKLQKLFTAI Scca_FasL_proba ------------------------------------------------------------ Scca_FasL_proba ---LQMLDGENP-------PSCHPTTAWPV------------PLQELRIEKKLGWLKICH Hosa_FasL LVMFFMVLVALVGLGLGMFQLFHLQKELAELRESTSQMHTASSLEKQIGHPSPP-PEKKE Mumu_FasL PVVFFMVLVALVGMGLGMYQLFHLQKELAELREFTNQSLKVSSFEKQIANPSTP-SEKKE Cami_FASL ILILILALIILVLLACGIFFIFNLRSQIEKITQKQIKEAEGQH-AKFTGHKQPI-KPSEN Scca_FasL_proba ILLLNLVLLALAGLALGTVYLLELQKEIQEMKQDNGGKGPVAH--KIIGAQNFT-QPPKD Scca_FasL_proba ALLLILVTLMLSTWNA---------VNISKLQLKQVNSRKQRT-EMQ--------NQRQA Scca_FasL_proba ALLLILVTLMLSTWNA---------VNISKLQLKQVNSRKQRT-EMQ--------NQRQA Scca_FasL_proba ------------------------------------------------------------ Scca_FasL_proba FVIVSMLFIALAGLMFGTYYLHQLQQEVEGMKLKIDGTPQNRR-EKQIGSEERTDKPKSE Hosa_FasL LRKVAHLTGKS-N---SRSMPLEWEDTYGIVLLSGVKYKKGGLVINETGLYFVYSKVYFR Mumu_FasL PRSVAHLTGNP-H---SRSIPLEWEDTYGTALISGVKYKKGGLVINETGLYFVYSKVYFR Cami_FASL IREAAHLTGKS-----SAANTLLWDDTKDIAFTRGVRYKDGGLIINETGLFFIYSKIYFR Scca_FasL_proba LRIAAHLTGDSIT---AGSNPLVWDDQKGHAFTSGILYKDRGLVINKAGIFFVYSKIYFR Scca_FasL_proba LKNAAHLTGMNVI---NSSTALIWDSVKGLAFVQGVSHSAEELIIIEAGFYLIYTKVSFG Scca_FasL_proba LKNAAHLTGMNVI---NSSTALIWDSVKGLAFVQGVSHSAEELIIIEAGFYLIYTKVSFG Scca_FasL_proba --------GLT-----ENSKMLTWKGTTGTAFMRGVDYQDEGLVVHAPGLFFIYSRILFR Scca_FasL_proba VKMAAHLTGKTTTDHPEDSGRFLW----VMHFYKGFSTRMERSSMKLDSTLFTQKSISVE * : : * : *. : . : . : . Hosa_FasL GQSCNNL-PLSHKVYMRNSKYPQDLVMMEGKMMSYCT-TGQMWARSSYLGAVFNLTSADH Mumu_FasL GQSCNNQ-PLNHKVYMRNSKYPEDLVLMEEKRLNYCT-TGQIWAHSSHLGAVFNLTSADH Cami_FASL NLICVKDLELEQTLFVRTSRYPKDMILMETKKNKYCVVKDTVWYINSYQAGIFKLHNGDH Scca_FasL_proba SYVCDRK-NLEHIVFKRTDRYRKELILMEAKKTNYC--SGNGWATNSYQAGIIHLLKGES Scca_FasL_proba GQVCQSDMLIDHSVLKRSDRYPVAIEIMKTRNSNNCPGNNGQWSRDSFQSGIFQLRKGDR Scca_FasL_proba GQVCQSDMLIDHSVLKRSDRYPVAIEIMKTRNSNNCPGNNGQWSRDSFQSGIFQLRKGDR
  • 51. 50 Scca_FasL_proba DVNSVNDRFQEQNVFKKTKQYHRDISLMRIIKHNYCAMNINAWSEST---GIFMLSEGDY Scca_FasL_proba WNVRQS----------RCNKLSSAMSITYTTVQSTSTTAVAQEHGGKAPPGRNIPSEGVQ : .: : : . . .. . .. Hosa_FasL LYVNVSELSLVNFEESQTFFGLYKL----------------------------------- Mumu_FasL LYVNISQLSLINFEESKTFFGLYKL----------------------------------- Cami_FASL IYVNVSNTNLVSFDQTKTFFGLYKL----------------------------------- Scca_FasL_proba IYVNVSDSRLVDFAESKTFFGLYKL----------------------------------- Scca_FasL_proba IYVTVSNPAFVNFRQVSTFFGLHKLLICIFNACFHLLWVNDQA-------------LSLF Scca_FasL_proba IYVTVSNPAFVNFRQVSTFFGLHKLLICIFNACFHLLWVNDQA-------------LSLF Scca_FasL_proba IYVSVSYPDFVG-DEHSTYFELYWLFAL---PVFCILAHNSPA----------------- Scca_FasL_proba LFVSVSHPRMVSTDELMTFFGAYKLIRPVWTMKLQITMLRDNCDTSQFKRFRCVHLLIVF ::*.:* ::. : *:* : *
  • 52. 51 TL1A Mumu_TL1A EERSEP--S--PQQVYSPPR----------GKPRAHLTIKKQTPAHLKNQLSALHWEHDL Hosa_TL1A --SHTKHIS--PFVTDAPLRA-------DGDKPRAHLTVVRQTPTHFKNQFPALHWEHEL Chpi_TL1A DESMSRFLK--QQASPISLVS-------SAGKPRAHLTVKKQDISPMGNQLPVLQWEHER Scca_TL1A_proba -------------MIRLP-VD-------TAEKPRAHLT-AKVTSSKKDGEYFALQWEHKN Scca_TL1A_proba AQKSEAQIG--PEMIRLP-VD-------TAEKPRAHLT-AKVTSSKKDGEYFALQWEHKN Anca_TL1A TTDGEVPTGKMPEAQRLDAHDHNCCNSTSIRKARTHLA-GAQKQSCMPGSLPILYWRND- Opha_TL1A RTRWTRDVAKIPQQEPL-------------MRPRK------------EGDATEIWWTP-- :.* .. : * Mumu_TL1A GMAFTKNGMKYINKSLVIPESGDYFIYSQITFRGTTSVCPDSITMVITKVADSY-PEPAR Hosa_TL1A GLAFTKNRMNYTNKFLLIPESGDYFIYSQVTFRGMTSECPDSITVVITKVTDSY-PEPTQ Chpi_TL1A GLAFTKNNMNYTNKSLMIPKAGDYYIYSQVTFRGPTPNHPVSITQTITKVTDSY-PEPTQ Scca_TL1A_proba GLAFTKGELNYQDRALTIPRKGDYFVYSQVSFRGYASDMNSLIIHMITKLTPRY-PKPMQ Scca_TL1A_proba GLAFTKGELNYQDRALTIPRKGDYFVYSQVSFRGYASDMNSLIIHMITKLTPRY-PKPMQ Anca_TL1A -RTHEEDTITYSDGYLTVPETGDYFVYTQVSYRCPDGKCKTIITQIISKISSSYGSEPKQ Opha_TL1A -FLQQGRALEPSGQEVVVRHTGLYFVYSQVLFHDPTF----TMGQVLRRVAL---GRPDQ : . : : . * *::*:*: :: : : ::: .* : Mumu_TL1A LLTGSKSVCEISNNWFQSLYLGATFSLEEGDRLMVNVSDISLVDYTKEDKTFFGAFL--- Hosa_TL1A LLMGTKSVCEVGSNWFQPIYLGAMFSLQEGDKLMVNVSDISLVDYTKEDKTFFGAFL--- Chpi_TL1A LLTATKTLCEMGNNWFQPIYLGAVLFMEEGDRLMVNVSEIQWVDYTKEDKTFFGAFL--- Scca_TL1A_proba LLSSTRSVNGEKSLWRMAIYLGAVFHLEQGDRIIVNVSSVKQVDFTNDHKTY-------- Scca_TL1A_proba LLSSTRSVNGEKSLWRMAIYLGAVFHLEQGDRIIVNVSSVKQVDFTNDHKTYFGAFL--- Anca_TL1A QLITSTSIGE-KEMWKKTLYLGGILELRERDKLMVNVSNPELVDIALPAMTFFGAFLRAN Opha_TL1A ILFRC---------------IQRVFHLQQGDRLSLSIPRFASFDISTH-GTFLGVLR--- * : : :.: *:: :.:. .* : *:
  • 53. 52 RANKL Cami_RANKL Q--RHTGHI-TES--TESSQSHHPQRT-PCAHLTMKRIKDSPKTKFIGLKGKPITSWEEK Scca_RANKL_prob -----------------------MADTLRSPHYKKKTTRNSSETSFPGLKGKPIISWEEK Scca_RANKL_prob K--TATQRF-AQG--IQSHEEEFMADTLRSPHYKKKTTRNSSETSFPGLKGKPIISWEEK Hosa_RANKL QHIRAEKAM-VDGSWLDLAKRSKLEAQ-PFAHLTINA------TDIPSSHKVSLSSWYHD Mumu_RANKL QRFSGAPAM-MEGSWLDVAQRGKPEAQ-PFAHLTINA------ASIPSSHKVTLSSWYHD Chpi_RANKL EPPRSGKAI-MEAPWFDLSKRNKPEKQ-PFAHLIIDD------KNIPATRKVNLTSWHHD Gaga_RANKL ESPRPEKAA-MEAIGMELYRRNKPEKQ-PFAHLIIDD------KNILTTRKVNLTSWHHN Lach_RANKL GYSKTENVIRINPPFMDPAKKSEPERR-PTSHLTIKR------SKYSYSTLQLLTSWEDN .* . . : ** .. Cami_RANKL IGIAHTIEMSHNQGLLKIKHHGLYYIYAQILFRYHE----YQLVQYVYWKTANYRKPHLI Scca_RANKL_prob VGLAHLREMRYSQGSLRIKHRGLYYIYAQILFRYHE----YQLIQYVYRKMSKYPKPTLI Scca_RANKL_prob VGLAHLREMRYSQGSLRIKHRGLYYIYAQILFRYHE----YQLIQYVYRKMSKYPKPTLI Hosa_RANKL RGWAKISNMTFSNGKLIVNQDGFYYLYANICFRHHETSGYLQLMVYVTKTSIKIPSSHTL Mumu_RANKL RGWAKISNMTLSNGKLRVNQDGFYYLYANICFRHHETSGYLQLMVYVVKTSIKIPSSHNL Chpi_RANKL KGQANLSNMTFRDGKLIVNQDGFYYLYANICFRHHETSGGLQLMVYMSKTNLKIRRSDVL Gaga_RANKL KGQANLSNMTFSDGKLIVNQDGFYYLYANICFRHHETSGGLQLMVYMTKTNLKIRRSDVL Lach_RANKL LGLAHASNMTYKDGKLKIHQDGFYYVYTNICFRHHETSG-IQLMLYMYHQKVTGSKPRYL * *: :* :* * ::: *:**:*::* **:** **: *: . . : Cami_RANKL LKGAGT-WCSGN—HNLYSVYLGGVFNLHRGDEIFVTVSNISLVDADETSSYLGAFKLN-- Scca_RANKL_prob LRGAST-WCSGN—HNLYSVYQGGVFSLLNGDKIFVTVSNVSLVDVDEASSYLGAFKMD-- Scca_RANKL_prob LRGAST-WCSGN—HNLYSVYQGGVFSLLNGDKIFVTVSNVSLVDVDEASSYLGAFKMD-- Hosa_RANKL MKGGSTKYWSGNSEFHFYSINVGGFFKLRSGEEISIEVSNPSLLDPDQDATYFGAFKVRDI Mumu_RANKL MKGGSTKNWSGNSEFHFYSINVGGFFKLRAGEEISIQVSNPSLLDPDQDATYFGAFKVQDI Chpi_RANKL MKGGSTKYWSGNSEFHFYSVNIGGFFKLKSGEVISIQVSNPSLLDSSQEATYFGAFKVRDL Gaga_RANKL MKGGSTKYWSGNSEFHFYSVNIGGFLKLKTGDMISIQVSNPLLLDSSQEATYFGAFKVRDL Lach_RANKL MKGGSTKYWSSHTEHNFYCIYQGGIFKLKSGDEIFLKISYASLLDQSQEASYFGAFKLRDL ::*..* *.: .::*.: **.:.* *: * : :* *:* .: ::*:****:
  • 54. 53 LIGHT Mumu_LIGHT GDIVAHLPDGGKGSWEKLIQDQRSH-ANPAAHLTGANASLIGIGGPLLWETRLGLAFLR Hosa_LIGHT --------DGPAGSWEQLIQERRSH-VNPAAHLTGANSSLTGSGGPLLWETQLGLAFLR Chpi_LIGHT EKATSQVGEEAGAISEKSIQDRKLH-AEKPAAHVLGAATT--GNGSLQWEHSKDWAFLH Anca_LIGHT TKATTQGGAEA----EKIVQGERPP-EKPAAHLTLGALINTAENGALLWEDKKGLAFLQ Scca_LIGHT_prob -------------------EDMIFQDSTKPAAHVTGCNSS--GYDRLQWEHSKGLAFLR Scca_LIGHT_prob QEFSSMF-LAAKNTSLRSIQDMIFQDSTKPAAHVTGCNSS--GYDRLQWEHSKGLAFLR : :****: . . * ** . ***: Mumu_LIGHT GLTYHDGALVTMEPGYYYVYSKVQLSGVGCPQLP-THGLYKRTSRYPKELELLVSRRSP Hosa_LIGHT GLSYHDGALVVTKAGYYYIYSKVQLGGVGCP-LT-THGLYKRTPRYPEELELLVSQQSP Chpi_LIGHT MRYQGGSLVCIKPGYYYIYSKIHLMAQSCSQNSVIYHSIYKKTPRYVEEMKLVENLILF Anca_LIGHT MGYKDGVLIFNQSGHYYIYSKLLLGDSRCPSSQMVIHEICKRTSRYPREIVLLTNKILT Scca_LIGHT_prob EIEYKEGALVIKRAGYYFIYSKILLGEIHCGDNGFLKQTLFKVGEAYPQPIELMAVRKF Scca_LIGHT_prob IEYKEGALVIKRAGYYFIYSKILLGEIHCGDNGFLKQTLFKVGEAYPQPIELMAVRKFF : *: * *: ..*:*::***: * * : : : * * . : *: Mumu_LIGHT CGR-NSSRVWWDSSFLGGVVHLEAGEEVVVRVPGNRLVRPRDGTRSYFGAFMV- Hosa_LIGHT CGRTSSSRVWWDSSFLGGVVHLEAGEKVVVRVLDERLVRLRDGTRSYFGAFMV- Chpi_LIGHT CDG-KDNRFWRQNSFLGGIFHLEEEEQIYVKVSQRQMLQVRDGTRSYFGTFMI- Anca_LIGHT CNL-QNTPGWRHNSFLAGMVHLEENEEVFVKVSHKQMVRVGDDTRSYFGTFMI- Scca_LIGHT_prob CTG-ESTDQWLDNSYLGAIFDFSEGDEVFIKITEKKVLRISNGAETFFGAFMV- Scca_LIGHT_prob CTG-ESTDQWLDNSYLGAIFDFSEGDEVFIKITEKKFE—SPTVQKHFWCFLM-- * ... * ..*:*..:..:. ::: ::: .:. ....* *::