SlideShare a Scribd company logo
1 of 114
Download to read offline
Identification of a Carboxysomal γ-Carbonic Anhydrase in the
Mesophilic Cyanobacterium Anabaena sp. PCC7120
by
Dewan Shamsul Arefeen
A thesis submitted in conformity with the requirements
for the degree of Master of Science
Department of Cell and Systems Biology
University of Toronto
© Copyright by Dewan Shamsul Arefeen (2010)
ii
Identification of a Carboxysomal γ -Carbonic Anhydrase in the
Mesophilic Cyanobacterium Anabaena sp. PCC7120
Dewan Shamsul Arefeen
Master of Science 2010
Department of Cell and Systems Biology, University of Toronto
Abstract
Analysis of the genome of Anabaena sp. PCC7120 reveals that it lacks the gene,
ccaA, which encodes the bonafide carboxysomal, β-class carbonic anhydrase (CA) CcaA.
However, the carboxysome enriched fraction of Anabaena PCC7120 exhibits CA
activity. Bioinformatic analysis reveals that the N-terminal region of the carboxysome
protein CcmM has high sequence and structural similarity to the γ-class CA of
Methanosarcina thermophila. Recombinantly expressed CcmM is found to be inactive in
in-vitro CA assays. E. coli cell extracts containing an overexpressed form of CcmM
comprised of the N-terminal 209 amino acids (CcmM209) are also inactive. However,
CcmM209 displays CA activity after incubation with the thiol oxidizing agent diamide or
when bound to an affinity matrix. It appears that CcmM is indeed a functional γ-CA
which is active under oxidizing condition. It is hypothesized that the C-terminal RbcS
like domain in CcmM may regulate activity by allowing CcmM activation only when
sequestered within the carboxysome.
iii
Acknowledgements
I am greatly thankful to my supervisor, Dr. George Espie, for giving me the
opportunity to pursue my Masters. I would like to thank him for providing me with
proper guidance, support and constructive criticisms throughout my graduate studies. I
particularly respect George for being so understanding and accommodating during
personal emergencies.
I would like to thank my committee members, Dr. Steven Short and Dr. Tim
Westwood, for their criticisms, ideas and advice throughout my thesis.
I wish to extend my gratitude to all the members of Espie lab for being
instrumental in my academic and personal growth. I would especially like to thank Dr.
Yohannes Tadesse for teaching me a multitude of techniques, discipline and patience.
Yohannes has always gone beyond my expectations to help me, regardless of the nature
of the problem. I would like thank Charlotte de Araujo for helping me with the mass
spectrometer. I would like to thank Dr. Anthony So for providing me with recombinant
full length CcmM and many of the protocols that I have used in my research project.
I am grateful to Ian Buglass who made all the administrative details clear and for
his rapid response to any queries. I am thankful to Sarah Gonsalves and Anja Lowrance
for making my teaching assistantship in microbiology labs a pleasurable experience.
I could not have come this far without the loving support of my family members,
Mom, Dad, Ron and Noel. I am thankful to my mother and father-in-law as well as
Mumu and Priya. I am thankful to my wife, Armana, who gave me support and
encouragement to accomplish everything. Finally I would like to thank God for
everything.
iv
TABLE OF CONTENTS
Page
ABSTRACT ii
ACKNOWLEDGEMENTS iii
TABLE OF CONTENTS iv
LIST OF ABBREVIATIONS vii
LIST OF TABLES x
LIST OF FIGURES xi
LIST OF APPENDICES xiii
INTRODUCTION 1
Cyanobacteria 1
Importance of CO2 concentrating mechanism (CCM) in cyanobacterial
photosynthesis and growth
2
Components of the CCM 3
Types of carbonic anhydrases 5
Carboxysomal genes 8
Carboxysomal proteins and their known functions 12
Present model of carboxysome function 16
Research objectives 19
MATERIALS AND METHODS 20
Bioinformatics study 20
Organisms and growth conditions 21
Isolation of carboxysomes 22
v
Polyacrylamide gel electrophoresis 24
Western blot analysis 24
Recombinant protein expression 25
Mass spectrometric carbonic anhydrase assay 26
CA assay of CcmM 27
Cellulose acetate assay 27
T7 Tag affinity purification and pull down assays 28
Sufonamide column binding assay 29
Protein sequencing 30
CcmM209 and isolation on His.Bind®
affinity column 30
RESULTS 32
In Silico Search for a Carboxysomal Carbonic Anhydrase 32
Mass spectrometric CA assays 45
CA Activity in Synechococcus PCC7942 carboxysomes 48
CA Activity in Anabaena PCC7120 carboxysomes 51
Activity of full length CcmM 54
Attempts to activate CcmM 59
Cellulose acetate assay 63
T7 tag - affinity pull down experiments 63
Sulfonamide pull down assays 66
Analysis of Anabaena PCC7120 CcmM209 69
Results summary 74
75
vi
DISCUSSION
Putative carboxysomal carbonic anhydrase in Anabaena PCC7120 75
The most likely candidate 78
Activity of Anabaena PCC7120 carboxysome enriched fraction 79
CcmM Expression and Catalytic activity 81
Identification of putative CA using affinity pull down assays 85
CcmM is an active γ-CA in mesophilic cyanobacteria 86
Proposed model of non-CcaA containing β-carboxysome 87
Conclusion and future directions 91
APPENDIX 100
Calculation of CO2 concentration in mass spectrometer 100
Truncated CcmM209 primers and sequence 101
REFERENCES 92
vii
LIST OF ABBREVIATIONS
°C degree(s) Celsius
µ micro (10-6
)
Amp ampicillin
AP alkaline phosphatase
Arg arginine
Asp aspartic acid
BCIP 5-bromo-4-chloro-3-indolylphosphate
CA carbonic anhydrase
cbx carboxysome
CCM carbon dioxide concentrating mechanism
Ci dissolved inorganic carbon
CO2 carbon dioxide
Cys cysteine
dH2O distilled water
DNA deoxyribonucleic acid
DTT dithiothreitol
EDTA ethylenediaminetetraacetic acid
EPPS N-(2-hydroxyethyl)piperazine-N’-(3-propanesulfonic acid)
g gram(s)
Gln glutamine
HCO3
-
bicarbonate ion
HCR high Ci requiring phenotype
HEPES 4-(2-Hydroxyethyl)piperazine-1-ethanesulfonic acid
His histidine
IgG immunoglobulin G
IMAC immobilized metal affinity chromatography
IPTG isopropyl-β-D-thiogalactoside
kbp kilobase pairs
kDa kilodaltons
viii
Km Michaelis-Menton constant
L liter(s)
LB Luria-Bertani
LC liquid chromatography
M moles per liter
mg Milligram
Mg2+
magnesium ion
min minute (s)
mL milliliter
mM millimolar
mol mole (s)
MS mass spectrometer
NaOH sodium hydroxide
NBT nitroblue tetrazolium chloride
NCBI National Center for Biotechnology Information
OD optical density
PAMBS p-aminobenzyl sulfonamide
PCC Pasteur Culture Collection
PDB protein data bank
PPFD photosynthetic photon flux density
psi pounds per square inch
RMS root mean square
rpm revolutions per minute
RuBisCO Ribulose-1,5-bisphosphate caboxylase/oxygenase
s second(s)
SDS sodium dodecyl sulfate
SDS-PAGE SDS-polyacrylamide gel electrophoresis
Sec secretory
Tat twin arginine translocation
TBS Tris-buffered saline
Tes N-[tris(hydroxymethyl)methyl]-2-aminoethanesulfonic acid
ix
Triton X-100 t-octylphenoxypolyethoxyethanol
Tween-20 polyoxyethylene sorbitan monolaurate
UTCC University of Toronto Culture Collection
V volt(s)
v/v volume by volume
W watt(s)
w/v weight by volume
x g acceleration due to gravity
Zn2+
zinc ion
x
LIST OF TABLES
Page
Table 1: List of proteins present in β-carboxysome and α-carboxysome. 15
Table 2: List of query sequences used to search the proteome of Anabaena
PCC7120 to identify a putative carbonic anhydrase.
21
Table 3: Factors changed in attempts to activate CcmM. 62
xi
LIST OF FIGURES
Page
Figure 1: Arrangements of β-carboxysomal genes… 11
Figure 2: Structure of a β-carboxysome… 14
Figure 3: The present model of the carboxysome… 18
Figure 4: Representation of the location of conserved β-CA and α-CA domains 40
Figure 5: Representation of the location of conserved γ-CA domains 41
Figure 6: Amino acid sequence alignment output from ClustalX2 for Cam of
Mehanosarcina thermophila and CcmM of selected species of cyanobacteria.
42
Figure 7: 3D structure of Cam of Methanosarcina thermophila and CcmM of
Anabaena PCC7120.
44
Figure 8: (a) Mass spectrometric assay for CA activity of E. coli lysate in which
CcaA expression was induced. (b) 18
O atom fraction in m/z 49 with respect to
time.
47
Figure 9: (a) Mass spectrometric assay for CA activity of Synechococcus
PCC7942 carboxysome extract. (b) Log 18
O atom fraction in m/z 49 with respect
to time.
50
Figure 10: (a) Mass spectrometric CA assay of carboxysome enriched fraction
from Anabaena PCC7120. (b) Log 18
O atom fraction in m/z 49 with respect to
time.
53
Figure 11: Western Blot using CcmM antibody for E. coli lysate overexpressing
CcmM and Anabaena PCC7120 lysate.
56
Figure 12: (a) Mass spectrometric assay for CA activity of E. coli lysate in 57
xii
which CcmM expression was induced. (b) Log 18
O atom fraction in m/z 49 with
respect to time.
Figure 13: (a) Mass spectrometric assay for CA activity of E. coli lysate
containing empty pET21 vector. (b) Log 18
O atom fraction in m/z 49 with
respect to time.
58
Figure 14: Western blot using CcmM antibody for varying temperature and
IPTG treatments during CcmM overexpression.
61
Figure 15: Western blot using CcmM antibody to show binding in anti-T7
agarose.
65
Figure 16: Western blot using CcaA antibody to show CcaA binding in
sulfonamide resin.
67
Figure 17: Mass spectrometric analysis of proteins isolated from the
sulfonamide column.
68
Figure 18: (a) Mass spectrometric assay for CA activity of E. coli lysate
containing CcmM209 treated with 20 mM diamide. (b) Log % 18
O in 13
CO2 with
respect to time.
71
Figure 19: (a) Mass spectrometric assay for CA activity of His.Bind resin in
which CcmM209 is bound. (b) Log % 18
O in 13
CO2 with respect to time.
72
Figure 20: (a) Mass spectrometric assay for CA activity of His.Bind resin in
which E. coli lysate containing empty vector is flowed through. (b) Log % 18
O in
13
CO2 with respect to time.
73
Figure 21: Proposed model for the function of β-carboxysomes lacking the
carboxysomal carbonic anhydrase, CcaA.
90
xiii
LIST OF APPENDICES
Calculation of CO2 concentration in mass spectrometer 100
Truncated CcmM209 primers and sequence 101
1
Introduction
Cyanobacteria
Cyanobacteria are a group of gram-negative, photoautotrophic bacteria that carry
out oxygenic photosynthesis. They are found in a wide variety of habitats including
freshwater, marine and soil surfaces. The cyanobacteria are responsible for as much as
one third of the global photosynthetic carbon dioxide fixation and hence are significant
contributors to the biogeochemical cycling of carbon and primary productivity on earth.
Beyond fixing carbon dioxide, cyanobacteria play a diverse role in nutrient
cycling in the ecosystem. For example, cyanobacteria are the only group of organisms
that are able to fix nitrogen under aerobic conditions. The enzyme involved in fixing
dinitrogen is nitrogenase and it is irreversibly inactivated in the presence of oxygen
(Gallon, 1992). Some strains of cyanobacteria, including Anabaena sp. PCC7120, have
evolved specialized cells called heterocyst which enable them to carry out nitrogen
fixation in spite of high ambient extracellular O2 concentrations. The ability of
cyanobacteria to fix atmospheric nitrogen makes them an important source of ammonia,
nitrates and nitrites in diverse habitats from the open oceans to the rice paddy fields of
Asia.
Recent studies have shown that Anabaena PCC7120 is able to detoxify
organophosphorous pesticides, used in agriculture, to harmless organic components
(Barton et al., 2004). The ability of this cyanobacterium to detoxify pesticides may play
an important role in the future bioremediation of farmland, given the large-scale and
increasing use of pesticides in modern intensive agriculture. Presently, with the ever-
2
increasing demand for energy and the diminishing fossil fuel, cyanobacteria are also
being looked at as a potential source of biofuels (Hu et al., 2008). The fact that many
cyanobacteria have a high efficiency of photosynthesis, the ability to grow non-
fastidiously and to fix nitrogen makes them very attractive and potentially lucrative
organisms for biofuel generation. To this end, an understanding of the biochemical and
genetic mechanisms that underlie their efficient growth and photosynthesis will be vital
in the metabolic engineering of stains for industrial scale biofuel production (Ragauskas
et al., 2006).
Importance of the CO2 concentrating mechanism (CCM) in cyanobacterial
photosynthesis and growth
Cyanobacteria are extremely primitive organisms dating back 2.7 billion years
(Buick, 1992). Over the course of this time photosynthesis has lowered the level of
atmospheric carbon dioxide while increasing the level of oxygen. Thus, photosynthesis
by cyanobacteria, which requires carbon dioxide, had to adapt to this globally changing
environment. Photosynthetic CO2 fixation is catalyzed by the enzyme, ribulose
bisphosphate carboxylase/oxygenase (RuBisCO; EC 4.1.1.39), and involves the
carboxylation of ribulose-1,5-bisphosphate (RuBP) to produce two molecules of 3-
phosphoglycerate (PGA). The catalytic activity of cyanobacterial RuBisCO is relatively
slow with a turnover number of 3 s-1
(Schneider et al., 1992) and requires high substrate
concentrations for efficient carboxylation (Km(CO2) > 240 µM) (Andrews and Abel,
1981). Not only is RuBisCO slow at atmospheric CO2 concentration but it also has an
oxygenase activity which adds O2 instead of CO2 to RuBP, yielding a molecule of PGA
3
and a molecule of phosphoglycolate (Schneider et al., 1992). Phosphoglycolate is
metabolized in the glycolate pathway and leads to loss of energy and CO2 by
photorespiration. Typically, cyanobacteria live in environments where the O2 / CO2 ratio
is about 25:1. Given the kinetic parameters of cyanobacterial RuBisCO (Km = 240 μM
CO2, Kcat = 3 s-1
and Km = 1000 μM O2), it would be expected to promote oxygenation
over carboxylation (Andrews and Abel, 1981; Jensen and Bahr, 1977; Schneider et al.,
1992). However, under normal environmental conditions, direct physiological
measurements show that photorespiration is almost nonexistent and that photosynthesis
proceeds with remarkable efficiency (Aizawa and Miyachi, 1986)! One widely supported
explanation for this discrepancy is that cyanobacteria have evolved mechanisms to
increase the level of CO2 around RuBisCO by a collective process called the carbon
dioxide concentrating mechanism (CCM) (Badger and Price, 2003), thereby decreasing
the intracellular O2 / CO2 ratio to a level that promotes photosynthesis.
Components of the CCM
The CCM is composed of two primary components, which together can increase
the concentration of CO2 around RuBisCO up to 1000 fold (Badger and Price, 2003;
Miller and Colman, 1980). First, cyanobacteria use membrane-associated active
transporters for bicarbonate, active transporters of carbon dioxide, and bicarbonate/Na+
symporters to create a large internal pool of bicarbonate ions (Badger and Price, 2003).
However, RuBisCO cannot use bicarbonate as a substrate and bicarbonate needs to be
converted to CO2 before use. Conversion of bicarbonate to CO2 is catalyzed by the
enzyme carbonic anhydrase (CA; EC 4.2.1.1). Carbonic anhydrase, along with RuBisCO,
4
in cyanobacteria is localized to polyhedral proteinaceous bodies called the carboxysomes,
the second primary component of the CCM (Mckay et al., 1993; Price et al., 1992).
Carboxysomes were first characterized in the chemoautotrophic bacterium
Halothiobacillus neapolitanus (Shively et al., 1973). Carboxysomes are about 120 nm in
diameter. Cross-sections of the carboxysomes of most species show that they are regular
hexagons surrounded by a 3 to 4 nm thick protein shell (Cannon et al., 2001). Sodium
dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) reveals that the shell
proteins comprise of about 17% of the total protein content of the carboxysomes (Cannon
et al., 2001; Cannon and Shively, 1983). No lipid component has ever been found to be
associated with the carboxysome shell or core. The core of the carboxysome is packed
with RuBisCO and the name carboxysome is derived from the carboxylase and
oxygenase activity of RuBisCO (Pierce et al., 1989; Price and Badger, 1989a). Studies
have shown that the packaging of RuBisCO within the carboxysomes enhance the
catalytic properties of the enzymes (Cannon et al., 2001). Carboxysomes are classified
into two phylogenetic groups based on the form of RuBisCO they contain.
Carboxysomes containing Form 1A RuBisCO are classified as α-carboxysomes and
carboxysomes containing Form 1B RuBisCO are called β-carboxysomes (Badger and
Price, 2003; Cannon et al., 2002). It has been suggested that two primary groups of
cyanobacteria can be classified based on their RuBisCO/carboxysome phylogeny.
Cyanobacteria with Form 1A RuBisCO are termed α-cyanobacteria and those with Form
1B RuBisCO are termed β-cyanobacteria (Badger and Price, 2003). Carboxysome
structure and function have been studied in both α and β-carboxysomes. α-carboxysomes
from the chemolithoautotrophic bacterium, Halothiobacillus neapolitanus, have been
5
most widely studied and best characterized (Cannon et al., 2001). β-carboxysome
structure has been studied using the model laboratory cyanobacterial species
Synechocystis sp. PCC6803 and Synechococcus sp. PCC7942 (Badger and Price, 2003).
The presence of carbonic anhydrase in close proximity to RuBisCO in the
carboxysomes allows for high levels of CO2 to be produced around the RuBisCO active
site, thus facilitating carboxylation over oxygenation (Price and Badger, 1989a). The
over-expression of recombinant human CA II in the cytosol of Synechococcus PCC7942
results in the conversion of the bicarbonate pool to carbon dioxide, which then readily
diffuses out of the cell (Price and Badger, 1989a). Thus, a cytosol-localized CA is
deleterious causing a “short circuit” in the CCM. This “short-circuit” prevents
bicarbonate accumulation and the localized formation of CO2 in the vicinity of RuBisCO,
thereby drastically reducing the substrate concentration available for CO2 fixation by
RuBisCO. Under ambient conditions, this situation ultimately leads to a reduction in
photosynthetic efficiency and to a high CO2 requiring phenotype. Mutants with defective
carbonic anhydrase also require high CO2 concentrations in their environment to be
viable. It is apparent that the localization of carbonic anhydrase and RuBisCO within
carboxysomes is vital for efficient photosynthesis in cyanobacteria.
Types of carbonic anhydrases
The importance of carbonic anhydrase in living organisms is implied by the
widespread distribution of this enzyme from mammals to Archaea (Smith and Ferry,
2000). So far, three distinct evolutionary lineages (α, β and γ) of carbonic anhydrases
have been characterized. As well, two additional classes (δ and ε) have been proposed
6
based on the occurrence of unique, non-homologous proteins that display CA activity (So
et al., 2004; So and Espie, 2005). The α-class was initially identified from animals but
examples can be found in plants and eubacteria. α-CAs typically exist as protein
monomers that are mostly composed of antiparallel β-strands (Liljas et al., 1972). This
single protein has all the necessary structural elements to generate a functional, active site
(So and Espie, 2005). The active site of α-CA is located in a 15-Å deep crevice
dominated by hydrophobic amino acid side chains at the base of which is a Zn2+
ion
invariably coordinated by three histidine residues (Liljas et al., 1972).
β-CA’s were initially identified in chloroplast of plants but are now known to be
present in various subcellular compartments of many organisms. Interestingly, β-CAs
have no amino acid sequence similarity with α-CAs, suggesting that they evolved
independently. Unlike α-CA’s, β-CA’s are only functional when oligomerized
(Mitsuhashi et al., 2000; Strop et al., 2001). A homodimer is the catalytic core of a β-CA,
which may exist as a dimer, tetramer or octamer depending on the species of origin
(Mitsuhashi et al., 2000). Dimerization enables formation of the hydrophobic pocket
required for CO2 binding and forms the active site at the interface. Zn2+
co-ordination is
mediated by a combination of His, Cys and Glu residues depending on the species
(Mitsuhashi et al., 2000; Strop et al., 2001). In contrast to α-CAs which is mostly
composed of β-sheets, β-CAs contain a number of α-helices.
The third distinct class of CA is the γ-CA which was first isolated from the
methanogenic archaeon Methanosarcina thermophila (Alber and Ferry, 1994). The γ-CA,
called Cam, from M. thermophila is catalytically active when trimerized (Kisker et al.,
1996). Cross-sectional profiles of the γ-CA trimer reveal that each monomer resembles
7
an equilateral triangle. Cam has an unusual left-handed β-helix structure containing seven
complete turns with an α-helix at the terminal portion forming αβ-helix (Kisker et al.,
1996). The active sites are located at the interfaces between two β-helices. The interface
is stabilized by H bonds, salt bridges and hydrophobic interactions. The side chain of
Arg59 is important since it forms salt bridge with Asp61 of the same monomer and
Asp76 of the adjacent monomer. The trimer contains 3 active sties and each monomer
contributes His residues located on the surface to coordinate Zn2+
(Kisker et al., 1996).
His81 and His122 of one monomer extend from equivalent positions of adjacent turns of
the β-helix along with His117 from a second monomer coordinate the zinc ion (Kisker et
al., 1996). A water molecule is the fourth Zn2+
ligand and is within hydrogen bonding
distance of Gln75. Cam is the only γ-CA which has been shown to have CA activity.
Numerous Cam homologs from both plants and Bacteria have been tested for catalytic
activity, including CcmM from the cyanobacteria Synechocystis PCC6803 and
Synechococcus PCC7942. All were found to lack CA activity, prompting the suggestion
that these homologs have evolved a different function and that Cam is a relic.
A fourth class of CA named δ-class has been isolated from the marine diatom
Thalassiosira weissflogii (Roberts et al., 1997). X-ray absorption spectroscopy of the δ-
CA, T. weissflogii CA1 (TWCA1), has shown that it indeed does contain a Zn2+
ion
bound by histidine residues. Presently, there are only 4 other proteins that display amino
acid sequence similarity to TWCA1 and, thus, its distribution may be restricted to only a
small number of diatom species (So and Espie, 2005).
A fifth class of CA tentatively named ε-class has been characterized from the
chemolithoautotrophic bacterium Halothiobacillus neapolitanus and several marine
8
cyanobacteria (So et al., 2004; So and Espie, 2005). BLAST search results show that this
protein is widely distributed among marine cyanobacterial strains but has no amino acid
sequence similarity to the α, β, γ and δ CAs. Recent X-ray crystal structure studies on the
ε-CA of H. neapolitanus, CsoS3, indicate that it is structurally related to β-CA (Sawaya
et al., 2006) in spite of the absence of any primary sequence similarity. The suggestion
that CsoS3 is a subclass of β-CA comes from the striking structural similarity of the Zn2+
-containing active site and from the fact that both need to form dimers in order to be
active (Sawaya et al., 2006). Thus, CsoS3 is an example of divergent evolution. Proteins
with high degree of amino acid sequence homology to CsoS3 have now been identified in
a range of chemolithoautotrophic bacteria, hydrogen bacteria and many strains of marine
cyanobacteria (So et al., 2004; So and Espie, 2005). In all examples to date, CsoS3 is
encoded within the cso operon which encodes all the components for the α-carboxysome.
Carboxysomal genes
α-carboxysomes are also characterized by the presence of shell proteins encoded
by the cso gene cluster (Badger and Price, 2003; Cannon et al., 2002; So et al., 2004). β-
carboxysomes are encapsulated by proteins encoded by the ccmKLMN gene cluster
(Badger and Price, 2003; Cannon et al., 2002). To date, all cyanobacterial species
characterized possess either α-carboxysome or β-carboxysomes, but not both. It has been
suggested that two primary groups of cyanobacteria can be classified based on their
carboxysome phylogeny. Cyanobacteria with α-carboxysomes are termed α-
cyanobacteria and those with β-carboxysomes are termed β-cyanobacteria (Badger and
Price, 2003).
9
Studies show that the arrangement of carboxysomal genes varies depending on
the type of carboxysomes. The α-carboxysomal genes are arranged into one operon
whereas the β-carboxysomal genes are distributed among multiple operons. The genes of
the putative operon of α-carboxysomes, in their transcriptional order, are cbbL, cbbS,
csoS2, csoS3, csoS4A, csoS4B, csoS1C, csoS1A, csoS1B (Cannon et al., 2003; Shively et
al., 1998). The cbbL and cbbS code for the large and small subunit of RuBisCO form 1A;
the genes csoS2, csoS3, csoS1C, csoS1A and csoS1B code for carboxysomal shell
proteins CsoS2, CsoS3, CsoS1C, CsoS1A and CsoS1B respectively (Cannon et al.,
2003). CsoS3 has been identified as a shell associated active carbonic anhydrase while
CsoS1 is the major structural protein of the shell (So et al., 2004). Purified CsoS1
spontaneously organizes into hexamers that form sheet-like structures. The hexamers are
thought to form the majority of the flat faces of the carboxysome icosahedron. It has
recently been established that the genes csoS4A, csoS4B (previously called orfA and orfB)
code for proteins CsoS4A and CsoS4B that assemble as pentamers (Tanaka et al., 2008).
The construction of large icosahedral structure typically requires a combination of
hexameric and pentameric proteins. It is conjectured that the pentamers provide the
curvature required at the vertices to form the icosahedron while the hexamers form the
flat faces of the icosahedron. It has been postulated that CsoS4A and CsoS4B form the
vertices of the carboxysome shell (Tanaka et al., 2008).
The β-carboxysomal genes are found scattered throughout the genome in 3 or 4
different clusters as shown by the gene diagram in Figure 1. Typically, these clusters
include ccmK2K1LMN and ccmK3K4 that contribute the structural components of the
carboxysome. The RbcLXS operon encodes the large and small subunits of RuBisCO
10
(Cannon et al., 2001) and RbcX required for RuBisCO assembly (Saschenbrecker et al.,
2007). Depending on the species, the genes ccmO and ccaA may or may not be present
and even if they are present then they are found at separate loci than the ccmKLMN and
rbcL, rbcS gene cluster. The gene cluster ccmK, ccmL, ccmM and ccmN codes for the
polypeptides CcmK, CcmL, CcmM and CcmN respectively. The gene ccmO codes for
the polypeptide CcmO and the gene ccaA codes for the carboxysomal carbonic
anhydrase, CcaA, of β-carboxysomes. Anabaena PCC7120 possesses β-carboxysomes
and the proteins involved in β-carboxysomes will be the focus in the following pages.
11
Anabaena sp. PCC7120
Synechocystis sp. PCC6803
Gloeobacter violaceus PCC7421
Synechococcus sp. PCC7942
= 1 kb
rbcccmccm
LM K1N K2O
997kb
XL
1786 kb
K3K4
361kb
ccaA?
all0863
all0866
all0867
all0868
all0318
all0865
all0864
all0317
alr1524
alr1525
alr1526
S
slr0436
OXL
219 kb
AK4K3LMN K2K1
2478 kb1742 kb956 kb 2551 kb
slr1347
S
slr0009
slr0010
slr0012
sll1028
sll1029
slr1838
slr1839
sll1032
sll1031
sll1030
ccm ccm ccmcca rbc
L S A X
1595 kb
rbcccm
NM OLK
1475 kb281 kb
KK
ccm
1500 kb
synpcc7942_0284
synpcc7942_0285
synpcc7942_1425
synpcc7942_1421
synpcc7942_1422
synpcc7942_1423
synpcc7942_1424
synpcc7942_1447
synpcc7942_1426
synpcc7942_1427
synpcc7942_1535
rbccca
rbcccm
LM K1N K2O
2249 kb
XL S
2307 kb
ccaA?
gll2093
gll2094
gll2091
gll2092
gll2095
gll2096
glr2156
glr2157
glr2158
Figure 1: Arrangements of β-carboxysomal genes from Anabaena sp. PCC7120, Synechocystis sp. PCC6803, Synechococcus sp.
PCC7942 and Gleobacter violaceus PCC7421. The ccm genes are depicted by red boxes, the Form 1B RuBisCO genes are depicted by
the green boxes and carboxysomal carbonic anhydrase genes are depicted by blue boxes. Putative, uncharacterized carboxysomal
carbonic anhydrase is denoted by the yellow boxes. The cyanobase ID for the corresponding gene is denoted below each box.
12
Carboxysomal proteins and their known functions
RuBisCO
Studies show that the CO2 fixing enzyme RuBisCO is mainly localized to the
carboxysomes regardless of the amount of dissolved inorganic carbon or the stage of
growth at which the cells are harvested (Mckay et al., 1993). The Form 1B RuBisCO is
present as a hexadecameric enzyme composed of eight large and small subunits of
RuBisCO encoded by rbcL and rbcS genes. It has been seen that the expression of a
simple dimeric form of RuBisCO from Rhodospirillum rubrum in Synechocystis
PCC6803 results in the loss of carboxysomes and produces high Ci requiring (HCR)
phenotype (Pierce et al., 1989; Price and Badger, 1989a). Mutants lacking structurally
intact carboxysomes require high CO2 to survive. This suggests that the localization of
RuBisCO and other carboxysomal proteins in the carboxysomes is essential for normal
growth.
CcaA
Many β-carboxysomes possess a carboxysomal carbonic anhydrase, CcaA. Amino
acid sequence characterization has shown that CcaA is a β-type CA. Biochemical
analysis has shown that up to 97% of the immunologically reactive CA polypeptide is
associated with the carboxysomes (So and Espie, 1998). Electron microscopy and
immunogold labeling of RuBisCO in Synechocystis PCC6803 mutant lacking CcaA
reveal that the carboxysome number, size and shape are similar to wild type cells
indicating that CcaA is not essential for maintaining carboxysomal structure or the
13
deposition of RuBisCO within the carboxysomes (So et al., 2002). However, the mutants
of Synechocystis PCC6803 that lack CcaA require high CO2 (5% v/v) to survive,
indicating that CcaA is vital for normal functioning of the cell (So et al., 2002). Protein-
protein interaction studies and protein capture studies have recently shown that CcaA is
localized to the carboxysome shell in complex with CcmM and CcmN, where it likely
serves to catalyze HCO3
-
dehydration in the vicinity of the CO2 fixing enzyme RuBisCO
facilitating efficient CO2 fixation and normal growth (Cot et al., 2008).
CcmK, CcmL and CcmO
CcmK, CcmL and CcmO all play a structural role in carboxysomes as mutants of
CcmK, CcmL or CcmO produce malformed carboxysomes and result in a HCR
phenotype (Cannon et al., 2002; Price et al., 1993). CcmK and CcmL appear to be present
in all β-carboxysomes so far identified while CcmO may or may not be present. CcmK
and CcmO have high amino acid sequence similarity to CsoS1 of α-carboxysomes while
CcmL has high amino acid sequence similarity to CsoS4A and CsoS4B polypeptides
present in α-carboxysomes (Table 1) (Cannon et al., 2002). Four variants (paralogs) of
CcmK (CcmK1-K4) are found in Anabaena PCC7120 as well as other species such as
Synechocystis PCC6803. The X-ray crystal structure of CcmK2 and CcmK4 from
Synechocystis PCC6803 has been elucidated. The crystal structure reveals that the CcmK
family of proteins forms hexamer plates with a central pore and, therefore, most likely
forms the flat facets of the polyhedral carboxysomal body (Kerfeld et al., 2005). The
central pore in the CcmK hexamers consists of positively charged amino acid residues
and this may serve a functional role in the carboxysome such as maintaining the
14
metabolite flux between the carboxysomes and the cytosol (Kerfeld et al., 2005).
Analysis of the crystal structure of CcmL has shown that CcmL forms pentamers (Tanaka
et al., 2008). Hypothetically, these pentameric proteins could reside at the vertices of
icosohedral carboxysomes, joining the 20 planar triangular faces that form the surface of
the carboxysome (Figure 2) (Tanaka et al., 2008). The deletion of the ccmL gene results
in the formation of elongated carboxysomes which further supports the function of CcmL
as a structural protein (Price et al., 1993; Tanaka et al., 2008).
Pentamers
at vertices
(CcmL)
Hexamers
on flat faces
(CcmK)
Figure 2: A proposed structure of a β-carboxysome showing the localization of
pentameric protein, CcmL, and hexameric protein, CcmK forming the carboxysome shell.
(Adapted from Tanaka et al., 2008)
15
Table 1: Protein components of β-carboxysomes and α-carboxysomes.
β-carboxysome proteins
α-carboxysome proteins
with β-carboxysome
homologs
α-carboxysome proteins
unique to α-cyanobacteria
CcmK1
CcmK2
CcmK3
CcmK4
CsoS1A
CsoS1B
CsoS1C
CcmL CsoS4A, CsoS4B
CcmM None CsoS2
CcmN None
CcmO
CcaA (variable) (deeply divergent) CsoS3
RbcL Form IB RbcL Form IA
RbcS RbcS
RbcX (Rbc assembly) None
16
CcmM and CcmN
CcmM and CcmN are unique to β-carboxysomes as are CsoS2 and CsoS3 to α-
carboxysomes (Table 1). The CcmM and CcmN proteins share considerable amino acid
sequence similarity among species. The definitive function of CcmM and CcmN has not
been determined, but mutants of CcmM and CcmN have HCR phenotype which shows
that they are essential for normal carbon assimilation (Ludwig et al., 2000; Price et al.,
1993). Mutation in CcmM yields cells that require high CO2 for growth. These cells are
able to concentrate Ci internally but are unable to effectively utilize the Ci pool in CO2
fixation (Ludwig et al., 2000). Ultrastructural examination shows the mutants lacked
carboxysomes (Ludwig et al., 2000). Yeast two-hybrid analysis and in-vitro pull down
experiments show that CcmM N-terminal region can interact and form a complex with
both CcmN and CcaA (Cot et al., 2008). Both the C-terminal and N-terminal region of
CcmM interact with CcmK and CcmL (Cot et al., 2008). The aforementioned evidence
leads to the conclusion that CcmM has a key structural organizational role in the
formation of carboxysomes and the bicarbonate dehydration complex.
Present model of carboxysome function
Research indicates that both α and β carboxysomes contain RuBisCO packed
within the core and surrounded by a protein shell. Current models of carboxysome
function suggest that HCO3
-
and RuBP diffuse from the cytosol to the carboxysome core,
possibily via the pores in CcmK hexamers, where HCO3
-
is dehydrated to CO2. RuBP
binds to RuBisCO and is subsequently carboxylated using the recently formed CO2 as a
substrate. In α-carboxysome, HCO3
-
dehydration is catalyzed by CsoS3. In some β-
17
cyanobacteria CcaA catalyzes dehydration of HCO3
-
. In these strains (Figure 3), it is
thought that cytosolic bicarbonate binds to CcmM and is vectorially channelled to CcaA
which lies on the interior region of the carboxysome shell, resulting in the dehydration of
the bicarbonate within the carboxysome interior (Cot et al., 2008). The localized
generation of a high concentration of CO2 near the active site of RuBisCO promotes CO2
fixation and reduces RuBP oxidation by outcompeting O2 for binding sites. Mutants
lacking CcaA or CcmM require high CO2 to survive (Ludwig et al., 2000; So et al.,
2002). Therefore, the presence of CcmM and CcaA is vital for the dehydration of
bicarbonate and to the overall efficiency of the CCM in promoting photosynthetic
fixation of CO2. However, recent findings suggest that this model for bicarbonate
dehydration may not be applicable to all β-cyanobacteria as genome analysis has revealed
that a number of β-cyanobacteria appear to lack a CcaA homolog (So and Espie, 2005)!
18
HCO3
-
Rbc
1B
Rbc
1B
Rbc
1B
AA
M M KL N Ncarboxysome
shell
cytosol
carboxysome
interior
CO2
RuBPRuBP
Key
L= CcmL
N=CcmN
M=CcmM
K=CcmK
A=CcaA
Rbc=Rubisco
Figure 3: The present model of the carboxysome shows that CcmM as the central organizing protein that binds to other
carbosysomal shell protein and channels bicarbonate to the CcaA. The CcaA dehydrates the bicarbonate to CO2. The localized
generation of a high concentration of CO2 near the active site of RuBisCO promotes CO2 fixation and reduces RuBP oxidation
by outcompeting O2 for binding sites. (Adapted from Cot et al., 2008)
CO2CO2
19
Research objectives
Amino acid sequence similarity searches of the deduced proteome of the β-
cyanobacterium Anabaena PCC7120 and a number of other strains have failed to identify
homologs of CcaA. Based on this observation, it can be hypothesized that Anabaena
PCC7120 does not require CA activity to form a functional carboxysome. Alternatively,
Anabaena PCC7120 carboxysomes may contain a CA protein with a novel amino acid
sequence that is not identifiable by homology searches. A third possibility is that an
existing previously identified protein may substitute the function of CA. My initial goal is
to perform a bioinformatics analysis of the proteome to assess the presence or absence of
a bicarbonate dehydration complex in Anabaena PCC7120 and to search for a potential /
alternate candidate for CcaA in Anabaena PCC7120. The main experimental objectives
of my thesis is to determine if carboxysomes of Anabaena PCC7120 possess CA activity
and, if so, explore the CA activity of potential candidate protein(s) identified using
bioinformatic analysis and determine if the activity is due to a previously identified
protein or a novel protein.
Materials and methods
Bioinformatics study
An in silico search of the deduced proteome of Anabaena PCC7120 was
conducted to identify potential carboxysomal carbonic anhydrases candidates using the
databases at NCBI and Cyanobase. Protein basic local alignment search tool (BLASTP
http://blast.ncbi.nlm.nih.gov/Blast.cgi) was used to search the proteome of Anabaena
PCC7120 using amino acid query sequences from bonafide carbonic anhydrases of
different classes (Table 2). The amino acid sequences in the proteome of Anabaena
PCC7120 which had the highest similarity to the query sequences were analyzed using
the web based algorithm SignalP 3.0 ( http://www.cbs.dtu.dk/services/SignalP/) using a
gram negative bacterial model for signal peptides and peptide cleavage sites. The
identified sequences were also analyzed using default parameters using the TatP 1.0
algorithm (http://www.cbs.dtu.dk/services/TatP/) for a specific twin arginine signal
sequence with RRNFL motif. Sequence alignments were done using ClustalX2
(http://www.clustal.org/) using default parameters. Predicted 3D structural model of
CcmM was created using the online protein modeling algorithm in Phyre
(http://www.sbg.bio.ic.ac.uk/phyre/) using default parameters. Superpose
(http://wishart.biology.ualberta.ca/SuperPose/) was used to superimpose the predicted
structure of CcmM on Cam. The superimposed structures were viewed in Swiss-Pdb
Viewer (http://spdbv.vital-it.ch/). The RMS values considering the spatial differences in
the α-carbons and all the backbone atoms of the amino acids for the superimposed
structures were calculated using iterative magic fit function in Swiss-Pdb Viewer.
20
21
Table 2: List of query sequences used to search the proteome of Anabaena PCC7120 to
identify a putative carbonic anhydrase.
Query sequence accession number in:Class of carbonic
anhydrase
Query sequence
species Cyanobase NCBI
β-class Synechocystis
PCC6803
slr1347 AAC46375
Halothibacillus
neapolitanus
EEG96215ε-class
Prochlorococcus
marinus MED4
NP_892671
α-class Human AAH27890
γ-class Methanosarcina
thermophila
1THJ-A
Organisms and growth conditions
The filamentous cyanobacterium Anabaena sp. UTCC387 was obtained from the
University of Toronto Culture Collection, Toronto, Canada. The culture record indicates
that this strain was deposited as Anabaena sp. PCC7120 from the Pasteur Culture
Collection, Paris, France. Thus, the strain UTCC387 and PCC7120 are equivalent.
Cultures of Anabaena PCC7120 were grown in BG11 medium at 30ºC (Rippka, 1979).
Small scale cell cultures were grown in 100 mL of unbuffered BG11 medium in 250 mL
Erlenmeyer flasks with constant shaking at 120 rpm. The cultures were continuously
illuminated using Cool White and Gro-lux fluorescent lamps with an average
22
photosynthetic photon flux density (PPFD) of 25 umol m-2
s-1
. Large scale cultures were
grown in 10 L carboys (Nalgene) using 6 L of BG11 buffered with 25 mM HEPES at pH
8. Typically, these cultures were grown using 5% CO2 (v/v) as the carbon source to
enhance growth rate and biomass. The gas stream was then changed to normal air
(0.035% CO2 [v/v]) 72 h prior to experiments, in order to enhance additional
carboxysome biosynthesis (Mckay et al., 1993).
Cultures of the heterotrophic bacterium Escherichia coli strain BL21 (Novagen)
were maintained in Luria-Bertani (LB) media (1% [w/v] tryptone, 0.5% [w/v] Bacto
yeast extract and 171 mM NaCl) containing 1.4% w/v agar. Transformed E. coli BL21
strains containing pET protein expression vectors were grown in LB media supplemented
with 100 μg mL-1
ampicillin. Liquid cultures were grown in 250 mL of LB media in 1 L
Belco flasks with shaking at 250 rpm at 37°C for 12 h.
Isolation of carboxysomes
Prior to experiments, illuminated Anabaena PCC7120 cells were grown with 5%
CO2 for 8 days followed by 3 additional days with air bubbling. Cells were collected by
centrifugation at 5000 x g for 10 min in 250 mL centrifugation flasks (Nalgene) at room
temperature. Cell pellets were pooled by resuspension in a small amount of retained
media, the bottles were washed with 5 mL of lysozyme buffer (0.6 M sucrose and 20 mM
Tes-NaOH at pH 7.5) and the combined solution was again centrifuged at 5000 x g for 10
min at room temperature. The cell pellet was then resuspended in 40 mL of lysozyme
buffer containing 2 mg mL-1
of egg white lysozyme and incubated at 37°C for 2 h in dark
with occasional mixing. Following lysozyme treatment, the cell suspension was
23
centrifuged at 5000 x g for 10 min at room temperature and the supernatant was
discarded. The precipitated cells were resuspended in 4 mL of lysozyme buffer plus 21
mL of “breaking” buffer (20 mM Tes-NaOH and 5 mM EDTA at pH 7.0) and 1/100 (v/v)
dilution of a protease inhibitor cocktail (Sigma). The resuspended cells were kept on ice
for 5 min. To disrupt the cells, the cell suspension was passed through an ice-cold French
Pressure Cell (Aminco) at 8,000 psi. The lysate was collected, centrifuged at 12,000 x g
for 10 minutes at 4°C and the supernatant containing the soluble fraction was retained.
To obtain a carboxysome-enriched fraction from the soluble portion of the crude
extract, the soluble fraction was diluted with 3 volumes of 1X EM buffer (40 mM EPPS-
NaOH and 27 mM MgSO4 at pH 8.0) containing 20% (v/v) Percoll (Amersham
Biosciences) and 0.133% Triton X-100. The mixture was allowed to incubate on ice for
30 min to allow for magnesium mediated aggregation of the carboxysome with the
Percoll beads. The aggregated carboxysomes were collected by centrifugation at 12,000 x
g for 20 min at 4°C. The carboxysomal aggregates were washed twice with 15 mL 0.75X
EM buffer containing 1% Triton X-100 and centrifuged at 12,000 x g for 20 min at 4°C
to remove thylakoid membranes. A final wash with 15 mL of 0.75X EM buffer was
carried out before resuspending the precipitate in 1 mL of 0.75X EM buffer containing
20% glycerol. The partially purified carboxysomes were separated in 100 μL aliquot in
microfuge tubes. The carboxysome isolation protocol is based on that developed by So et
al and Long et al (Long et al., 2007; So et al., 2002).
24
Polyacrylamide gel electrophoresis
Proteins from lysates or carboxysome preparations were separated according to
size by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) using
Bio-Rad Mini-PROTEAN II electrophoresis system. Polyacrylamide gels (8%, 10% or
12% [w/v]) were prepared as described by Ausuble et al. (1993). Electrophoresis was
performed at 75 V for 10 min followed by 150 V for 60-100 min. Following
electrophoresis, the gels were either stained for protein or the separated protein was
transferred to nitrocellulose membranes for immunoblot analysis. Staining was carried
out by immersing the gels in Coomassie Brilliant Blue R-250 (Bio-Rad), 50% (v/v)
methanol and 10% (v/v) acetic acid for 1-2 hours. Gels were destained using 50% (v/v)
methanol, 10% (v/v) acetic acid for 2-4 hours and dried in cellophane membranes to
preserve them.
Western blot analysis
Polyacrylamide gels from SDS-PAGE were washed with transfer buffer (25 mM
Tris-HCl pH 8.3, 190 mM glycine and 20% [v/v] methanol) to remove residual SDS-
PAGE buffer. Proteins were transferred onto BioTrace NT pure nitrocellulose
membranes (Pall Gelman) using the Mini Trans-Blot system (Bio-Rad) at a constant
voltage of 100 V for 2 h. The apparatus was cooled using ice packs. Following transfer,
nictrocellulose membranes were incubated overnight in a blocking solution of TBST (20
mM Tris-HCl pH 7.6, 140 mM NaCl, 0.1% [v/v] Tween-20 [Bio-Rad]) and 5% (w/v)
skim milk powder (Carnation). The blot was subsequently incubated with primary
antibody in TBST and 2% (w/v) gelatin (Bio-Rad) for 1 h at room temperature followed
25
by three washes of five min each with TBST. The antibody was used at a dilution of
1:2000 – 1:3000 depending on the antibody. For detection of the primary antibody, which
was generated in rabbits, the blot was treated with goat anti-rabbit IgG conjugated to
alkaline phosphatase (AP). The secondary antibody was diluted to 1:2000 in TBST
containing 2% (w/v) gelatin (Bio-Rad) and incubated for 45 min at room temperature.
Following incubation with the secondary antibody, the blots were washed three times
with TBST and treated with AP visualization substrates (40 μL BCIP and 40 μL NBT
[Amersham Biosciences]) diluted in 10 mL of AP buffer. The AP visualization substrate
and blot were allowed to incubate until formation of purple precipitate after which the
blot was washed with distilled water to stop further development. The blot was dried and
digitized using a scanner.
Recombinant protein expression
E. coli cells containing recombinant, full length CcmM were generously provided
by Dr. A K C So.
The activity of CcmM was investigated using recombinant CcmM expression in
E. coli cells (Novagen BL-21). Initially, small scale protein extracts were prepared to
confirm the overexpression of CcmM. A single colony from a plate culture was used to
inoculate 5 mL of LB media containing 100 µg mL-1
ampicillin at 37°C with shaking at
250 rpm for 5 h. After 5 h, the cells were induced with 1 mM isopropyl-β-D-
thiogalactoside (IPTG) at 30°C for 4 hours. After induction, the cells were centrifuged at
8,000 x g for 1 min at 4°C and the pellet collected. The cell pellet was then sonicated 5
times with 5 s pulses at 10 W on ice. The sonicated cells were then centrifuged at 10,000
26
x g and the supernatant was collected. Samples of the supernatant were used to check for
the presence of CcmM using PAGE followed by Western Blotting with CcmM antibody.
Once the presence of CcmM was confirmed, large-scale cultures of transformed
E. coli were grown. Overnight stock cultures of transformed E. coli were grown in 4
tubes containing 20 mL of LB/amp media at 37°C with shaking (250 rpm) for 14 h. The
overnight cultures were used to inoculate 4 flasks containing 250 mL of LB/amp at 37°C
with shaking (150 rpm) for 3-4 h until an OD600 reading of 0.8. Protein overexpression in
the cultures was induced with 1 mM IPTG and the cells were allowed to grow for another
5 hours. After induction, the cultures were centrifuged at 5,000 rpm for 10 min at 4°C to
collect the cells. The ice-cold cell pellets were subjected to 4 rounds of sonication at 5 W
for 10 s each. The lysed cells were centrifuged and the supernatant was collected.
Mass spectrometric carbonic anhydrase assay
CA activity was measured using mass spectrometric assay by following the
irreversible exchange of 18
O from 13
C18
O2 to H2O (Miller et al., 1997). The 18
O enriched
carbonate-bicarbonate mixture was prepared by mixing 20 mg K2
13
C16
O3 (95%) with
0.936 mL of H2
18
O (99%). CA catalyses the following partial reaction:
13
C18
O2 + H2
16
O ↔ H+
+ H13
C16
O18
O18
O ↔ 13
C16
O18
O + H2
18
O
Repeated cycles of hydration/dehydration cause the irreversible loss of 18
O from
13
C18
O2 and result in the formation of 13
C16
O2 with an m/z of 45. Hence, CA activity can
be assayed by measuring the rate of formation of mass 45 13
C16
O2 following the supply of
13
C18
O2 (m/z = 49). The measurement of the formation of m/z 45 13
C16
O2 from m/z 49
27
13
C18
O2 was carried out using an aqueous inlet, magnetic sector mass spectrometer
(model MM 14-80SC; VG Gas Analysis) (So et al., 2002).
CA assay of CcmM
Recombinantly expressed CcmM was assayed for CA activity. E. coli cells
overexpressing CcmM were grown at temperatures ranging from 16°C to 37°C using 1
mM or 2 mM of the inducer IPTG. CcmM overexpressed at different temperature and
inducer concentration was assayed for CA activity. Zinc (0.05 - 2 mM), dithiothreitol
(0.05, 0.1 and 2 mM), bicarbonate (0.6 mM and 20 mM) and RuBisCO (1 mg) was added
to the assay buffer in independent CA assay to see if CcmM was activated by the
aforementioned small molecules. pH of the assay buffer was changed from 7.2 to 8.5 to
see if CcmM was activated by changes in pH. The CcmM lysate was concentrated 10X to
ensure that enough protein was being assayed to exhibit CA activity. Oxygen was
bubbled for 15 min to see if an oxidative environment activates CcmM. A combination of
1 mM zinc, 0.1 mM DTT and 20 mM bicarbonate at pH 8 was assayed to see if a
combination of external factors activates CcmM (Table 3).
Cellulose acetate assay
In 1.0 L of running buffer (15.74 g TRIS, 0.82 g EDTA, 0.24 g NaCl and 4.41 g
boric acid in 1.0 L of ddH2O adjusted to pH 8.9 with NaOH) Titan III Zip Zone cellulose
acetate plates (Helena Laboratories, Mississauga) (76 X 76 mm) were soaked in ice-cold
running buffer for 1 h. Using Whatman No. 1 chromatography paper, the plates were
blotted dry. The cell extract was loaded onto the cellulose acetate plates quickly to
28
prevent the plate from drying using a twelve-lane Super Z applicator at a point
approximately 1 cm away from the plate edge. For positive control bovine CA (Sigma C-
4831) was used. The plate was plated upside down as a bridge between two
chromatography paper wicks (63 mm apart), dipped in isolated chambers containing the
running buffer. Electrophoresis was performed at 200 V (about 5 mA per plate) for 40 to
60 min at 4°C. The electrophoresed plates were soaked in 20 mM Na-barbital buffer (pH
8.30) containing 0.1% (w/v) phenol red dye for 1 min.
The cellulose acetate membrane was then placed face up on an ice-cooled
aluminum plate and blotted to remove excess buffer from the cellulose acetate
membrane.
Using an inverted funnel, CO2 was blown on top of the cellulose acetate membrane.
Positive CA activity appeared as yellow bands on a red background within 1-2 minutes.
To increase the sensitivity, the plate was rinsed in running dH20 for about 15 s followed
by the re-addition of phenol barbital buffer and CO2 treatment. The plate was meant to be
photographed using a digital camera at the point of maximum contrast or when minor
bands appear just before the background becomes yellow. However, the plate could not
be photographed due to inadequate camera setup.
T7 Tag affinity purification and pull down assays
T7 Tag Antibody Agarose purification kit (Novagen 69025-3) was used in the
pull down assays. The column was mounted on an appropriate support and then primed
with several ml of Bind/Wash Buffer to eliminate air from the bottom frit area. The T7-
Tag Antibody Agarose was left to equilibrate to room temperature. The 50% slurry of
29
T7-Tag Antibody Agarose was resuspended and 1 ml was transferred into the
chromatography column. The storage buffer was allowed to flow to waste. The resin was
equilibrated by washing with ten column volumes (10 mL) of 1X T7•Tag Bind/Wash
Buffer at room temperature. The cell extract was brought to room temperature and loaded
onto the column and incubated overnight at 4°C on orbital rotator. The flow through was
collected in a 15 mL tube. The column was then washed with ten column volumes (10
mL) of 1X T7•Tag Bind/Wash Buffer. The flow through was saved for Western Blot
analysis. Then, the Anabaena PCC7120 lysate was added and incubated overnight at 4°C
on orbital rotator. The column was then washed with ten column volumes (10 mL) of 1X
T7•Tag Bind/Wash Buffer. The flow through was saved for activity and Western Blot
analysis. The resin which had been incubated with CcmM and Anabaena PCC7120
extract was resuspended in EPPS-NaOH pH 8 and checked for activity.
Sulfonamide column binding assay
Carboxysome preparations from previous extractions were pooled. A total volume
of 1 mL of carboxysome extract was diluted with 2 mL of binding buffer (100 mM
EPPS/NaOH, pH 8 +100 mM MgSO4). 0.5 mL of p-aminobenzyl sulfonamide (PAMBS)
agarose was prepared for binding by washing 3 times with 1 mL of binding buffer. The
washed PAMBS was placed into a Bio Rad Poly-Prep chromatography column and
washed further with 2 mL of binding buffer until nearly dry. Then 3 mL of the
carboxysome sample was added to the column and incubated for 1 hour on an orbital
rotator. After incubation, the buffer was allowed to flow out of the column and drops 50
to 60 were collected as the first flow through. Then 6 mL of binding buffer was added
30
and drops 50 to 60 collected as second flow through. This was repeated until fourth flow
through was collected. After the fourth flow through 50 μL of PAMBS resin was saved
for analysis. As a positive control to see if PAMBS resin binds to bonafide carboxysomal
CA, CcaA, the above mentioned procedure was carried out and the resin analyzed using
Western blot.
Protein sequencing
The bound protein in the sulfonamide affinity column from Anabaena PCC7120
carboxysomal extract was removed by boiling in 10 mM Tris-HCL buffer for 5 min. The
protein that was bound was extracted and sent to the Advance Protein Technology Center
at Sick Kids Hospital for processing and sequencing using Liquid Chromatography
MS/MS. The protein sequence was analyzed using Scaffold 2 software.
CcmM209 and isolation on His.Bind®
affinity column
A segment of the ccmM gene encoding the N-terminal 209 amino acids was
cloned into the pET28A expression vector using PCR based methods by Dr. Y Tadesse in
our lab. This region encompasses the entire γ-CA domain of CcmM as predicted by the
homology model. The primers used for PCR amplification and the DNA sequence of the
insert used to express CcmM209 are in the Appendix.
E. coli cells expressing recombinant 6XHis tagged CcmM209 were grown using
the same protocol as described previously. Cells from 750 mL of culture were centrifuged
and the pellet lysed using BugBuster®
reagent from Novagen following the protocol
provided with the kit. Cellular debris was removed by centrifuging at 16,000 x g for 20
31
min and the supernatant was collected. Binding of 6XHis-tagged protein was
accomplished by passing the supernatant through a His-bind immobilized metal affinity
chromatography (IMAC) column at a rate of 6 times the bed volume (1.5 mL) per hour.
400 uL of washed resin was retained for further analysis. Bound protein was eluted from
the remaining resin using the previously described method.
32
Results
In Silico Search for a Carboxysomal Carbonic Anhydrase.
To identify potential carboxysomal carbonic anhydrase candidates within the
proteome of the cyanobacterium Anabaena PCC7120, in silico searches of the NCBI
database (http://www.ncbi.nlm.nih.gov/) and Cyanobase
(http://genome.kazusa.or.jp/cyanobase/) were performed using reference sequences
(NC_003272). The initial search query was a 274 amino acid protein (AAC46375,
slr1347) from the cyanobacterium Synechocystis PCC6803 which is a bonafide
carboxysomal carbonic anhydrase belonging to the β-class of CAs (Cot et al., 2008; So et
al., 2002; So and Espie, 1998; So and Espie, 2005). The BLASTP search identified a
single, high-probability ortholog within the Anabaena PCC7120 proteome having NCBI
accession number NP_486950 and cyanobase ID all2910. The ortholog had 29.9%
sequence identity to the bonafide β-CA, slr1347, of Synechocystis PCC6803. Reciprocal
BLASTP analysis also revealed that all2910 had greater than 40% amino acid sequence
similarity to an additional β-CA within the Synechocystis proteome, namely slr0051, a
putative periplasmic protein. Proteins from Synechocystis, the carboxysomal carbonic
anhydrase (CcaA) slr1347 and the periplasmic CA slr0051, along with all2910 contained
the β-CA superfamily conserved domain motif (Fig. 4). This included amino acid
residues involved in the formation of the zinc binding sites, dimer interfaces and active
site clefts, all of which were highly conserved in both all2910 and CcaA. However, the β-
CA superfamily conserved domain in all2910 and slr0051 was positioned towards the C-
terminal end of the protein whereas the conserved domain in CcaA was positioned
towards the N-terminal region of the protein (Fig. 4). Analysis of the amino acid
32
33
sequence of all2910 using the web-based algorithm SignalP 3.0
(http://www.cbs.dtu.dk/services/SignalP/) within a gram negative bacterial model
revealed that the first 30 N-terminal amino acids of all2910 is a signal peptide with a
probability prediction of 1.0. A signal peptidase cleavage site was also predicted between
amino acid positions 30 and 31. Analysis of the same amino acid sequence using the TatP
1.0 algorithm (http://www.cbs.dtu.dk/services/TatP/) revealed that the amino acid
sequence of all2910 contained a specific twin arginine signal sequence with an RRNFL
motif starting at amino acid position 10. Unlike all2910, analysis of CcaA using the
SignalP 3.0 server and TatP 1.0 server showed that it did not possess a generic signal
peptide or a specific twin arginine signal sequence.
A second bonafide carboxysomal carbonic anhydrase is encoded by csoS3 (So et
al 2004, So & Espie, 2005) found within the cso operon of α-cyanobacteria and a variety
of chemolithoautotrophic bacteria (Badger et al., 2002). Consequently, BLASTP
searches of the Anabaena PCC7120 proteome using the ε-type CA, CsoS3, from
Halothiobacillus neapolitanus (EEG96215) and Prochlorococcus marinus MED4
(NP_892671) as the query sequences. The searches did not identify proteins with
significant amino acid sequence similarity. This was not unexpected as Anabaena
PCC7120 does not contain α-carboxysomes or the cso operon. However, the search was
necessary to exclude the possibility that Anabaena may have acquired csoS3 independent
of the cso operon.
T. weissflogii CA1 (TWCA1) is a rare δ-CA which has been discovered in a few
species of marine diatoms (Roberts et al., 1997). The Anabaena PCC7120 proteome was
34
searched using TWCA1 (AAX08632) protein sequence as a template. The search did not
identify any proteins with significant similarity.
Although α-class CA has not been identified as a carboxysome constituent, α-CA
homologs have been identified in bacteria. The most relevant example is EcaA from
Anabaena PCC7120 and Synechococcus PCC7942 (So et al., 1998; Soltes-Rak et al.,
1997). EcaA was identified as a CA homolog prior to the complete sequencing of the
Anabaena PCC7120 genome and, consequently, other unidentified homologs may exist.
The Anabaena PCC7120 genome database was searched using human CA1
(AAH27890), an α-CA, as the query sequence. The BLASTP search yielded a single
polypeptide with 35% amino acid sequence similarity to the query sequence. This protein
is identical to the previously identified EcaA and is identified in Cyanobase as all2929.
Comparison of the amino acid residues within the α-CA conserved domain of all2929
with human CA1 (Fig 4) revealed a high degree of conservation within the active site
residues as well as the zinc – binding residues. Previous study on EcaA by Soltes-Rak et
al. (1997) indicated that EcaA cross-reacts with antibody directed against chicken α-CAII
(Soltes-Rak et al., 1997). Immunogold labeling of thin sections of Anabaena PCC7120
and Synechococcus PCC7942 using chicken α-CAII (Soltes-Rak et al., 1997) antibody
showed that EcaA appeared to reside on the surface of the cells, possibly in the
periplasmic space (So et al., 1998; Soltes-Rak et al., 1997). Analysis of the EcaA amino
acid sequence using the SignalP 3.0 and TatP 1.0 servers revealed that the first 33 N-
terminal amino acids comprised a signal peptide with an estimated probability of 1.0.
These results are consistent with the results of Soltes-Rak et al (1997). Again, a twin
arginine motif, RRQLL, was identified as the specific signal sequence beginning at
35
residue 6. A signal peptidase cleavage site between amino acid residues 33 and 34 was
also predicted. Thus, it appears that the putative periplasmic carbonic anhydrases of
Anabaena PCC7120 are exported via the Tat, rather than the Sec, export machinery.
Overexpressed EcaA and EcaB from Synechococcus PCC7942 and Synechocystis
PCC6803 showed that these putative carbonic anhydrases were inactive (So et al., 1998).
Finally, Anabaena PCC7120 proteome was searched to find proteins similar to the
γ-type CA, using Cam from Methanosarcina thermophila (Protein Data Bank ID
1THJ_A) as the query. The BLASTP search identified the N-terminal 192 amino acids of
the 555 amino acid polypeptide CcmM as possessing a left-hand parallel beta helix
gamma CA conserved domain (Fig. 5). This region of CcmM had 40 % sequence identity
and 65 % sequence similarity with an e value of 8e-39
. Cam is a homotrimeric protein
that coordinates three separate zinc atoms at dimer interfaces (Kisker et al., 1996).
Dissecting CcmM further revealed that amino acid residues responsible for the formation
of the homotrimer interfaces and active site zinc binding residues, as denoted by the
small pyramids in Figure 5, are conserved. This suggests that CcmM may have CA
catalytic activity. However, there are distinct differences between Cam and CcmM. First,
CcmM is a much larger protein having a C-terminal domain of about 355 amino acids
that is absent in Cam. Within the C-terminal domain, CcmM has three RbcS-like motifs
that are unlikely to play a role in CA activity. Previous work has suggested that the RbcS-
like motifs may be involved in binding RuBisCO within the mature carboxysome
structure. Interestingly, the amino acid sequence of Cam, deduced from the gene,
indicates that Cam has a signal peptide typical of an extracellular enzyme, and in fact
Cam appears to be a periplasmic enzyme in Methanosarcina (Kisker et al., 1996).
36
Analysis of the deduced amino acid sequence of CcmM using the SignalP 3.0 and TatP
1.0 servers shows that it is unlikely to possess a signal peptide with a calculated
probability of 0.019. CcmM is therefore unlikely to be exported to the periplasm.
Deduced amino acid sequences of Anabaena PCC7120 CcmM (all0865),
Synechococcus elongatus PCC6301 CcmM (syc0133_c), Synechococcus elongatus
PCC7942 CcmM, Thermosynechococcus eolongatus BP-1 CcmM (tll0944) and
Metahnosarcina thermophila Cam (1THJ_A) were aligned using ClustalX2
(http://www.clustal.org/ ) (Figure 6). The N-terminal 210 residues of CcmM proteins
were selected and aligned with the Cam. Only the N-terminal residues were chosen so as
to truncate the Rbcs like repeats on the C-terminal end which is not present in Cam. The
sequence alignment showed that the histidine residues involved in Zn2+
binding (His81,
His117 and His122 in Cam) were conserved in all the aligned amino acid sequences
among all the species (Kisker et al., 1996). The amino acid residues involved in trimer
formation in Cam are Arg59, Asp61 and Asp76 (Kisker et al., 1996). The amino acids,
Arg59 and Asp61, of Cam are conserved in all of the aligned amino acid sequences of
CcmM. Asp76 of Cam are conserved in Anabaena and Thermosynechococcus CcmM but
not by the CcmM of Synechococcus CcmM. In Syncechococcus CcmM the aspartic acid
is substituted by a glutamic acid residue. This substitution is unlikely to have major
functional impact as both aspartic acid and glutamic acid are negatively charged acidic
amino acids. The glutamine residue, Gln75, is thought to form hydrogen bonds to H2O in
the active site. It is observed that Gln75 is conserved among all the aligned CcmM and
Cam (Kisker et al., 1996). Finally, cysteine residues (Cys194 and Cys200) in Anabaena
PCC7120 CcmM and Thermosynechococcus elongatus BP-1 CcmM are conserved but
37
are not conserved in the CcmM of Synechococcus elongatus PCC6301 or Synechococcus
elongatus PCC7942.
The 3-dimensional crystal structure of Cam has been determined by X-ray
diffraction and the spatial coordinates for the amino acids can be accessed in Protein Data
Bank (http://www.pdb.org/) using accession number 1THJ (Figure 7a, b Cam). One of
the unique features of the structure of Cam is that it contains seven complete turns of a
left-handed parallel β-helix. The β-helix is topped by a short α-helix and a second α-helix
is formed by the C-terminal portion of the protein. A short segment of the polypeptide
chain at the N-terminus is in extended conformation. Cross-section of the β-helix of Cam
resembles an equilateral triangle as each section of the β-helix contains three parallel
sheets of almost equal length (Kisker et al., 1996). Each of the β-helix contains two β-
turns.
The X-ray crystal structure of CcmM from Anabaena PCC7120 or any other
organism has not yet been determined. In order to compare the structural similarity of
CcmM to Cam, which have high amino acid sequence similarity, a 3D structure of CcmM
is required. Advances in protein structure determination and computational analysis has
led to the development of a suite of programs capable of predicting 3 dimensional protein
structure based on primary amino acid sequence. Chief among these are programs that
utilize template-based homology modeling, fold-recognition and profile–profile matching
algorithms (Kelley and Sternberg, 2009). A 3-D structural model was created for CcmM
using the Phyre server (http://www.sbg.bio.ic.ac.uk/phyre/). The entire 555 amino acids
of Anabaena PCC7120 CcmM could not be resolved by a single model. The best match
model was created using 192 amino acids of CcmM starting from the 4th
N-terminal
38
residue having an e-value of 4.57e-14
(Figure 7a,b CcmM). The predicted 3D structure of
CcmM is strikingly similar to that of Cam. The left handed parallel β-helix observed in
Cam is also observed in CccM. The cross section along the β-helix of the CcmM
resembles an equilateral triangle just like that of Cam. The number, location and
orientation of α-helices are the same in both Cam and CcmM. However, CcmM appears
to have seven complete turns of the left handed parallel β-helix along with an additional
β-strand in the region where Cam has the extended conformation. CcmM appears to have
one β-turn compared to two β-turns in Cam.
Visual inspection revealed that Cam and CcmM have similar structural properties.
To assess the goodness of fit between the known crystal structure of Cam and the model
for CcmM a superimposition program (Superpose,
http://wishart.biology.ualberta.ca/SuperPose/ and SwissPdb, http://spdbv.vital-it.ch/ )
were used to compare mainchain spatial positioning (Figure 7a,b Cam + CcmM
superimposed). The RMS (root mean square deviation) of the crystal structure after
iterative magic fit function in SwissPdb Viewer using 185 α-carbons was 0.41 Å. The
RMS calculated using all 740 atoms of the backbone was 0.52Å. Overall and side chain
RMS values could not be obtained as Cam and CcmM do not contain the same number of
atoms in the side chains. The Cam structure was determined originally at a resolution of
2.8 Å, therefore, RMS values of 0.41 Å and 0.52Å for α-carbons and backbone atoms
respectively shows that the two structures match within the resolution of the available
data. The active site Zn2+
-binding residues (His81, His117 and His122 in Cam) are in the
same position and orientation in CcmM (His75, His102 and His107). The amino acid
residues involved in trimer formation (Arg59, Asp61 and Asp76 in Cam) also appear to
39
be in the same position and orientation in CcmM (Arg57, Asp55 and Asp70 in CcmM).
Finally, spatial orientation of the residue responsible for binding to H2O (Gln75 in Cam)
is similar in Cam and CcmM (Gln69 in CcmM).
The in silico search for a carboxysomal carbonic anhydrase within the proteome
of Anabaena PCC7120 revealed that CcmM has sequence similarity within the N-
terminal region to γ-CA, Cam, of Methanosarcina thermophila. Structural modeling and
analysis revealed that CcmM has high structural similarity to Cam and is therefore a
likely candidate CA in Anabaena PCC7120 which lacks a previously characterized
carboxysomal CA. The other candidate CAs appears to be periplasmic carbonic
anhydrases based on the presence of export leader sequences and other data.
40
Anabaena PCC7120 - all2910
Synechocystis PCC6803 - CcaA
Anabaena PCC7120 EcaA - all2929
NP_486969
Human CA1 – AAH27890
Figure 4: Representation of the location of conserved β-CA and α-CA domains in selected proteins from Anabaena PCC7120,
Synechocystis PCC6803 and humans obtained from the Conserved Domains Database from NCBI server.
41
Anabaena PCC7120 CcmM – all0865
Methanosarcina thermophila Cam - 1THJ_A
Figure 5: Representation of the location of conserved γ-CA domains in selected proteins from Anabaena PCC7120 and
Methanosarcina thermophila obtained from the Conserved Domains Database from NCBI server (figures not to scale).
42
Figure 6: Amino acid sequence alignment output from ClustalX2 for Cam of Mehanosarcina thermophila and CcmM of selected
species of cyanobacteria. Abbreviation used for species are: Ana for Anabaena PCC7120, Thermo for Thermosynechococcus
elongatus BP-1, Methano for Mehanosarcina thermophila, Syn6301 for Synechococcus elongatus PCC6301 and Syn7942 for
Synechococcus elongatus PCC7942. Amino acid residues in red boxes denote the Zn2+
binding residues, those in blue boxes are amino
acid residues involved in the formation of the trimer and the amino acid residues in the yellow box are those that are involved in
hydrogen bonding to water. The green ovals show conserved cysteine residues among Anabaena PCC7120 CcmM and
Thermosynechococcus elongatus BP-1 CcmM.
43
a)
44
Figure 7: 3D structure of Cam of Methanosarcina thermophila and CcmM of Anabaena
PCC7120. Crystal structure of Cam obtained from x-ray crystallography of γ-CA, Chain
A of Cam, is on the top left (PDB ID: 1THJ). On the top right is the computer modeled
3D structure of CcmM of Anabaena PCC7120 using Phyre server. The bottom 3D
structure denotes the superposition of Cam and CcmM using Superpose followed by an
iterative fit in Swiss-Pdb Viewer. Blue represents regions of β-sheets, red represents
regions of α-helices and green represents regions of extended conformation. All images
are from the same vantage point. a) is the side view and b) is a cross section through the
β-barrel.
b)
45
Mass spectrometric CA assays
In order to establish a positive control, a bonafide carboxysomal CA, CcaA, from
Synechocystis PCC6803, was recombinantly expressed in E. coli and assayed for activity
(So and Espie, 2005). CA activity was determined mass spectrometrically as acceleration
in the rate of loss of 18
O from 13
C18
O labeled CO2 (Figure 8a). At time 0, 2.5 μL of 13
C
18
O labeled bicarbonate was added to the reaction vial as a substrate for CA. Addition of
13
C 18
O labeled bicarbonate to the cuvette initially resulted in the appearance of three
separate CO2 species having a mass/charge (m/z) ratio of 49, 47 and 45, corresponding to
13
C18
O18
O (green line), 13
C18
O16
O (red line) and 13
C16
O16
O (blue line). The mass
spectrometer does not detect charged carbonate species as these do not cross the
membrane of the inlet system. With time, a steady state reaction rate is achieved which
corresponds to the uncatalyzed rate of 18
O loss from CO2. This is reflected in a decrease
in m/z =49 and an increase in m/z = 45. Addition of 200 μL of E. coli lysate in which
CcaA overexpression had been induced resulted in a rapid loss of the substrate, m/z =49 ,
and intermediate, m/z =47, along with a rapid increase in the rate of production of the
product, m/z =45. The rapid increase in the formation of the product followed by the
rapid decrease in the substrate indicates that CA activity was present in the E. coli lysate.
The level of m/z=45 reached a plateau as all of the substrate was converted, indicated by
near zero levels of m/z=49 and m/z=47.
In order to calculate the fold increase in activity from the catalyzed reaction
compared to the uncatalyzed reaction, the atom fraction of 18
O in m/z 49 compared to the
total 18
O was calculated and plotted on a logarithmic scale (Figure 8b). Trend lines were
plotted for the uncatalyzed section of the plot and the catalyzed section of the plot. The
46
equation of the trend line for the uncatalyzed reaction was y = -0.0346x + 1.7404 with a
R2
for the equation to be 0.9948. The equation of the trend line for the catalyzed reaction
was y = -1.7482x + 8.8172 with an R2
value of 0.9329. The R2
value being close to 1
indicates that the linear trend line is close to the actual data. Comparing the slopes of the
uncatalyzed and catalyzed trendline shows a 50 fold increase in the rate of 18
O loss from
m/z 49.
As a negative control, equal amounts of E. coli lysate harboring the pET15b
plasmid alone were added to the reaction vial after the substrate reached steady state
reaction. Addition of the negative control lysate did not increase the rate of production of
m/z =45 indicating that CA activity was absent.
47
Time (min)
13CO2species(uM)
0.0
2.0
4.0
6.0
8.0
10.0
12.0
14.0
0.0 2.0 4.0 6.0 8.0 10.0
CO2-45, uM CO2-47, uM CO2-49, uM
200 uL of E. coli lysate with
overexpressed CcaA added
(a)
Time (min)
Log18O2AtomFraction
(b)
Uncatalyzed
y = -0.0346x + 1.7404
R
2
= 0.9948
With lysate
y = -1.7482x + 8.8172
R
2
= 0.9329
0.0
0.2
0.4
0.6
0.8
1.0
1.2
1.4
1.6
1.8
0.0 2.0 4.0 6.0 8.0
18O Atom Fraction Uncatalyzed trend Trend with lysate
Linear (Uncatalyzed trend) Linear (Trend with lysate)
50X
Figure 8: (a) Mass spectrometric assay for CA activity of E. coli lysate in which CcaA
expression was induced. (b) 18
O atom fraction in m/z 49 with respect to time. The
trendlines represent the uncatalyzed reaction and reaction upon addition of lysate.
48
CA Activity in Synechococcus PCC7942 carboxysomes
Mass spectrometric assays of concentrated intact cells of Anabaena PCC 7120
and of concentrated whole cell lysates of Anabaena PCC7120 failed to show any
detectable CA activity (data not shown). This is consistent with past work that has shown
that CA levels in many cyanobacteria are quite low (Ingle and Colman, 1975; So and
Espie, 2005). What is clear is that the levels of both periplasmic CA and cytosolic CA are
below the level of detection even in concentrated extracts and are therefore unlikely to be
confounding factors in subsequent analysis. Similarly, if present, the level of
carboxysomal CA in Anabaena carboxysomes is also likely to be low and difficult to
detect.
To determine the level of CA activity anticipated from carboxysomes containing a
bonafide CA, CcaA, a carboxysome enriched fraction from Synechococcus PCC7942
lysate was obtained using standard procedures. Synechococcus PCC7942 is known to
contain CcaA and was used to provide a positive control for carboxysome enriched
fraction exhibiting CA activity. In these mass spectrometric assays the steady state
concentration of CO2 (m/z = 49, 47, 45) was reversed to that observed in the positive
control using overexpressed CcaA (Fig. 9a). This was due to the slow deterioration over
time in the content of 18
O in the 13
C labeled CO2. Trace amounts of atmospheric
moisture containing H2
16
O were inevitably introduced into the vial containing substrate
13
C 18
O. As a result there is less and less of m/z = 49 and increasing amounts of m/z = 47
and m/z = 45.
Addition of 200 μL of Synechococcus PCC7942 carboxysome enriched fraction to
the reaction cuvette resulted in an increase in the rate of formation of m/z = 45 with the
49
subsequent drop in m/z = 49 and m/z = 47 (Figure 9a). However, the rate of increase of
m/z = 45 was not as rapid as in the case of overexpressed CcaA and did not plateau after
3 min. The rate of 18
O loss from m/z 49 was plotted on a logarithmic scale (Figure 9b).
The equation of the uncatalyzed steady state reaction was y = - 0.0788x + 1.2838 with an
R2
value of 0.9679 and with the carboxysome extract was y = - 0.1968x + 1.6522 with an
R2
value of 0.9954. Again, the high R2
value of the trend lines indicates that data and the
trend lines have high goodness of fit. The rate with the carboxysome extract was 2.5 fold
faster than the uncatalyzed rate in the loss of 18
O from m/z 49 thus indicating 2.5 fold
increase in the rate of reaction.
The catalysis rate was much lower than of E. coli lysate with overexpressed
CcaA. This was not unexpected as it is known that CcaA levels in cyanobacteria are
much lower than the amount of CcaA produced by an overexpression vector in E. coli.
Moreover, the substrate concentration m/z = 49 in which CA can act on was lower than
the CA assay of overexpressed CcaA.
50
Time (min)
13CO2species(uM)
(a)
0.0
2.0
4.0
6.0
8.0
10.0
12.0
0.0 2.0 4.0 6.0 8.0
CO2-45, uM CO2-47, uM CO2-49, uM
200 uL of Synechococcus PCC7942
carboxysome extract added
Time (min)
Log18O2AtomFraction
(b)
Uncatalyzed
y = -0.0788x + 1.2838
R
2
= 0.9679
With Cbx extract
y = -0.1968x + 1.6522
R
2
= 0.9954
0.0
0.2
0.4
0.6
0.8
1.0
1.2
1.4
0.0 2.0 4.0 6.0 8.0
18O Atom Fraction Uncatalyzed trend
Trend with Cbx extract Linear (Uncatalyzed trend)
Linear (Trend with Cbx extract)
2.5X
Figure 9: (a) Mass spectrometric assay for CA activity of Synechococcus PCC7942
carboxysome extract (b) Log 18
O atom fraction in m/z 49 with respect to time. The
trendlines represent the uncatalyzed reaction and reaction upon addition of carboxysome
extract.
51
CA Activity in Anabaena PCC7120 carboxysomes
Addition of 200 μL of a carboxysome enriched fraction from Anabaena PCC7120
resulted in an increase in the rate of formation of m/z = 45 indicating the presence of CA
activity (Figure 10a). On a protein basis, this increase was similar to that observed for the
Synechococcus PCC 7942 carboxysome enriched fraction. Growth, enrichment and
activity assays were repeated independently seven different times confirming the
presence of CA activity in the carboxysomal extract of Anabaena PCC7120. Addition of
20 μM of ethoxyzolamide, a classical inhibitor of CA, to the carboxysome enriched
fractions and assay buffer eliminated CA activity; so did boiling (indicated by dotted line
in Figure 10a). The rate of 18
O loss from m/z 49 was plotted on a logarithmic scale
(Figure 10b). The equation of the trend line for the uncatalyzed steady state reaction was
y = - 0.0636x + 1.6571 with an R2
value of 0.9978 and with the carboxysome extract was
y = - 0.1645x + 1.6522 with an R2
value of 0.9954. The R2
value of the trend lines being
close to 1 indicates that data and the trend lines have high goodness of fit. The rate with
carboxysome extract was 2.6 fold faster than the uncatalyzed rate in the loss of 18
O from
m/z 49 thus indicating 2.6 fold increase in the rate of reaction.
These results demonstrate a low, but consistently present level of CA activity
concentrated with the carboxysome enriched fraction of Anabaena PCC7120. The
combined results also show that periplasmic CA or cytosollic CA activity contributed
minimally to the activity detected in the carboxysome fraction. Moreover, this activity
was eliminated by the addition of a classical CA inhibitor, ethoxyzolamide. Therefore,
either there is a novel CA in the carboxysomes or one of the previously identified
carboxysomal shell proteins may contribute CA activity. Bioinformatics analysis and in
52
silico search for a CA in Anabaena PCC7120 points towards CcmM as a likely candidate
CA.
53
Time (min)
13CO2species(uM)
(a)
0.0
2.0
4.0
6.0
8.0
10.0
12.0
0.0 2.0 4.0 6.0 8.0 10.0
CO2-45, uM CO2-47, uM CO2-49, uM
200 uL of Anabaena PCC7120
carboxysome extract added
Time (min)
Log18O2AtomFraction
(b)
Uncatalyzed
y = -0.0636x + 1.2887
R
2
= 0.9922
With Cbx extract
y = -0.1645x + 1.6571
R
2
= 0.9978
0.0
0.2
0.4
0.6
0.8
1.0
1.2
1.4
0.0 2.0 4.0 6.0 8.0 10.0 12.0
18O Atom Fraction Uncatalyzed trend
Trend with cbx extract Linear (Uncatalyzed trend)
Linear (Trend with cbx extract)
2.6X
Figure 10: (a) Mass spectrometric CA assay of carboxysome enriched fraction from
Anabaena PCC7120. The dotted line represents the assay with 20 μM of CA inhibitor,
etoxyzolamide. (b) Log 18
O atom fraction in m/z 49 with respect to time. The trendlines
represent the uncatalyzed reaction and reaction upon addition of carboxysome extract.
54
Activity of full length CcmM
CcmM protein was recombinantly expressed in E. coli using the overexpression
vector pET21. Western blot analysis was performed using a polyclonal antibody made
against Synechococcus PCC7942 CcmM to detect the presence of CcmM in Anabaena
PCC7120. Antibody staining and development showed bands at about 50 and 58 KDa in
lanes A and C containing 10X and 100X dilution of E. coli lysate in which CcmM was
induced respectively (Figure 11). The 50 KDa band was very faint in the 100X dilution
and was not detected by the scanner. The estimated size of CcmM in Anabaena PCC7120
is 58 KDa, which is same as the detected band in the western blot. Western blot analysis
of Anabaena carboxysome extract in lane D also produced bands at about 50 and 58 KDa
with CcmM antibody indicative of the presence of CcmM in Anabaena PCC7120. Lane
C which contained the negative control sample, E. coli lysate with induced empty vector,
did not cross-react with CcmM antibody.
Lysates of E. coli in which CcmM has been induced were assayed for activity.
Addition of the lysate did not increase the steady state reaction as indicated by no
increase in the rate of formation of m/z = 45 (Figure 12a). The rate of 18
O loss from m/z
49 was plotted on a logarithmic scale (Figure 12b). The equation of the trend line for the
uncatalyzed steady state reaction was y = - 0.0915x + 1.433 with an R2
value of 0.9982
and with lysate was y = - 0.1004x + 1.4792 with an R2
value of 0.9986. The R2
value of
the trend lines being close to 1 indicates that data and the trend lines have high goodness
of fit. Comparing the slopes of the catalyzed and uncatalyzed reaction trend lines reveals
that addition of the E. coli lysate with CcmM overexpressed resulted in minimal
enhancement (1.1 fold increase) of the reaction. Growth, induction and CA activity assay
55
was repeated independently five times with minimal enhancement of CA activity. The
total protein assayed to detect CcmM activity was comparable to the total protein used in
CcaA assay. The CA activity enhancement using recombinantly expressed CcmM was
minimal, even when compared to CA activity from carboxysomal extracts of Anabaena
PCC7120 or Synechococcus PCC7942. E. coli lysate proteins without CcmM
overexpressed also resulted in the minimal increase (1.1 fold increase) in 18
O loss from
m/z 49 (Figure 13ab). Therefore, it may be concluded that E. coli lysate with
overexpressed CcmM does not show enhanced CA activity as it has no increase in CA
activity as compared to the control E. coli lysate with empty pET21 vector.
56
M = Size marker
A = 10X dilution
B = -ve control
C = 100X dilution
D = Anabaena PCC7120 lysate
75
50
100
37
25
20
M A B C D
Figure 11: Western Blot using CcmM antibody for E. coli lysate overexpressing CcmM
and Anabaena PCC7120 lysate. Lane M is the marker lane, lane A is the 10X dilution of
E. coli lysate overexpressing CcmM, lane B is E. coli lysate containing no CcmM vector,
lane C is 100X dilution of E. coli lysate overexpressing CcmM and lane D is Anabaena
PCC7120 lysate. It is seen that CcmM corresponding to 58 kDa is expressed in the E. coli
as well as in the Anabaena PCC7120 lysate.
57
Time (min)
13CO2species(uM)
(a)
0.0
2.0
4.0
6.0
8.0
10.0
0.0 2.0 4.0 6.0 8.0 10.0
CO2-45, uM CO2-47, uM CO2-49, uM
200 uL of E. coli lysate with
overexpressed CcmM added
Time (min)
Log18O2AtomFraction
(b)
Uncatalyzed
y = -0.0915x + 1.433
R
2
= 0.9982
With lysate
y = -0.1004x + 1.4792
R
2
= 0.9986
0.0
0.2
0.4
0.6
0.8
1.0
1.2
1.4
0.0 2.0 4.0 6.0 8.0 10.0 12.0
18O Atom Fraction Uncatalyzed trend Trend with lysate
Linear (Uncatalyzed trend) Linear (Trend with lysate)
1.1X
Figure 12: (a) Mass spectrometric assay for CA activity of E. coli lysate in which CcmM
expression was induced. (b) Log 18
O atom fraction in m/z 49 with respect to time. The
trend lines represent the uncatalyzed reaction and reaction upon addition of E. coli lysate
in which CcmM expression was induced.
58
Time (min)
13CO2species(uM)
(a)
0.0
2.0
4.0
6.0
8.0
10.0
0.0 2.0 4.0 6.0 8.0 10.0
CO2-45, uM CO2-47, uM CO2-49, uM
200 uL of E. coli lysate
with empty pET21 vector
Time (min)
Log18O2AtomFraction
(b)
Uncatalyzed
y = -0.0829x + 1.4884
R
2
= 0.9954
With lysate
y = -0.0907x + 1.5243
R
2
= 0.9982
0.0
0.2
0.4
0.6
0.8
1.0
1.2
1.4
1.6
0.0 2.0 4.0 6.0 8.0 10.0
18O Atom Fraction Uncatalyzed trend Trend with lysate
Linear (Uncatalyzed trend) Linear (Trend with lysate)
1.1X
Figure 13: (a) Mass spectrometric assay for CA activity of E. coli lysate containing
empty pET21 vector. (b) Log 18
O atom fraction in m/z 49 with respect to time. The trend
lines represent the uncatalyzed reaction and reaction upon addition of E. coli lysate
containing empty pET21 vector.
59
Attempts to activate CcmM
Various factors were altered in attempts to activate CcmM as summarized in
Table 3. Induction temperature was varied from 16°C to 37°C and the inducer, IPTG,
concentration was alternated between 1 or 2 mM to determine if induction temperature
and IPTG affected CcmM expression and activity. Equal quantity of total protein from E.
coli induced at different temperatures and inducer concentrations were loaded into wells
and electrophoresed. Western blot was carried out using CcmM antibody. The Western
blot showed that at 37°C using 2 mM IPTG was the optimal temperature for the
overexpression of CcmM (Figure 14). CcmM expression levels did not have any effect on
CcmM having CA activity. Zinc is a candidate cofactor for activation of CcmM. E. coli
lysate with overexpressed CcmM was incubated for 1 h with 0.05, 0.5, 1 and 2 mM Zn2+
and assayed for CA activity. Addition of Zn2+
at any of the tested concentration did not
appear to have made CcmM active. Redox active reagent dithiothreitol (DTT) at
concentrations of 0.05, 0.10 and 2 mM was added to the overexpressed CcmM extract
and in the assay buffer to see if DTT can activate CcmM. DTT did not appear to have
activated CcmM. Attempts were made to activate CcmM with excess bicarbonate.
Addition of 20 mM bicarbonate did not activate CcmM. As CcmM has RbcS like C-
terminal repeats, therefore RuBisCO may interact with CcmM C- terminal end and
activate it in the process. 1 mg of purified RuBisCO from spinach (Sigma R8000-1UN)
was added to 200 uL of E. coli lysate in which CcmM had been induced and incubated
for 15 min. Addition of RuBisCO did not make CcmM CA active. Assaying at a pH
range of 7.2 to 8.5 had no effect in activating CcmM. The CcmM lysate from E. coli was
concentrated to 10X but that did not result in the activation of the CcmM overexpressed
60
protein. Bubbling of oxygen for 15 min did not activate CcmM. A combination of 1 mM
Zinc, 0.1 mM DTT and 20 mM bicarbonate at pH 8 did not activate the CcmM.
61
1 2 3 4 5 6 7 8
250
150
100
75
50
37
25
20
Figure 14: Western blot using CcmM antibody for varying temperature and IPTG
treatments during CcmM overexpression. Lane 1 contains –ve control containing E. coli
lysate with empty expression vector, lane 2 contains standard protein marker with the size
indicated on the left side of the blot, lane 3 contains overexpressed CcmM at 16°C
induced with 2 mM IPTG, lane 4 contains overexpressed CcmM at 37°C with 2 mM
IPTG for induction. Lanes 5, 6, 7, 8 and 9 contains overexpressed CcmM at 25, 30, 37
and 16°C respectively induced with 1 mM IPTG.
62
Table 3: Factors changed in attempts to activate CcmM. CcmM was overexpressed in E.
coli and cell lysate was assayed for CA activity after manipulation of the factors listed
below.
Factors tested Probable effect on CcmM CA Activity
Induction temperature (16 -37°C) Protein Folding No
Zinc (0.05 – 2.0 mM) Part of active site No
Dithiothreitol (DTT) (0.05, 0.1 & 2
mM)
Thioredoxin association with
CcmM
No
Bicarbonate (HCO3
-
) (20 mM) Non-active site regulator No
RuBisCO (1 mg in 200 μL of E. coli
lysate in which CcmM has been
induced)
C-terminal repeats of RbcS-like
sequence. Possible binding to
RbcL or RbcS.
No
pH (7.2 – 8.5) Enzyme requires optimum pH No
10X concentration Ensure enough protein No
Bubbling oxygen for 15 min Oxidative activation No
1 mM Zinc, 0.1 mM DTT and 20
mM bicarbonate at pH 8
Combinatorial activation No
63
Cellulose acetate assay
Cellulose acetate assays to separate CA active protein from carboxysome were
performed. 0.01 mg, 0.001 mg, 0.0001 mg and 0.00001 mg of bovine CA was loaded as
positive control and 0.01 mg of boiled bovine CA was loaded as negative control.
Positive control using bovine CA produced transient yellow bands in a red background
after passing CO2 over the cellulose acetate plates which disappeared immediately after
removing the CO2 stream. This assay could detect a faint band up to the concentration of
0.0001 mg. Considering bovine CA has a molecular mass of 30 KDa, the limit of
detection was determined to be 3.3 x 10-12
moles. As bovine CA has one active site per
molecule this corresponds to 2 x 1012
active sites. Loading and electrophoresing 0.1 mg
of carboxysomes extract yielded no distinct band. Very faint yellowing was observed
around the carboxysome loading spot (picture could not be taken due to the lack of
appropriate camera).
T7 tag - affinity pull down experiments
Attempts were made to pull down and concentrate the CA active agent in the
Anabaena PCC7120 lysate using T7 tagged CcmM bound to anti-T7 tag resin. Western
blot analysis using CcmM antibody showed that CcmM did bind to the anti-T7 resin as
indicated by strong CcmM cross-reactivity in lane 3 of Figure 15. The T7-tagged resin
with CcmM bound to it did not have CA activity. A 2 L culture of Anabaena PCC7120
was centrifuged and the precipitated cells were lysed in a French press to a volume of 10
mL. The Anabaena PCC7120 lysate was incubated with CcmM bound to the anti-T7
resin. After incubation the lysate was allowed to flow out of the column and the column
64
washed. Incubation of the Anabaena PCC7120 lysate with CcmM bound to anti-T7 resin
was done with the aim to pull out proteins in the Anabaena PCC7120 lysate which has
affinity to CcmM thus activating it or concentrating the active CA. The resin was then
assayed for CA activity. However, no activity could be detected in the resin.
65
75
50
100
150
250
1 2 3 4 5 6
Figure 15: Western blot using CcmM antibody to show binding in anti-T7 agarose. Lane
1 contains the standard protein marker, lane 2 contains Anabaena PCC7120 lysate, lane 3
contains anti-T7 agarose which has been incubated with E. coli lysate expressing CcmM
and washed to remove non specific proteins, lane 4 contains anti-T7 agarose negative
control, lane 5 contains E. coli lysate expressing CcmM and lane 6 contains E. coli lysate
with empty expression vector.
66
Sulfonamide pull down assays
Sulfonamide inhibits CA by binding to the active site metal ion (Coleman, 1967).
Since sulfonamides bind to the active site of CA, immobilized sulfonamide column can
be used to isolate CA. As a positive control E. coli lysate with His tagged CcaA
overexpressed was incubated with activated sulfonamide resin and then washed to
remove proteins weakly bound. Western blot using CcaA antibody produced bands at 31
kDa and 38 kDa for the resin and eluate as seen in lane 1 and 2 of Figure 16. Very faint
bands were observed in similar location in the wash buffer eluate in lane 4. No such
bands were observed in the negative control containing E. coli lysate with no insert.
Carboxysome extract of Anabaena PCC7120 was mixed with the sulfonamide binding
buffer and incubated for an hour with activated sulfonamide resin. The sulfonamide resin
was washed to remove proteins that were weakly bound. The sample was sent for protein
sequencing to see what proteins bound to the sulfonamide column and if they correspond
to any previously characterized protein from Anabaena PCC7120. 62 individual protein
sequences were detected by the mass spectrometer. The identified protein sequences were
compared to the NCBI proteome database for Anabaena PCC7120. The only identifiable
protein related to the CCM was CcmM (Figure 17). The probability of match to CcmM
was 100%. The rest of the identified proteins were membrane transporters or other
membrane related proteins. Nineteen hypothetical proteins were also identified. The
hypothetical proteins were modeled using PHYRE to identify if any of the hypothetical
proteins had structural similarity to any previously characterized proteins involved in the
CCM. However, all of the models of the hypothetical proteins produced had close
similarity to membrane transporters and porins.
67
1 2 3 4 5
250
150
100
75
50
37
25
20
Figure 16: Western blot using CcaA antibody to show CcaA binding in sulfonamide
resin. Lane 1 contains E. coli lysate with overexpressed CcaA flow through, lane 2
contains E. coli lysate with overexpressed CcaA resin bound, lane 3 contains the protein
marker and lane 4 contains the wash buffer flow through. Lane five is negative control
containing E. coli lysate with an empty expression vector.
68
gi|17228360 (100%), 59,464.3 Da
carbon dioxide concentrating m echanis m protein [Nostoc sp. PCC 7120], gi|17130210|dbj|BAB72822.1| carbon dioxide concentrating m echanism protein [No
4 unique peptide s, 4 unique spe ctra, 4 total spectra, 57/555 am ino acids (10% coverage)
M A V R S T A A P P T P W S R S L A E A Q I H E S A F V H P F S N I I G D V H I G A N V I I A P G T S I R A D E G T P F
H I G E N T N I Q D G V V I H G L E Q G R V V G D D N K E Y S V W V G S S A S L T H M A L I H G P A Y V G D N S F I G F
R S T V F N A K V G A G C I V M M H A L I K D V E V P P G K Y V P S G A I I T N Q K Q A D R L P D V Q P Q D R D F A H H
V I G I N Q A L R A G Y L C A A D S K C I A P L R N D Q V K S Y T S T T V I G L E R S S E V A S N S L G A E T I E Q V R
Y L L E Q G Y K I G S E H V D Q R R F R T G S W T S C Q P I E A R S V G D A L A A L E A C L A D H S G E Y V R L F G I D
P K G K R R V L E T I I Q R P D G V V A G S T S F K A P A S N T N G N G S Y H S N G N G N G Y S N G A T S G K V S A E T
V D Q I R Q L L A G G Y K I G T E H V D E R R F R T G S W N S C K P I E A T S A G E V V A A L E E C I D S H Q G E Y I R
L I G I D P K A K R R V L E S I I Q R P N G Q V A P S S S P R T V V S A S S A S S G T A T A T A T R L S T E V V D Q V R
Q I L G G G Y K L S I E H V D Q R R F R T G S W S S T G A I S A T S E R E A I A V I E A S L S E F A G E Y V R L I G I D
P K A K R R V L E T I I Q R P
gi|17228360 (100%), 59,464.3 Da
carbon dioxide concentrating m echanis m protein [Nostoc sp. PCC 7120], gi|17130210|dbj|BAB72822.1| carbon dioxide concentrating m echanism protein [No
3 unique peptide s, 3 unique spe ctra, 3 total spectra, 47/555 am ino acids (8% coverage)
M A V R S T A A P P T P W S R S L A E A Q I H E S A F V H P F S N I I G D V H I G A N V I I A P G T S I R A D E G T P F
H I G E N T N I Q D G V V I H G L E Q G R V V G D D N K E Y S V W V G S S A S L T H M A L I H G P A Y V G D N S F I G F
R S T V F N A K V G A G C I V M M H A L I K D V E V P P G K Y V P S G A I I T N Q K Q A D R L P D V Q P Q D R D F A H H
V I G I N Q A L R A G Y L C A A D S K C I A P L R N D Q V K S Y T S T T V I G L E R S S E V A S N S L G A E T I E Q V R
Y L L E Q G Y K I G S E H V D Q R R F R T G S W T S C Q P I E A R S V G D A L A A L E A C L A D H S G E Y V R L F G I D
P K G K R R V L E T I I Q R P D G V V A G S T S F K A P A S N T N G N G S Y H S N G N G N G Y S N G A T S G K V S A E T
V D Q I R Q L L A G G Y K I G T E H V D E R R F R T G S W N S C K P I E A T S A G E V V A A L E E C I D S H Q G E Y I R
L I G I D P K A K R R V L E S I I Q R P N G Q V A P S S S P R T V V S A S S A S S G T A T A T A T R L S T E V V D Q V R
Q I L G G G Y K L S I E H V D Q R R F R T G S W S S T G A I S A T S E R E A I A V I E A S L S E F A G E Y V R L I G I D
P K A K R R V L E T I I Q R P
Figure 17: Mass spectrometric analysis of proteins isolated from the sulfonamide column. Mass spectrometric analysis identified
stretches of amino acid sequences highlighted in yellow. BLASTP search of the NCBI database using the identified amino acid
sequences identified the protein to be CcmM of Anabaena PCC7120. b) Second mass spectrometric analysis of the same sample.
CcmM of Anabaena PCC7120 was again identified.
a)
b)
69
Analysis of Anabaena PCC7120 CcmM209
Results of the sulfonamide pull down assay again lead to CcmM as the
carboxysomal CA in Anabaena PCC7120, but its lack of CA activity is problematic.
Potentially, the long C-terminal region of CcmM, when outside the carboxysome
environment, may cause torsional strain on the protein leading to a distorted active site in
the N-terminal γ-CA domain resulting in the lack of activity. In order to circumvent the
influence of the C-terminal end, a His tagged truncated version of CcmM containing only
the N-terminal 209 amino acids (CcmM209) was constructed. CA activity assays of
freshly prepared E. coli lysate over-expressing CcmM209 proved to be negative (not
shown). However, lysate incubated with the thiol oxidizing agent diamide (20 mM) for 1
hour, exhibited CA activity (Figure 18a). The equation of the trend line for the
uncatalyzed steady state reaction was y = -0.0617x + 1.7954 with a R2
value of 0.9994
and the equation of the trend line for diamide treated E. coli lysate expressing CcmM209
was y = -0.1668x + 2.0197 with an R2
value of 0.9986 (Figure 18b). Comparing the
slopes of the uncatalyzed reaction to the reaction with diamide treated E. coli lysate
expressing CcmM209 showed a 2.7 fold increase from the steady state uncatalyzed rate.
CcmM209 was concentrated by binding to the His.Bind affinity resin. After
binding, the resin from the His.Bind affinity column was also assayed for CA activity.
CcmM209 bound to the resin also exhibited CA activity (Figure 19a). The equation of the
trend line for the uncatalyzed steady state reaction was y = -0.0678x + 1.8769 with a R2
for the equation to be 0.9987 and the equation of the trend line for CcmM209 bound to
His.Bind affinity column was y = -0.2508x + 2.2235 with an R2
value of 0.9954 (Figure
19b). The resin bound CcmM209 showed a 3.7 fold increase in activity compared to the
70
steady state uncatalyzed rate (Figure 19b). Freshly prepared E. coli lysate in which
CcmM209 was not expressed displayed no CA activity nor did the resin through which
the lysate was passed (Figure 20ab).
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis
Arefeen_Dewan_S_201006_MSc_thesis

More Related Content

Viewers also liked

Los side projects, la rentabilidad de probar nuevas ideas expansión (i ortega...
Los side projects, la rentabilidad de probar nuevas ideas expansión (i ortega...Los side projects, la rentabilidad de probar nuevas ideas expansión (i ortega...
Los side projects, la rentabilidad de probar nuevas ideas expansión (i ortega...
Deusto Business School
 
Shooting Schedule
Shooting ScheduleShooting Schedule
Shooting Schedule
Alex Hooks
 
Chemical structure of water by Group 1
Chemical structure of water by Group 1Chemical structure of water by Group 1
Chemical structure of water by Group 1
e-twinning
 

Viewers also liked (7)

Examen 1º (1)
Examen 1º (1)Examen 1º (1)
Examen 1º (1)
 
Los side projects, la rentabilidad de probar nuevas ideas expansión (i ortega...
Los side projects, la rentabilidad de probar nuevas ideas expansión (i ortega...Los side projects, la rentabilidad de probar nuevas ideas expansión (i ortega...
Los side projects, la rentabilidad de probar nuevas ideas expansión (i ortega...
 
Boas praticas EE Dr. "Nestor Sampaio Bittencuort"
Boas praticas   EE Dr. "Nestor Sampaio Bittencuort"Boas praticas   EE Dr. "Nestor Sampaio Bittencuort"
Boas praticas EE Dr. "Nestor Sampaio Bittencuort"
 
Shooting Schedule
Shooting ScheduleShooting Schedule
Shooting Schedule
 
18 arı hastalık ve zararlıları
18 arı hastalık ve zararlıları18 arı hastalık ve zararlıları
18 arı hastalık ve zararlıları
 
Chemical structure of water by Group 1
Chemical structure of water by Group 1Chemical structure of water by Group 1
Chemical structure of water by Group 1
 
Artigo cloud computing pdf
Artigo cloud computing pdfArtigo cloud computing pdf
Artigo cloud computing pdf
 

Similar to Arefeen_Dewan_S_201006_MSc_thesis

System response of metabolic networks in Chlamydomonas reinhardtii during Nit...
System response of metabolic networks in Chlamydomonas reinhardtii during Nit...System response of metabolic networks in Chlamydomonas reinhardtii during Nit...
System response of metabolic networks in Chlamydomonas reinhardtii during Nit...
Nishikant Wase
 
eugenia-pechkova-university-of-genova-italy.ppt
eugenia-pechkova-university-of-genova-italy.ppteugenia-pechkova-university-of-genova-italy.ppt
eugenia-pechkova-university-of-genova-italy.ppt
amith49
 
Cox1998-Automated_RNA_selection.
Cox1998-Automated_RNA_selection.Cox1998-Automated_RNA_selection.
Cox1998-Automated_RNA_selection.
J. Colin Cox
 
Nucleoside libray e-conference VRX-Harry
Nucleoside libray e-conference VRX-HarryNucleoside libray e-conference VRX-Harry
Nucleoside libray e-conference VRX-Harry
Harry An
 
CHM462 Poster Presentation By Alexander Ward (1)
CHM462 Poster Presentation By Alexander Ward (1)CHM462 Poster Presentation By Alexander Ward (1)
CHM462 Poster Presentation By Alexander Ward (1)
Alexander Ward
 
Carboxy-terminal Degradation of Peptides using Perfluoroacyl Anhydrides : C-T...
Carboxy-terminal Degradation of Peptides using Perfluoroacyl Anhydrides : C-T...Carboxy-terminal Degradation of Peptides using Perfluoroacyl Anhydrides : C-T...
Carboxy-terminal Degradation of Peptides using Perfluoroacyl Anhydrides : C-T...
Keiji Takamoto
 

Similar to Arefeen_Dewan_S_201006_MSc_thesis (20)

System response of metabolic networks in Chlamydomonas reinhardtii during Nit...
System response of metabolic networks in Chlamydomonas reinhardtii during Nit...System response of metabolic networks in Chlamydomonas reinhardtii during Nit...
System response of metabolic networks in Chlamydomonas reinhardtii during Nit...
 
eugenia-pechkova-university-of-genova-italy.ppt
eugenia-pechkova-university-of-genova-italy.ppteugenia-pechkova-university-of-genova-italy.ppt
eugenia-pechkova-university-of-genova-italy.ppt
 
Green Synthesis of Calcium Oxide Nanoparticles and Its Applications
Green Synthesis of Calcium Oxide Nanoparticles and Its ApplicationsGreen Synthesis of Calcium Oxide Nanoparticles and Its Applications
Green Synthesis of Calcium Oxide Nanoparticles and Its Applications
 
LCMS PHYTOCHEMICAL ANALYSIS
LCMS PHYTOCHEMICAL ANALYSISLCMS PHYTOCHEMICAL ANALYSIS
LCMS PHYTOCHEMICAL ANALYSIS
 
Cox1998-Automated_RNA_selection.
Cox1998-Automated_RNA_selection.Cox1998-Automated_RNA_selection.
Cox1998-Automated_RNA_selection.
 
Report on molecular biology techniques
Report on molecular biology techniquesReport on molecular biology techniques
Report on molecular biology techniques
 
Real-time PCR.ppt
Real-time PCR.pptReal-time PCR.ppt
Real-time PCR.ppt
 
CHEM3204_PRAC_Manual_2016
CHEM3204_PRAC_Manual_2016CHEM3204_PRAC_Manual_2016
CHEM3204_PRAC_Manual_2016
 
Nucleoside libray e-conference VRX-Harry
Nucleoside libray e-conference VRX-HarryNucleoside libray e-conference VRX-Harry
Nucleoside libray e-conference VRX-Harry
 
Chromatography: Trends and Developments in MAb Screening and Characterization
Chromatography: Trends and Developments in MAb Screening and CharacterizationChromatography: Trends and Developments in MAb Screening and Characterization
Chromatography: Trends and Developments in MAb Screening and Characterization
 
CHM462 Poster Presentation By Alexander Ward (1)
CHM462 Poster Presentation By Alexander Ward (1)CHM462 Poster Presentation By Alexander Ward (1)
CHM462 Poster Presentation By Alexander Ward (1)
 
Carboxy-terminal Degradation of Peptides using Perfluoroacyl Anhydrides : C-T...
Carboxy-terminal Degradation of Peptides using Perfluoroacyl Anhydrides : C-T...Carboxy-terminal Degradation of Peptides using Perfluoroacyl Anhydrides : C-T...
Carboxy-terminal Degradation of Peptides using Perfluoroacyl Anhydrides : C-T...
 
Kieran_ccpNMR
Kieran_ccpNMRKieran_ccpNMR
Kieran_ccpNMR
 
Ugi Poster_for_Open_House
Ugi Poster_for_Open_HouseUgi Poster_for_Open_House
Ugi Poster_for_Open_House
 
Improvement in Protein A Resin Lifetime
Improvement in Protein A Resin LifetimeImprovement in Protein A Resin Lifetime
Improvement in Protein A Resin Lifetime
 
Improvement in Protein A Resin Lifetime
Improvement in Protein A Resin LifetimeImprovement in Protein A Resin Lifetime
Improvement in Protein A Resin Lifetime
 
Protein sequencing by kk sahu
Protein sequencing by kk sahuProtein sequencing by kk sahu
Protein sequencing by kk sahu
 
Accessing genetically tagged heterocycle libraries via a chemoresistant DNA s...
Accessing genetically tagged heterocycle libraries via a chemoresistant DNA s...Accessing genetically tagged heterocycle libraries via a chemoresistant DNA s...
Accessing genetically tagged heterocycle libraries via a chemoresistant DNA s...
 
Analytical toxicology
Analytical toxicologyAnalytical toxicology
Analytical toxicology
 
Dis blue car
Dis blue carDis blue car
Dis blue car
 

Arefeen_Dewan_S_201006_MSc_thesis

  • 1. Identification of a Carboxysomal γ-Carbonic Anhydrase in the Mesophilic Cyanobacterium Anabaena sp. PCC7120 by Dewan Shamsul Arefeen A thesis submitted in conformity with the requirements for the degree of Master of Science Department of Cell and Systems Biology University of Toronto © Copyright by Dewan Shamsul Arefeen (2010)
  • 2. ii Identification of a Carboxysomal γ -Carbonic Anhydrase in the Mesophilic Cyanobacterium Anabaena sp. PCC7120 Dewan Shamsul Arefeen Master of Science 2010 Department of Cell and Systems Biology, University of Toronto Abstract Analysis of the genome of Anabaena sp. PCC7120 reveals that it lacks the gene, ccaA, which encodes the bonafide carboxysomal, β-class carbonic anhydrase (CA) CcaA. However, the carboxysome enriched fraction of Anabaena PCC7120 exhibits CA activity. Bioinformatic analysis reveals that the N-terminal region of the carboxysome protein CcmM has high sequence and structural similarity to the γ-class CA of Methanosarcina thermophila. Recombinantly expressed CcmM is found to be inactive in in-vitro CA assays. E. coli cell extracts containing an overexpressed form of CcmM comprised of the N-terminal 209 amino acids (CcmM209) are also inactive. However, CcmM209 displays CA activity after incubation with the thiol oxidizing agent diamide or when bound to an affinity matrix. It appears that CcmM is indeed a functional γ-CA which is active under oxidizing condition. It is hypothesized that the C-terminal RbcS like domain in CcmM may regulate activity by allowing CcmM activation only when sequestered within the carboxysome.
  • 3. iii Acknowledgements I am greatly thankful to my supervisor, Dr. George Espie, for giving me the opportunity to pursue my Masters. I would like to thank him for providing me with proper guidance, support and constructive criticisms throughout my graduate studies. I particularly respect George for being so understanding and accommodating during personal emergencies. I would like to thank my committee members, Dr. Steven Short and Dr. Tim Westwood, for their criticisms, ideas and advice throughout my thesis. I wish to extend my gratitude to all the members of Espie lab for being instrumental in my academic and personal growth. I would especially like to thank Dr. Yohannes Tadesse for teaching me a multitude of techniques, discipline and patience. Yohannes has always gone beyond my expectations to help me, regardless of the nature of the problem. I would like thank Charlotte de Araujo for helping me with the mass spectrometer. I would like to thank Dr. Anthony So for providing me with recombinant full length CcmM and many of the protocols that I have used in my research project. I am grateful to Ian Buglass who made all the administrative details clear and for his rapid response to any queries. I am thankful to Sarah Gonsalves and Anja Lowrance for making my teaching assistantship in microbiology labs a pleasurable experience. I could not have come this far without the loving support of my family members, Mom, Dad, Ron and Noel. I am thankful to my mother and father-in-law as well as Mumu and Priya. I am thankful to my wife, Armana, who gave me support and encouragement to accomplish everything. Finally I would like to thank God for everything.
  • 4. iv TABLE OF CONTENTS Page ABSTRACT ii ACKNOWLEDGEMENTS iii TABLE OF CONTENTS iv LIST OF ABBREVIATIONS vii LIST OF TABLES x LIST OF FIGURES xi LIST OF APPENDICES xiii INTRODUCTION 1 Cyanobacteria 1 Importance of CO2 concentrating mechanism (CCM) in cyanobacterial photosynthesis and growth 2 Components of the CCM 3 Types of carbonic anhydrases 5 Carboxysomal genes 8 Carboxysomal proteins and their known functions 12 Present model of carboxysome function 16 Research objectives 19 MATERIALS AND METHODS 20 Bioinformatics study 20 Organisms and growth conditions 21 Isolation of carboxysomes 22
  • 5. v Polyacrylamide gel electrophoresis 24 Western blot analysis 24 Recombinant protein expression 25 Mass spectrometric carbonic anhydrase assay 26 CA assay of CcmM 27 Cellulose acetate assay 27 T7 Tag affinity purification and pull down assays 28 Sufonamide column binding assay 29 Protein sequencing 30 CcmM209 and isolation on His.Bind® affinity column 30 RESULTS 32 In Silico Search for a Carboxysomal Carbonic Anhydrase 32 Mass spectrometric CA assays 45 CA Activity in Synechococcus PCC7942 carboxysomes 48 CA Activity in Anabaena PCC7120 carboxysomes 51 Activity of full length CcmM 54 Attempts to activate CcmM 59 Cellulose acetate assay 63 T7 tag - affinity pull down experiments 63 Sulfonamide pull down assays 66 Analysis of Anabaena PCC7120 CcmM209 69 Results summary 74 75
  • 6. vi DISCUSSION Putative carboxysomal carbonic anhydrase in Anabaena PCC7120 75 The most likely candidate 78 Activity of Anabaena PCC7120 carboxysome enriched fraction 79 CcmM Expression and Catalytic activity 81 Identification of putative CA using affinity pull down assays 85 CcmM is an active γ-CA in mesophilic cyanobacteria 86 Proposed model of non-CcaA containing β-carboxysome 87 Conclusion and future directions 91 APPENDIX 100 Calculation of CO2 concentration in mass spectrometer 100 Truncated CcmM209 primers and sequence 101 REFERENCES 92
  • 7. vii LIST OF ABBREVIATIONS °C degree(s) Celsius µ micro (10-6 ) Amp ampicillin AP alkaline phosphatase Arg arginine Asp aspartic acid BCIP 5-bromo-4-chloro-3-indolylphosphate CA carbonic anhydrase cbx carboxysome CCM carbon dioxide concentrating mechanism Ci dissolved inorganic carbon CO2 carbon dioxide Cys cysteine dH2O distilled water DNA deoxyribonucleic acid DTT dithiothreitol EDTA ethylenediaminetetraacetic acid EPPS N-(2-hydroxyethyl)piperazine-N’-(3-propanesulfonic acid) g gram(s) Gln glutamine HCO3 - bicarbonate ion HCR high Ci requiring phenotype HEPES 4-(2-Hydroxyethyl)piperazine-1-ethanesulfonic acid His histidine IgG immunoglobulin G IMAC immobilized metal affinity chromatography IPTG isopropyl-β-D-thiogalactoside kbp kilobase pairs kDa kilodaltons
  • 8. viii Km Michaelis-Menton constant L liter(s) LB Luria-Bertani LC liquid chromatography M moles per liter mg Milligram Mg2+ magnesium ion min minute (s) mL milliliter mM millimolar mol mole (s) MS mass spectrometer NaOH sodium hydroxide NBT nitroblue tetrazolium chloride NCBI National Center for Biotechnology Information OD optical density PAMBS p-aminobenzyl sulfonamide PCC Pasteur Culture Collection PDB protein data bank PPFD photosynthetic photon flux density psi pounds per square inch RMS root mean square rpm revolutions per minute RuBisCO Ribulose-1,5-bisphosphate caboxylase/oxygenase s second(s) SDS sodium dodecyl sulfate SDS-PAGE SDS-polyacrylamide gel electrophoresis Sec secretory Tat twin arginine translocation TBS Tris-buffered saline Tes N-[tris(hydroxymethyl)methyl]-2-aminoethanesulfonic acid
  • 9. ix Triton X-100 t-octylphenoxypolyethoxyethanol Tween-20 polyoxyethylene sorbitan monolaurate UTCC University of Toronto Culture Collection V volt(s) v/v volume by volume W watt(s) w/v weight by volume x g acceleration due to gravity Zn2+ zinc ion
  • 10. x LIST OF TABLES Page Table 1: List of proteins present in β-carboxysome and α-carboxysome. 15 Table 2: List of query sequences used to search the proteome of Anabaena PCC7120 to identify a putative carbonic anhydrase. 21 Table 3: Factors changed in attempts to activate CcmM. 62
  • 11. xi LIST OF FIGURES Page Figure 1: Arrangements of β-carboxysomal genes… 11 Figure 2: Structure of a β-carboxysome… 14 Figure 3: The present model of the carboxysome… 18 Figure 4: Representation of the location of conserved β-CA and α-CA domains 40 Figure 5: Representation of the location of conserved γ-CA domains 41 Figure 6: Amino acid sequence alignment output from ClustalX2 for Cam of Mehanosarcina thermophila and CcmM of selected species of cyanobacteria. 42 Figure 7: 3D structure of Cam of Methanosarcina thermophila and CcmM of Anabaena PCC7120. 44 Figure 8: (a) Mass spectrometric assay for CA activity of E. coli lysate in which CcaA expression was induced. (b) 18 O atom fraction in m/z 49 with respect to time. 47 Figure 9: (a) Mass spectrometric assay for CA activity of Synechococcus PCC7942 carboxysome extract. (b) Log 18 O atom fraction in m/z 49 with respect to time. 50 Figure 10: (a) Mass spectrometric CA assay of carboxysome enriched fraction from Anabaena PCC7120. (b) Log 18 O atom fraction in m/z 49 with respect to time. 53 Figure 11: Western Blot using CcmM antibody for E. coli lysate overexpressing CcmM and Anabaena PCC7120 lysate. 56 Figure 12: (a) Mass spectrometric assay for CA activity of E. coli lysate in 57
  • 12. xii which CcmM expression was induced. (b) Log 18 O atom fraction in m/z 49 with respect to time. Figure 13: (a) Mass spectrometric assay for CA activity of E. coli lysate containing empty pET21 vector. (b) Log 18 O atom fraction in m/z 49 with respect to time. 58 Figure 14: Western blot using CcmM antibody for varying temperature and IPTG treatments during CcmM overexpression. 61 Figure 15: Western blot using CcmM antibody to show binding in anti-T7 agarose. 65 Figure 16: Western blot using CcaA antibody to show CcaA binding in sulfonamide resin. 67 Figure 17: Mass spectrometric analysis of proteins isolated from the sulfonamide column. 68 Figure 18: (a) Mass spectrometric assay for CA activity of E. coli lysate containing CcmM209 treated with 20 mM diamide. (b) Log % 18 O in 13 CO2 with respect to time. 71 Figure 19: (a) Mass spectrometric assay for CA activity of His.Bind resin in which CcmM209 is bound. (b) Log % 18 O in 13 CO2 with respect to time. 72 Figure 20: (a) Mass spectrometric assay for CA activity of His.Bind resin in which E. coli lysate containing empty vector is flowed through. (b) Log % 18 O in 13 CO2 with respect to time. 73 Figure 21: Proposed model for the function of β-carboxysomes lacking the carboxysomal carbonic anhydrase, CcaA. 90
  • 13. xiii LIST OF APPENDICES Calculation of CO2 concentration in mass spectrometer 100 Truncated CcmM209 primers and sequence 101
  • 14. 1 Introduction Cyanobacteria Cyanobacteria are a group of gram-negative, photoautotrophic bacteria that carry out oxygenic photosynthesis. They are found in a wide variety of habitats including freshwater, marine and soil surfaces. The cyanobacteria are responsible for as much as one third of the global photosynthetic carbon dioxide fixation and hence are significant contributors to the biogeochemical cycling of carbon and primary productivity on earth. Beyond fixing carbon dioxide, cyanobacteria play a diverse role in nutrient cycling in the ecosystem. For example, cyanobacteria are the only group of organisms that are able to fix nitrogen under aerobic conditions. The enzyme involved in fixing dinitrogen is nitrogenase and it is irreversibly inactivated in the presence of oxygen (Gallon, 1992). Some strains of cyanobacteria, including Anabaena sp. PCC7120, have evolved specialized cells called heterocyst which enable them to carry out nitrogen fixation in spite of high ambient extracellular O2 concentrations. The ability of cyanobacteria to fix atmospheric nitrogen makes them an important source of ammonia, nitrates and nitrites in diverse habitats from the open oceans to the rice paddy fields of Asia. Recent studies have shown that Anabaena PCC7120 is able to detoxify organophosphorous pesticides, used in agriculture, to harmless organic components (Barton et al., 2004). The ability of this cyanobacterium to detoxify pesticides may play an important role in the future bioremediation of farmland, given the large-scale and increasing use of pesticides in modern intensive agriculture. Presently, with the ever-
  • 15. 2 increasing demand for energy and the diminishing fossil fuel, cyanobacteria are also being looked at as a potential source of biofuels (Hu et al., 2008). The fact that many cyanobacteria have a high efficiency of photosynthesis, the ability to grow non- fastidiously and to fix nitrogen makes them very attractive and potentially lucrative organisms for biofuel generation. To this end, an understanding of the biochemical and genetic mechanisms that underlie their efficient growth and photosynthesis will be vital in the metabolic engineering of stains for industrial scale biofuel production (Ragauskas et al., 2006). Importance of the CO2 concentrating mechanism (CCM) in cyanobacterial photosynthesis and growth Cyanobacteria are extremely primitive organisms dating back 2.7 billion years (Buick, 1992). Over the course of this time photosynthesis has lowered the level of atmospheric carbon dioxide while increasing the level of oxygen. Thus, photosynthesis by cyanobacteria, which requires carbon dioxide, had to adapt to this globally changing environment. Photosynthetic CO2 fixation is catalyzed by the enzyme, ribulose bisphosphate carboxylase/oxygenase (RuBisCO; EC 4.1.1.39), and involves the carboxylation of ribulose-1,5-bisphosphate (RuBP) to produce two molecules of 3- phosphoglycerate (PGA). The catalytic activity of cyanobacterial RuBisCO is relatively slow with a turnover number of 3 s-1 (Schneider et al., 1992) and requires high substrate concentrations for efficient carboxylation (Km(CO2) > 240 µM) (Andrews and Abel, 1981). Not only is RuBisCO slow at atmospheric CO2 concentration but it also has an oxygenase activity which adds O2 instead of CO2 to RuBP, yielding a molecule of PGA
  • 16. 3 and a molecule of phosphoglycolate (Schneider et al., 1992). Phosphoglycolate is metabolized in the glycolate pathway and leads to loss of energy and CO2 by photorespiration. Typically, cyanobacteria live in environments where the O2 / CO2 ratio is about 25:1. Given the kinetic parameters of cyanobacterial RuBisCO (Km = 240 μM CO2, Kcat = 3 s-1 and Km = 1000 μM O2), it would be expected to promote oxygenation over carboxylation (Andrews and Abel, 1981; Jensen and Bahr, 1977; Schneider et al., 1992). However, under normal environmental conditions, direct physiological measurements show that photorespiration is almost nonexistent and that photosynthesis proceeds with remarkable efficiency (Aizawa and Miyachi, 1986)! One widely supported explanation for this discrepancy is that cyanobacteria have evolved mechanisms to increase the level of CO2 around RuBisCO by a collective process called the carbon dioxide concentrating mechanism (CCM) (Badger and Price, 2003), thereby decreasing the intracellular O2 / CO2 ratio to a level that promotes photosynthesis. Components of the CCM The CCM is composed of two primary components, which together can increase the concentration of CO2 around RuBisCO up to 1000 fold (Badger and Price, 2003; Miller and Colman, 1980). First, cyanobacteria use membrane-associated active transporters for bicarbonate, active transporters of carbon dioxide, and bicarbonate/Na+ symporters to create a large internal pool of bicarbonate ions (Badger and Price, 2003). However, RuBisCO cannot use bicarbonate as a substrate and bicarbonate needs to be converted to CO2 before use. Conversion of bicarbonate to CO2 is catalyzed by the enzyme carbonic anhydrase (CA; EC 4.2.1.1). Carbonic anhydrase, along with RuBisCO,
  • 17. 4 in cyanobacteria is localized to polyhedral proteinaceous bodies called the carboxysomes, the second primary component of the CCM (Mckay et al., 1993; Price et al., 1992). Carboxysomes were first characterized in the chemoautotrophic bacterium Halothiobacillus neapolitanus (Shively et al., 1973). Carboxysomes are about 120 nm in diameter. Cross-sections of the carboxysomes of most species show that they are regular hexagons surrounded by a 3 to 4 nm thick protein shell (Cannon et al., 2001). Sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) reveals that the shell proteins comprise of about 17% of the total protein content of the carboxysomes (Cannon et al., 2001; Cannon and Shively, 1983). No lipid component has ever been found to be associated with the carboxysome shell or core. The core of the carboxysome is packed with RuBisCO and the name carboxysome is derived from the carboxylase and oxygenase activity of RuBisCO (Pierce et al., 1989; Price and Badger, 1989a). Studies have shown that the packaging of RuBisCO within the carboxysomes enhance the catalytic properties of the enzymes (Cannon et al., 2001). Carboxysomes are classified into two phylogenetic groups based on the form of RuBisCO they contain. Carboxysomes containing Form 1A RuBisCO are classified as α-carboxysomes and carboxysomes containing Form 1B RuBisCO are called β-carboxysomes (Badger and Price, 2003; Cannon et al., 2002). It has been suggested that two primary groups of cyanobacteria can be classified based on their RuBisCO/carboxysome phylogeny. Cyanobacteria with Form 1A RuBisCO are termed α-cyanobacteria and those with Form 1B RuBisCO are termed β-cyanobacteria (Badger and Price, 2003). Carboxysome structure and function have been studied in both α and β-carboxysomes. α-carboxysomes from the chemolithoautotrophic bacterium, Halothiobacillus neapolitanus, have been
  • 18. 5 most widely studied and best characterized (Cannon et al., 2001). β-carboxysome structure has been studied using the model laboratory cyanobacterial species Synechocystis sp. PCC6803 and Synechococcus sp. PCC7942 (Badger and Price, 2003). The presence of carbonic anhydrase in close proximity to RuBisCO in the carboxysomes allows for high levels of CO2 to be produced around the RuBisCO active site, thus facilitating carboxylation over oxygenation (Price and Badger, 1989a). The over-expression of recombinant human CA II in the cytosol of Synechococcus PCC7942 results in the conversion of the bicarbonate pool to carbon dioxide, which then readily diffuses out of the cell (Price and Badger, 1989a). Thus, a cytosol-localized CA is deleterious causing a “short circuit” in the CCM. This “short-circuit” prevents bicarbonate accumulation and the localized formation of CO2 in the vicinity of RuBisCO, thereby drastically reducing the substrate concentration available for CO2 fixation by RuBisCO. Under ambient conditions, this situation ultimately leads to a reduction in photosynthetic efficiency and to a high CO2 requiring phenotype. Mutants with defective carbonic anhydrase also require high CO2 concentrations in their environment to be viable. It is apparent that the localization of carbonic anhydrase and RuBisCO within carboxysomes is vital for efficient photosynthesis in cyanobacteria. Types of carbonic anhydrases The importance of carbonic anhydrase in living organisms is implied by the widespread distribution of this enzyme from mammals to Archaea (Smith and Ferry, 2000). So far, three distinct evolutionary lineages (α, β and γ) of carbonic anhydrases have been characterized. As well, two additional classes (δ and ε) have been proposed
  • 19. 6 based on the occurrence of unique, non-homologous proteins that display CA activity (So et al., 2004; So and Espie, 2005). The α-class was initially identified from animals but examples can be found in plants and eubacteria. α-CAs typically exist as protein monomers that are mostly composed of antiparallel β-strands (Liljas et al., 1972). This single protein has all the necessary structural elements to generate a functional, active site (So and Espie, 2005). The active site of α-CA is located in a 15-Å deep crevice dominated by hydrophobic amino acid side chains at the base of which is a Zn2+ ion invariably coordinated by three histidine residues (Liljas et al., 1972). β-CA’s were initially identified in chloroplast of plants but are now known to be present in various subcellular compartments of many organisms. Interestingly, β-CAs have no amino acid sequence similarity with α-CAs, suggesting that they evolved independently. Unlike α-CA’s, β-CA’s are only functional when oligomerized (Mitsuhashi et al., 2000; Strop et al., 2001). A homodimer is the catalytic core of a β-CA, which may exist as a dimer, tetramer or octamer depending on the species of origin (Mitsuhashi et al., 2000). Dimerization enables formation of the hydrophobic pocket required for CO2 binding and forms the active site at the interface. Zn2+ co-ordination is mediated by a combination of His, Cys and Glu residues depending on the species (Mitsuhashi et al., 2000; Strop et al., 2001). In contrast to α-CAs which is mostly composed of β-sheets, β-CAs contain a number of α-helices. The third distinct class of CA is the γ-CA which was first isolated from the methanogenic archaeon Methanosarcina thermophila (Alber and Ferry, 1994). The γ-CA, called Cam, from M. thermophila is catalytically active when trimerized (Kisker et al., 1996). Cross-sectional profiles of the γ-CA trimer reveal that each monomer resembles
  • 20. 7 an equilateral triangle. Cam has an unusual left-handed β-helix structure containing seven complete turns with an α-helix at the terminal portion forming αβ-helix (Kisker et al., 1996). The active sites are located at the interfaces between two β-helices. The interface is stabilized by H bonds, salt bridges and hydrophobic interactions. The side chain of Arg59 is important since it forms salt bridge with Asp61 of the same monomer and Asp76 of the adjacent monomer. The trimer contains 3 active sties and each monomer contributes His residues located on the surface to coordinate Zn2+ (Kisker et al., 1996). His81 and His122 of one monomer extend from equivalent positions of adjacent turns of the β-helix along with His117 from a second monomer coordinate the zinc ion (Kisker et al., 1996). A water molecule is the fourth Zn2+ ligand and is within hydrogen bonding distance of Gln75. Cam is the only γ-CA which has been shown to have CA activity. Numerous Cam homologs from both plants and Bacteria have been tested for catalytic activity, including CcmM from the cyanobacteria Synechocystis PCC6803 and Synechococcus PCC7942. All were found to lack CA activity, prompting the suggestion that these homologs have evolved a different function and that Cam is a relic. A fourth class of CA named δ-class has been isolated from the marine diatom Thalassiosira weissflogii (Roberts et al., 1997). X-ray absorption spectroscopy of the δ- CA, T. weissflogii CA1 (TWCA1), has shown that it indeed does contain a Zn2+ ion bound by histidine residues. Presently, there are only 4 other proteins that display amino acid sequence similarity to TWCA1 and, thus, its distribution may be restricted to only a small number of diatom species (So and Espie, 2005). A fifth class of CA tentatively named ε-class has been characterized from the chemolithoautotrophic bacterium Halothiobacillus neapolitanus and several marine
  • 21. 8 cyanobacteria (So et al., 2004; So and Espie, 2005). BLAST search results show that this protein is widely distributed among marine cyanobacterial strains but has no amino acid sequence similarity to the α, β, γ and δ CAs. Recent X-ray crystal structure studies on the ε-CA of H. neapolitanus, CsoS3, indicate that it is structurally related to β-CA (Sawaya et al., 2006) in spite of the absence of any primary sequence similarity. The suggestion that CsoS3 is a subclass of β-CA comes from the striking structural similarity of the Zn2+ -containing active site and from the fact that both need to form dimers in order to be active (Sawaya et al., 2006). Thus, CsoS3 is an example of divergent evolution. Proteins with high degree of amino acid sequence homology to CsoS3 have now been identified in a range of chemolithoautotrophic bacteria, hydrogen bacteria and many strains of marine cyanobacteria (So et al., 2004; So and Espie, 2005). In all examples to date, CsoS3 is encoded within the cso operon which encodes all the components for the α-carboxysome. Carboxysomal genes α-carboxysomes are also characterized by the presence of shell proteins encoded by the cso gene cluster (Badger and Price, 2003; Cannon et al., 2002; So et al., 2004). β- carboxysomes are encapsulated by proteins encoded by the ccmKLMN gene cluster (Badger and Price, 2003; Cannon et al., 2002). To date, all cyanobacterial species characterized possess either α-carboxysome or β-carboxysomes, but not both. It has been suggested that two primary groups of cyanobacteria can be classified based on their carboxysome phylogeny. Cyanobacteria with α-carboxysomes are termed α- cyanobacteria and those with β-carboxysomes are termed β-cyanobacteria (Badger and Price, 2003).
  • 22. 9 Studies show that the arrangement of carboxysomal genes varies depending on the type of carboxysomes. The α-carboxysomal genes are arranged into one operon whereas the β-carboxysomal genes are distributed among multiple operons. The genes of the putative operon of α-carboxysomes, in their transcriptional order, are cbbL, cbbS, csoS2, csoS3, csoS4A, csoS4B, csoS1C, csoS1A, csoS1B (Cannon et al., 2003; Shively et al., 1998). The cbbL and cbbS code for the large and small subunit of RuBisCO form 1A; the genes csoS2, csoS3, csoS1C, csoS1A and csoS1B code for carboxysomal shell proteins CsoS2, CsoS3, CsoS1C, CsoS1A and CsoS1B respectively (Cannon et al., 2003). CsoS3 has been identified as a shell associated active carbonic anhydrase while CsoS1 is the major structural protein of the shell (So et al., 2004). Purified CsoS1 spontaneously organizes into hexamers that form sheet-like structures. The hexamers are thought to form the majority of the flat faces of the carboxysome icosahedron. It has recently been established that the genes csoS4A, csoS4B (previously called orfA and orfB) code for proteins CsoS4A and CsoS4B that assemble as pentamers (Tanaka et al., 2008). The construction of large icosahedral structure typically requires a combination of hexameric and pentameric proteins. It is conjectured that the pentamers provide the curvature required at the vertices to form the icosahedron while the hexamers form the flat faces of the icosahedron. It has been postulated that CsoS4A and CsoS4B form the vertices of the carboxysome shell (Tanaka et al., 2008). The β-carboxysomal genes are found scattered throughout the genome in 3 or 4 different clusters as shown by the gene diagram in Figure 1. Typically, these clusters include ccmK2K1LMN and ccmK3K4 that contribute the structural components of the carboxysome. The RbcLXS operon encodes the large and small subunits of RuBisCO
  • 23. 10 (Cannon et al., 2001) and RbcX required for RuBisCO assembly (Saschenbrecker et al., 2007). Depending on the species, the genes ccmO and ccaA may or may not be present and even if they are present then they are found at separate loci than the ccmKLMN and rbcL, rbcS gene cluster. The gene cluster ccmK, ccmL, ccmM and ccmN codes for the polypeptides CcmK, CcmL, CcmM and CcmN respectively. The gene ccmO codes for the polypeptide CcmO and the gene ccaA codes for the carboxysomal carbonic anhydrase, CcaA, of β-carboxysomes. Anabaena PCC7120 possesses β-carboxysomes and the proteins involved in β-carboxysomes will be the focus in the following pages.
  • 24. 11 Anabaena sp. PCC7120 Synechocystis sp. PCC6803 Gloeobacter violaceus PCC7421 Synechococcus sp. PCC7942 = 1 kb rbcccmccm LM K1N K2O 997kb XL 1786 kb K3K4 361kb ccaA? all0863 all0866 all0867 all0868 all0318 all0865 all0864 all0317 alr1524 alr1525 alr1526 S slr0436 OXL 219 kb AK4K3LMN K2K1 2478 kb1742 kb956 kb 2551 kb slr1347 S slr0009 slr0010 slr0012 sll1028 sll1029 slr1838 slr1839 sll1032 sll1031 sll1030 ccm ccm ccmcca rbc L S A X 1595 kb rbcccm NM OLK 1475 kb281 kb KK ccm 1500 kb synpcc7942_0284 synpcc7942_0285 synpcc7942_1425 synpcc7942_1421 synpcc7942_1422 synpcc7942_1423 synpcc7942_1424 synpcc7942_1447 synpcc7942_1426 synpcc7942_1427 synpcc7942_1535 rbccca rbcccm LM K1N K2O 2249 kb XL S 2307 kb ccaA? gll2093 gll2094 gll2091 gll2092 gll2095 gll2096 glr2156 glr2157 glr2158 Figure 1: Arrangements of β-carboxysomal genes from Anabaena sp. PCC7120, Synechocystis sp. PCC6803, Synechococcus sp. PCC7942 and Gleobacter violaceus PCC7421. The ccm genes are depicted by red boxes, the Form 1B RuBisCO genes are depicted by the green boxes and carboxysomal carbonic anhydrase genes are depicted by blue boxes. Putative, uncharacterized carboxysomal carbonic anhydrase is denoted by the yellow boxes. The cyanobase ID for the corresponding gene is denoted below each box.
  • 25. 12 Carboxysomal proteins and their known functions RuBisCO Studies show that the CO2 fixing enzyme RuBisCO is mainly localized to the carboxysomes regardless of the amount of dissolved inorganic carbon or the stage of growth at which the cells are harvested (Mckay et al., 1993). The Form 1B RuBisCO is present as a hexadecameric enzyme composed of eight large and small subunits of RuBisCO encoded by rbcL and rbcS genes. It has been seen that the expression of a simple dimeric form of RuBisCO from Rhodospirillum rubrum in Synechocystis PCC6803 results in the loss of carboxysomes and produces high Ci requiring (HCR) phenotype (Pierce et al., 1989; Price and Badger, 1989a). Mutants lacking structurally intact carboxysomes require high CO2 to survive. This suggests that the localization of RuBisCO and other carboxysomal proteins in the carboxysomes is essential for normal growth. CcaA Many β-carboxysomes possess a carboxysomal carbonic anhydrase, CcaA. Amino acid sequence characterization has shown that CcaA is a β-type CA. Biochemical analysis has shown that up to 97% of the immunologically reactive CA polypeptide is associated with the carboxysomes (So and Espie, 1998). Electron microscopy and immunogold labeling of RuBisCO in Synechocystis PCC6803 mutant lacking CcaA reveal that the carboxysome number, size and shape are similar to wild type cells indicating that CcaA is not essential for maintaining carboxysomal structure or the
  • 26. 13 deposition of RuBisCO within the carboxysomes (So et al., 2002). However, the mutants of Synechocystis PCC6803 that lack CcaA require high CO2 (5% v/v) to survive, indicating that CcaA is vital for normal functioning of the cell (So et al., 2002). Protein- protein interaction studies and protein capture studies have recently shown that CcaA is localized to the carboxysome shell in complex with CcmM and CcmN, where it likely serves to catalyze HCO3 - dehydration in the vicinity of the CO2 fixing enzyme RuBisCO facilitating efficient CO2 fixation and normal growth (Cot et al., 2008). CcmK, CcmL and CcmO CcmK, CcmL and CcmO all play a structural role in carboxysomes as mutants of CcmK, CcmL or CcmO produce malformed carboxysomes and result in a HCR phenotype (Cannon et al., 2002; Price et al., 1993). CcmK and CcmL appear to be present in all β-carboxysomes so far identified while CcmO may or may not be present. CcmK and CcmO have high amino acid sequence similarity to CsoS1 of α-carboxysomes while CcmL has high amino acid sequence similarity to CsoS4A and CsoS4B polypeptides present in α-carboxysomes (Table 1) (Cannon et al., 2002). Four variants (paralogs) of CcmK (CcmK1-K4) are found in Anabaena PCC7120 as well as other species such as Synechocystis PCC6803. The X-ray crystal structure of CcmK2 and CcmK4 from Synechocystis PCC6803 has been elucidated. The crystal structure reveals that the CcmK family of proteins forms hexamer plates with a central pore and, therefore, most likely forms the flat facets of the polyhedral carboxysomal body (Kerfeld et al., 2005). The central pore in the CcmK hexamers consists of positively charged amino acid residues and this may serve a functional role in the carboxysome such as maintaining the
  • 27. 14 metabolite flux between the carboxysomes and the cytosol (Kerfeld et al., 2005). Analysis of the crystal structure of CcmL has shown that CcmL forms pentamers (Tanaka et al., 2008). Hypothetically, these pentameric proteins could reside at the vertices of icosohedral carboxysomes, joining the 20 planar triangular faces that form the surface of the carboxysome (Figure 2) (Tanaka et al., 2008). The deletion of the ccmL gene results in the formation of elongated carboxysomes which further supports the function of CcmL as a structural protein (Price et al., 1993; Tanaka et al., 2008). Pentamers at vertices (CcmL) Hexamers on flat faces (CcmK) Figure 2: A proposed structure of a β-carboxysome showing the localization of pentameric protein, CcmL, and hexameric protein, CcmK forming the carboxysome shell. (Adapted from Tanaka et al., 2008)
  • 28. 15 Table 1: Protein components of β-carboxysomes and α-carboxysomes. β-carboxysome proteins α-carboxysome proteins with β-carboxysome homologs α-carboxysome proteins unique to α-cyanobacteria CcmK1 CcmK2 CcmK3 CcmK4 CsoS1A CsoS1B CsoS1C CcmL CsoS4A, CsoS4B CcmM None CsoS2 CcmN None CcmO CcaA (variable) (deeply divergent) CsoS3 RbcL Form IB RbcL Form IA RbcS RbcS RbcX (Rbc assembly) None
  • 29. 16 CcmM and CcmN CcmM and CcmN are unique to β-carboxysomes as are CsoS2 and CsoS3 to α- carboxysomes (Table 1). The CcmM and CcmN proteins share considerable amino acid sequence similarity among species. The definitive function of CcmM and CcmN has not been determined, but mutants of CcmM and CcmN have HCR phenotype which shows that they are essential for normal carbon assimilation (Ludwig et al., 2000; Price et al., 1993). Mutation in CcmM yields cells that require high CO2 for growth. These cells are able to concentrate Ci internally but are unable to effectively utilize the Ci pool in CO2 fixation (Ludwig et al., 2000). Ultrastructural examination shows the mutants lacked carboxysomes (Ludwig et al., 2000). Yeast two-hybrid analysis and in-vitro pull down experiments show that CcmM N-terminal region can interact and form a complex with both CcmN and CcaA (Cot et al., 2008). Both the C-terminal and N-terminal region of CcmM interact with CcmK and CcmL (Cot et al., 2008). The aforementioned evidence leads to the conclusion that CcmM has a key structural organizational role in the formation of carboxysomes and the bicarbonate dehydration complex. Present model of carboxysome function Research indicates that both α and β carboxysomes contain RuBisCO packed within the core and surrounded by a protein shell. Current models of carboxysome function suggest that HCO3 - and RuBP diffuse from the cytosol to the carboxysome core, possibily via the pores in CcmK hexamers, where HCO3 - is dehydrated to CO2. RuBP binds to RuBisCO and is subsequently carboxylated using the recently formed CO2 as a substrate. In α-carboxysome, HCO3 - dehydration is catalyzed by CsoS3. In some β-
  • 30. 17 cyanobacteria CcaA catalyzes dehydration of HCO3 - . In these strains (Figure 3), it is thought that cytosolic bicarbonate binds to CcmM and is vectorially channelled to CcaA which lies on the interior region of the carboxysome shell, resulting in the dehydration of the bicarbonate within the carboxysome interior (Cot et al., 2008). The localized generation of a high concentration of CO2 near the active site of RuBisCO promotes CO2 fixation and reduces RuBP oxidation by outcompeting O2 for binding sites. Mutants lacking CcaA or CcmM require high CO2 to survive (Ludwig et al., 2000; So et al., 2002). Therefore, the presence of CcmM and CcaA is vital for the dehydration of bicarbonate and to the overall efficiency of the CCM in promoting photosynthetic fixation of CO2. However, recent findings suggest that this model for bicarbonate dehydration may not be applicable to all β-cyanobacteria as genome analysis has revealed that a number of β-cyanobacteria appear to lack a CcaA homolog (So and Espie, 2005)!
  • 31. 18 HCO3 - Rbc 1B Rbc 1B Rbc 1B AA M M KL N Ncarboxysome shell cytosol carboxysome interior CO2 RuBPRuBP Key L= CcmL N=CcmN M=CcmM K=CcmK A=CcaA Rbc=Rubisco Figure 3: The present model of the carboxysome shows that CcmM as the central organizing protein that binds to other carbosysomal shell protein and channels bicarbonate to the CcaA. The CcaA dehydrates the bicarbonate to CO2. The localized generation of a high concentration of CO2 near the active site of RuBisCO promotes CO2 fixation and reduces RuBP oxidation by outcompeting O2 for binding sites. (Adapted from Cot et al., 2008) CO2CO2
  • 32. 19 Research objectives Amino acid sequence similarity searches of the deduced proteome of the β- cyanobacterium Anabaena PCC7120 and a number of other strains have failed to identify homologs of CcaA. Based on this observation, it can be hypothesized that Anabaena PCC7120 does not require CA activity to form a functional carboxysome. Alternatively, Anabaena PCC7120 carboxysomes may contain a CA protein with a novel amino acid sequence that is not identifiable by homology searches. A third possibility is that an existing previously identified protein may substitute the function of CA. My initial goal is to perform a bioinformatics analysis of the proteome to assess the presence or absence of a bicarbonate dehydration complex in Anabaena PCC7120 and to search for a potential / alternate candidate for CcaA in Anabaena PCC7120. The main experimental objectives of my thesis is to determine if carboxysomes of Anabaena PCC7120 possess CA activity and, if so, explore the CA activity of potential candidate protein(s) identified using bioinformatic analysis and determine if the activity is due to a previously identified protein or a novel protein.
  • 33. Materials and methods Bioinformatics study An in silico search of the deduced proteome of Anabaena PCC7120 was conducted to identify potential carboxysomal carbonic anhydrases candidates using the databases at NCBI and Cyanobase. Protein basic local alignment search tool (BLASTP http://blast.ncbi.nlm.nih.gov/Blast.cgi) was used to search the proteome of Anabaena PCC7120 using amino acid query sequences from bonafide carbonic anhydrases of different classes (Table 2). The amino acid sequences in the proteome of Anabaena PCC7120 which had the highest similarity to the query sequences were analyzed using the web based algorithm SignalP 3.0 ( http://www.cbs.dtu.dk/services/SignalP/) using a gram negative bacterial model for signal peptides and peptide cleavage sites. The identified sequences were also analyzed using default parameters using the TatP 1.0 algorithm (http://www.cbs.dtu.dk/services/TatP/) for a specific twin arginine signal sequence with RRNFL motif. Sequence alignments were done using ClustalX2 (http://www.clustal.org/) using default parameters. Predicted 3D structural model of CcmM was created using the online protein modeling algorithm in Phyre (http://www.sbg.bio.ic.ac.uk/phyre/) using default parameters. Superpose (http://wishart.biology.ualberta.ca/SuperPose/) was used to superimpose the predicted structure of CcmM on Cam. The superimposed structures were viewed in Swiss-Pdb Viewer (http://spdbv.vital-it.ch/). The RMS values considering the spatial differences in the α-carbons and all the backbone atoms of the amino acids for the superimposed structures were calculated using iterative magic fit function in Swiss-Pdb Viewer. 20
  • 34. 21 Table 2: List of query sequences used to search the proteome of Anabaena PCC7120 to identify a putative carbonic anhydrase. Query sequence accession number in:Class of carbonic anhydrase Query sequence species Cyanobase NCBI β-class Synechocystis PCC6803 slr1347 AAC46375 Halothibacillus neapolitanus EEG96215ε-class Prochlorococcus marinus MED4 NP_892671 α-class Human AAH27890 γ-class Methanosarcina thermophila 1THJ-A Organisms and growth conditions The filamentous cyanobacterium Anabaena sp. UTCC387 was obtained from the University of Toronto Culture Collection, Toronto, Canada. The culture record indicates that this strain was deposited as Anabaena sp. PCC7120 from the Pasteur Culture Collection, Paris, France. Thus, the strain UTCC387 and PCC7120 are equivalent. Cultures of Anabaena PCC7120 were grown in BG11 medium at 30ºC (Rippka, 1979). Small scale cell cultures were grown in 100 mL of unbuffered BG11 medium in 250 mL Erlenmeyer flasks with constant shaking at 120 rpm. The cultures were continuously illuminated using Cool White and Gro-lux fluorescent lamps with an average
  • 35. 22 photosynthetic photon flux density (PPFD) of 25 umol m-2 s-1 . Large scale cultures were grown in 10 L carboys (Nalgene) using 6 L of BG11 buffered with 25 mM HEPES at pH 8. Typically, these cultures were grown using 5% CO2 (v/v) as the carbon source to enhance growth rate and biomass. The gas stream was then changed to normal air (0.035% CO2 [v/v]) 72 h prior to experiments, in order to enhance additional carboxysome biosynthesis (Mckay et al., 1993). Cultures of the heterotrophic bacterium Escherichia coli strain BL21 (Novagen) were maintained in Luria-Bertani (LB) media (1% [w/v] tryptone, 0.5% [w/v] Bacto yeast extract and 171 mM NaCl) containing 1.4% w/v agar. Transformed E. coli BL21 strains containing pET protein expression vectors were grown in LB media supplemented with 100 μg mL-1 ampicillin. Liquid cultures were grown in 250 mL of LB media in 1 L Belco flasks with shaking at 250 rpm at 37°C for 12 h. Isolation of carboxysomes Prior to experiments, illuminated Anabaena PCC7120 cells were grown with 5% CO2 for 8 days followed by 3 additional days with air bubbling. Cells were collected by centrifugation at 5000 x g for 10 min in 250 mL centrifugation flasks (Nalgene) at room temperature. Cell pellets were pooled by resuspension in a small amount of retained media, the bottles were washed with 5 mL of lysozyme buffer (0.6 M sucrose and 20 mM Tes-NaOH at pH 7.5) and the combined solution was again centrifuged at 5000 x g for 10 min at room temperature. The cell pellet was then resuspended in 40 mL of lysozyme buffer containing 2 mg mL-1 of egg white lysozyme and incubated at 37°C for 2 h in dark with occasional mixing. Following lysozyme treatment, the cell suspension was
  • 36. 23 centrifuged at 5000 x g for 10 min at room temperature and the supernatant was discarded. The precipitated cells were resuspended in 4 mL of lysozyme buffer plus 21 mL of “breaking” buffer (20 mM Tes-NaOH and 5 mM EDTA at pH 7.0) and 1/100 (v/v) dilution of a protease inhibitor cocktail (Sigma). The resuspended cells were kept on ice for 5 min. To disrupt the cells, the cell suspension was passed through an ice-cold French Pressure Cell (Aminco) at 8,000 psi. The lysate was collected, centrifuged at 12,000 x g for 10 minutes at 4°C and the supernatant containing the soluble fraction was retained. To obtain a carboxysome-enriched fraction from the soluble portion of the crude extract, the soluble fraction was diluted with 3 volumes of 1X EM buffer (40 mM EPPS- NaOH and 27 mM MgSO4 at pH 8.0) containing 20% (v/v) Percoll (Amersham Biosciences) and 0.133% Triton X-100. The mixture was allowed to incubate on ice for 30 min to allow for magnesium mediated aggregation of the carboxysome with the Percoll beads. The aggregated carboxysomes were collected by centrifugation at 12,000 x g for 20 min at 4°C. The carboxysomal aggregates were washed twice with 15 mL 0.75X EM buffer containing 1% Triton X-100 and centrifuged at 12,000 x g for 20 min at 4°C to remove thylakoid membranes. A final wash with 15 mL of 0.75X EM buffer was carried out before resuspending the precipitate in 1 mL of 0.75X EM buffer containing 20% glycerol. The partially purified carboxysomes were separated in 100 μL aliquot in microfuge tubes. The carboxysome isolation protocol is based on that developed by So et al and Long et al (Long et al., 2007; So et al., 2002).
  • 37. 24 Polyacrylamide gel electrophoresis Proteins from lysates or carboxysome preparations were separated according to size by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) using Bio-Rad Mini-PROTEAN II electrophoresis system. Polyacrylamide gels (8%, 10% or 12% [w/v]) were prepared as described by Ausuble et al. (1993). Electrophoresis was performed at 75 V for 10 min followed by 150 V for 60-100 min. Following electrophoresis, the gels were either stained for protein or the separated protein was transferred to nitrocellulose membranes for immunoblot analysis. Staining was carried out by immersing the gels in Coomassie Brilliant Blue R-250 (Bio-Rad), 50% (v/v) methanol and 10% (v/v) acetic acid for 1-2 hours. Gels were destained using 50% (v/v) methanol, 10% (v/v) acetic acid for 2-4 hours and dried in cellophane membranes to preserve them. Western blot analysis Polyacrylamide gels from SDS-PAGE were washed with transfer buffer (25 mM Tris-HCl pH 8.3, 190 mM glycine and 20% [v/v] methanol) to remove residual SDS- PAGE buffer. Proteins were transferred onto BioTrace NT pure nitrocellulose membranes (Pall Gelman) using the Mini Trans-Blot system (Bio-Rad) at a constant voltage of 100 V for 2 h. The apparatus was cooled using ice packs. Following transfer, nictrocellulose membranes were incubated overnight in a blocking solution of TBST (20 mM Tris-HCl pH 7.6, 140 mM NaCl, 0.1% [v/v] Tween-20 [Bio-Rad]) and 5% (w/v) skim milk powder (Carnation). The blot was subsequently incubated with primary antibody in TBST and 2% (w/v) gelatin (Bio-Rad) for 1 h at room temperature followed
  • 38. 25 by three washes of five min each with TBST. The antibody was used at a dilution of 1:2000 – 1:3000 depending on the antibody. For detection of the primary antibody, which was generated in rabbits, the blot was treated with goat anti-rabbit IgG conjugated to alkaline phosphatase (AP). The secondary antibody was diluted to 1:2000 in TBST containing 2% (w/v) gelatin (Bio-Rad) and incubated for 45 min at room temperature. Following incubation with the secondary antibody, the blots were washed three times with TBST and treated with AP visualization substrates (40 μL BCIP and 40 μL NBT [Amersham Biosciences]) diluted in 10 mL of AP buffer. The AP visualization substrate and blot were allowed to incubate until formation of purple precipitate after which the blot was washed with distilled water to stop further development. The blot was dried and digitized using a scanner. Recombinant protein expression E. coli cells containing recombinant, full length CcmM were generously provided by Dr. A K C So. The activity of CcmM was investigated using recombinant CcmM expression in E. coli cells (Novagen BL-21). Initially, small scale protein extracts were prepared to confirm the overexpression of CcmM. A single colony from a plate culture was used to inoculate 5 mL of LB media containing 100 µg mL-1 ampicillin at 37°C with shaking at 250 rpm for 5 h. After 5 h, the cells were induced with 1 mM isopropyl-β-D- thiogalactoside (IPTG) at 30°C for 4 hours. After induction, the cells were centrifuged at 8,000 x g for 1 min at 4°C and the pellet collected. The cell pellet was then sonicated 5 times with 5 s pulses at 10 W on ice. The sonicated cells were then centrifuged at 10,000
  • 39. 26 x g and the supernatant was collected. Samples of the supernatant were used to check for the presence of CcmM using PAGE followed by Western Blotting with CcmM antibody. Once the presence of CcmM was confirmed, large-scale cultures of transformed E. coli were grown. Overnight stock cultures of transformed E. coli were grown in 4 tubes containing 20 mL of LB/amp media at 37°C with shaking (250 rpm) for 14 h. The overnight cultures were used to inoculate 4 flasks containing 250 mL of LB/amp at 37°C with shaking (150 rpm) for 3-4 h until an OD600 reading of 0.8. Protein overexpression in the cultures was induced with 1 mM IPTG and the cells were allowed to grow for another 5 hours. After induction, the cultures were centrifuged at 5,000 rpm for 10 min at 4°C to collect the cells. The ice-cold cell pellets were subjected to 4 rounds of sonication at 5 W for 10 s each. The lysed cells were centrifuged and the supernatant was collected. Mass spectrometric carbonic anhydrase assay CA activity was measured using mass spectrometric assay by following the irreversible exchange of 18 O from 13 C18 O2 to H2O (Miller et al., 1997). The 18 O enriched carbonate-bicarbonate mixture was prepared by mixing 20 mg K2 13 C16 O3 (95%) with 0.936 mL of H2 18 O (99%). CA catalyses the following partial reaction: 13 C18 O2 + H2 16 O ↔ H+ + H13 C16 O18 O18 O ↔ 13 C16 O18 O + H2 18 O Repeated cycles of hydration/dehydration cause the irreversible loss of 18 O from 13 C18 O2 and result in the formation of 13 C16 O2 with an m/z of 45. Hence, CA activity can be assayed by measuring the rate of formation of mass 45 13 C16 O2 following the supply of 13 C18 O2 (m/z = 49). The measurement of the formation of m/z 45 13 C16 O2 from m/z 49
  • 40. 27 13 C18 O2 was carried out using an aqueous inlet, magnetic sector mass spectrometer (model MM 14-80SC; VG Gas Analysis) (So et al., 2002). CA assay of CcmM Recombinantly expressed CcmM was assayed for CA activity. E. coli cells overexpressing CcmM were grown at temperatures ranging from 16°C to 37°C using 1 mM or 2 mM of the inducer IPTG. CcmM overexpressed at different temperature and inducer concentration was assayed for CA activity. Zinc (0.05 - 2 mM), dithiothreitol (0.05, 0.1 and 2 mM), bicarbonate (0.6 mM and 20 mM) and RuBisCO (1 mg) was added to the assay buffer in independent CA assay to see if CcmM was activated by the aforementioned small molecules. pH of the assay buffer was changed from 7.2 to 8.5 to see if CcmM was activated by changes in pH. The CcmM lysate was concentrated 10X to ensure that enough protein was being assayed to exhibit CA activity. Oxygen was bubbled for 15 min to see if an oxidative environment activates CcmM. A combination of 1 mM zinc, 0.1 mM DTT and 20 mM bicarbonate at pH 8 was assayed to see if a combination of external factors activates CcmM (Table 3). Cellulose acetate assay In 1.0 L of running buffer (15.74 g TRIS, 0.82 g EDTA, 0.24 g NaCl and 4.41 g boric acid in 1.0 L of ddH2O adjusted to pH 8.9 with NaOH) Titan III Zip Zone cellulose acetate plates (Helena Laboratories, Mississauga) (76 X 76 mm) were soaked in ice-cold running buffer for 1 h. Using Whatman No. 1 chromatography paper, the plates were blotted dry. The cell extract was loaded onto the cellulose acetate plates quickly to
  • 41. 28 prevent the plate from drying using a twelve-lane Super Z applicator at a point approximately 1 cm away from the plate edge. For positive control bovine CA (Sigma C- 4831) was used. The plate was plated upside down as a bridge between two chromatography paper wicks (63 mm apart), dipped in isolated chambers containing the running buffer. Electrophoresis was performed at 200 V (about 5 mA per plate) for 40 to 60 min at 4°C. The electrophoresed plates were soaked in 20 mM Na-barbital buffer (pH 8.30) containing 0.1% (w/v) phenol red dye for 1 min. The cellulose acetate membrane was then placed face up on an ice-cooled aluminum plate and blotted to remove excess buffer from the cellulose acetate membrane. Using an inverted funnel, CO2 was blown on top of the cellulose acetate membrane. Positive CA activity appeared as yellow bands on a red background within 1-2 minutes. To increase the sensitivity, the plate was rinsed in running dH20 for about 15 s followed by the re-addition of phenol barbital buffer and CO2 treatment. The plate was meant to be photographed using a digital camera at the point of maximum contrast or when minor bands appear just before the background becomes yellow. However, the plate could not be photographed due to inadequate camera setup. T7 Tag affinity purification and pull down assays T7 Tag Antibody Agarose purification kit (Novagen 69025-3) was used in the pull down assays. The column was mounted on an appropriate support and then primed with several ml of Bind/Wash Buffer to eliminate air from the bottom frit area. The T7- Tag Antibody Agarose was left to equilibrate to room temperature. The 50% slurry of
  • 42. 29 T7-Tag Antibody Agarose was resuspended and 1 ml was transferred into the chromatography column. The storage buffer was allowed to flow to waste. The resin was equilibrated by washing with ten column volumes (10 mL) of 1X T7•Tag Bind/Wash Buffer at room temperature. The cell extract was brought to room temperature and loaded onto the column and incubated overnight at 4°C on orbital rotator. The flow through was collected in a 15 mL tube. The column was then washed with ten column volumes (10 mL) of 1X T7•Tag Bind/Wash Buffer. The flow through was saved for Western Blot analysis. Then, the Anabaena PCC7120 lysate was added and incubated overnight at 4°C on orbital rotator. The column was then washed with ten column volumes (10 mL) of 1X T7•Tag Bind/Wash Buffer. The flow through was saved for activity and Western Blot analysis. The resin which had been incubated with CcmM and Anabaena PCC7120 extract was resuspended in EPPS-NaOH pH 8 and checked for activity. Sulfonamide column binding assay Carboxysome preparations from previous extractions were pooled. A total volume of 1 mL of carboxysome extract was diluted with 2 mL of binding buffer (100 mM EPPS/NaOH, pH 8 +100 mM MgSO4). 0.5 mL of p-aminobenzyl sulfonamide (PAMBS) agarose was prepared for binding by washing 3 times with 1 mL of binding buffer. The washed PAMBS was placed into a Bio Rad Poly-Prep chromatography column and washed further with 2 mL of binding buffer until nearly dry. Then 3 mL of the carboxysome sample was added to the column and incubated for 1 hour on an orbital rotator. After incubation, the buffer was allowed to flow out of the column and drops 50 to 60 were collected as the first flow through. Then 6 mL of binding buffer was added
  • 43. 30 and drops 50 to 60 collected as second flow through. This was repeated until fourth flow through was collected. After the fourth flow through 50 μL of PAMBS resin was saved for analysis. As a positive control to see if PAMBS resin binds to bonafide carboxysomal CA, CcaA, the above mentioned procedure was carried out and the resin analyzed using Western blot. Protein sequencing The bound protein in the sulfonamide affinity column from Anabaena PCC7120 carboxysomal extract was removed by boiling in 10 mM Tris-HCL buffer for 5 min. The protein that was bound was extracted and sent to the Advance Protein Technology Center at Sick Kids Hospital for processing and sequencing using Liquid Chromatography MS/MS. The protein sequence was analyzed using Scaffold 2 software. CcmM209 and isolation on His.Bind® affinity column A segment of the ccmM gene encoding the N-terminal 209 amino acids was cloned into the pET28A expression vector using PCR based methods by Dr. Y Tadesse in our lab. This region encompasses the entire γ-CA domain of CcmM as predicted by the homology model. The primers used for PCR amplification and the DNA sequence of the insert used to express CcmM209 are in the Appendix. E. coli cells expressing recombinant 6XHis tagged CcmM209 were grown using the same protocol as described previously. Cells from 750 mL of culture were centrifuged and the pellet lysed using BugBuster® reagent from Novagen following the protocol provided with the kit. Cellular debris was removed by centrifuging at 16,000 x g for 20
  • 44. 31 min and the supernatant was collected. Binding of 6XHis-tagged protein was accomplished by passing the supernatant through a His-bind immobilized metal affinity chromatography (IMAC) column at a rate of 6 times the bed volume (1.5 mL) per hour. 400 uL of washed resin was retained for further analysis. Bound protein was eluted from the remaining resin using the previously described method.
  • 45. 32 Results In Silico Search for a Carboxysomal Carbonic Anhydrase. To identify potential carboxysomal carbonic anhydrase candidates within the proteome of the cyanobacterium Anabaena PCC7120, in silico searches of the NCBI database (http://www.ncbi.nlm.nih.gov/) and Cyanobase (http://genome.kazusa.or.jp/cyanobase/) were performed using reference sequences (NC_003272). The initial search query was a 274 amino acid protein (AAC46375, slr1347) from the cyanobacterium Synechocystis PCC6803 which is a bonafide carboxysomal carbonic anhydrase belonging to the β-class of CAs (Cot et al., 2008; So et al., 2002; So and Espie, 1998; So and Espie, 2005). The BLASTP search identified a single, high-probability ortholog within the Anabaena PCC7120 proteome having NCBI accession number NP_486950 and cyanobase ID all2910. The ortholog had 29.9% sequence identity to the bonafide β-CA, slr1347, of Synechocystis PCC6803. Reciprocal BLASTP analysis also revealed that all2910 had greater than 40% amino acid sequence similarity to an additional β-CA within the Synechocystis proteome, namely slr0051, a putative periplasmic protein. Proteins from Synechocystis, the carboxysomal carbonic anhydrase (CcaA) slr1347 and the periplasmic CA slr0051, along with all2910 contained the β-CA superfamily conserved domain motif (Fig. 4). This included amino acid residues involved in the formation of the zinc binding sites, dimer interfaces and active site clefts, all of which were highly conserved in both all2910 and CcaA. However, the β- CA superfamily conserved domain in all2910 and slr0051 was positioned towards the C- terminal end of the protein whereas the conserved domain in CcaA was positioned towards the N-terminal region of the protein (Fig. 4). Analysis of the amino acid 32
  • 46. 33 sequence of all2910 using the web-based algorithm SignalP 3.0 (http://www.cbs.dtu.dk/services/SignalP/) within a gram negative bacterial model revealed that the first 30 N-terminal amino acids of all2910 is a signal peptide with a probability prediction of 1.0. A signal peptidase cleavage site was also predicted between amino acid positions 30 and 31. Analysis of the same amino acid sequence using the TatP 1.0 algorithm (http://www.cbs.dtu.dk/services/TatP/) revealed that the amino acid sequence of all2910 contained a specific twin arginine signal sequence with an RRNFL motif starting at amino acid position 10. Unlike all2910, analysis of CcaA using the SignalP 3.0 server and TatP 1.0 server showed that it did not possess a generic signal peptide or a specific twin arginine signal sequence. A second bonafide carboxysomal carbonic anhydrase is encoded by csoS3 (So et al 2004, So & Espie, 2005) found within the cso operon of α-cyanobacteria and a variety of chemolithoautotrophic bacteria (Badger et al., 2002). Consequently, BLASTP searches of the Anabaena PCC7120 proteome using the ε-type CA, CsoS3, from Halothiobacillus neapolitanus (EEG96215) and Prochlorococcus marinus MED4 (NP_892671) as the query sequences. The searches did not identify proteins with significant amino acid sequence similarity. This was not unexpected as Anabaena PCC7120 does not contain α-carboxysomes or the cso operon. However, the search was necessary to exclude the possibility that Anabaena may have acquired csoS3 independent of the cso operon. T. weissflogii CA1 (TWCA1) is a rare δ-CA which has been discovered in a few species of marine diatoms (Roberts et al., 1997). The Anabaena PCC7120 proteome was
  • 47. 34 searched using TWCA1 (AAX08632) protein sequence as a template. The search did not identify any proteins with significant similarity. Although α-class CA has not been identified as a carboxysome constituent, α-CA homologs have been identified in bacteria. The most relevant example is EcaA from Anabaena PCC7120 and Synechococcus PCC7942 (So et al., 1998; Soltes-Rak et al., 1997). EcaA was identified as a CA homolog prior to the complete sequencing of the Anabaena PCC7120 genome and, consequently, other unidentified homologs may exist. The Anabaena PCC7120 genome database was searched using human CA1 (AAH27890), an α-CA, as the query sequence. The BLASTP search yielded a single polypeptide with 35% amino acid sequence similarity to the query sequence. This protein is identical to the previously identified EcaA and is identified in Cyanobase as all2929. Comparison of the amino acid residues within the α-CA conserved domain of all2929 with human CA1 (Fig 4) revealed a high degree of conservation within the active site residues as well as the zinc – binding residues. Previous study on EcaA by Soltes-Rak et al. (1997) indicated that EcaA cross-reacts with antibody directed against chicken α-CAII (Soltes-Rak et al., 1997). Immunogold labeling of thin sections of Anabaena PCC7120 and Synechococcus PCC7942 using chicken α-CAII (Soltes-Rak et al., 1997) antibody showed that EcaA appeared to reside on the surface of the cells, possibly in the periplasmic space (So et al., 1998; Soltes-Rak et al., 1997). Analysis of the EcaA amino acid sequence using the SignalP 3.0 and TatP 1.0 servers revealed that the first 33 N- terminal amino acids comprised a signal peptide with an estimated probability of 1.0. These results are consistent with the results of Soltes-Rak et al (1997). Again, a twin arginine motif, RRQLL, was identified as the specific signal sequence beginning at
  • 48. 35 residue 6. A signal peptidase cleavage site between amino acid residues 33 and 34 was also predicted. Thus, it appears that the putative periplasmic carbonic anhydrases of Anabaena PCC7120 are exported via the Tat, rather than the Sec, export machinery. Overexpressed EcaA and EcaB from Synechococcus PCC7942 and Synechocystis PCC6803 showed that these putative carbonic anhydrases were inactive (So et al., 1998). Finally, Anabaena PCC7120 proteome was searched to find proteins similar to the γ-type CA, using Cam from Methanosarcina thermophila (Protein Data Bank ID 1THJ_A) as the query. The BLASTP search identified the N-terminal 192 amino acids of the 555 amino acid polypeptide CcmM as possessing a left-hand parallel beta helix gamma CA conserved domain (Fig. 5). This region of CcmM had 40 % sequence identity and 65 % sequence similarity with an e value of 8e-39 . Cam is a homotrimeric protein that coordinates three separate zinc atoms at dimer interfaces (Kisker et al., 1996). Dissecting CcmM further revealed that amino acid residues responsible for the formation of the homotrimer interfaces and active site zinc binding residues, as denoted by the small pyramids in Figure 5, are conserved. This suggests that CcmM may have CA catalytic activity. However, there are distinct differences between Cam and CcmM. First, CcmM is a much larger protein having a C-terminal domain of about 355 amino acids that is absent in Cam. Within the C-terminal domain, CcmM has three RbcS-like motifs that are unlikely to play a role in CA activity. Previous work has suggested that the RbcS- like motifs may be involved in binding RuBisCO within the mature carboxysome structure. Interestingly, the amino acid sequence of Cam, deduced from the gene, indicates that Cam has a signal peptide typical of an extracellular enzyme, and in fact Cam appears to be a periplasmic enzyme in Methanosarcina (Kisker et al., 1996).
  • 49. 36 Analysis of the deduced amino acid sequence of CcmM using the SignalP 3.0 and TatP 1.0 servers shows that it is unlikely to possess a signal peptide with a calculated probability of 0.019. CcmM is therefore unlikely to be exported to the periplasm. Deduced amino acid sequences of Anabaena PCC7120 CcmM (all0865), Synechococcus elongatus PCC6301 CcmM (syc0133_c), Synechococcus elongatus PCC7942 CcmM, Thermosynechococcus eolongatus BP-1 CcmM (tll0944) and Metahnosarcina thermophila Cam (1THJ_A) were aligned using ClustalX2 (http://www.clustal.org/ ) (Figure 6). The N-terminal 210 residues of CcmM proteins were selected and aligned with the Cam. Only the N-terminal residues were chosen so as to truncate the Rbcs like repeats on the C-terminal end which is not present in Cam. The sequence alignment showed that the histidine residues involved in Zn2+ binding (His81, His117 and His122 in Cam) were conserved in all the aligned amino acid sequences among all the species (Kisker et al., 1996). The amino acid residues involved in trimer formation in Cam are Arg59, Asp61 and Asp76 (Kisker et al., 1996). The amino acids, Arg59 and Asp61, of Cam are conserved in all of the aligned amino acid sequences of CcmM. Asp76 of Cam are conserved in Anabaena and Thermosynechococcus CcmM but not by the CcmM of Synechococcus CcmM. In Syncechococcus CcmM the aspartic acid is substituted by a glutamic acid residue. This substitution is unlikely to have major functional impact as both aspartic acid and glutamic acid are negatively charged acidic amino acids. The glutamine residue, Gln75, is thought to form hydrogen bonds to H2O in the active site. It is observed that Gln75 is conserved among all the aligned CcmM and Cam (Kisker et al., 1996). Finally, cysteine residues (Cys194 and Cys200) in Anabaena PCC7120 CcmM and Thermosynechococcus elongatus BP-1 CcmM are conserved but
  • 50. 37 are not conserved in the CcmM of Synechococcus elongatus PCC6301 or Synechococcus elongatus PCC7942. The 3-dimensional crystal structure of Cam has been determined by X-ray diffraction and the spatial coordinates for the amino acids can be accessed in Protein Data Bank (http://www.pdb.org/) using accession number 1THJ (Figure 7a, b Cam). One of the unique features of the structure of Cam is that it contains seven complete turns of a left-handed parallel β-helix. The β-helix is topped by a short α-helix and a second α-helix is formed by the C-terminal portion of the protein. A short segment of the polypeptide chain at the N-terminus is in extended conformation. Cross-section of the β-helix of Cam resembles an equilateral triangle as each section of the β-helix contains three parallel sheets of almost equal length (Kisker et al., 1996). Each of the β-helix contains two β- turns. The X-ray crystal structure of CcmM from Anabaena PCC7120 or any other organism has not yet been determined. In order to compare the structural similarity of CcmM to Cam, which have high amino acid sequence similarity, a 3D structure of CcmM is required. Advances in protein structure determination and computational analysis has led to the development of a suite of programs capable of predicting 3 dimensional protein structure based on primary amino acid sequence. Chief among these are programs that utilize template-based homology modeling, fold-recognition and profile–profile matching algorithms (Kelley and Sternberg, 2009). A 3-D structural model was created for CcmM using the Phyre server (http://www.sbg.bio.ic.ac.uk/phyre/). The entire 555 amino acids of Anabaena PCC7120 CcmM could not be resolved by a single model. The best match model was created using 192 amino acids of CcmM starting from the 4th N-terminal
  • 51. 38 residue having an e-value of 4.57e-14 (Figure 7a,b CcmM). The predicted 3D structure of CcmM is strikingly similar to that of Cam. The left handed parallel β-helix observed in Cam is also observed in CccM. The cross section along the β-helix of the CcmM resembles an equilateral triangle just like that of Cam. The number, location and orientation of α-helices are the same in both Cam and CcmM. However, CcmM appears to have seven complete turns of the left handed parallel β-helix along with an additional β-strand in the region where Cam has the extended conformation. CcmM appears to have one β-turn compared to two β-turns in Cam. Visual inspection revealed that Cam and CcmM have similar structural properties. To assess the goodness of fit between the known crystal structure of Cam and the model for CcmM a superimposition program (Superpose, http://wishart.biology.ualberta.ca/SuperPose/ and SwissPdb, http://spdbv.vital-it.ch/ ) were used to compare mainchain spatial positioning (Figure 7a,b Cam + CcmM superimposed). The RMS (root mean square deviation) of the crystal structure after iterative magic fit function in SwissPdb Viewer using 185 α-carbons was 0.41 Å. The RMS calculated using all 740 atoms of the backbone was 0.52Å. Overall and side chain RMS values could not be obtained as Cam and CcmM do not contain the same number of atoms in the side chains. The Cam structure was determined originally at a resolution of 2.8 Å, therefore, RMS values of 0.41 Å and 0.52Å for α-carbons and backbone atoms respectively shows that the two structures match within the resolution of the available data. The active site Zn2+ -binding residues (His81, His117 and His122 in Cam) are in the same position and orientation in CcmM (His75, His102 and His107). The amino acid residues involved in trimer formation (Arg59, Asp61 and Asp76 in Cam) also appear to
  • 52. 39 be in the same position and orientation in CcmM (Arg57, Asp55 and Asp70 in CcmM). Finally, spatial orientation of the residue responsible for binding to H2O (Gln75 in Cam) is similar in Cam and CcmM (Gln69 in CcmM). The in silico search for a carboxysomal carbonic anhydrase within the proteome of Anabaena PCC7120 revealed that CcmM has sequence similarity within the N- terminal region to γ-CA, Cam, of Methanosarcina thermophila. Structural modeling and analysis revealed that CcmM has high structural similarity to Cam and is therefore a likely candidate CA in Anabaena PCC7120 which lacks a previously characterized carboxysomal CA. The other candidate CAs appears to be periplasmic carbonic anhydrases based on the presence of export leader sequences and other data.
  • 53. 40 Anabaena PCC7120 - all2910 Synechocystis PCC6803 - CcaA Anabaena PCC7120 EcaA - all2929 NP_486969 Human CA1 – AAH27890 Figure 4: Representation of the location of conserved β-CA and α-CA domains in selected proteins from Anabaena PCC7120, Synechocystis PCC6803 and humans obtained from the Conserved Domains Database from NCBI server.
  • 54. 41 Anabaena PCC7120 CcmM – all0865 Methanosarcina thermophila Cam - 1THJ_A Figure 5: Representation of the location of conserved γ-CA domains in selected proteins from Anabaena PCC7120 and Methanosarcina thermophila obtained from the Conserved Domains Database from NCBI server (figures not to scale).
  • 55. 42 Figure 6: Amino acid sequence alignment output from ClustalX2 for Cam of Mehanosarcina thermophila and CcmM of selected species of cyanobacteria. Abbreviation used for species are: Ana for Anabaena PCC7120, Thermo for Thermosynechococcus elongatus BP-1, Methano for Mehanosarcina thermophila, Syn6301 for Synechococcus elongatus PCC6301 and Syn7942 for Synechococcus elongatus PCC7942. Amino acid residues in red boxes denote the Zn2+ binding residues, those in blue boxes are amino acid residues involved in the formation of the trimer and the amino acid residues in the yellow box are those that are involved in hydrogen bonding to water. The green ovals show conserved cysteine residues among Anabaena PCC7120 CcmM and Thermosynechococcus elongatus BP-1 CcmM.
  • 56. 43 a)
  • 57. 44 Figure 7: 3D structure of Cam of Methanosarcina thermophila and CcmM of Anabaena PCC7120. Crystal structure of Cam obtained from x-ray crystallography of γ-CA, Chain A of Cam, is on the top left (PDB ID: 1THJ). On the top right is the computer modeled 3D structure of CcmM of Anabaena PCC7120 using Phyre server. The bottom 3D structure denotes the superposition of Cam and CcmM using Superpose followed by an iterative fit in Swiss-Pdb Viewer. Blue represents regions of β-sheets, red represents regions of α-helices and green represents regions of extended conformation. All images are from the same vantage point. a) is the side view and b) is a cross section through the β-barrel. b)
  • 58. 45 Mass spectrometric CA assays In order to establish a positive control, a bonafide carboxysomal CA, CcaA, from Synechocystis PCC6803, was recombinantly expressed in E. coli and assayed for activity (So and Espie, 2005). CA activity was determined mass spectrometrically as acceleration in the rate of loss of 18 O from 13 C18 O labeled CO2 (Figure 8a). At time 0, 2.5 μL of 13 C 18 O labeled bicarbonate was added to the reaction vial as a substrate for CA. Addition of 13 C 18 O labeled bicarbonate to the cuvette initially resulted in the appearance of three separate CO2 species having a mass/charge (m/z) ratio of 49, 47 and 45, corresponding to 13 C18 O18 O (green line), 13 C18 O16 O (red line) and 13 C16 O16 O (blue line). The mass spectrometer does not detect charged carbonate species as these do not cross the membrane of the inlet system. With time, a steady state reaction rate is achieved which corresponds to the uncatalyzed rate of 18 O loss from CO2. This is reflected in a decrease in m/z =49 and an increase in m/z = 45. Addition of 200 μL of E. coli lysate in which CcaA overexpression had been induced resulted in a rapid loss of the substrate, m/z =49 , and intermediate, m/z =47, along with a rapid increase in the rate of production of the product, m/z =45. The rapid increase in the formation of the product followed by the rapid decrease in the substrate indicates that CA activity was present in the E. coli lysate. The level of m/z=45 reached a plateau as all of the substrate was converted, indicated by near zero levels of m/z=49 and m/z=47. In order to calculate the fold increase in activity from the catalyzed reaction compared to the uncatalyzed reaction, the atom fraction of 18 O in m/z 49 compared to the total 18 O was calculated and plotted on a logarithmic scale (Figure 8b). Trend lines were plotted for the uncatalyzed section of the plot and the catalyzed section of the plot. The
  • 59. 46 equation of the trend line for the uncatalyzed reaction was y = -0.0346x + 1.7404 with a R2 for the equation to be 0.9948. The equation of the trend line for the catalyzed reaction was y = -1.7482x + 8.8172 with an R2 value of 0.9329. The R2 value being close to 1 indicates that the linear trend line is close to the actual data. Comparing the slopes of the uncatalyzed and catalyzed trendline shows a 50 fold increase in the rate of 18 O loss from m/z 49. As a negative control, equal amounts of E. coli lysate harboring the pET15b plasmid alone were added to the reaction vial after the substrate reached steady state reaction. Addition of the negative control lysate did not increase the rate of production of m/z =45 indicating that CA activity was absent.
  • 60. 47 Time (min) 13CO2species(uM) 0.0 2.0 4.0 6.0 8.0 10.0 12.0 14.0 0.0 2.0 4.0 6.0 8.0 10.0 CO2-45, uM CO2-47, uM CO2-49, uM 200 uL of E. coli lysate with overexpressed CcaA added (a) Time (min) Log18O2AtomFraction (b) Uncatalyzed y = -0.0346x + 1.7404 R 2 = 0.9948 With lysate y = -1.7482x + 8.8172 R 2 = 0.9329 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 0.0 2.0 4.0 6.0 8.0 18O Atom Fraction Uncatalyzed trend Trend with lysate Linear (Uncatalyzed trend) Linear (Trend with lysate) 50X Figure 8: (a) Mass spectrometric assay for CA activity of E. coli lysate in which CcaA expression was induced. (b) 18 O atom fraction in m/z 49 with respect to time. The trendlines represent the uncatalyzed reaction and reaction upon addition of lysate.
  • 61. 48 CA Activity in Synechococcus PCC7942 carboxysomes Mass spectrometric assays of concentrated intact cells of Anabaena PCC 7120 and of concentrated whole cell lysates of Anabaena PCC7120 failed to show any detectable CA activity (data not shown). This is consistent with past work that has shown that CA levels in many cyanobacteria are quite low (Ingle and Colman, 1975; So and Espie, 2005). What is clear is that the levels of both periplasmic CA and cytosolic CA are below the level of detection even in concentrated extracts and are therefore unlikely to be confounding factors in subsequent analysis. Similarly, if present, the level of carboxysomal CA in Anabaena carboxysomes is also likely to be low and difficult to detect. To determine the level of CA activity anticipated from carboxysomes containing a bonafide CA, CcaA, a carboxysome enriched fraction from Synechococcus PCC7942 lysate was obtained using standard procedures. Synechococcus PCC7942 is known to contain CcaA and was used to provide a positive control for carboxysome enriched fraction exhibiting CA activity. In these mass spectrometric assays the steady state concentration of CO2 (m/z = 49, 47, 45) was reversed to that observed in the positive control using overexpressed CcaA (Fig. 9a). This was due to the slow deterioration over time in the content of 18 O in the 13 C labeled CO2. Trace amounts of atmospheric moisture containing H2 16 O were inevitably introduced into the vial containing substrate 13 C 18 O. As a result there is less and less of m/z = 49 and increasing amounts of m/z = 47 and m/z = 45. Addition of 200 μL of Synechococcus PCC7942 carboxysome enriched fraction to the reaction cuvette resulted in an increase in the rate of formation of m/z = 45 with the
  • 62. 49 subsequent drop in m/z = 49 and m/z = 47 (Figure 9a). However, the rate of increase of m/z = 45 was not as rapid as in the case of overexpressed CcaA and did not plateau after 3 min. The rate of 18 O loss from m/z 49 was plotted on a logarithmic scale (Figure 9b). The equation of the uncatalyzed steady state reaction was y = - 0.0788x + 1.2838 with an R2 value of 0.9679 and with the carboxysome extract was y = - 0.1968x + 1.6522 with an R2 value of 0.9954. Again, the high R2 value of the trend lines indicates that data and the trend lines have high goodness of fit. The rate with the carboxysome extract was 2.5 fold faster than the uncatalyzed rate in the loss of 18 O from m/z 49 thus indicating 2.5 fold increase in the rate of reaction. The catalysis rate was much lower than of E. coli lysate with overexpressed CcaA. This was not unexpected as it is known that CcaA levels in cyanobacteria are much lower than the amount of CcaA produced by an overexpression vector in E. coli. Moreover, the substrate concentration m/z = 49 in which CA can act on was lower than the CA assay of overexpressed CcaA.
  • 63. 50 Time (min) 13CO2species(uM) (a) 0.0 2.0 4.0 6.0 8.0 10.0 12.0 0.0 2.0 4.0 6.0 8.0 CO2-45, uM CO2-47, uM CO2-49, uM 200 uL of Synechococcus PCC7942 carboxysome extract added Time (min) Log18O2AtomFraction (b) Uncatalyzed y = -0.0788x + 1.2838 R 2 = 0.9679 With Cbx extract y = -0.1968x + 1.6522 R 2 = 0.9954 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 0.0 2.0 4.0 6.0 8.0 18O Atom Fraction Uncatalyzed trend Trend with Cbx extract Linear (Uncatalyzed trend) Linear (Trend with Cbx extract) 2.5X Figure 9: (a) Mass spectrometric assay for CA activity of Synechococcus PCC7942 carboxysome extract (b) Log 18 O atom fraction in m/z 49 with respect to time. The trendlines represent the uncatalyzed reaction and reaction upon addition of carboxysome extract.
  • 64. 51 CA Activity in Anabaena PCC7120 carboxysomes Addition of 200 μL of a carboxysome enriched fraction from Anabaena PCC7120 resulted in an increase in the rate of formation of m/z = 45 indicating the presence of CA activity (Figure 10a). On a protein basis, this increase was similar to that observed for the Synechococcus PCC 7942 carboxysome enriched fraction. Growth, enrichment and activity assays were repeated independently seven different times confirming the presence of CA activity in the carboxysomal extract of Anabaena PCC7120. Addition of 20 μM of ethoxyzolamide, a classical inhibitor of CA, to the carboxysome enriched fractions and assay buffer eliminated CA activity; so did boiling (indicated by dotted line in Figure 10a). The rate of 18 O loss from m/z 49 was plotted on a logarithmic scale (Figure 10b). The equation of the trend line for the uncatalyzed steady state reaction was y = - 0.0636x + 1.6571 with an R2 value of 0.9978 and with the carboxysome extract was y = - 0.1645x + 1.6522 with an R2 value of 0.9954. The R2 value of the trend lines being close to 1 indicates that data and the trend lines have high goodness of fit. The rate with carboxysome extract was 2.6 fold faster than the uncatalyzed rate in the loss of 18 O from m/z 49 thus indicating 2.6 fold increase in the rate of reaction. These results demonstrate a low, but consistently present level of CA activity concentrated with the carboxysome enriched fraction of Anabaena PCC7120. The combined results also show that periplasmic CA or cytosollic CA activity contributed minimally to the activity detected in the carboxysome fraction. Moreover, this activity was eliminated by the addition of a classical CA inhibitor, ethoxyzolamide. Therefore, either there is a novel CA in the carboxysomes or one of the previously identified carboxysomal shell proteins may contribute CA activity. Bioinformatics analysis and in
  • 65. 52 silico search for a CA in Anabaena PCC7120 points towards CcmM as a likely candidate CA.
  • 66. 53 Time (min) 13CO2species(uM) (a) 0.0 2.0 4.0 6.0 8.0 10.0 12.0 0.0 2.0 4.0 6.0 8.0 10.0 CO2-45, uM CO2-47, uM CO2-49, uM 200 uL of Anabaena PCC7120 carboxysome extract added Time (min) Log18O2AtomFraction (b) Uncatalyzed y = -0.0636x + 1.2887 R 2 = 0.9922 With Cbx extract y = -0.1645x + 1.6571 R 2 = 0.9978 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 0.0 2.0 4.0 6.0 8.0 10.0 12.0 18O Atom Fraction Uncatalyzed trend Trend with cbx extract Linear (Uncatalyzed trend) Linear (Trend with cbx extract) 2.6X Figure 10: (a) Mass spectrometric CA assay of carboxysome enriched fraction from Anabaena PCC7120. The dotted line represents the assay with 20 μM of CA inhibitor, etoxyzolamide. (b) Log 18 O atom fraction in m/z 49 with respect to time. The trendlines represent the uncatalyzed reaction and reaction upon addition of carboxysome extract.
  • 67. 54 Activity of full length CcmM CcmM protein was recombinantly expressed in E. coli using the overexpression vector pET21. Western blot analysis was performed using a polyclonal antibody made against Synechococcus PCC7942 CcmM to detect the presence of CcmM in Anabaena PCC7120. Antibody staining and development showed bands at about 50 and 58 KDa in lanes A and C containing 10X and 100X dilution of E. coli lysate in which CcmM was induced respectively (Figure 11). The 50 KDa band was very faint in the 100X dilution and was not detected by the scanner. The estimated size of CcmM in Anabaena PCC7120 is 58 KDa, which is same as the detected band in the western blot. Western blot analysis of Anabaena carboxysome extract in lane D also produced bands at about 50 and 58 KDa with CcmM antibody indicative of the presence of CcmM in Anabaena PCC7120. Lane C which contained the negative control sample, E. coli lysate with induced empty vector, did not cross-react with CcmM antibody. Lysates of E. coli in which CcmM has been induced were assayed for activity. Addition of the lysate did not increase the steady state reaction as indicated by no increase in the rate of formation of m/z = 45 (Figure 12a). The rate of 18 O loss from m/z 49 was plotted on a logarithmic scale (Figure 12b). The equation of the trend line for the uncatalyzed steady state reaction was y = - 0.0915x + 1.433 with an R2 value of 0.9982 and with lysate was y = - 0.1004x + 1.4792 with an R2 value of 0.9986. The R2 value of the trend lines being close to 1 indicates that data and the trend lines have high goodness of fit. Comparing the slopes of the catalyzed and uncatalyzed reaction trend lines reveals that addition of the E. coli lysate with CcmM overexpressed resulted in minimal enhancement (1.1 fold increase) of the reaction. Growth, induction and CA activity assay
  • 68. 55 was repeated independently five times with minimal enhancement of CA activity. The total protein assayed to detect CcmM activity was comparable to the total protein used in CcaA assay. The CA activity enhancement using recombinantly expressed CcmM was minimal, even when compared to CA activity from carboxysomal extracts of Anabaena PCC7120 or Synechococcus PCC7942. E. coli lysate proteins without CcmM overexpressed also resulted in the minimal increase (1.1 fold increase) in 18 O loss from m/z 49 (Figure 13ab). Therefore, it may be concluded that E. coli lysate with overexpressed CcmM does not show enhanced CA activity as it has no increase in CA activity as compared to the control E. coli lysate with empty pET21 vector.
  • 69. 56 M = Size marker A = 10X dilution B = -ve control C = 100X dilution D = Anabaena PCC7120 lysate 75 50 100 37 25 20 M A B C D Figure 11: Western Blot using CcmM antibody for E. coli lysate overexpressing CcmM and Anabaena PCC7120 lysate. Lane M is the marker lane, lane A is the 10X dilution of E. coli lysate overexpressing CcmM, lane B is E. coli lysate containing no CcmM vector, lane C is 100X dilution of E. coli lysate overexpressing CcmM and lane D is Anabaena PCC7120 lysate. It is seen that CcmM corresponding to 58 kDa is expressed in the E. coli as well as in the Anabaena PCC7120 lysate.
  • 70. 57 Time (min) 13CO2species(uM) (a) 0.0 2.0 4.0 6.0 8.0 10.0 0.0 2.0 4.0 6.0 8.0 10.0 CO2-45, uM CO2-47, uM CO2-49, uM 200 uL of E. coli lysate with overexpressed CcmM added Time (min) Log18O2AtomFraction (b) Uncatalyzed y = -0.0915x + 1.433 R 2 = 0.9982 With lysate y = -0.1004x + 1.4792 R 2 = 0.9986 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 0.0 2.0 4.0 6.0 8.0 10.0 12.0 18O Atom Fraction Uncatalyzed trend Trend with lysate Linear (Uncatalyzed trend) Linear (Trend with lysate) 1.1X Figure 12: (a) Mass spectrometric assay for CA activity of E. coli lysate in which CcmM expression was induced. (b) Log 18 O atom fraction in m/z 49 with respect to time. The trend lines represent the uncatalyzed reaction and reaction upon addition of E. coli lysate in which CcmM expression was induced.
  • 71. 58 Time (min) 13CO2species(uM) (a) 0.0 2.0 4.0 6.0 8.0 10.0 0.0 2.0 4.0 6.0 8.0 10.0 CO2-45, uM CO2-47, uM CO2-49, uM 200 uL of E. coli lysate with empty pET21 vector Time (min) Log18O2AtomFraction (b) Uncatalyzed y = -0.0829x + 1.4884 R 2 = 0.9954 With lysate y = -0.0907x + 1.5243 R 2 = 0.9982 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 0.0 2.0 4.0 6.0 8.0 10.0 18O Atom Fraction Uncatalyzed trend Trend with lysate Linear (Uncatalyzed trend) Linear (Trend with lysate) 1.1X Figure 13: (a) Mass spectrometric assay for CA activity of E. coli lysate containing empty pET21 vector. (b) Log 18 O atom fraction in m/z 49 with respect to time. The trend lines represent the uncatalyzed reaction and reaction upon addition of E. coli lysate containing empty pET21 vector.
  • 72. 59 Attempts to activate CcmM Various factors were altered in attempts to activate CcmM as summarized in Table 3. Induction temperature was varied from 16°C to 37°C and the inducer, IPTG, concentration was alternated between 1 or 2 mM to determine if induction temperature and IPTG affected CcmM expression and activity. Equal quantity of total protein from E. coli induced at different temperatures and inducer concentrations were loaded into wells and electrophoresed. Western blot was carried out using CcmM antibody. The Western blot showed that at 37°C using 2 mM IPTG was the optimal temperature for the overexpression of CcmM (Figure 14). CcmM expression levels did not have any effect on CcmM having CA activity. Zinc is a candidate cofactor for activation of CcmM. E. coli lysate with overexpressed CcmM was incubated for 1 h with 0.05, 0.5, 1 and 2 mM Zn2+ and assayed for CA activity. Addition of Zn2+ at any of the tested concentration did not appear to have made CcmM active. Redox active reagent dithiothreitol (DTT) at concentrations of 0.05, 0.10 and 2 mM was added to the overexpressed CcmM extract and in the assay buffer to see if DTT can activate CcmM. DTT did not appear to have activated CcmM. Attempts were made to activate CcmM with excess bicarbonate. Addition of 20 mM bicarbonate did not activate CcmM. As CcmM has RbcS like C- terminal repeats, therefore RuBisCO may interact with CcmM C- terminal end and activate it in the process. 1 mg of purified RuBisCO from spinach (Sigma R8000-1UN) was added to 200 uL of E. coli lysate in which CcmM had been induced and incubated for 15 min. Addition of RuBisCO did not make CcmM CA active. Assaying at a pH range of 7.2 to 8.5 had no effect in activating CcmM. The CcmM lysate from E. coli was concentrated to 10X but that did not result in the activation of the CcmM overexpressed
  • 73. 60 protein. Bubbling of oxygen for 15 min did not activate CcmM. A combination of 1 mM Zinc, 0.1 mM DTT and 20 mM bicarbonate at pH 8 did not activate the CcmM.
  • 74. 61 1 2 3 4 5 6 7 8 250 150 100 75 50 37 25 20 Figure 14: Western blot using CcmM antibody for varying temperature and IPTG treatments during CcmM overexpression. Lane 1 contains –ve control containing E. coli lysate with empty expression vector, lane 2 contains standard protein marker with the size indicated on the left side of the blot, lane 3 contains overexpressed CcmM at 16°C induced with 2 mM IPTG, lane 4 contains overexpressed CcmM at 37°C with 2 mM IPTG for induction. Lanes 5, 6, 7, 8 and 9 contains overexpressed CcmM at 25, 30, 37 and 16°C respectively induced with 1 mM IPTG.
  • 75. 62 Table 3: Factors changed in attempts to activate CcmM. CcmM was overexpressed in E. coli and cell lysate was assayed for CA activity after manipulation of the factors listed below. Factors tested Probable effect on CcmM CA Activity Induction temperature (16 -37°C) Protein Folding No Zinc (0.05 – 2.0 mM) Part of active site No Dithiothreitol (DTT) (0.05, 0.1 & 2 mM) Thioredoxin association with CcmM No Bicarbonate (HCO3 - ) (20 mM) Non-active site regulator No RuBisCO (1 mg in 200 μL of E. coli lysate in which CcmM has been induced) C-terminal repeats of RbcS-like sequence. Possible binding to RbcL or RbcS. No pH (7.2 – 8.5) Enzyme requires optimum pH No 10X concentration Ensure enough protein No Bubbling oxygen for 15 min Oxidative activation No 1 mM Zinc, 0.1 mM DTT and 20 mM bicarbonate at pH 8 Combinatorial activation No
  • 76. 63 Cellulose acetate assay Cellulose acetate assays to separate CA active protein from carboxysome were performed. 0.01 mg, 0.001 mg, 0.0001 mg and 0.00001 mg of bovine CA was loaded as positive control and 0.01 mg of boiled bovine CA was loaded as negative control. Positive control using bovine CA produced transient yellow bands in a red background after passing CO2 over the cellulose acetate plates which disappeared immediately after removing the CO2 stream. This assay could detect a faint band up to the concentration of 0.0001 mg. Considering bovine CA has a molecular mass of 30 KDa, the limit of detection was determined to be 3.3 x 10-12 moles. As bovine CA has one active site per molecule this corresponds to 2 x 1012 active sites. Loading and electrophoresing 0.1 mg of carboxysomes extract yielded no distinct band. Very faint yellowing was observed around the carboxysome loading spot (picture could not be taken due to the lack of appropriate camera). T7 tag - affinity pull down experiments Attempts were made to pull down and concentrate the CA active agent in the Anabaena PCC7120 lysate using T7 tagged CcmM bound to anti-T7 tag resin. Western blot analysis using CcmM antibody showed that CcmM did bind to the anti-T7 resin as indicated by strong CcmM cross-reactivity in lane 3 of Figure 15. The T7-tagged resin with CcmM bound to it did not have CA activity. A 2 L culture of Anabaena PCC7120 was centrifuged and the precipitated cells were lysed in a French press to a volume of 10 mL. The Anabaena PCC7120 lysate was incubated with CcmM bound to the anti-T7 resin. After incubation the lysate was allowed to flow out of the column and the column
  • 77. 64 washed. Incubation of the Anabaena PCC7120 lysate with CcmM bound to anti-T7 resin was done with the aim to pull out proteins in the Anabaena PCC7120 lysate which has affinity to CcmM thus activating it or concentrating the active CA. The resin was then assayed for CA activity. However, no activity could be detected in the resin.
  • 78. 65 75 50 100 150 250 1 2 3 4 5 6 Figure 15: Western blot using CcmM antibody to show binding in anti-T7 agarose. Lane 1 contains the standard protein marker, lane 2 contains Anabaena PCC7120 lysate, lane 3 contains anti-T7 agarose which has been incubated with E. coli lysate expressing CcmM and washed to remove non specific proteins, lane 4 contains anti-T7 agarose negative control, lane 5 contains E. coli lysate expressing CcmM and lane 6 contains E. coli lysate with empty expression vector.
  • 79. 66 Sulfonamide pull down assays Sulfonamide inhibits CA by binding to the active site metal ion (Coleman, 1967). Since sulfonamides bind to the active site of CA, immobilized sulfonamide column can be used to isolate CA. As a positive control E. coli lysate with His tagged CcaA overexpressed was incubated with activated sulfonamide resin and then washed to remove proteins weakly bound. Western blot using CcaA antibody produced bands at 31 kDa and 38 kDa for the resin and eluate as seen in lane 1 and 2 of Figure 16. Very faint bands were observed in similar location in the wash buffer eluate in lane 4. No such bands were observed in the negative control containing E. coli lysate with no insert. Carboxysome extract of Anabaena PCC7120 was mixed with the sulfonamide binding buffer and incubated for an hour with activated sulfonamide resin. The sulfonamide resin was washed to remove proteins that were weakly bound. The sample was sent for protein sequencing to see what proteins bound to the sulfonamide column and if they correspond to any previously characterized protein from Anabaena PCC7120. 62 individual protein sequences were detected by the mass spectrometer. The identified protein sequences were compared to the NCBI proteome database for Anabaena PCC7120. The only identifiable protein related to the CCM was CcmM (Figure 17). The probability of match to CcmM was 100%. The rest of the identified proteins were membrane transporters or other membrane related proteins. Nineteen hypothetical proteins were also identified. The hypothetical proteins were modeled using PHYRE to identify if any of the hypothetical proteins had structural similarity to any previously characterized proteins involved in the CCM. However, all of the models of the hypothetical proteins produced had close similarity to membrane transporters and porins.
  • 80. 67 1 2 3 4 5 250 150 100 75 50 37 25 20 Figure 16: Western blot using CcaA antibody to show CcaA binding in sulfonamide resin. Lane 1 contains E. coli lysate with overexpressed CcaA flow through, lane 2 contains E. coli lysate with overexpressed CcaA resin bound, lane 3 contains the protein marker and lane 4 contains the wash buffer flow through. Lane five is negative control containing E. coli lysate with an empty expression vector.
  • 81. 68 gi|17228360 (100%), 59,464.3 Da carbon dioxide concentrating m echanis m protein [Nostoc sp. PCC 7120], gi|17130210|dbj|BAB72822.1| carbon dioxide concentrating m echanism protein [No 4 unique peptide s, 4 unique spe ctra, 4 total spectra, 57/555 am ino acids (10% coverage) M A V R S T A A P P T P W S R S L A E A Q I H E S A F V H P F S N I I G D V H I G A N V I I A P G T S I R A D E G T P F H I G E N T N I Q D G V V I H G L E Q G R V V G D D N K E Y S V W V G S S A S L T H M A L I H G P A Y V G D N S F I G F R S T V F N A K V G A G C I V M M H A L I K D V E V P P G K Y V P S G A I I T N Q K Q A D R L P D V Q P Q D R D F A H H V I G I N Q A L R A G Y L C A A D S K C I A P L R N D Q V K S Y T S T T V I G L E R S S E V A S N S L G A E T I E Q V R Y L L E Q G Y K I G S E H V D Q R R F R T G S W T S C Q P I E A R S V G D A L A A L E A C L A D H S G E Y V R L F G I D P K G K R R V L E T I I Q R P D G V V A G S T S F K A P A S N T N G N G S Y H S N G N G N G Y S N G A T S G K V S A E T V D Q I R Q L L A G G Y K I G T E H V D E R R F R T G S W N S C K P I E A T S A G E V V A A L E E C I D S H Q G E Y I R L I G I D P K A K R R V L E S I I Q R P N G Q V A P S S S P R T V V S A S S A S S G T A T A T A T R L S T E V V D Q V R Q I L G G G Y K L S I E H V D Q R R F R T G S W S S T G A I S A T S E R E A I A V I E A S L S E F A G E Y V R L I G I D P K A K R R V L E T I I Q R P gi|17228360 (100%), 59,464.3 Da carbon dioxide concentrating m echanis m protein [Nostoc sp. PCC 7120], gi|17130210|dbj|BAB72822.1| carbon dioxide concentrating m echanism protein [No 3 unique peptide s, 3 unique spe ctra, 3 total spectra, 47/555 am ino acids (8% coverage) M A V R S T A A P P T P W S R S L A E A Q I H E S A F V H P F S N I I G D V H I G A N V I I A P G T S I R A D E G T P F H I G E N T N I Q D G V V I H G L E Q G R V V G D D N K E Y S V W V G S S A S L T H M A L I H G P A Y V G D N S F I G F R S T V F N A K V G A G C I V M M H A L I K D V E V P P G K Y V P S G A I I T N Q K Q A D R L P D V Q P Q D R D F A H H V I G I N Q A L R A G Y L C A A D S K C I A P L R N D Q V K S Y T S T T V I G L E R S S E V A S N S L G A E T I E Q V R Y L L E Q G Y K I G S E H V D Q R R F R T G S W T S C Q P I E A R S V G D A L A A L E A C L A D H S G E Y V R L F G I D P K G K R R V L E T I I Q R P D G V V A G S T S F K A P A S N T N G N G S Y H S N G N G N G Y S N G A T S G K V S A E T V D Q I R Q L L A G G Y K I G T E H V D E R R F R T G S W N S C K P I E A T S A G E V V A A L E E C I D S H Q G E Y I R L I G I D P K A K R R V L E S I I Q R P N G Q V A P S S S P R T V V S A S S A S S G T A T A T A T R L S T E V V D Q V R Q I L G G G Y K L S I E H V D Q R R F R T G S W S S T G A I S A T S E R E A I A V I E A S L S E F A G E Y V R L I G I D P K A K R R V L E T I I Q R P Figure 17: Mass spectrometric analysis of proteins isolated from the sulfonamide column. Mass spectrometric analysis identified stretches of amino acid sequences highlighted in yellow. BLASTP search of the NCBI database using the identified amino acid sequences identified the protein to be CcmM of Anabaena PCC7120. b) Second mass spectrometric analysis of the same sample. CcmM of Anabaena PCC7120 was again identified. a) b)
  • 82. 69 Analysis of Anabaena PCC7120 CcmM209 Results of the sulfonamide pull down assay again lead to CcmM as the carboxysomal CA in Anabaena PCC7120, but its lack of CA activity is problematic. Potentially, the long C-terminal region of CcmM, when outside the carboxysome environment, may cause torsional strain on the protein leading to a distorted active site in the N-terminal γ-CA domain resulting in the lack of activity. In order to circumvent the influence of the C-terminal end, a His tagged truncated version of CcmM containing only the N-terminal 209 amino acids (CcmM209) was constructed. CA activity assays of freshly prepared E. coli lysate over-expressing CcmM209 proved to be negative (not shown). However, lysate incubated with the thiol oxidizing agent diamide (20 mM) for 1 hour, exhibited CA activity (Figure 18a). The equation of the trend line for the uncatalyzed steady state reaction was y = -0.0617x + 1.7954 with a R2 value of 0.9994 and the equation of the trend line for diamide treated E. coli lysate expressing CcmM209 was y = -0.1668x + 2.0197 with an R2 value of 0.9986 (Figure 18b). Comparing the slopes of the uncatalyzed reaction to the reaction with diamide treated E. coli lysate expressing CcmM209 showed a 2.7 fold increase from the steady state uncatalyzed rate. CcmM209 was concentrated by binding to the His.Bind affinity resin. After binding, the resin from the His.Bind affinity column was also assayed for CA activity. CcmM209 bound to the resin also exhibited CA activity (Figure 19a). The equation of the trend line for the uncatalyzed steady state reaction was y = -0.0678x + 1.8769 with a R2 for the equation to be 0.9987 and the equation of the trend line for CcmM209 bound to His.Bind affinity column was y = -0.2508x + 2.2235 with an R2 value of 0.9954 (Figure 19b). The resin bound CcmM209 showed a 3.7 fold increase in activity compared to the
  • 83. 70 steady state uncatalyzed rate (Figure 19b). Freshly prepared E. coli lysate in which CcmM209 was not expressed displayed no CA activity nor did the resin through which the lysate was passed (Figure 20ab).