Arabidopsis thaliana cDNA Clone Homologous to the Genes Coding
for Cellulose Synthase
Robert J. Sims and Senthil Sankaralingam
BOT 380C Dr. K. Sathasivan
4/28/97
ABSTRACT
Cellulose is the primary component of plant cell walls. The only known enzyme to be
unique to the cellulose synthetic pathway is cellulose synthase. We have isolated and
partially sequenced a cDNA clone from Arabidopsis thaliana that contains regions of
homology [82%] to the Gossypium hirsutum protein CelA-2. The CelA-2 protein has
been shown to belong to a family of related proteins involved in cellulose synthesis,
specifically, the protein cellulose synthase. Southern analysis, using the cDNA clone as a
probe, showed that this A.thaliana gene is a member of a small gene family, consisting of
3 or 4 genes. mRNA transcription of this cDNA was found to be most prevalent in stem
and flower tissue, sights of active cell division.
INTRODUCTION
Cellulose is the principle structural material in plants, comprising 80% of their dry weight.
It has been estimated that cellulose accounts for over half the carbon in the biosphere;
approximately 1015 kilograms of cellulose is estimated to be synthesized and degraded
annually. The only enzyme known to be unique to the cellulose synthesis pathway is
cellulose synthase. There has been a great deal of effort in order to characterize the
enzyme(s) responsible for the synthesis of cellulose (1,4-B-D-glucan) in higher plants.
Until recently, a catalytic subunit for the synthesis of cellulose had not been characterized
in higher plants (3 and 9). Several genes and operons have been cloned from bacterial,
including Acetobacter xylinum, Escherichia coli, and Agrobacterium tumefaciens, and
plant species, including Gossypium hirsutum and Oryza sativa, which are involved in the
synthesis of cellulose. It appears, via recent studies (3), that there are conserved motifs
among common glycosyltransferases which use UDP-glucose or UDP-glucoseNAc as a
substrate. It appears that each essential region to cellulose synthesis contains a conserved
Asp residue; furthermore, an additional region contains a conserved QXXRW motif (10)
common in related glycosyltransferases. The amino acid sequence of the A. thaliana
cDNA we have isolated contains homologies to two of these essential regions, including
the QXXRW motif. Due to these observations, it is possible that this cDNA may code for
the catalytic subunit of cellulose synthase in A. thaliana. Further characterization by
Southern blot confirms the identity of a small gene family of cellulose synthase proteins
within A.thaliana. Additionally, the levels of gene expression are examined by Northern
blot.
MATERIALS AND METHODS
Isolation of the cDNA clone, Subcloning, and Minipreparation.
An unidentified cDNA insert, contained within the plasmid pZL1, was obtained from
the Arabidopsis Biotechnology Research Center. This plasmid was isolated from
Escherichia coli using a modified alkaline-lysis method (1, 7). The isolated plasmid was
subcloned into the pBluescript (Stratagene) phagemid after it was restriction cut with
EcoRI and HindIII (Promega). The pBluescript, containing the cDNA insert, was then
transformed into the E.coli host XL1-Blue using the CaCl2 method (11). The isolated
plasmid was then purified with a Quiagen Miniprep Kit (8). Selected transformants were
then amplified by PCR (5).
DNA Sequencing and Characterization.
The PCR product from above was sequenced using the Sanger dideoxy chain
termination method (12) along with the Sequenase Version 2.0 DNA sequencing kit (14).
The cDNA insert was also characterized by automated sequencing at the DNA
Sequencing Center (UT Austin). Two primers, SP6 (ATTTAGGTGACACTATAG) and
T7 (TAATACGACTCACTATAGGG), were used to sequence both strands. The insert
sequences were analyzed (ORF, translation, hyrophilicity plots, secondary structure) using
the Mac Vector software package; the sequence and amino acid comparisons were made
on NCBI's Blast server.
Isolation of Genomic DNA and Southern Analysis from A. thaliana.
The genomic DNA was isolated from A.thaliana using standard protocols (2, 15, 13).
The isolated genomic DNA was then digested using EcoRI, HindIII, and BamHI
(Promega). The digested fragments were then run on an 0.7% agarose gel and transferred
to a Zetaprobe nylon membrane (Biorad) via a Southern transfer (16). The membrane was
hybridized using the cDNA insert as a probe, labeled with ý-P32 dATP by random priming
(4).
Isolation of RNA and Northern Analysis from A. thaliana.
Total RNA was isolated from a variety of sources in A.thaliana, including whole plant,
leaf, flower, and stem using standard protocols (11). In addition, total RNA was isolated
from wounded plants at different time intervals (0, 1, 5, 8 hours) after wounding. The
RNA was run on a formaldehyde-1.2% agarose gel and transferred to a Zetaprobe nylon
membrane via a Northern Transfer (11). The RNA was hybridized using the same probe
as above (Southern).
RESULTS
DNA Sequence Analysis.
The recombinant plasmid in pBluescript was found to be 4.6 kb; therefore, the insert is
approximately 1.6 kb in length (pBluescript is 3.0 kb). The pZL1 plasmid is 5.9 kb in
length, with the cDNA insert (CelA homolog); this can be seen in Figure 1. The deduced
ORF for the cDNA insert sequence, along with the translated amino acid sequence found
to be homologous to cellulose synthase, is given for the T7 (A) and SP6 (B) primers in
Figure 2.
Protein Sequence Analysis.
Alignments between the cDNA insert amino acid sequence and three other related
genes are given in Figure 3. The comparisons are with the G.hirsutum CelA-2,
G.hirsutum CelA-1, and O.sativa CelA proteins. The homologies to the cellulose
synthase catalytic subunits are prevalent.
Southern Analysis.
The Southern blot appears to contain several bands for each of the three distinct
restricted samples. This would suggest a small gene family, possible 3 or 4 distinct
members. The Southern blot is shown in Figure 4.
Northern Analysis.
The Northern blot suggests that the highest concentration of mRNA for the cDNA
insert is present in stem and flower. The mRNA in leaves are expressed in much lower
concentrations and undetectable in whole plant. It appears that after wounding, mRNA
transcription of our cDNA insert is not as critical. Repair mechanisms most likely would
dominate over cellular division, where cellulose synthesis would be needed. The Northern
blot is shown in Figure 5. The major mRNA species is around 2.5 kb in length.
Hydrophilicity Analysis.
The Kyte-Doolittle hydrophilicity plots of the T7 and SP6 primed amino acids are
shown in Figures 6A and 6B, respectively. A hypothetical orientation of the cotton celA1
protein in the plasma membrane is given in Figure 6C. The homologies for the cDNA
insert to this cotton protein are given in Figure 6C.
Secondary Structure Analysis.
Proposed secondary structure for T7 and SP6 primed amino acid sequences are shown
in Figures 7A and 7B, respectively. Figure 7A shows inconsistences, perhaps due to
complex or infrequent secondary structure in the cytoplasmic region between regions B
and C, in Figure 6C. Figure 7B shows possible alpha-helices, perhaps corresponding to
the regions F and G, in Figure 6C.
Figure 1. Multi-cloning region in pZL1, where the cDNA insert (cel A) is inserted
between EcoRI and HindIII. The T7 and SP6 primer were later used for sequencing.
Figure 2A. The ORF nucleotide sequence for the T7 primer. The translated sequence is
the region of homology shown in figure 3.
Figure 2B. The ORF nucleotide sequence for the SP6 primer. The translated sequence is
the region of homology shown in figure 3.
Figure 3. The regions marked U-3 and U-4 are predicted to be critical in substrate
binding and catalysis. Important conserved homologies are in bold face; the stared regions
indicate critical amino acids. Adopted Pear, et al (6).
Figure 4. This is a picture of a Southern blot done under low stringency conditions. The
appearance of multiple bands under BamHI cut DNA suggests a small gene family (3 or 4
genes).
Figure 5. Northern blot indicating the varying expression of mRNA. The letters are as
follows (left to right): T8 (8 hours after wounding), T5 (5 hours), T1 (1 hour), T0 (time
0), S (stem), F (flower), L (leaves), W (whole plant). The major mRNA species is around
2.5 kb in length.
Figure 6. The hydrophobic chart for (A) T7 and (B) SP6 primed amino acid sequences (Kyte-Doolittle). Fig. C shows a hypothetical orientation of the cotton celA1 protein in
the plasma membrane. The T7 primer region corresponds to the cytoplasmic region
between the transmembrane region B and C. The SP6 primer region corresponds to the
regions E, F and G (C).
DISCUSSION
This Arabidopsis cDNA insert contains homologies to conserved subunits in plants and
bacteria responsible for the synthesis of cellulose. These homologies suggest that this
clone may have a critical role in the cell wall synthesis of Arabidopsis. There appears to
be four conserved subdomains within this family of cellulose synthase genes. Within these
subdomains, designated U-1 - U-4, there is a highly conserved Asp residue; in subdomain
four, there is a highly conserved QXXRW motif (10). The cDNA insert does contain the
U-3 and U-4 region; however, the U-1 and U-2 regions are not present. The size of the
full length mRNA was found to be 2.5 kb, while our cDNA insert is around 1.6 kb. This
suggests that our cDNA insert is not full length; the full length clone may reveal the U-1
and U-2 regions. Alternatively, these missing regions may not be a part of this gene
product; the G. hirsutum CelA-2 protein does not contain the conserved U-1 region. The
exact implications of this deleted region are not currently understood. Once the
A.thaliana cDNA is fully cloned, further assays can be done to clarify this issue. The
Southern blot analysis shows the existence of a small gene family of cellulose synthase
genes, possibly 3 or 4. Other higher plants (cotton) have shown similar data (6). Perhaps
several related proteins combine to form a multi-protein complex at the plasma membrane
which catalyzes the formation of cellulose. The highest level of mRNA transcription of
the cDNA insert occurs in the stem and flower. During the life cycle of a mature plant,
these are the sites where transcription would be predicted to be the highest. The stem and
flower are the sites of the highest cell division. Once this cDNA is cloned, further
experiments can shed more light on the role of this cDNA in cellulose synthesis.
REFERENCES
1. Birnboim, H.C. 1983. A rapid alkaline extraction method for the isolation of plasmid DNA. Methods. Enzymol. 100:243-255.
2. Chung, K.H., S.S. Korban and D. E. Buetow. 1994. Construction of genomic DNA library using high-molecular weight DNA prepared from peach leaves. Plant Mol. Bio. Reporter 12:304-309.
3. Delmer, D.P. and Amor, Y. 1995. Cellulose biosynthesis. Plant Cell 7:987-1000.
4. Feinberg, A.P. and B. Vogelstein. 1983, 1984 Anal. Biochem. 132:6 and 137:266.
5. Innis, M.A., Gelfand, D.H., Sninsky, J.J., and White., T.J., 1990. PCR Protocols. A guide to methods and applications. Academic Press. pp. 3-13.
6. Pear, J. R., Kawagoe, Y., Schreckengost, W.E., Delmer, D.P., and David M. Stalker. 1996. Higher plants contain homologues of the bacterial celA genes encoding the catalytic subunit of cellulose synthase. Proc. Natl. Acad. Sci. U.S.A. 93:12637-12642.
7. Protocol modified by Drs. Paul Kreig and Doug Melton, Harvard University. Protocols and Applications Guide. 2nd Edition, Promega.
8. QIAprep Plasmid Handbook for Standard Plasmid Miniprep. 1995. QIAGEN, Chadsworth, CA.
9. Ross, P., Mayer, R. and M. Benziman. 1991. Cellulose biosynthesis and function in bacteria. Microbiol. Rev. 55:35-38.
10. Saxena, I.M., Kudlicka, K., Okuda, K. and Brown, R.M., Jr. 1994. Characterization of genes in the cellulose-synthesizing operon (acs operon) of Acetobacter xylinum:
implications for cellulose crystallization. J. Bacteriol. 176: 5735-5752.
11. Sambrook, J., Fritsch, E.F. and Maniatis, T. 1989. Molecular Cloning - A Laboratory Manual. Cold Spring Harbour Press.
12. Sanger, F., S. Nicklen and A.R. Coulson. 1977. DNA Sequencing with chain terminating inhibitors. Proc. Natl. Acad. Sci. U.S.A. 74:5463-5467.
13. Schuler, M.A. and R.E. Zelinski. 1989. Methods in Plant Molecular Biology.Academic Press, Inc., San Diego, pp. 71-73.
14. Sequenase Version 2.0 DNA sequencing kit. Step by Step Protocols. United States Biochemical, Cleaveland, OH. pp. 1-25.
15. Tai, T. and Tangsley, S. 1991. Plant Mol. Bio. Reporter 8:297-303.
16. Zetaprobe Blotting Membranes. 1992. Instruction manual. Bio-Rad Labs. Hercules, CA.