|
|
||||||||
Systemics |
2Department of Botany, University of Florida, Gainesville, Florida 32611 USA; 3The Natural History Museums and Botanical Garden, University of Oslo, P.O. Box 1172 Blindern, NO-0318 Oslo, Norway; 4Molekylärsystematiska laboratoriet, Naturhistoriska riksmuseet, Box 50007, SE-104 05 Stockholm, Sweden; 5Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA
Received for publication December 9, 2003. Accepted for publication August 26, 2004.
| ABSTRACT |
|---|
|
|
|---|
Key Words: Amborella AP3 B-class basal angiosperms MADS PI
| INTRODUCTION |
|---|
|
|
|---|
enicová et al., 2003
Some of the best-known MADS-box genes control the organ identity and initiation of the flower (Theißen et al., 1996
, 2000
; Theißen and Saedler, 2001
). Studies in Arabidopsis and Antirrhinum have shown that MADS-box genes encode the B function of floral development, which together with A specifies petals, and with C discriminates stamens from carpels (C only) (Coen and Meyerowitz, 1991
; Meyerowitz et al., 1991
; Coen et al., 1991
). In Arabidopsis, the B function genes are AP3 and PI. Further complexity in the specification of floral organ identity involves the redundant functions of the SEPALLATA MADS-box genes, which are collectively required for petal, stamen, and carpel identities. Protein products of these genes dimerize, and the tetramer (or multimer) configuration is probably the active state in cells (Egea-Cortines et al., 1999
; Honma and Goto, 2000
; Ferrario et al., 2003
).
Two subclades of B-function genes have been recognized, corresponding to AP3- and PI-homologues, respectively (reviewed in Theißen et al., 2000
). Phylogenetic analyses have revealed a complex pattern of gene duplication and divergence in B-class genes. Kramer et al. (1998)
and Kramer and Irish (2000)
suggested that the ancestral gene of the AP3/PI lineages underwent duplication yielding the AP3 and PI lineages at some time well before the diversification of the angiosperms. Another major gene duplication, which yielded the TM6 and euAP3 lineages, is proposed to have occurred in the AP3 lineage just prior to the diversification of the core eudicots (Kramer et al., 1998
; Kramer and Irish, 2000
), a clade that comprises approximately 75% of all flowering plants (Drinnan et al., 1994
).
PI-family genes have a highly conserved region of amino acid sequences at the C-terminal end called the "PI motif." Members of the AP3 family have a less well-conserved "PI derived motif," which can be aligned with the PI motif of the PI family. AP3 genes also have a well-conserved "euAP3" or "paleoAP3" motif at the C-terminal end. A sequence similar to the paleoAP3 motif is also seen in B-class genes of the fern Ceratopteris (Münster et al., 1997
). Therefore, the presence of the paleoAP3 motif is a synapomorphy (shared, derived state) of B- and related gene families (GGM13-, GGM2-, DAL12-, and CJMADS1-homologues), and the loss of the paleoAP3 motif is believed to be a synapomorphy for the PI lineage (Kramer et al., 1998
; Winter et al., 2002a
).
The exon lengths of MADS gene families are well conserved in most cases (Johansen et al., 2002
). For example, exon 6 in both AP3- and PI-homologues is 45 bp long (Johansen et al., 2002
; Winter et al., 2002a
). Additionally, the PI-homologues reported to date are all distinguished from the AP3-homologues by an exon 5 that is 30 bp rather than 42 bp in length (Johansen et al., 2002
; Winter et al., 2002a
). In most other MADS gene families, exons 5 and 6 are generally 42 bp (Johansen et al., 2002
; Winter et al., 2002a
).
The overall framework of angiosperm phylogeny has crystallized (e.g., Qiu et al., 1999
; D. Soltis et al., 2000
; Zanis et al., 2002
) (Fig. 1). A series of studies identified the same early-branching basal angiosperms (Mathews and Donoghue, 1999
; Qiu et al., 1999
; P. Soltis et al., 1999
; Parkinson et al., 1999
; Barkman et al., 2000
; Graham and Olmstead, 2000
; D. Soltis et al., 2000
; Zanis et al., 2002
; Borsch et al., 2003
): Amborella (Amborellaceae) is sister to all other extant angiosperms, either alone, or with the water lilies (Nymphaeaceae). Amborellaceae and Nymphaeaceae are followed by the Austrobaileyales clade (Fig. 1).
|
We sought to improve both the taxonomic coverage and our understanding of phylogenetic relationships of AP3- and PI-homologues among critical lineages of flowering plants. Our goals were to: (1) isolate and sequence both AP3- and PI-homologues from early-branching angiosperms, as well as from several previously unsampled lineages of core eudicots; (2) trace structural changes in the AP3 and PI lineages, including motif regions and lineage-specific insertions/deletions; (3) conduct phylogenetic analyses of B-class genes to determine how well these gene trees track organismal phylogeny based on results from recent multigene studies; and (4) estimate the timing of the duplication event that yielded the AP3 and PI lineages.
| MATERIALS AND METHODS |
|---|
|
|
|---|
RT-PCR and cDNA sequence determination
Reverse transcription was performed following the manufacturer's directions using SuperScript II RNase H-reverse transcriptase (Invitrogen, Carlsbad, California, USA) and polyT primer (5'-CCG GAT CCT CTA GAG CGG CCG C(T)17-3'). PCR (polymerase chain reaction) was performed using a B-class gene-specific primer (5'-GGG GTA CCA AYM GIC ARG TIA CIT AYT CIA AGM GIM G-3') and the polyT primer used in reverse transcription (Kramer et al., 1998
). PCR conditions were those employed by Kramer et al. (1998)
. We also used additional primers for PCR: AP3-PIf (5'-GSK MGI GGI AAG ATC KAG AT-3') and newAP3-PIf (5'-AAC MGG CAR GTG ACG TAY TC-3') (designed by Michael Zanis). PCR bands over 800 bp in size were excised from the agarose gel and purified using the Geneclean II Kit (Q·Bio Gene, Carlsbad, California, USA). Purified DNAs were cloned using the TOPO TA Cloning Kit (Invitrogen). Plasmid DNAs were purified from cloned cells using the e.Z.N.A. Plasmid Miniprep Kit (Omega Bio-tek, Doraville, Georgia, USA). Cycle sequencing reactions were performed using the CEQ DTCS-Quick Start Kit (Beckman Coulter, Fullerton, California, USA), and cDNA sequences were determined using a CEQ 8000 sequencing system (Beckman Coulter). Because the positions of all B-class gene-specific primers that we used in this study were approximately 75 bp after the start codon, 5' end sequences of each gene were not included in this study.
To determine the genomic DNA sequence of Amborella trichopoda, we designed specific primers for Amborella PI-homologues at the MADS domain (AF1P; 5'-AGC GGA ATA CTG AAG AAG GC-3') and the 3' end of the translating region (AF6P; 5'-TGC TGA AGA TTG GGT TGG-3') for PCR and sequencing. We compared the genomic sequence with the cDNA sequence to determine the sizes of the exons and introns. PCR conditions for genomic DNA analyses followed Kim et al. (2001)
.
Alignment
A data set was constructed of 103 previously reported B-function genes obtained from GenBank and 24 new sequences that we produced (excluding Am.tr.AP3-2; see Results and Discussion) (Table 1). Three different alignments, referred to as alignments I, II, and III, were used for phylogenetic analyses. For all three matrices we first aligned the amino acid sequences and used this amino acid alignment to produce a data set of aligned nucleotide sequences. We then analyzed both nucleotide and amino acid sequences phylogenetically (see next section). Alignment I includes all available AP3- and PI-homologues from angiosperms. Amino acid sequences of these genes were aligned using CLUSTAL X (Thompson et al., 1997
) with the default options and then adjusted manually. In alignment II, we divided sequences into small subgroups based on organismal relationships (e.g., basal angiosperms, Magnoliales, early-diverging eudicots, monocots) or sequence similarity (e.g., euAP3 and TM6). We aligned sequences within each subgroup first. We also placed both AP3- and PI-homologues of basal angiosperms (Amborella, Nuphar, and Illicium) in a subgroup because we recognized similar shared amino acid sequences between AP3 and PI of Amborella, including the "DEAER" motif. Based on aligned sequences of this subgroup, other subgroups were subsequently combined into the alignment using the "profile alignment" method in CLUSTAL X (file-to-file alignment). We adjusted previously reported motif regions following Kramer et al. (1998)
. The alignment of the MIK-region in alignment II was the same as that of alignment I; however, alignments I and II differed in the more variable C-domain region. Alignment III includes not only angiosperm sequences, but also three published gymnosperm B-class genes (Mouradov et al., 1998
, 1999
). These gymnosperm sequences were added to alignment II.
|
Phylogenetic analyses
Three different methods of phylogenetic inference were used: maximum parsimony (MP), support weighting (Farris, 2001
), and Bayesian (Huelsenbeck and Ronquist, 2001
). The maximum parsimony analyses for all matrices were conducted using PAUP* 4.0b10 (Swofford, 2001
). The search strategy involved 100 random addition replicates with TBR branch swapping saving all optimal trees. To assess support for each node, bootstrap analyses (Felsenstein, 1985
) were performed using 100 bootstrap replicate heuristic searches with 10 random taxon addition replicates and TBR branch swapping saving all optimal trees.
We applied the support weighting method (Farris, 2001
) to the MIKC-DNA-III and MIK-DNA-III alignments. This method measures the degree to which changes in a character (site) are concentrated in the supported branches of a tree. Jackknife resampling was used to generate randomly selected suites of initial weights in successive support weighting, and this provides a way of assessing the stability of successive weight results (Farris, 2001
). Support values for the support weighting tree were generated by parsimony jackknifing (Farris et al., 1996
) of the original data matrix using 1000 replicates.
Bayesian analyses were conducted using MrBayes 2.01 (Huelsenbeck and Ronquist, 2001
) for MIKC-DNA-III and MIK-DNA-III. We used uniform prior probabilities and the general time-reversible + gamma + I model of molecular evolution. This model of molecular evolution was selected as the optimal model (Akaike information criterion) by ModelTest (Posada and Crandall, 1998
). We ran four chains of Markov Chain Monte Carlo (MCMC), sampling every 1000 generations for 1 000 000 generations, starting with a random tree. Stationarity was reached at approximately generation 35 000; thus, the first 35 trees were the "burn in" of the chain, and phylogenetic inferences are based on those trees sampled after generation 35 000.
Estimation of divergence time
We calculated MP branch lengths and optimized these using PAUP* 4.0b10 onto a phylogenetic tree based on recent multigene studies (Qiu et al., 1999
; P. Soltis et al., 1999
; Zanis et al., 2002
). Two data sets were analyzed: one contained the euAP3 genes, along with the paleoAP3 and PI genes, and the second contained the TM6, plus the paleoAP3 and PI genes. Separate analyses of the MIK and MIKC regions for each data set were conducted. Trees with branch lengths were transformed into ultrametric trees using nonparametric rate smoothing (NPRS) (Sanderson, 1997
) as implemented in TREEEDIT (version 1.0 alpha 10 by A. Rambaut and M. Charleston). The characteristic pollen of the eudicots, combined with their extensive fossil record, places the origin of the eudicots at 125 mya (Hughes, 1994
), one of the firmest dates in the paleobotanical record. This minimum age for eudicots was used to calibrate the tree. This calibration point was alternatively applied to the eudicot node in the AP3 and PI clades of each tree. To compute error estimates for the ages, we reapplied the NPRS procedure to 100 bootstrapped matrices obtained by resampling the data irrespective of codon position using PAUP*4.0b10 (cf. P. Soltis et al., 2002
).
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
The 5' ends of the translating region of B-class gene sequences have generally not been determined in previous studies because genes were amplified and sequenced using primers designed for the 5' end of the translating region. Likewise, we did not determine the sequence of this region of most sequences that we report. In our data matrix, which consists of the 24 new full or nearly full-length sequences that we obtained (excluding Am.tr.AP3-2) and other sequences from GenBank, only 23 of 54 PI-homologues and 34 of 72 AP3-homologues include the complete 5' end of the translating region. We may assume that all B-class genes have the same starting point at the 5' end of the translating region in the aligned sequence matrix because the MADS domain is so strongly conserved (Ma and dePamphillis, 2000
) and all previously published sequences having the complete 5' end of the translating region have the same starting point in the aligned sequence matrix. The description we provide dealing with gene length is based on this reasonable assumption. The putative size ranges that we estimate for each gene family are 163239 amino acids (aa) for PI-homologues and 195261 aa for AP3-homologues. Thus, the average size of AP3-homologues is approximately 20 aa larger than PI-homologues.
Structural features
The AP3-homologues of Magnoliales that we report did not possess the PI-derived motif typical of AP3-homologues (Kramer et al., 1998
; Kramer and Irish, 2000
), but rather a deletion of 10 aa unique to this clade (Fig. 2). In the remaining AP3-homologues studied, the PI-derived motif was loosely aligned with the PI motif (Fig. 2). We recognized two different groups of sequences in the PI-derived motif. Group A (hereafter referred to as PI-derived A, abbreviated as PI-dA) occurs in all basal angiosperms, with the exception of Magnoliales, and in early-diverging eudicots (Fig. 2). The PI-dA sequences have almost the same number of amino acids as found in the PI motif (Fig. 2) and are easily aligned with the PI motif. The second group of the PI-derived motif, PI-derived B (abbreviated as PI-dB), occurs in all core eudicots (both TM6 and euAP3 lineages). These PI-dB sequences were difficult to align among themselves, as well as with the PI-dA sequences, and with the PI motif (Fig. 2). Less than half of the consensus sequence of the PI-derived motif (FxFRLQPxSQPNLH) (Kramer et al., 1998
) is identical to the PI-dB sequences. The PI-dB group typically has many histidine (H) and asparagine (N) residues, which are not present (or are rare) in PI-dA (Fig. 2).
|
The translated AP3- and PI-homologues of Amborella (Am.tr.AP3 and Am.tr.PI) share a number of amino acid strings in the first half of exon 7 (also a part of the C-domain), which is the most variable region of B-class proteins (Fig. 3). In contrast, conspecific AP3- and PI-homologues from flowering plants other than Amborella are not easily alignable in this region. The alignment of AP3- and PI-homologues revealed a PI-specific gap (Fig. 3: box B) and two AP3-specific gaps (Fig. 3: boxes C and D).
|
It is also possible that these shared aa strings are homoplasious, having arisen alternatively via convergent mutations in the two genes or gene conversion. The lack of such similarity between AP3- and PI-homologues of other species argues against convergent mutations in the Amborella genes. To test for the possibility that either ancient or recent gene conversion was responsible for the shared aa strings between Amborella AP3- and PI-homologues, we used GENECONV (Sawyer, 1989
), with a range of mismatch penalties (gscale = 0, 1, 2). Similarities between the Amborella genes were not identified as significant "fragments." We therefore conclude that the shared strings are most likely ancestral motifs. If the shared motifs are indeed ancestral between the AP3- and PI-homologues, our results support Amborella as the sister organism to all other extant angiosperms, a placement that has been well supported by plastid, mitochondrial, and nuclear rRNA genes (e.g., Mathews and Donoghue, 1999
; Qiu et al., 1999
; P. Soltis et al., 1999
; Parkinson et al., 1999
; Graham and Olmstead, 2000
; D. Soltis et al., 2000
; Zanis et al., 2002
; Borsch et al., 2003
; Hilu et al., 2003
).
The size of exon 5 in all PI-homologues reported to date is 30 bp. However, exon 5 in other MADS genes (e.g., AP3-, AP1-, and AG-homologues) is generally 42 bp in length (Johansen et al., 2002
; Winter et al., 2002a
). A length of 30 bp for exon 5 was thought to be a general feature of all PI-homologues, but genomic DNA sequences of PI-homologues have been reported only for PI (Tröbner et al., 1992
) and GLO (Goto and Meyerowitz, 1994
), each of which has an exon 5 of 30 bp. However, by sequencing genomic DNA, we determined the size of exon 5 in Amborella to be 42 instead of 30 bp.
Using the confirmed splicing sites from Am.tr.PI derived in this study and previously reported for PI, GLO, AP3, and DEF, we determined the putative splicing sites for all other B-class genes (Fig. 3). We recognized the length of exon 5 in each sequence because the splicing sites of exon 5 of these aligned five genes matched exactly. This comparison revealed that Amborella and Nuphar both have an exon 5 of 14 aa (42 bp), whereas all other angiosperms analyzed have an exon 5 that is 10 aa (30 bp) in length (Fig. 3). Illicium (star anise), representing Austrobaileyales (the sister group to all other angiosperms after Amborella and Nymphaeaceae; the sequence of Illicium PI was kindly provided by Elena Kramer), also has an exon 5 of 10 aa. Because all AP3-homologues, as well as most other MADS families, have an exon 5 of 14 aa, a deletion of four amino acids must have occurred in exon 5 just after the branches leading to Amborella and Nymphaeaceae, providing additional support for their basal placement in angiosperms.
Exon 5 is located in the K3 portion of the K domain, which plays an important role in the specificity and strength of dimerization between AP3 and PI proteins in both Arabidopsis and Antirrhinum (Zachgo et al., 1995
; Riechmann et al., 1996
). In Arabidopsis, deletion of K3 affects the strength of AP3/PI dimerization more than does deletion of the I or C domains (Yang et al., 2003
). In Amborella and Nuphar, the K3 amphipathic-helix motif has a different structure than the (abcdefg)n repeat (a and d being hydrophobic residues) known from core eudicots. Thus, the large deletion in exon 5 that occurred in angiosperms after the branch to Nymphaeaceae may have generated different AP3/PI heterodimerization capacity after the branches to Amborella and Nymphaeaceae.
Certain amino acid residues are crucial for AP3/PI heterodimerization in Arabidopsis: 97-E and 98-N in PI and 98-N and 102-R in AP3 (Yang et al., 2003
). We confirmed that these residues are conserved in most AP3- and PI-homologues of angiosperms. However, some of these residues are not present in AP3- and PI-homologues of Amborella: Am.tr.PI has 97-D, Am.tr.AP3-1 has 98-S, and Am.tr.AP3-2 has 102-Q (Fig. 4). The PI-homologue of Amborella (Am.tr.PI) has 98-N and 102-R, which are conserved residues in most AP3-homologues. One of the AP3-homologues of Amborella (Am.tr.AP3-2) has 97-E and 98-N, which are conserved residues in most PI-homologues, and another Amborella AP3-homologue (Am.tr.AP3-1) has 97-E and 102-R, which are conserved residues in PI- and AP3-homologues, respectively. An AP3/PI homologue from the gymnosperm Gnetum gnemon, GGM2, has 97-E, 98-N, and 102-R (Winter et al., 2002b
; Yang et al., 2003
) (Fig. 4).
|
The C domains apparently signal for assembly of ternary protein complexes for several MADS proteins in core eudicots (Egea-Cortines et al., 1999
; Ferrario et al., 2003
). Indeed, higher-order multimers are probably the active state of B-function MADS-box proteins (Egea-Cortines et al., 1999
; Honma and Goto, 2000
; Theißen, 2001
; Ferrario et al., 2003
), and the composition of possible active multimers may only be limited by (i) available MADS translation products in cells and (ii) the capacity of different AP3 and PI dimers to recognize these translation products for effective multimerization. The strong similarity between Amborella AP3 and PI C-domain amino acid sequences, if it extends functionally to indistinct AP3-/ PI-homologue recognition in vivo, could suggest that the range of possible multimer combinations in Amborella may be substantially greater than that of eudicots or monocots.
Our data suggest that the evolution of B-function MADS-box gene controls in the development of the earliest flowers was rather dynamic, with different "experiments" tried. Amborella trichopoda may have been the most biochemically flexible angiosperm in terms of B-protein multimers that could operate during its reproductive development. However, the amino acid structural evidence we present suggests that this inferred flexibility was rapidly lost before the bulk of the angiosperm radiation occurred. The unique phylogenetic position of Amborella coupled with its apparently ancestral and flexible mode of B-gene function make it a model organism that should be studied more intensively.
Structural evolution: summary
We reconstructed a hypothesis of the structural evolution of B-class genes in angiosperms using a simplified summary phylogeny (Fig. 1). The common ancestor of AP3 and PI possessed an exon 6 of 14 aa (42 bp). An insertion occurred in this ancestral B-class gene to produce an exon 6 of 15 aa (45 bp) (Winter et al., 2002a
). The ancestor of AP3 and PI also possessed an exon 5 of 14 aa (42 bp) (Winter et al., 2002a
). After the duplication and divergence of AP3 and PI, a deletion of four amino acids occurred in exon 5 of PI during the early diversification of the angiosperms. This deletion occurred after the nodes leading to Amborella and Nymphaeaceae, which retain the ancestral state. All other angiosperms have the derived conditionan exon 5 of 10 aa (30 bp).
Prior to the duplication yielding the PI and AP3 lineages, several characteristic motifs must have been present in the ancestral protein. These ancestral motifs include the PI and paleoAP3 motifs (Kramer et al., 1998
; Kramer and Irish, 2000
). Several modifications to the PI-derived motif occurred in the angiosperms. Following the divergence of the AP3 and PI lineages, the PI motif was retained in the PI lineage and was modified in the AP3 lineage to form the PI-derived motif. All angiosperms except the core eudicots have what we term the PI-dA motif. A deletion of 10 aa in this motif unites all members of Magnoliales. The PI-derived motif was also extensively modified in the early evolution of the core eudicots (both in the TM6 and euAP3 lineages) to form what we term the PI-dB motif.
Other structural changes also occurred in the early diversification of the AP3 lineage. For example, a portion of exon 7 was modified to yield the TM6 motif, which characterizes all members of the TM6 clade.
All motifs that are newly reported in this study were consistent in different alignments (see Materials and Methods) that we used in this study.
Phylogenetic analyses
Details of the trees obtained through phylogenetic analyses were sensitive to the alignment used. This is not surprising given the small number of characters (312348 aligned amino acids) and large number of genes (129 sequences). However, in general, the gene trees for both AP3 and PI roughly follow organismal phylogeny as inferred using multiple plastid, mitochondrial, and nuclear rRNA genes (e.g., Mathews and Donoghue, 1999
; Qiu et al., 1999
; P. Soltis et al., 1999
; Parkinson et al., 1999
; Barkman et al., 2000
; Graham and Olmstead, 2000
; D. Soltis et al., 2000
; Zanis et al., 2002
; Borsch et al., 2003
). Some local relationships among genes from closely related taxa were very similar regardless of the alignment, nature of the sequence (DNA or amino acid), or regions of the gene (MIKC or MIK) used, or the lineage investigated (AP3- or PI-homologues). The abbreviation for each matrix is indicated in Table 2; hereafter, we follow these abbreviations.
|
|
|
Most of the clades of sequences recovered within orders (e.g., Magnoliales, Laurales, and Poales) were consistently obtained with high bootstrap support regardless of the alignment used and with both amino acid and nucleotide sequences. For example, the clade of four AP3-homologues of Magnoliaceae, MpMADS7 (Magnolia kobus), Ma.gr.AP3 (Magnolia grandiflora), MfAP3 (Michelia), and LtAP3 (Liriodendron), received 85%, 75%, and 87% bootstrap support in analyses of the MIKC-AA-I, MIKC-AA-II, and MIKC-AA-III alignments, respectively. The clade of three PI-homologues of Magnoliaceae, MpMADS8 (Magnolia kobus), MfPI (Michelia), and LtPI (Liriodendron), received 74%, 70%, and 76% support in the analyses of MIKC-AA-I, MIKC-AA-II, and MIKC-AA-III alignments, respectively.
A well-defined clade of Ranunculales was observed only in the PI lineage in our analyses of the MIK-DNA-III alignment. In other analyses of other data sets, a subset of Ranunculales sequences typically was recovered, but not all Ranunculales sequences formed a clade. Many duplication events have occurred in Ranunculaceae (Kramer et al., 2003
). Hence, the evolutionary history of AP3 and PI in Ranunculaceae is apparently complex. If all orthologues have not been sampled for this clade, it is reasonable to expect that the placement of some sequences in the gene tree may not reflect the organismal tree because of extensive duplication and subsequent diversification.
Genes having relatively short sequences such as PhPI (Peperomia), nmads1 (Oryza), PnPI-1 (Papaver), CMB2 (Dianthus), and RAD2 (Rumex) were especially sensitive to the alignment and the method of analysis used. For example, PhPI was sister to the clade of euAP3 sequences in our analyses of alignment MIKC-AA-I. However, it was sister to the TM6 clade in analyses of alignment MIKC-AA-II. In contrast, PhPI was embedded in the TM6 clade in analyses of the MIK-AA-I alignment and was grouped together with some AP3 genes of Ranunculales (RfAP3-1 and RbAP3-1) in analyses of alignments MIKC-DNA-I and MIK-DNA-I. Lastly, PhPI was embedded in the euAP3 clade in analyses of MIK-DNA-III.
The TM6 lineage (Kramer et al., 1998
) was not recovered in all analyses. In our analyses of data sets MIKC-AA-I, MIKC-DNA-I, MIKC-AA-II, and MIKC-DNA-III, a TM6 clade was observed, but without bootstrap support >50%. Furthermore, the TM6 clade does not include CMB2, an unusual aa sequence that was placed in the TM6 clade by Kramer et al. (1998)
. However, as noted, CMB2 also lacks the TM6 motif and therefore does not appear to belong to the TM6 lineage.
The topology of the support weighted tree (Farris, 2001
) based on nucleotide sequences is very similar to that of the MP trees based on translated amino acids (Fig. 6). Magnoliales, monocots, and core eudicots were recognized in both the PI and AP3 lineages. Amborella is the sister to all other angiosperms in the PI lineage, but Illicium is sister to all other angiosperms in the AP3 lineage; jackknife support for both relationships is <50%. The TM6 and euAP3 lineages were found using support weighting, and CMB2 was included in the TM6 lineage.
Bayesian inference using nucleotide sequences produced a tree similar to those obtained with parsimony (Fig. 7). Many clades received relatively high posterior probabilities, much higher than the corresponding bootstrap values. The tendency of Bayesian analyses to yield posterior probability values higher than bootstrap values has been discussed (e.g., Huelsenbeck and Ronquist, 2001
; Suzuki et al., 2002
). Clades for which we obtained posterior probability values (x100) of 95100 in the MIKC analysis include (Fig. 8): Magnoliales (100), monocots (100), the euAP3 lineage (100), the TM6 lineage (100, including CMB2), and core eudicots (100) for the AP3 lineage; Magnoliales (100), Ranunculales (100), monocots (100), and the core eudicots (99) for the PI lineage.
|
Using our AP3- and PI-homologue data sets and NPRS (Sanderson, 1997
), we estimated that the duplication that produced the AP3 and PI lineages occurred approximately 260 mya (range of 230290 mya) (Table 3). This date places the duplication shortly after the split between extant gymnosperms and angiosperms and on the "stem" (ancestral) lineage of extant flowering plants. Extant seed plants originated approximately 290309.2 mya (Mapes and Rothwell, 1984
, 1991
), and most evidence indicates a very early split between the living gymnosperms and the line leading to angiosperms (P. Soltis et al., 2002
). The absence of AP3- or PI-specific homologues in extant gymnosperms was previously used to suggest that the AP3/PI duplication arose prior to the origin of the angiosperms (Doyle, 1994
; Purugganan et al., 1995
; Kramer et al., 1998
; Becker and Theißen, 2003
). However, few gymnosperms and none of the basalmost angiosperms had been examined, and it was therefore unclear whether the duplication occurred early in angiosperm history, just prior to the origin of angiosperms, early along the stem lineage leading to angiosperms, or even early in seed plant evolution. Although two classes of AP3/PI homologues have been suggested for gymnosperms based on comparison of motifs (Kramer and Irish, 2000
), phylogenetic analyses to date have not supported this distinction (Hasebe, 1999
; Kramer and Irish, 2000
).
|
|
|
|
| FOOTNOTES |
|---|
6 Authors for reprint requests. (dsoltis{at}botany.ufl.edu
and sangtae{at}botany.ufl.edu
) ![]()
| LITERATURE CITED |
|---|
|
|
|---|
Albert V. A. M. H. G. Gustafsson L. Di Laurenzio 1998 Ontogenetic systematics, molecular developmental genetics, and the angiosperm petal. In P. Soltis, D. Soltis, and J. Doyle [eds.], Molecular systematics of plants II, 349374. Kluwer, Boston, Massachusetts, USA
Alvarez-Buylla E. R. S. Pelaz S. J. Liljegren S. E. Gold C. Burgeff G. S. Ditta L. R. De Pouplana L. Martinez-Castilla M. F. Yanofsky 2000 An ancestral MADS-box gene duplication occurred before the divergence of plant and animals. Proceedings of the National Academy of Sciences, USA 97: 5328-5333
Barkman T. J. G. Ghenery J. R. McNeal J. Lyons-Weiler C. W. DePamphilis 2000 Independent and combined analyses of sequences from all three genomic compartments converge on the root of flowering plant phylogeny. Proceedings of the National Academy of Sciences, USA 97: 13166-13171
Becker A. K. Kaufmann A. Freialdenhoven C. Vincent M.-A. Li H. Saedler G. Theißen 2002 A novel MADS-box gene subfamily with a sister-group relationship to class B floral homeotic genes. Molecular Genetics and Genomics 266: 942-950[CrossRef][ISI][Medline]
Becker A. G. Theißen 2003 The major clades of MADS-box genes and their role in the development and evolution of flowering plants. Molecular Phylogenetics and Evolution 29: 464-489[CrossRef][ISI][Medline]
Bodt S. D. J. Raes Y. V. de Peer G. Theißen 2003 And then there were many: MADS goes genomic. Trends in Plant Science 8: 475-483[CrossRef][ISI][Medline]
Borsch T. K. W. Hilu D. Quandt V. Wilde C. Neinhuis W. Barthlott 2003 Noncoding plastid trnT-trnF sequences reveal a well resolved phylogeny of basal angiosperms. Journal of Evolutionary Biology 16: 558-576[CrossRef][ISI][Medline]
Coen E. S. J. S. Doyle M. Romero R. Elliot R. Magrath R. Carpenter 1991 Homeotic genes controlling flower development in Antirrhinum. Development (Supplement) 1: 149-156
Coen E. S. E. M. Meyerowitz 1991 The war of the whorls: genetic interactions controlling flower development. Nature 353: 31-37[CrossRef][Medline]
Drinnan A. N. P. R. Crane S. B. Hoot 1994 Patterns of floral evolution in the early diversification of non-magnoliid dicotyledons (eudicots). Plant Systematics and Evolution supplement 8: 93-122
Doyle J. J. 1994 Evolution of a plant homeotic multigene family: toward connecting molecular systematics and molecular developmental genetics. Systematic Biology 43: 307-328[CrossRef]
Doyle J. J. J. L. Doyle 1987 A rapid DNA isolation from small amount of fresh leaf tissue. Phytochemical Bulletin 19: 11-15
Egea-Cortines M. H. Saedler H. Sommer 1999 Ternary complex formation between the MADS-box proteins SQUAMOSA, DEFICIENS and GLOBOSA is involved in the control of floral architecture in Antirrhinum majus. EMBO Journal 18: 5370-5379[CrossRef][ISI][Medline]
Farris J. S. 2001 Support weighting. Cladistics 17: 389-394[CrossRef][ISI]
Farris J. S. V. A. Albert M. Källersjö D. Lipscomb A. G. Kluge 1996 Parsimony jackknifing outperforms neighbor-joining. Cladistics 12: 99-124[CrossRef][ISI]
Felsenstein J. 1985 Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39: 783-791[CrossRef][ISI]
Ferrario S. R. G. Immink A. Shchennikova J. Busscher-Lange G. C. Angenent 2003 The MADS box gene FBP2 is required for the SEPALLATA function in petunia. Plant Cell 15: 914-925
Graham S. W. R. G. Olmstead 2000 Utility of 17 chloroplast genes for inferring the phylogeny of the basal angiosperms. American Journal of Botany 87: 1712-1730
Goto K. E. M. Meyerowitz 1994 Function and regulation of the Arabidopsis floral homeotic gene PISTILLATA. Genes and Development 8: 1548-1560
Hasebe M. 1999 Evolution of reproductive organs in land plants. Journal of Plant Research 112: 463-474[CrossRef][ISI]
Hilu K. W. T. Borsch K. Müller D. E. Soltis P. S. Soltis V. Savolainen M. W. Chase M. Powell L. A. Alice R. Evans H. Sauquet C. Neinhuis T. A. Sotta J. G. Rohwer C. S. Champbell L. Chatrou 2003 Angiosperm phylogeny based on matK sequence information. American Journal of Botany 90: 1758-1776
Honma T. K. Goto 2000 Complexes of MADS-box proteins are sufficient to convert leaves into floral organs. Nature 409: 525-529[CrossRef][ISI]
Huelsenbeck J. P. F. Ronquist 2001 MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics 17: 754-755
Hughes N. F. 1994 The enigma of angiosperm origins. Cambridge University Press, Cambridge, UK
Johansen B. L. B. Pedersen M. Skipper S. Frederiksen 2002 MADS-box gene evolution-structure and transcription patterns. Molecular Phylogenetics and Evolution 23: 458-480[CrossRef][ISI][Medline]
Kim S. C.-W. Park Y.-D. Kim Y. Suh 2001 Phylogenetic relationships in family Magnoliaceae inferred from ndhF sequences. American Journal of Botany 88: 717-728