Am. J. Bot. Subscribe to E-TOCs
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (53)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Kim, S.
Right arrow Articles by Soltis, D. E.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Kim, S.
Right arrow Articles by Soltis, D. E.
Agricola
Right arrow Articles by Kim, S.
Right arrow Articles by Soltis, D. E.
(American Journal of Botany. 2004;91:2102-2118.)
© 2004 Botanical Society of America, Inc.


Systemics

Phylogeny and diversification of B-function MADS-box genes in angiosperms: evolutionary and functional implications of a 260-million-year-old duplication1

Sangtae Kim2,6, Mi-Jeong Yoo2, Victor A. Albert3, James S. Farris4, Pamela S. Soltis5 and Douglas E. Soltis2,6

2Department of Botany, University of Florida, Gainesville, Florida 32611 USA; 3The Natural History Museums and Botanical Garden, University of Oslo, P.O. Box 1172 Blindern, NO-0318 Oslo, Norway; 4Molekylärsystematiska laboratoriet, Naturhistoriska riksmuseet, Box 50007, SE-104 05 Stockholm, Sweden; 5Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA

Received for publication December 9, 2003. Accepted for publication August 26, 2004.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 LITERATURE CITED
 
B-function MADS-box genes play crucial roles in floral development in model angiosperms. We reconstructed the structural and functional implications of B-function gene phylogeny in the earliest extant flowering plants based on analyses that include 25 new AP3 and PI sequences representing critical lineages of the basalmost angiosperms: Amborella, Nuphar (Nymphaeaceae), and Illicium (Austrobaileyales). The ancestral size of exon 5 in PI-homologues is 42 bp, typical of exon 5 in other plant MADS-box genes. This 42-bp length is found in PI-homologues from Amborella and Nymphaeaceae, successive sisters to all other angiosperms. Following these basalmost branches, a deletion occurred in exon 5, yielding a length of 30 bp, a condition that unites all other angiosperms. Several shared amino acid strings, including a prominent "DEAER" motif, are present in the AP3- and PI-homologues of Amborella. These may be ancestral motifs that were present before the duplication that yielded the AP3 and PI lineages and subsequently were modified after the divergence of Amborella. Other structural features were identified, including a motif that unites the previously described TM6 clade and a deletion in AP3-homologues that unites all Magnoliales. Phylogenetic analyses of AP3- and PI-homologues yielded gene trees that generally track organismal phylogeny as inferred by multigene data sets. With both AP3 and PI amino acid sequences, Amborella and Nymphaeaceae are sister to all other angiosperms. Using nonparametric rate smoothing (NPRS), we estimated that the duplication that produced the AP3 and PI lineages occurred approximately 260 mya (231–290). This places the duplication after the split between extant gymnosperms and angiosperms, but well before the oldest angiosperm fossils. A striking similarity in the multimer-signalling C domains of the Amborella proteins suggests the potential for the formation of unique transcription-factor complexes. The earliest angiosperms may have been biochemically flexible in their B function and "tinkered" with floral organ identity.

Key Words: AmborellaAP3 • B-class • basal angiosperms • MADS • PI


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 LITERATURE CITED
 
MADS (MCM1, AGAMOUS, DEFICIENS, and SRF)-box genes, which encode transcription factors, play important roles in the development of plants, animals, and fungi (Schwarz-Sommer et al., 1990 ; Shore and Sharrocks, 1995 ; Theißen and Saedler, 1995 ; Theißen et al., 1996 , 2000 ; Riechmann and Meyerowitz, 1997 ; Becker and Theißen, 2003 ). Almost all known MADS-domain proteins from vascular plants share a conserved structural organization, the so-called MIKC-type domain structure, including MADS (M), intervening (I), keratin-like (K), and C-terminal (C) domains (Theißen et al., 1996 , 2000 ; Münster et al., 1997 ; Becker and Theißen, 2003 ; but see Alvarez-Buylla et al., 2000 ). Phylogenetic analyses have revealed that the MADS-box gene family is composed of several well-defined clades (e.g., Doyle, 1994 ; Purugganan et al., 1995 ; Theißen et al., 1996 , 2000 ; Purugganan, 1997 ; Münster et al., 1997 ; Johansen et al., 2002 ; Bodt et al., 2003 ; Parenicová et al., 2003 ; Becker and Theißen, 2003 ); the members of each clade share similar expression patterns and highly related functions (Theißen and Saedler, 1995 ; Theißen et al., 1996 , 2000 ; Becker and Theißen, 2003 ).

Some of the best-known MADS-box genes control the organ identity and initiation of the flower (Theißen et al., 1996 , 2000 ; Theißen and Saedler, 2001 ). Studies in Arabidopsis and Antirrhinum have shown that MADS-box genes encode the B function of floral development, which together with A specifies petals, and with C discriminates stamens from carpels (C only) (Coen and Meyerowitz, 1991 ; Meyerowitz et al., 1991 ; Coen et al., 1991 ). In Arabidopsis, the B function genes are AP3 and PI. Further complexity in the specification of floral organ identity involves the redundant functions of the SEPALLATA MADS-box genes, which are collectively required for petal, stamen, and carpel identities. Protein products of these genes dimerize, and the tetramer (or multimer) configuration is probably the active state in cells (Egea-Cortines et al., 1999 ; Honma and Goto, 2000 ; Ferrario et al., 2003 ).

Two subclades of B-function genes have been recognized, corresponding to AP3- and PI-homologues, respectively (reviewed in Theißen et al., 2000 ). Phylogenetic analyses have revealed a complex pattern of gene duplication and divergence in B-class genes. Kramer et al. (1998) and Kramer and Irish (2000) suggested that the ancestral gene of the AP3/PI lineages underwent duplication yielding the AP3 and PI lineages at some time well before the diversification of the angiosperms. Another major gene duplication, which yielded the TM6 and euAP3 lineages, is proposed to have occurred in the AP3 lineage just prior to the diversification of the core eudicots (Kramer et al., 1998 ; Kramer and Irish, 2000 ), a clade that comprises approximately 75% of all flowering plants (Drinnan et al., 1994 ).

PI-family genes have a highly conserved region of amino acid sequences at the C-terminal end called the "PI motif." Members of the AP3 family have a less well-conserved "PI derived motif," which can be aligned with the PI motif of the PI family. AP3 genes also have a well-conserved "euAP3" or "paleoAP3" motif at the C-terminal end. A sequence similar to the paleoAP3 motif is also seen in B-class genes of the fern Ceratopteris (Münster et al., 1997 ). Therefore, the presence of the paleoAP3 motif is a synapomorphy (shared, derived state) of B- and related gene families (GGM13-, GGM2-, DAL12-, and CJMADS1-homologues), and the loss of the paleoAP3 motif is believed to be a synapomorphy for the PI lineage (Kramer et al., 1998 ; Winter et al., 2002a ).

The exon lengths of MADS gene families are well conserved in most cases (Johansen et al., 2002 ). For example, exon 6 in both AP3- and PI-homologues is 45 bp long (Johansen et al., 2002 ; Winter et al., 2002a ). Additionally, the PI-homologues reported to date are all distinguished from the AP3-homologues by an exon 5 that is 30 bp rather than 42 bp in length (Johansen et al., 2002 ; Winter et al., 2002a ). In most other MADS gene families, exons 5 and 6 are generally 42 bp (Johansen et al., 2002 ; Winter et al., 2002a ).

The overall framework of angiosperm phylogeny has crystallized (e.g., Qiu et al., 1999 ; D. Soltis et al., 2000 ; Zanis et al., 2002 ) (Fig. 1). A series of studies identified the same early-branching basal angiosperms (Mathews and Donoghue, 1999 ; Qiu et al., 1999 ; P. Soltis et al., 1999 ; Parkinson et al., 1999 ; Barkman et al., 2000 ; Graham and Olmstead, 2000 ; D. Soltis et al., 2000 ; Zanis et al., 2002 ; Borsch et al., 2003 ): Amborella (Amborellaceae) is sister to all other extant angiosperms, either alone, or with the water lilies (Nymphaeaceae). Amborellaceae and Nymphaeaceae are followed by the Austrobaileyales clade (Fig. 1).



View larger version (18K):
[in this window]
[in a new window]
 
Fig. 1. Summary of angiosperm phylogeny (Qiu et al., 1999 ; P. Soltis et al., 1999 ; Zanis et al., 2002 ; D. Soltis et al., 2000 ). Some of the structural changes in AP3 and PI identified or localized in this study are plotted on the tree

 
To understand the diversification of B-class genes during the evolutionary history of angiosperms, phylogenetic analyses using B-class genes from representatives of all major lineages of angiosperms are needed. However, sequences of the AP3- and PI-homologues identified to date only represent core- and early-diverging eudicots and some magnoliid taxa (Kramer et al., 1998 ; Kramer and Irish, 2000 ). Critical lineages of basal angiosperms, including all of the early-branching taxa (Amborella, Nymphaeaceae, and Austrobaileyales), have not yet been analyzed for B-class genes. Furthermore, only a few sequences have so far been obtained for other basal lineages, such as Chloranthaceae (e.g., Chloranthus) and the magnoliid clade of Laurales (avocado relatives), Piperales (black pepper relatives), Canellales (mountain pepper relatives), and Magnoliales (magnolia relatives). In the eudicots (Fig. 1), AP3- and PI-homologues have been isolated from several early-diverging eudicots (Papaveraceae and Buxaceae, the poppy and boxwood families, respectively), but in the core eudicots, only the rosid (rose relatives), Caryophyllales (beet relatives), and asterid (e.g., Antirrhinum) clades have so far been sampled. No data are available for Saxifragales (saxifrage relatives), Santalales (mistletoes), or Gunnerales (gunneras). The latter lineage has recently been shown to be sister to all other core eudicots (D. Soltis et al., 2003 ).

We sought to improve both the taxonomic coverage and our understanding of phylogenetic relationships of AP3- and PI-homologues among critical lineages of flowering plants. Our goals were to: (1) isolate and sequence both AP3- and PI-homologues from early-branching angiosperms, as well as from several previously unsampled lineages of core eudicots; (2) trace structural changes in the AP3 and PI lineages, including motif regions and lineage-specific insertions/deletions; (3) conduct phylogenetic analyses of B-class genes to determine how well these gene trees track organismal phylogeny based on results from recent multigene studies; and (4) estimate the timing of the duplication event that yielded the AP3 and PI lineages.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 LITERATURE CITED
 
RNA extraction
RNA extraction kits available commercially (e.g., RNA WIZ Kit from Promega, RNeasy Plant Mini Kit from Qiagen) are designed for use based on investigations of model plants (e.g., Arabidopsis). These kits have rarely been tested on other flowering plants, such as lineages of basal angiosperms. In our initial RNA extractions of basal angiosperm taxa (e.g., Amborella and Nuphar), we employed commercially available kits, and these efforts failed. We therefore developed our own protocol for isolating high yields of RNA from these difficult plant tissues. Our method combined parts of the CTAB DNA extraction protocol (Doyle and Doyle, 1987 ) with the subsequent use of the RNeasy Plant Mini Kit (Qiagen, Stanford, California, USA) (Detailed method provided at http://www.flmnh.ufl.edu/soltislab/soltis_lab_protocols.htm). This protocol has worked well in all of the basal angiosperm taxa that we have tested. One problem with this method, however, is DNA contamination. We therefore treated all samples with DNase (DNA free; Ambion, Austin, Texas, USA) before subsequent experiments.

RT-PCR and cDNA sequence determination
Reverse transcription was performed following the manufacturer's directions using SuperScript II RNase H-reverse transcriptase (Invitrogen, Carlsbad, California, USA) and polyT primer (5'-CCG GAT CCT CTA GAG CGG CCG C(T)17-3'). PCR (polymerase chain reaction) was performed using a B-class gene-specific primer (5'-GGG GTA CCA AYM GIC ARG TIA CIT AYT CIA AGM GIM G-3') and the polyT primer used in reverse transcription (Kramer et al., 1998 ). PCR conditions were those employed by Kramer et al. (1998) . We also used additional primers for PCR: AP3-PIf (5'-GSK MGI GGI AAG ATC KAG AT-3') and newAP3-PIf (5'-AAC MGG CAR GTG ACG TAY TC-3') (designed by Michael Zanis). PCR bands over 800 bp in size were excised from the agarose gel and purified using the Geneclean II Kit (Q·Bio Gene, Carlsbad, California, USA). Purified DNAs were cloned using the TOPO TA Cloning Kit (Invitrogen). Plasmid DNAs were purified from cloned cells using the e.Z.N.A. Plasmid Miniprep Kit (Omega Bio-tek, Doraville, Georgia, USA). Cycle sequencing reactions were performed using the CEQ DTCS-Quick Start Kit (Beckman Coulter, Fullerton, California, USA), and cDNA sequences were determined using a CEQ 8000 sequencing system (Beckman Coulter). Because the positions of all B-class gene-specific primers that we used in this study were approximately 75 bp after the start codon, 5' end sequences of each gene were not included in this study.

To determine the genomic DNA sequence of Amborella trichopoda, we designed specific primers for Amborella PI-homologues at the MADS domain (AF1P; 5'-AGC GGA ATA CTG AAG AAG GC-3') and the 3' end of the translating region (AF6P; 5'-TGC TGA AGA TTG GGT TGG-3') for PCR and sequencing. We compared the genomic sequence with the cDNA sequence to determine the sizes of the exons and introns. PCR conditions for genomic DNA analyses followed Kim et al. (2001) .

Alignment
A data set was constructed of 103 previously reported B-function genes obtained from GenBank and 24 new sequences that we produced (excluding Am.tr.AP3-2; see Results and Discussion) (Table 1). Three different alignments, referred to as alignments I, II, and III, were used for phylogenetic analyses. For all three matrices we first aligned the amino acid sequences and used this amino acid alignment to produce a data set of aligned nucleotide sequences. We then analyzed both nucleotide and amino acid sequences phylogenetically (see next section). Alignment I includes all available AP3- and PI-homologues from angiosperms. Amino acid sequences of these genes were aligned using CLUSTAL X (Thompson et al., 1997 ) with the default options and then adjusted manually. In alignment II, we divided sequences into small subgroups based on organismal relationships (e.g., basal angiosperms, Magnoliales, early-diverging eudicots, monocots) or sequence similarity (e.g., euAP3 and TM6). We aligned sequences within each subgroup first. We also placed both AP3- and PI-homologues of basal angiosperms (Amborella, Nuphar, and Illicium) in a subgroup because we recognized similar shared amino acid sequences between AP3 and PI of Amborella, including the "DEAER" motif. Based on aligned sequences of this subgroup, other subgroups were subsequently combined into the alignment using the "profile alignment" method in CLUSTAL X (file-to-file alignment). We adjusted previously reported motif regions following Kramer et al. (1998) . The alignment of the MIK-region in alignment II was the same as that of alignment I; however, alignments I and II differed in the more variable C-domain region. Alignment III includes not only angiosperm sequences, but also three published gymnosperm B-class genes (Mouradov et al., 1998 , 1999 ). These gymnosperm sequences were added to alignment II.


View this table:
[in this window]
[in a new window]
 
Table 1. Sequences used in this study. The GenBank accession numbers and the number of colonies that we sequenced are indicated for genes cloned in this study (highlighted in boldface). Angiosperm species are arranged by families and higher groups according to the APG II (2003) system

 
Aligned amino acids were converted to DNA sequences using the program AA2DNA (provided by Jongmin Nam). Separate matrices of DNA and amino acid sequences of alignments I, II, and III, for the entire gene (MIKC) and only the M-, I-, and K-domains were constructed for phylogenetic analyses.

Phylogenetic analyses
Three different methods of phylogenetic inference were used: maximum parsimony (MP), support weighting (Farris, 2001 ), and Bayesian (Huelsenbeck and Ronquist, 2001 ). The maximum parsimony analyses for all matrices were conducted using PAUP* 4.0b10 (Swofford, 2001 ). The search strategy involved 100 random addition replicates with TBR branch swapping saving all optimal trees. To assess support for each node, bootstrap analyses (Felsenstein, 1985 ) were performed using 100 bootstrap replicate heuristic searches with 10 random taxon addition replicates and TBR branch swapping saving all optimal trees.

We applied the support weighting method (Farris, 2001 ) to the MIKC-DNA-III and MIK-DNA-III alignments. This method measures the degree to which changes in a character (site) are concentrated in the supported branches of a tree. Jackknife resampling was used to generate randomly selected suites of initial weights in successive support weighting, and this provides a way of assessing the stability of successive weight results (Farris, 2001 ). Support values for the support weighting tree were generated by parsimony jackknifing (Farris et al., 1996 ) of the original data matrix using 1000 replicates.

Bayesian analyses were conducted using MrBayes 2.01 (Huelsenbeck and Ronquist, 2001 ) for MIKC-DNA-III and MIK-DNA-III. We used uniform prior probabilities and the general time-reversible + gamma + I model of molecular evolution. This model of molecular evolution was selected as the optimal model (Akaike information criterion) by ModelTest (Posada and Crandall, 1998 ). We ran four chains of Markov Chain Monte Carlo (MCMC), sampling every 1000 generations for 1 000 000 generations, starting with a random tree. Stationarity was reached at approximately generation 35 000; thus, the first 35 trees were the "burn in" of the chain, and phylogenetic inferences are based on those trees sampled after generation 35 000.

Estimation of divergence time
We calculated MP branch lengths and optimized these using PAUP* 4.0b10 onto a phylogenetic tree based on recent multigene studies (Qiu et al., 1999 ; P. Soltis et al., 1999 ; Zanis et al., 2002 ). Two data sets were analyzed: one contained the euAP3 genes, along with the paleoAP3 and PI genes, and the second contained the TM6, plus the paleoAP3 and PI genes. Separate analyses of the MIK and MIKC regions for each data set were conducted. Trees with branch lengths were transformed into ultrametric trees using nonparametric rate smoothing (NPRS) (Sanderson, 1997 ) as implemented in TREEEDIT (version 1.0 alpha 10 by A. Rambaut and M. Charleston). The characteristic pollen of the eudicots, combined with their extensive fossil record, places the origin of the eudicots at 125 mya (Hughes, 1994 ), one of the firmest dates in the paleobotanical record. This minimum age for eudicots was used to calibrate the tree. This calibration point was alternatively applied to the eudicot node in the AP3 and PI clades of each tree. To compute error estimates for the ages, we reapplied the NPRS procedure to 100 bootstrapped matrices obtained by resampling the data irrespective of codon position using PAUP*4.0b10 (cf. P. Soltis et al., 2002 ).


    RESULTS AND DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 LITERATURE CITED
 
We report 25 new B-class gene sequences, which have been cloned from nine taxa including seven basal angiosperms (Amborella, Nuphar, Illicium, Persea, Asimina, Eupomatia, and Magnolia) and two eudicots (Gunnera and Ribes) (Table 1). None of these sequences is closely related to the B-sister gene family (Becker et al., 2002 ); when we include B-sister sequences in our analyses, our sequences were placed in the B-class clade instead of the B-sister clade. Because we obtained only an incomplete sequence from one of the AP3-homologues of Amborella (Am.tr.AP3-2), we excluded this sequence from the phylogenetic analyses and used it for sequence comparison only.

The 5' ends of the translating region of B-class gene sequences have generally not been determined in previous studies because genes were amplified and sequenced using primers designed for the 5' end of the translating region. Likewise, we did not determine the sequence of this region of most sequences that we report. In our data matrix, which consists of the 24 new full or nearly full-length sequences that we obtained (excluding Am.tr.AP3-2) and other sequences from GenBank, only 23 of 54 PI-homologues and 34 of 72 AP3-homologues include the complete 5' end of the translating region. We may assume that all B-class genes have the same starting point at the 5' end of the translating region in the aligned sequence matrix because the MADS domain is so strongly conserved (Ma and dePamphillis, 2000 ) and all previously published sequences having the complete 5' end of the translating region have the same starting point in the aligned sequence matrix. The description we provide dealing with gene length is based on this reasonable assumption. The putative size ranges that we estimate for each gene family are 163–239 amino acids (aa) for PI-homologues and 195–261 aa for AP3-homologues. Thus, the average size of AP3-homologues is approximately 20 aa larger than PI-homologues.

Structural features
The AP3-homologues of Magnoliales that we report did not possess the PI-derived motif typical of AP3-homologues (Kramer et al., 1998 ; Kramer and Irish, 2000 ), but rather a deletion of 10 aa unique to this clade (Fig. 2). In the remaining AP3-homologues studied, the PI-derived motif was loosely aligned with the PI motif (Fig. 2). We recognized two different groups of sequences in the PI-derived motif. Group A (hereafter referred to as PI-derived A, abbreviated as PI-dA) occurs in all basal angiosperms, with the exception of Magnoliales, and in early-diverging eudicots (Fig. 2). The PI-dA sequences have almost the same number of amino acids as found in the PI motif (Fig. 2) and are easily aligned with the PI motif. The second group of the PI-derived motif, PI-derived B (abbreviated as PI-dB), occurs in all core eudicots (both TM6 and euAP3 lineages). These PI-dB sequences were difficult to align among themselves, as well as with the PI-dA sequences, and with the PI motif (Fig. 2). Less than half of the consensus sequence of the PI-derived motif (FxFRLQPxSQPNLH) (Kramer et al., 1998 ) is identical to the PI-dB sequences. The PI-dB group typically has many histidine (H) and asparagine (N) residues, which are not present (or are rare) in PI-dA (Fig. 2).



View larger version (71K):
[in this window]
[in a new window]
 
Fig. 2. A part of the alignment of the C domain of AP3- and PI-homologues showing various motifs. Gray boxes in AP3 indicate a deletion found only in Magnoliales. The PI-derived A (PI-dA) motif and PI-derived B (PI-dB) motif in the previously recognized PI-derived motif and the TM6 motif (thick-lined box) are newly recognized in this study. The most frequent amino acid at each site in the TM6 motif is highlighted in boldface

 
Kramer et al. (1998) described the paleoAP3 motif and its modification following an apparent duplication to form the TM6 and euAP3 clades. Their TM6 clade was recovered in phylogenetic analyses with low bootstrap support and possessed the paleoAP3 motif (Kramer et al., 1998 ). The TM6 lineage shares the paleoAP3 motif with basal angiosperms and early-diverging eudicots, and the PI-dB motif with the euAP3 gene lineage (Fig. 2). We discovered a region of 10 amino acids that represents a previously undescribed motif that characterizes hypothetical proteins previously referred to as TM6 clade members. This "TM6 motif" is located just before the PI-dB region with AVAFANGVxNL, the most frequent amino acids at each position (Fig. 2). The TM6 lineage is therefore easily identified by this newly recognized motif. Although Kramer et al. (1998) placed CMB2 (Dianthus) in their TM6 lineage, the CMB2 aa sequence does not have the newly reported TM6 motif. In our phylogenetic analyses (see below: phylogenetic analyses) CMB2 fluctuated in position and did not always appear with TM6 genes.

The translated AP3- and PI-homologues of Amborella (Am.tr.AP3 and Am.tr.PI) share a number of amino acid strings in the first half of exon 7 (also a part of the C-domain), which is the most variable region of B-class proteins (Fig. 3). In contrast, conspecific AP3- and PI-homologues from flowering plants other than Amborella are not easily alignable in this region. The alignment of AP3- and PI-homologues revealed a PI-specific gap (Fig. 3: box B) and two AP3-specific gaps (Fig. 3: boxes C and D).



View larger version (72K):
[in this window]
[in a new window]
 
Fig. 3. Translations of the AP3- and PI-homologues of Amborella (Am.tr.AP3 and Am.tr.PI) show several shared amino acid strings, including a prominent "DEAER motif" (gray box) in exon 7, the most variable part of the gene. PI-specific (box B) and AP3-specific (boxes C and D) gaps were recognized. Shared amino acid residues between Am.tr.AP3 and Am.tr.PI (hollow boxes) are highlighted in bold. Vertical lines indicate putative splicing sites recognized from the comparison of confirmed splicing sites (solid bars) in Am.tr.PI, PI (Goto and Meyerowitz, 1994 ), GLO (Tröbner et al., 1992 ), AP3 (Purugganan and Suddith, 1999 ), and DEF (Schwarz-Sommer et al., 1992 ). Box A indicates a four-amino acid deletion found in exon 5 of all PI-homologues except Amborella and Nuphar. AP3- and PI-homologues of Amborella, Nuphar (Nymphaeaceae), and Illicium (Austrobaileyales), respectively, are compared in detail at the bottom of the figure. *Provided by E. Kramer; unpublished data

 
Twelve amino acid sites, including a "DEAER" motif, which is a string of five amino acids, were identical between Am.tr.AP3 and Am.tr.PI in the 24 aligned amino acids of this variable region of exon 7 (Fig. 3). This shared aa sequence between the AP3- and PI-homologues of Amborella was not found in any other angiosperm, including Nuphar and Illicium (Fig. 3). In the absence of evidence for gene conversion acting among MADS-box genes, we believe that the most parsimonious interpretation of the DEAER motif is that it is an ancestral feature among flowering plant B-class proteins that was subsequently modified in both the AP3 and PI lineages leading to all other angiosperms. This is supported by our failure to locate the DEAER motif in gymnosperm B-class and B-sister protein sequences (Becker et al., 2002 ). Thus, this feature apparently evolved along the branch leading to flowering plants (its ancestral and unknown stem lineage, now extinct) because there is no indication that it predates the angiosperm-gymnosperm split. QxAL in exon 7 and LExQNK in exon 6 are supporting examples of shared amino acid strings found only in Am.tr.AP3 and Am.tr.PI (Fig. 3).

It is also possible that these shared aa strings are homoplasious, having arisen alternatively via convergent mutations in the two genes or gene conversion. The lack of such similarity between AP3- and PI-homologues of other species argues against convergent mutations in the Amborella genes. To test for the possibility that either ancient or recent gene conversion was responsible for the shared aa strings between Amborella AP3- and PI-homologues, we used GENECONV (Sawyer, 1989 ), with a range of mismatch penalties (gscale = 0, 1, 2). Similarities between the Amborella genes were not identified as significant "fragments." We therefore conclude that the shared strings are most likely ancestral motifs. If the shared motifs are indeed ancestral between the AP3- and PI-homologues, our results support Amborella as the sister organism to all other extant angiosperms, a placement that has been well supported by plastid, mitochondrial, and nuclear rRNA genes (e.g., Mathews and Donoghue, 1999 ; Qiu et al., 1999 ; P. Soltis et al., 1999 ; Parkinson et al., 1999 ; Graham and Olmstead, 2000 ; D. Soltis et al., 2000 ; Zanis et al., 2002 ; Borsch et al., 2003 ; Hilu et al., 2003 ).

The size of exon 5 in all PI-homologues reported to date is 30 bp. However, exon 5 in other MADS genes (e.g., AP3-, AP1-, and AG-homologues) is generally 42 bp in length (Johansen et al., 2002 ; Winter et al., 2002a ). A length of 30 bp for exon 5 was thought to be a general feature of all PI-homologues, but genomic DNA sequences of PI-homologues have been reported only for PI (Tröbner et al., 1992 ) and GLO (Goto and Meyerowitz, 1994 ), each of which has an exon 5 of 30 bp. However, by sequencing genomic DNA, we determined the size of exon 5 in Amborella to be 42 instead of 30 bp.

Using the confirmed splicing sites from Am.tr.PI derived in this study and previously reported for PI, GLO, AP3, and DEF, we determined the putative splicing sites for all other B-class genes (Fig. 3). We recognized the length of exon 5 in each sequence because the splicing sites of exon 5 of these aligned five genes matched exactly. This comparison revealed that Amborella and Nuphar both have an exon 5 of 14 aa (42 bp), whereas all other angiosperms analyzed have an exon 5 that is 10 aa (30 bp) in length (Fig. 3). Illicium (star anise), representing Austrobaileyales (the sister group to all other angiosperms after Amborella and Nymphaeaceae; the sequence of Illicium PI was kindly provided by Elena Kramer), also has an exon 5 of 10 aa. Because all AP3-homologues, as well as most other MADS families, have an exon 5 of 14 aa, a deletion of four amino acids must have occurred in exon 5 just after the branches leading to Amborella and Nymphaeaceae, providing additional support for their basal placement in angiosperms.

Exon 5 is located in the K3 portion of the K domain, which plays an important role in the specificity and strength of dimerization between AP3 and PI proteins in both Arabidopsis and Antirrhinum (Zachgo et al., 1995 ; Riechmann et al., 1996 ). In Arabidopsis, deletion of K3 affects the strength of AP3/PI dimerization more than does deletion of the I or C domains (Yang et al., 2003 ). In Amborella and Nuphar, the K3 amphipathic-helix motif has a different structure than the (abcdefg)n repeat (a and d being hydrophobic residues) known from core eudicots. Thus, the large deletion in exon 5 that occurred in angiosperms after the branch to Nymphaeaceae may have generated different AP3/PI heterodimerization capacity after the branches to Amborella and Nymphaeaceae.

Certain amino acid residues are crucial for AP3/PI heterodimerization in Arabidopsis: 97-E and 98-N in PI and 98-N and 102-R in AP3 (Yang et al., 2003 ). We confirmed that these residues are conserved in most AP3- and PI-homologues of angiosperms. However, some of these residues are not present in AP3- and PI-homologues of Amborella: Am.tr.PI has 97-D, Am.tr.AP3-1 has 98-S, and Am.tr.AP3-2 has 102-Q (Fig. 4). The PI-homologue of Amborella (Am.tr.PI) has 98-N and 102-R, which are conserved residues in most AP3-homologues. One of the AP3-homologues of Amborella (Am.tr.AP3-2) has 97-E and 98-N, which are conserved residues in most PI-homologues, and another Amborella AP3-homologue (Am.tr.AP3-1) has 97-E and 102-R, which are conserved residues in PI- and AP3-homologues, respectively. An AP3/PI homologue from the gymnosperm Gnetum gnemon, GGM2, has 97-E, 98-N, and 102-R (Winter et al., 2002b ; Yang et al., 2003 ) (Fig. 4).



View larger version (49K):
[in this window]
[in a new window]
 
Fig. 4. Alignment of K1 subdomain. The motifs containing 97-E and 98-N in PI and 98-N and 102-R in AP3 are important for dimerization in Arabidopsis (Yang et al., 2003 ). These residues are substituted in Amborella: 97-D in Am.tr.PI, 98-S in Am.tr.AP3–1, and 102-Q in Am.tr.AP3–2

 
Differences in Amborella of amino acid residues required for heterodimerization in Arabidopsis suggest that Amborella B-function proteins may not have the same capacity to heterodimerize as do those of Arabidopsis (and perhaps other angiosperms). Furthermore, because hypothetical Amborella AP3 and PI proteins share residues at these sites with GGM2, which forms a homodimer (Winter et al., 2002b ), both Amborella AP3- and PI-homologues may have the ability to homodimerize, which has only been demonstrated so far for PI-homologues of some monocots (Winter et al., 2002b ). Recent studies using transgenic Arabidopsis plants indicate that the C terminus of AP3 is sufficient to confer AP3 functionality on the paralogous PI protein (Lamb and Irish, 2003 ). This finding, when extended to Amborella and its indistinct AP3 and PI C domains, further supports the possibility that Am.tr.AP3-1/ Am.tr.AP3-1, Am.tr.AP3-2/Am.tr.AP3-2, and Am.tr.AP3-1/ Am.tr.AP3-2 might be capable of homo- and heterodimerization.

The C domains apparently signal for assembly of ternary protein complexes for several MADS proteins in core eudicots (Egea-Cortines et al., 1999 ; Ferrario et al., 2003 ). Indeed, higher-order multimers are probably the active state of B-function MADS-box proteins (Egea-Cortines et al., 1999 ; Honma and Goto, 2000 ; Theißen, 2001 ; Ferrario et al., 2003 ), and the composition of possible active multimers may only be limited by (i) available MADS translation products in cells and (ii) the capacity of different AP3 and PI dimers to recognize these translation products for effective multimerization. The strong similarity between Amborella AP3 and PI C-domain amino acid sequences, if it extends functionally to indistinct AP3-/ PI-homologue recognition in vivo, could suggest that the range of possible multimer combinations in Amborella may be substantially greater than that of eudicots or monocots.

Our data suggest that the evolution of B-function MADS-box gene controls in the development of the earliest flowers was rather dynamic, with different "experiments" tried. Amborella trichopoda may have been the most biochemically flexible angiosperm in terms of B-protein multimers that could operate during its reproductive development. However, the amino acid structural evidence we present suggests that this inferred flexibility was rapidly lost before the bulk of the angiosperm radiation occurred. The unique phylogenetic position of Amborella coupled with its apparently ancestral and flexible mode of B-gene function make it a model organism that should be studied more intensively.

Structural evolution: summary
We reconstructed a hypothesis of the structural evolution of B-class genes in angiosperms using a simplified summary phylogeny (Fig. 1). The common ancestor of AP3 and PI possessed an exon 6 of 14 aa (42 bp). An insertion occurred in this ancestral B-class gene to produce an exon 6 of 15 aa (45 bp) (Winter et al., 2002a ). The ancestor of AP3 and PI also possessed an exon 5 of 14 aa (42 bp) (Winter et al., 2002a ). After the duplication and divergence of AP3 and PI, a deletion of four amino acids occurred in exon 5 of PI during the early diversification of the angiosperms. This deletion occurred after the nodes leading to Amborella and Nymphaeaceae, which retain the ancestral state. All other angiosperms have the derived condition—an exon 5 of 10 aa (30 bp).

Prior to the duplication yielding the PI and AP3 lineages, several characteristic motifs must have been present in the ancestral protein. These ancestral motifs include the PI and paleoAP3 motifs (Kramer et al., 1998 ; Kramer and Irish, 2000 ). Several modifications to the PI-derived motif occurred in the angiosperms. Following the divergence of the AP3 and PI lineages, the PI motif was retained in the PI lineage and was modified in the AP3 lineage to form the PI-derived motif. All angiosperms except the core eudicots have what we term the PI-dA motif. A deletion of 10 aa in this motif unites all members of Magnoliales. The PI-derived motif was also extensively modified in the early evolution of the core eudicots (both in the TM6 and euAP3 lineages) to form what we term the PI-dB motif.

Other structural changes also occurred in the early diversification of the AP3 lineage. For example, a portion of exon 7 was modified to yield the TM6 motif, which characterizes all members of the TM6 clade.

All motifs that are newly reported in this study were consistent in different alignments (see Materials and Methods) that we used in this study.

Phylogenetic analyses
Details of the trees obtained through phylogenetic analyses were sensitive to the alignment used. This is not surprising given the small number of characters (312–348 aligned amino acids) and large number of genes (129 sequences). However, in general, the gene trees for both AP3 and PI roughly follow organismal phylogeny as inferred using multiple plastid, mitochondrial, and nuclear rRNA genes (e.g., Mathews and Donoghue, 1999 ; Qiu et al., 1999 ; P. Soltis et al., 1999 ; Parkinson et al., 1999 ; Barkman et al., 2000 ; Graham and Olmstead, 2000 ; D. Soltis et al., 2000 ; Zanis et al., 2002 ; Borsch et al., 2003 ). Some local relationships among genes from closely related taxa were very similar regardless of the alignment, nature of the sequence (DNA or amino acid), or regions of the gene (MIKC or MIK) used, or the lineage investigated (AP3- or PI-homologues). The abbreviation for each matrix is indicated in Table 2; hereafter, we follow these abbreviations.


View this table:
[in this window]
[in a new window]
 
Table 2. Summary of results of parsimony analyses for 10 different data matrices. Matrix names are abbreviated by their meaning. For example, MIKC-AA-I refers to M-, I-, K-, and C-domain regions, amino acid sequence, and alignment I. Dash means <50% bootstrap support. The "X" indicates the node was not recognized in the strict consensus tree

 
The basalmost sequences of the AP3 and PI lineages were nearly the same in every maximum parsimony analysis (Table 2; Fig. 5), although the internal support for relationships was low (<50% bootstrap support) based on analyses of both amino acid and DNA sequences. In the trees based on amino acids of the MIKC region, Am.tr.PI, followed by two PI-homologues of Nuphar (Nymphaeaceae), Nu.ad.PI and Nu.va.PI, are typically the subsequent sisters to all remaining PI-homologues. Similarly, considering the AP3-homologues, Am.tr.AP3 plus two AP3-homologues of Nuphar (Nu.va.AP3-1 and Nu.va.AP3-2) are the sister group to all remaining AP3-homologues in all amino acid analyses except MIK-AA-III (Table 2). For amino acid sequences of the MIKC regions, trees from the three different alignments agreed that Amborella and Amborella + Nuphar are the sisters to all other sequences in the PI and AP3 lineages, respectively. However, Illicium is sister to all other AP3 sequences in all analyses based on nucleotide sequences using both MP and support weighting (Table 2; Fig. 6), although support for this relationship is weak: <50% bootstrap value in all DNA MP analyses and <50% jackknife value on the support weighted tree (Fig. 6).



View larger version (32K):
[in this window]
[in a new window]
 
Fig. 5. Strict consensus of 72 MP trees (CI = 0.355) using M-, I-, K-, and C-domain regions of amino acid sequences (MIKC-AA-III), shown as a phylogram. Numbers above the branches are bootstrap values. Only values over 50% are indicated

 


View larger version (30K):
[in this window]
[in a new window]
 
Fig. 6. Phylogenetic tree resulting from analysis of nucleotides using Support Weighting with jackknife values from non-weighted analysis

 
Several clades of AP3- and PI-homologues that correspond to well-supported organismal clades were consistently recognized, including Magnoliales and monocots. The euAP3 clade, which was previously described (Kramer et al., 1998 ), was recovered in most analyses, but not all (Table 2). The Magnoliales clade in the PI lineage and the euAP3 clade in the AP3 lineage were recognized in all analyses (Table 2).

Most of the clades of sequences recovered within orders (e.g., Magnoliales, Laurales, and Poales) were consistently obtained with high bootstrap support regardless of the alignment used and with both amino acid and nucleotide sequences. For example, the clade of four AP3-homologues of Magnoliaceae, MpMADS7 (Magnolia kobus), Ma.gr.AP3 (Magnolia grandiflora), MfAP3 (Michelia), and LtAP3 (Liriodendron), received 85%, 75%, and 87% bootstrap support in analyses of the MIKC-AA-I, MIKC-AA-II, and MIKC-AA-III alignments, respectively. The clade of three PI-homologues of Magnoliaceae, MpMADS8 (Magnolia kobus), MfPI (Michelia), and LtPI (Liriodendron), received 74%, 70%, and 76% support in the analyses of MIKC-AA-I, MIKC-AA-II, and MIKC-AA-III alignments, respectively.

A well-defined clade of Ranunculales was observed only in the PI lineage in our analyses of the MIK-DNA-III alignment. In other analyses of other data sets, a subset of Ranunculales sequences typically was recovered, but not all Ranunculales sequences formed a clade. Many duplication events have occurred in Ranunculaceae (Kramer et al., 2003 ). Hence, the evolutionary history of AP3 and PI in Ranunculaceae is apparently complex. If all orthologues have not been sampled for this clade, it is reasonable to expect that the placement of some sequences in the gene tree may not reflect the organismal tree because of extensive duplication and subsequent diversification.

Genes having relatively short sequences such as PhPI (Peperomia), nmads1 (Oryza), PnPI-1 (Papaver), CMB2 (Dianthus), and RAD2 (Rumex) were especially sensitive to the alignment and the method of analysis used. For example, PhPI was sister to the clade of euAP3 sequences in our analyses of alignment MIKC-AA-I. However, it was sister to the TM6 clade in analyses of alignment MIKC-AA-II. In contrast, PhPI was embedded in the TM6 clade in analyses of the MIK-AA-I alignment and was grouped together with some AP3 genes of Ranunculales (RfAP3-1 and RbAP3-1) in analyses of alignments MIKC-DNA-I and MIK-DNA-I. Lastly, PhPI was embedded in the euAP3 clade in analyses of MIK-DNA-III.

The TM6 lineage (Kramer et al., 1998 ) was not recovered in all analyses. In our analyses of data sets MIKC-AA-I, MIKC-DNA-I, MIKC-AA-II, and MIKC-DNA-III, a TM6 clade was observed, but without bootstrap support >50%. Furthermore, the TM6 clade does not include CMB2, an unusual aa sequence that was placed in the TM6 clade by Kramer et al. (1998) . However, as noted, CMB2 also lacks the TM6 motif and therefore does not appear to belong to the TM6 lineage.

The topology of the support weighted tree (Farris, 2001 ) based on nucleotide sequences is very similar to that of the MP trees based on translated amino acids (Fig. 6). Magnoliales, monocots, and core eudicots were recognized in both the PI and AP3 lineages. Amborella is the sister to all other angiosperms in the PI lineage, but Illicium is sister to all other angiosperms in the AP3 lineage; jackknife support for both relationships is <50%. The TM6 and euAP3 lineages were found using support weighting, and CMB2 was included in the TM6 lineage.

Bayesian inference using nucleotide sequences produced a tree similar to those obtained with parsimony (Fig. 7). Many clades received relatively high posterior probabilities, much higher than the corresponding bootstrap values. The tendency of Bayesian analyses to yield posterior probability values higher than bootstrap values has been discussed (e.g., Huelsenbeck and Ronquist, 2001 ; Suzuki et al., 2002 ). Clades for which we obtained posterior probability values (x100) of 95–100 in the MIKC analysis include (Fig. 8): Magnoliales (100), monocots (100), the euAP3 lineage (100), the TM6 lineage (100, including CMB2), and core eudicots (100) for the AP3 lineage; Magnoliales (100), Ranunculales (100), monocots (100), and the core eudicots (99) for the PI lineage.



View larger version (35K):
[in this window]
[in a new window]
 
Fig. 7. Phylogenetic tree from Bayesian analysis. Numbers above branches are posterior probabilities multiplied by 100

 
Estimation of divergence time
The evolution of MADS-box genes has involved a series of gene duplications and subsequent diversification (e.g., Purugganan, 1997 ; Kramer et al., 1998 , 2003 ; Theißen et al., 2000 ; Nam et al., 2003 ). A duplication yielding the AP1-like and SEPALLATA-like plus AGL6-like genes occurred approximately 374 million years ago (Nam et al., 2003 ), and the ages of several other prominent MADS-box gene duplications have also been estimated (e.g., Purugganan et al., 1995 ; Purugganan, 1997 ; Nam et al., 2003 ). However, the age of the AP3/PI duplication could not be estimated in these earlier studies because of the accelerated rates of AP3 and PI sequence evolution relative to other MADS-box genes (Purugganan et al., 1995 ; Purugganan, 1997 ; Nam et al., 2003 ). Here we focus only on B-class genes and use nonparametric rate smoothing (NPRS), which accommodates rate heterogeneity by permitting rates to vary among branches, thereby allowing us to estimate the timing of the AP3/PI duplication. Although the effectiveness of NPRS in accommodating rate heterogeneity has not been adequately tested and NPRS may overcompensate for rate inconstancy (Sanderson and Doyle, 2001 ), estimates in other studies based on NPRS, penalized likelihood (Sanderson, 2002 ), and Bayesian methods (Thorne and Kishino, 2002 ) are similar (e.g., S. Renner, University of Missouri, St. Louis, personal communication; C. Bell et al., University of Florida, unpublished data). We conclude that NPRS provides reasonable estimates for data sets with heterogeneous rates.

Using our AP3- and PI-homologue data sets and NPRS (Sanderson, 1997 ), we estimated that the duplication that produced the AP3 and PI lineages occurred approximately 260 mya (range of 230–290 mya) (Table 3). This date places the duplication shortly after the split between extant gymnosperms and angiosperms and on the "stem" (ancestral) lineage of extant flowering plants. Extant seed plants originated approximately 290–309.2 mya (Mapes and Rothwell, 1984 , 1991 ), and most evidence indicates a very early split between the living gymnosperms and the line leading to angiosperms (P. Soltis et al., 2002 ). The absence of AP3- or PI-specific homologues in extant gymnosperms was previously used to suggest that the AP3/PI duplication arose prior to the origin of the angiosperms (Doyle, 1994 ; Purugganan et al., 1995 ; Kramer et al., 1998 ; Becker and Theißen, 2003 ). However, few gymnosperms and none of the basalmost angiosperms had been examined, and it was therefore unclear whether the duplication occurred early in angiosperm history, just prior to the origin of angiosperms, early along the stem lineage leading to angiosperms, or even early in seed plant evolution. Although two classes of AP3/PI homologues have been suggested for gymnosperms based on comparison of motifs (Kramer and Irish, 2000 ), phylogenetic analyses to date have not supported this distinction (Hasebe, 1999 ; Kramer and Irish, 2000 ).


View this table:
[in this window]
[in a new window]
 
Table 3. Divergence times between AP3 and PI lineages estimated directly from original matrices and from bootstrapped matrices using NPRS. The "TM6 data set" includes the TM6 lineage (plus paleoAP3 and PI genes) but excludes the euAP3 lineage; the "euAP3 data set" indi cates the converse

 
In summary, our results suggest that the AP3/PI duplication occurred shortly after the divergence of extant gymnosperms and angiosperms, well before the oldest flowering plant fossils (generally placed in the range of 125–131.8 mya; Hughes, 1994 ; P. Soltis et al., 2002 ). Importantly, this implies that the joint expression of AP3 and PI may not have immediately resulted in the formation of petals, structures for which they control the development in extant angiosperms. Recognizable flowers did not appear for perhaps another 100 million years after the estimated timing of the AP3/PI gene duplication. The co-expression of AP3-and PI-homologues could nevertheless reflect an evolutionary innovation of animal-attractive, petal-like organs prior to the recognition of flowering plants in the fossil record. Transference of B-class gene function from control over gymnosperm sex determination to petal-like and true petal organ identity (Albert et al., 1998 ) is an attractive hypothesis meriting further research.


View this table:
[in this window]
[in a new window]
 
Table 1. Continued

 

View this table:
[in this window]
[in a new window]
 
Table 1. Continued

 

View this table:
[in this window]
[in a new window]
 
Table 1. Continued

 

    FOOTNOTES
 
1 The authors thank E. Kramer for sharing an unpublished amino acid sequence of Illicium PI and M. Zanis for determining some sequences, M. Buzgo, S. Brockington, G. Theißen, and H. Ma for helpful comments and discussion, and J. Nam for allowing us to use his reverse-translation utility AA2DNA. This work was supported by NSF grant PGR-0115684, the University of Florida Research Foundation, and the Post-doctoral Fellowship Program of Korea Science and Engineering Foundation (KOSEF) for S. Kim. Back

6 Authors for reprint requests. (dsoltis{at}botany.ufl.edu and sangtae{at}botany.ufl.edu ) Back


    LITERATURE CITED
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 LITERATURE CITED
 
APG II. 2003 An updated classification of the angiosperms. Botanical Journal of the Linnean Society 141: 399-436[CrossRef]

Albert V. A. M. H. G. Gustafsson L. Di Laurenzio 1998 Ontogenetic systematics, molecular developmental genetics, and the angiosperm petal. In P. Soltis, D. Soltis, and J. Doyle [eds.], Molecular systematics of plants II, 349–374. Kluwer, Boston, Massachusetts, USA

Alvarez-Buylla E. R. S. Pelaz S. J. Liljegren S. E. Gold C. Burgeff G. S. Ditta L. R. De Pouplana L. Martinez-Castilla M. F. Yanofsky 2000 An ancestral MADS-box gene duplication occurred before the divergence of plant and animals. Proceedings of the National Academy of Sciences, USA 97: 5328-5333[Abstract/Free Full Text]

Barkman T. J. G. Ghenery J. R. McNeal J. Lyons-Weiler C. W. DePamphilis 2000 Independent and combined analyses of sequences from all three genomic compartments converge on the root of flowering plant phylogeny. Proceedings of the National Academy of Sciences, USA 97: 13166-13171[Abstract/Free Full Text]

Becker A. K. Kaufmann A. Freialdenhoven C. Vincent M.-A. Li H. Saedler G. Theißen 2002 A novel MADS-box gene subfamily with a sister-group relationship to class B floral homeotic genes. Molecular Genetics and Genomics 266: 942-950[CrossRef][ISI][Medline]

Becker A. G. Theißen 2003 The major clades of MADS-box genes and their role in the development and evolution of flowering plants. Molecular Phylogenetics and Evolution 29: 464-489[CrossRef][ISI][Medline]

Bodt S. D. J. Raes Y. V. de Peer G. Theißen 2003 And then there were many: MADS goes genomic. Trends in Plant Science 8: 475-483[CrossRef][ISI][Medline]

Borsch T. K. W. Hilu D. Quandt V. Wilde C. Neinhuis W. Barthlott 2003 Noncoding plastid trnT-trnF sequences reveal a well resolved phylogeny of basal angiosperms. Journal of Evolutionary Biology 16: 558-576[CrossRef][ISI][Medline]

Coen E. S. J. S. Doyle M. Romero R. Elliot R. Magrath R. Carpenter 1991 Homeotic genes controlling flower development in Antirrhinum. Development (Supplement) 1: 149-156

Coen E. S. E. M. Meyerowitz 1991 The war of the whorls: genetic interactions controlling flower development. Nature 353: 31-37[CrossRef][Medline]

Drinnan A. N. P. R. Crane S. B. Hoot 1994 Patterns of floral evolution in the early diversification of non-magnoliid dicotyledons (eudicots). Plant Systematics and Evolution supplement 8: 93-122

Doyle J. J. 1994 Evolution of a plant homeotic multigene family: toward connecting molecular systematics and molecular developmental genetics. Systematic Biology 43: 307-328[CrossRef]

Doyle J. J. J. L. Doyle 1987 A rapid DNA isolation from small amount of fresh leaf tissue. Phytochemical Bulletin 19: 11-15

Egea-Cortines M. H. Saedler H. Sommer 1999 Ternary complex formation between the MADS-box proteins SQUAMOSA, DEFICIENS and GLOBOSA is involved in the control of floral architecture in Antirrhinum majus. EMBO Journal 18: 5370-5379[CrossRef][ISI][Medline]

Farris J. S. 2001 Support weighting. Cladistics 17: 389-394[CrossRef][ISI]

Farris J. S. V. A. Albert M. Källersjö D. Lipscomb A. G. Kluge 1996 Parsimony jackknifing outperforms neighbor-joining. Cladistics 12: 99-124[CrossRef][ISI]

Felsenstein J. 1985 Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39: 783-791[CrossRef][ISI]

Ferrario S. R. G. Immink A. Shchennikova J. Busscher-Lange G. C. Angenent 2003 The MADS box gene FBP2 is required for the SEPALLATA function in petunia. Plant Cell 15: 914-925[Abstract/Free Full Text]

Graham S. W. R. G. Olmstead 2000 Utility of 17 chloroplast genes for inferring the phylogeny of the basal angiosperms. American Journal of Botany 87: 1712-1730[Abstract/Free Full Text]

Goto K. E. M. Meyerowitz 1994 Function and regulation of the Arabidopsis floral homeotic gene PISTILLATA. Genes and Development 8: 1548-1560[Abstract/Free Full Text]

Hasebe M. 1999 Evolution of reproductive organs in land plants. Journal of Plant Research 112: 463-474[CrossRef][ISI]

Hilu K. W. T. Borsch K. Müller D. E. Soltis P. S. Soltis V. Savolainen M. W. Chase M. Powell L. A. Alice R. Evans H. Sauquet C. Neinhuis T. A. Sotta J. G. Rohwer C. S. Champbell L. Chatrou 2003 Angiosperm phylogeny based on matK sequence information. American Journal of Botany 90: 1758-1776[Abstract/Free Full Text]

Honma T. K. Goto 2000 Complexes of MADS-box proteins are sufficient to convert leaves into floral organs. Nature 409: 525-529[CrossRef][ISI]

Huelsenbeck J. P. F. Ronquist 2001 MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics 17: 754-755[Abstract/Free Full Text]

Hughes N. F. 1994 The enigma of angiosperm origins. Cambridge University Press, Cambridge, UK

Johansen B. L. B. Pedersen M. Skipper S. Frederiksen 2002 MADS-box gene evolution-structure and transcription patterns. Molecular Phylogenetics and Evolution 23: 458-480[CrossRef][ISI][Medline]

Kim S. C.-W. Park Y.-D. Kim Y. Suh 2001 Phylogenetic relationships in family Magnoliaceae inferred from ndhF sequences. American Journal of Botany 88: 717-728[Abstract/Free