|
|
||||||||
Systematics |
Department of Ecology and Evolutionary Biology, Yale University, New Haven, Connecticut 06520 USA
Received for publication February 27, 2004. Accepted for publication September 21, 2004.
| ABSTRACT |
|---|
|
|
|---|
Key Words: Adoxaceae Bayesian relaxed clock Caprifoliaceae Dipsacales local clocks molecular clock nonparametric rate smoothing penalized likelihood
| INTRODUCTION |
|---|
|
|
|---|
|
Several methods have recently been proposed to deal with non-rate constancy among lineages (Sanderson, 1997
; Thorne et al., 1998
; Huelsenbeck et al., 2000
; Yoder and Yang, 2000
; Kishino et al., 2001
; Sanderson, 2002
; Thorne and Kishino, 2002
). These methods variously "relax" the null hypothesis of a uniform rate of molecular evolution across all lineages, allowing different parts of a tree to experience different rates. In this paper we apply a variety of these dating methods (local clocks, nonparametric rate smoothing, penalized likelihood, and Bayesian relaxed clock methods) to estimate the age of the Dipsacales and divergence times for the major lineages within this clade. We compare the results of different strategies, using several chloroplast markers (separately and in combination), and the influence of fossil calibrations and maximum and minimum constraints in such analyses. Our results bear on the tempo of diversification and morphological evolution in Dipsacales but also provide a concrete comparison of methods that may bear on their use in other organisms.
| MATERIALS AND METHODS |
|---|
|
|
|---|
The use of fossils in concert with molecular data can take two forms: (1) they can serve as fixed "calibration" points used to calculate absolute branching times, or (2) they may serve as maximum or minimum age "constraints" (Sanderson, 1997
). For this study we selected four fossils that we believe can be confidently placed in the Dipsacales tree to use as separate calibration points (Fig. 1B), as well as minimum age constraints in concert with other fossils. In several analyses, we also explored the use of a maximum age constraint. Dipsacales is nested within eudicots, and can be no older than eudicots. The eudicots can be rather confidently dated by their distinctive and pervasive tricolpate pollen, which appears in the fossil record at about 125 mya (Doyle and Donoghue, 1993
; Magallón et al., 1999
; Sanderson and Doyle, 2001
).
Absolute ages in this paper are based on the 1999 Geological Time Scale from the Geological Society of America (GSA, 2004
). In all cases we have assigned a range of ages to fossils, as opposed to a single fixed date.
For each of the methods used, we examined the consistency of our fossil calibration points in reference to one another using the combined data set. Since all of these fossil placements are minimum age estimates, an inferred age of a clade (based on calibration with another fossil) was judged to be inconsistent only if it was inferred to be younger than a "known" age based on a fossil assigned to that clade.
Phylogenetic estimation and tests for rate constancy
Maximum likelihood methods were employed to infer both tree topologies and branch lengths from three coding (rbcL, ndhF, matK) and three non-coding (trnL, atpB-rbcL intergenic spacer, and a matK intron) chloroplast regions. These data were all previously published (Bell et al., 2001
). The taxon sample represents all major lineages within Dipsacales, including representatives of all the traditional families, subfamilies, and tribes. All data sets analyzed for this study are available from TreeBASE (http://www.treebase.org) or by request from the first author. For each data set, a series of likelihood ratio tests (LRTs) was performed to select a model for maximum likelihood searches using PORN* (Bell, 2001
). All searches were performed with the computer software PAUP*b10 (Swofford, 2002
). Parameters for the searches were estimated on a tree topology obtained in a maximum likelihood search using a Jukes-Cantor (JC; Jukes and Cantor, 1969
) model of sequence evolution. Parameters for each model were then fixed, and heuristic searches were run using 100 random taxon additions and tree-bisection-reconnection (TBR) branch swapping. A final round of searching entailed estimating all model parameters simultaneously using the tree obtained in the previous step as the starting tree and nearest-neighbor interchange (NNI) branch swapping.
Following each search, a LRT was used to test for departure from clock-like evolution in the molecular sequence data (Felsenstein, 1981
). For the three coding regions, analyses were performed on two data sets: (1) including all three codon positions, and (2) including just third codon positions. In all cases, rate constancy among lineages was rejected (Table 1). For the remaining analyses, the maximum likelihood tree from the combined data, including coding and non-coding regions, was used (Fig. 1B).
|
Local molecular clocks
In a local molecular clock model, different rates can be assigned to different clades in a tree. All tests for local molecular clocks were performed using the software PAML, version 3.01b (Yang, 2000
; see Yoder and Yang, 2000
) and the tree in Fig. 1.
Because the LRT of rate hypotheses among lineages is valid only when a null hypothesis is specified (Yoder and Yang, 2000
), we initially divided the tree into two ratesone rate for "herbaceous" and another rate for "non-herbaceous" taxa. This was based on previous analyses suggesting that rates might differ according to plant habit (Gaut et al., 1996
; Laroche et al., 1997
; Gaut, 1998
). The shift to herbaceousness appears to have occurred at least five separate times in Dipsacales (Donoghue et al., 2003
). However, our sampling for this analysis covers only three occurrences: (1) Triosteum, (2) the Adoxina clade, and (3) the Valerina clade (see Fig. 1B). The other two transitions from woody to herbaceous occur within Sambucus (Eriksson and Donoghue, 1997
), from which we have sampled just one woody representative. Because there is no reason to think that all three lineages would necessarily have the same rate of molecular evolution, we next specified a local molecular clock with four ratesone for each of the three herbaceous lineages and another for the non-herbaceous taxa. Confidence intervals for local and global molecular clocks were calculated using BASEML in the PAML software package.
Nonparametric rate smoothing
An alternative to estimating divergence times with global or local molecular clocks, which specify an explicit parametric model of sequence evolution, is nonparametric rate smoothing (NPRS; Sanderson, 1997
). NPRS estimates rates and times using a least-squares smoothing criterion that penalizes rapid rate changes from branch to branch in a phylogeny. All NPRS analyses were performed using the maximum likelihood tree topology in Fig. 1, and branch lengths were estimated for each data partition using its "best fitting" model as determined by a series of LRTs.
Two separate NPRS analyses were performed. We first fixed the age of the root at 1.0, smoothed all branches relative to that, and used a single fossil calibration to calculated absolute divergence dates. In a second NPRS analysis we employed minimum and maximum age constraints. In this case the root of the tree was constrained to a maximum age of 125 mya (see calibration points), and other fossils were used as minimum age constraints.
Penalized likelihood
Penalized likelihood (PL; Sanderson, 2002
) is a semiparametric smoothing method. Like NPRS, PL assumes that there is an autocorrelation of substitution rates and attempts to minimize rate changes between ancestral/descendant branches on a tree (i.e., at the nodes). PL attempts to combine the statistical power of parametric methods (models of molecular evolution) with the robustness of nonparametric methods. A smoothing parameter (
) can vary from very small, in which case each branch of the phylogeny has a different substitution rate (saturated model), to very large, in which parameters are essentially clock-like. The crux of the penalized likelihood method is determining the optimal smoothing level. The program r8s (Sanderson, 2003
) implements a data-driven cross-validation procedure that systematically prunes terminals from the tree, then estimates parameters from the submatrix and a given smoothing value. It then tries to predict the data for pruned taxa using the estimated parameters. Finally, it calculates a chi-squared error associated with the difference between the predicted and observed data. The optimal smoothing level is chosen as the one that minimizes the chi-squared error (Sanderson, 2002
).
As in the case of NPRS, we ran two separate PL analyses: (1) we fixed the age of the root at 1.0, smoothed all branches, and calibrated with a single fossil, and (2) we employed minimum and maximum age constraints. Confidence intervals around the age estimates for all nodes were calculated using nonparametric bootstrapping (Baldwin and Sanderson, 1998
). This procedure assumes that bootstrap estimates are normally distributed, which might not be the case.
Bayesian relaxed clock using Markov chain Monte Carlo (MCMC)
Bayesian methods (Thorne et al., 1998
; Kishino et al., 2001
; Thorne and Kishino, 2002
) that relax a strict molecular clock were also used to estimate divergence times using MULTIDIVTIME (available from J. Thorne, North Carolina State University). This parametric approach relaxes the assumption of a strict molecular clock with a continuous autocorrelation of substitution rates across the phylogeny, and allows the use of several calibrations/ time constraints.
Because these analyses eventually trim off the outgroup, we added three additional sequences from GenBank: Pittosporum tobira (rbcL, U50261; ndhF, AF130201) and Pittosporum undulatum (matK, AJ429374). The non-coding regions were not included in the Bayesian analyses due to alignment difficulties. The three genes were analyzed separately as well as combined. A simultaneous analysis was also performed to account for the differences in the substitution process between genes using the program MULTIDIVTIME. An additional analysis was performed after partitioning the data by codon position and analyzing the three-codon positions simultaneously.
Divergence date estimation with MULTIDIVTIME involved two steps. First, ESTBRANCHES was run to estimates branch lengths from the data and a fixed tree topology using the F84 (Felsenstein, 1984
; Kishino and Hasegawa, 1989
) model of sequence evolution. This allows rates to vary among sites following a discrete gamma distribution with four rate categories (Yang, 1994
) along with their variance-covariance matrix. Parameters for the F84 +
model were estimated using the BASEML program in PAML (Yang, 2000
). Estimated parameters are presented in Table 2.
|
The following prior distributions were used in these analyses: 120 mya (SD = 60 mya) for the expected time between tip and root if there were no constraints; 0.0006 (SD = 0.0003) substitutions per site per million year for the rate of the root node; 0.01 (SD = 0.01) for the parameter that determines the magnitude of autocorrelation per million years; and 125 mya for the largest value of the time unit between the root and the tips.
Three separate Bayesian analyses where performed: (1) one in which all of the data were combined into a single dataset prior to analyses; (2) one in which all genes shared a common value for the autocorrelation parameter; and (3) one in which each gene had its own autocorrelation parameter.
| RESULTS |
|---|
|
|
|---|
|
|
Local molecular clocks
All LRTs rejected (P < 0.00001) the hypothesis of the presence of local clocks among lineages, regardless of what data partition was being tested, or whether two rates or four rates were specified. As in the global clock analyses, local clock estimates for the age of Dipsacales appear to be too old, for the most part suggesting a pre-angiosperm, Triassic origin of Dipsacales.
Again, age estimates for all nodes varied considerably among genes and non-coding regions as well as if only third positions were used. Rates inferred from these analyses did support the hypothesis that rates of molecular evolution are higher in herbaceous plants than in their woody relatives, regardless of whether a two-rate model was imposed (herbaceous rate > non-herbaceous rate) or a four-rate model (Triosteum rate > Adoxina rate > Valerina rate > non-herbaceous rate; for all partitions of the data). Based on a likelihood ratio test, these data do not support the hypothesis that there is a uniform rate across the different herbaceous clades (
= 175.4, df = 2, P < 0.001).
Nonparametric rate smoothing (NPRS)
Age estimates from the NPRS analyses are presented in Table 4. Unlike the analyses based on global and local molecular clocks, NPRS estimates ages that generally fall within the age range of angiosperms. Ages estimated for Dipsacales using NPRS ranged from 52.94 mya (matK intron) to 124.12 mya (combined coding for third-codon positions). With the exception of the matK intron, all other partitions or combinations of partitions suggest an origin of Dipsacales at or prior to the Cretaceous-Tertiary (K/T) boundary. In the NPRS analysis of the combined data, the origin of Dipsacales is placed at 76.76 mya (± 4.4), Caprifoliaceae in the vicinity of the K/T boundary (65.00 mya ± 3.7), Adoxaceae in the Early Eocene (54.68 mya ± 4.3), and core Valerianaceae in the Miocene (12.22 mya ± 3.1).
Penalized likelihood (PL)
Smoothing levels (
) for all analyses tended to be small (between 0.0001 and 0.01), suggesting substantial rate heterogeneity among lineages (Sanderson, 2002
). Like the age estimates obtained from NPRS, estimates for the age of Dipsacales are generally more in line with the angiosperm fossil record than those obtained with global and local clocks. Age estimates for the root node ranged from 51.76 mya (for the matK intron) to 176.9 mya (ndhF all positions) (see Table 4). With the exception of the rbcL data, PL estimated older dates for Dipsacales than NPRS. Only the ndhF data estimated dates that we would reject as being too old. When a maximum age constraint of 125 mya was placed at the root of Dipsacales, age estimates for the entire clade were pushed up against this barrier (i.e., ca. 125 mya; data not presented). Age estimates for most of the clades within Dipsacales were only slightly affected by the use of the maximum age constraint.
Bayesian estimation of divergence times
Ages estimated using Thorne's Bayesian relaxed clock method are presented in Tables 4 and 5. Prior and posterior distributions for the analyses with fossil constraints were all fairly similar, which may suggest that most of the information concerning age estimates is attributable to the prior distributions, rather than the data. However, when age constraints were not imposed, the prior distributions of divergence times showed a greater degree of variation than the posterior distributions, indicating that much of the information concerning branching times can be attributed to the sequence data (or estimated branch lengths from the sequence data). Simultaneous analyses of all three coding regions (as well as codon positions) resulted in estimates with smaller variance (see Table 5). In agreement with the PL combined results, all Bayesian analyses suggest a Late Cretaceous origin for Dipsacales, as well as for Adoxaceae and Caprifoliaceae.
|
| DISCUSSION |
|---|
|
|
|---|
In the case of Dipsacales, this realization has important consequences, as we can distinguish between those fossils for which we lack evidence about whether they actually fall within or outside of the crown group (Weigela, Heptacodium, and Patrinia), and Diplodipelta, for which we have evidence that it falls outside of its crown group (Dipelta). Manchester and Donoghue (1995)
argued that the two species of Diplodipelta together represent the extinct sister group of Dipelta, with three extant species. Thus, of the four fossils used in our analyses, we are most certain that Diplodipelta does not cause the underestimation problem described above. The Weigela, Heptacodium, and Patrinia fossils, on the other hand, might be nested within their respective crown groups, in which case our default treatment of them as falling outside of the crown will underestimate the subtending divergence times. We think this may be why overly young dates are estimated for the Dipelta/ Abelia divergence when these were used as calibration points (Table 3). An alternative explanation would suppose that the fossils of Weigella, Heptacodium, and Patrinia really do belong outside of their respective crown groups, but that the fossil record is poor and the known fossils happen not to fall close in time to the split from the sister group, but instead fall close to the first split within the crown group.
Our greater confidence in the phylogenetic position of Diplodipelta provides, we believe, the best justification for relying on it for inferring divergence times in Dipsacalesbetter than the fact that it yielded no inconsistencies with the ages of the other fossils in our cross-validation tests. Diplodipelta, as the oldest of the four fossils included, is not expected to yield divergence times that are too young for the clades represented by the other much younger fossils if these really are nested within their crown groups.
Divergence time variation among methods and data sets
It would be comforting to find a convergence on similar age estimates using different methods and sources of data. In Dipsacales we have not been so lucky and need to select among analyses in order to proceed. The likelihood ratio test (LRT) provides an explicit means for selecting among methods that assume a molecular clock and those that do not, though it has been suggested that the LRT may not be sufficiently sensitive to detect localized deviations from clock-like evolution (Conti et al., 2002
). This is in contrast to others who feel that the global LRT represents a conservative test for rate constancy (Sanderson, 1998
). In Dipsacales, rejection of the clock is also suggested by the age estimates obtained when a clock is imposed. That is, global and local molecular clock analyses both estimate dates for Dipsacales that are grossly inconsistent with the fossil record. Specifically, Dipsacales is inferred to be older than eudicots and even angiosperms. This phenomenon has been seen in analyses of other taxonomic groups (e.g., Martin et al., 1993
; Heckman et al., 2001
; Soltis et al., 2002
).
In rejecting clock models, one faces the choice among alternative methods that relax the clock assumption. One short-coming of the NPRS method was highlighted by Sanderson (2002
, 2003
). NPRS may overfit the data, leading to rapid fluctuations in rates in regions of a phylogeny where there are short branches. Data sets with little information content (i.e., few inferred substitutions across the phylogeny) may tend to have zero-length branches in areas that are fairly unresolved, which would result in the appearance of rapid rate fluctuation. Sanderson (2002)
demonstrated that PL always out-performed NPRS when data departed from a constant rate and when cross-validation was used to determine the optimal smoothing level. He also pointed out that when the smoothing factor is small, the model is overfit and small changes in the data (i.e., subsamples constructed during pruning) will lead to large changes in parameter estimates. Consequently, one would expect some intermediate (or larger) level of smoothing to provide the best cross-validation score.
These observations may help to explain the differences we see between our NPRS and PL results for Dipsacales. In fact, in Dipsacales we find that NPRS yields consistently younger ages of major clades than PL or Bayesian. Because NPRS is sensitive to major variation in branch lengths in a tree (Sanderson, 2002
), the presence of many short-internal branches could potentially lead NPRS to push the age of the root node further back in time to compensate for extreme transitions in rates from parent to offspring branches (i.e., by increasing the lengths of the short-internal branches). We suspect that we might be observing the opposite situation in Dipsacales, with several long internal branches, and shorter ones towards the tips (with only a few exceptions). Under these circumstances, NPRS may pull the root of the tree closer to the present in order to fit the data to a situation where there are decelerations in rates across nodes, potentially leading to an underestimate in divergence times.
It is noteworthy that PL and Bayesian approaches applied to Dipsacales converge on rather similar results. This may not be that surprising and may be a more general phenomenon, due to the fact that both methods weight a likelihood function by some costthe roughness penalty in the PL method and a prior distribution in the Bayesian method. A primary concern with Bayesian approaches is how sensitive the posterior distributions of parameters are to the prior distributions. This issue has so far received very little attention (but see Kishino et al., 2001
; Wiegmann et al., 2003
; Yang and Yoder, 2003
). The greatest differences between the PL and Bayesian estimates involved the Bayesian analyses in which each gene had its own autocorrelation parameter. Because these genes all belong to the same chloroplast genome, they may share a tendency to change rates in the same direction on certain branches due to common biological factors, such as generation time and population size (Thorne and Kishino, 2002
). On this basis, we favor the Bayesian estimates that assume a common autocorrelation parameter.
With respect to different genes and different partitions within genes, the high level of variation in age estimates is noteworthy. Whether this level of variation is common or unusual is difficult to judge, as such comparisons have seldom been reported (but see Sanderson and Doyle, 2001
; Soltis et al., 2002
). In the face of such variation, we believe that the best results might be obtained from combined data sets. Combining data sets may dampen the impact of any one (potentially aberrant) gene on the overall result. On the other hand, there may be significant heterogeneity among gene regions that would render it problematical to apply a single evolutionary model. In the future, the development of methods that allow different models to be applied in the same analysis to separate partitions is especially promising (e.g., MrBayes [Huelsenbeck and Ronquist, 2001
]).
Overall, our observation of great variation in age estimates based on different methods and on different gene regions suggests that estimates based on a single method applied to a single gene should be treated very cautiously. With respect to methods, those that impose global or local clocks may often be rejected by the data. PL may generally outperform NPRS (Sanderson, 2002
), which may be especially sensitive to major changes in branch length in a tree. Bayesian approaches, though promising, have not been very thoroughly explored with respect to the sensitivity to different prior distributions. With respect to choosing among competing results, we favor PL and Bayesian methods as applied to the combined data. Encouragingly, for Dipsacales at least, they yield similar results.
Timing the radiation of Dipsacales
Based on the arguments above, we set aside global and local clock and NPRS age estimates for Dipsacales. The following discussion is based, therefore, on the largely congruent PL and Bayesian results from our combined data set. Because these genes all belong to the same chloroplast genome, they may share a tendency to change rates in the same direction on certain branches due to common biological factors, such as generation time and population size (Thorne and Kishino, 2002
). On this basis, we favor the Bayesian estimates that assume a common autocorrelation parameter.
The first major split within crown Dipsacales appears to have taken place in the mid-Cretaceous, by 11193 mya (Albian to Cenomanian). We note that our estimates for Dipsacales are 2030 million years older than those of Backlund (1996)
and Wikstrom et al. (2001)
. Crown Caprifoliaceae and Adoxaceae also probably originated prior to the Cretaceous/ Tertiary (K/T) boundary, ca. 9175 mya and 8160 mya, respectively. Within Adoxaceae, the basal split between the Viburnum lineage and Adoxoideae may have taken place close to the K/T boundary. Within Caprifoliaceae, stem Diervilleae, Caprifolieae plus Heptacodium, and probably Linnina, were likely in existence before the Tertiary, although differentiation within their respective crown groups probably largely took place in the Tertiary.
In the Tertiary, a number of major Dipsacales clades appear to have originated during the Eocene. Within Adoxaceae the split between Sambucus and the herbaceous Adoxina lineage occurred in the Late Eocene according to the PL estimate, but in the Oligocene based on the combined Bayesian analyses. Within Caprifoliaceae we see the differentiation of Caprifolieae into several extant lineages, and, likewise, the beginning of the diversification of Linnaeeae and Diervilleae. Although the split between stem Valerianaceae and Triplostegia plus Dipsacaceae may date to the Paleocene, we infer that the initial diversification of crown Valerianaceae probably occurred in the Eocene, as did the split between the Triplostegia and Dipsacaceae lines.
According to the combined Bayesian analyses, the differentiation of Adoxoideae may have taken place in the Oligocene, and perhaps also the origin of core Valerianaceae. In either case, Adoxoideae and core Valerianaceae appear to have differentiated by the end of the Miocene. It would appear that the diversification of these major herbaceous lineages within Dipsacales generally coincided with the spread of colder, less equable, and in some places drier, climates around the Northern Hemisphere (Morley, 2000
; Zachos et al., 2001
).
This study, together with our ongoing analyses within Valerianaceae (Bell, 2004
; Bell and Donoghue, In press
), suggests that many extant species within Dipsacales have originated since the Miocene, probably mostly within the last 10 million years. Specifically, the two major herbaceous cladesthe core Valerianaceae and the Dipsacaeaeseem to have radiated relatively recently. Together, these two clades account for nearly half of the 1100 species of Dispacales. In the case of core Valerianaceae, this diversification appears to correlate with the spread of the clade from Asia to Europe and then to the New World, mostly in alpine habitats. An especially major radiation appears to have occurred with the occupation of South America by one major clade within Valeriana, which now contains some 150 species. The radiation of Dipsacaceae, in contrast, has mainly taken place in relatively recently derived semi-arid areas around the Mediterranean basin. Understanding the timing of the radiation of the two major woody lineagesViburnum (ca. 165 species) and Lonicera (ca. 180)requires the inclusion of additional species. However, our analyses indicate that these two lineages are more ancient than the major herbaceous lineages, and preliminary molecular phylogenetic analyses within these groups (Donoghue et al., 2004
; Winkworth and Donoghue, 2004
; J. Li et al., unpublished data) imply that at least the basal splits within them occurred earlier in the Tertiary.
Several ecological and biogeographic aspects of the Dipsacales radiation are noteworthy. First, Dipsacales probably originated well before the spread of the Northern Hemisphere temperate-zone climate and vegetation types with which they are mainly associated today. The more limited distribution of such climates during the Cretaceous and early Tertiary (Wolfe, 1978
; Upchurch and Wolfe, 1987
; Zachos et al., 2001
) implies that Dipsacales may have (1) occupied limited temperate territory and remained low in diversity until the expansion of the temperate zone in the mid-Tertiary; (2) diversified in more tropical areas, where they do not occur today, followed by the extinction of early lineages; or (3) initially adapted to living in more tropical habitats, but did not undergo significant diversification until they evolved adaptations that allowed their entry into the temperate zone. It may be possible in the future to choose among these alternatives, depending on the discovery of Cretaceous fossils and/or additional phylogenetic analyses within key lineages. For example, recent analyses have suggested that the root of Viburnum falls between V. clemensiae, which occupies subtropical forests in Borneo, and a clade containing the remainder of the species, almost all of which live in temperate forests (Donoghue et al., 2004
; Winkworth and Donoghue, 2004
). If this is substantiated and if this basal split dates at least to the Eocene, V. clemensiae might represent a remnant of an earlier phase of Dipsacales diversification in more tropical regions.
Second, many major lineages originated in an Eocene time frame that would have allowed spread around the Northern Hemisphere through the North Atlantic and/or the Bering land bridges (Donoghue et al., 2001
; Sanmartin et al., 2001
; Tiffney and Manchester, 2001
). Dipsacales today provide many instances of Old World/New World disjunctions, some of these probably dating to the Eocene (e.g., several major clades within Viburnum, Lonicera, and possibly Diervilleae, but see Donoghue et al., 2001
, on the later). However, most Dipsacales clades with such intercontinental disjunctions are younger, and movements between the Old and New World in these cases presumably occurred through Beringia (e.g., disjunctions within Sambucus, Adoxa, Triosteum, Symphoricarpos, Linnaea, and Valeriana).
Third, as noted already, the major herbaceous clades within Dipsacales appear to have originated in the Oligocene or Miocene, though at least in the two largest herbaceous clades most of the species diversity may be accounted for by more recent radiations into new regions and habitats. Specifically, much of the diversification within Valeriana is accounted for by the recent radiation of a South American clade, and Dipsacaceae diversity has been elevated recently in connection with the occupation of semi-arid habitats.
Finally, it is interesting to consider the evolution of morphological disparity in relation to the timing of diversification within Dipsacales. The basal split between Caprifoliaceae and Adoxaceae represents the greatest shift, measured either in molecular terms or using morphological characters. In particular, these two clades differ greatly and consistently (with little homoplasy) in many flower characters (Donoghue et al., 2003
): Adoxaceae have radial flowers, small calyx lobes, rotate corollas, short styles, lobed stigmas, and lack nectaries of unicellular hairs; Caprifoliaceae have bilaterally symmetrical flowers, large calyx lobes, tubular corollas, long styles, unlobed stigmas, and produce nectaries of unicellular hairs. The accumulation of so many differences between these clades favors either a concerted and relatively rapid shift in a suite of morphological characters related to pollination, or a scenario involving stepwise diversification over a longer period of time, followed by the extinction of "intermediate" forms before the radiation of crown Adoxaceae and Caprifoliaceae. The Adoxaceae, with ca. 200 species, is presently less species diverse than Caprifoliaceae, with over 900 species. The radiation of crown Caprifoliaceae may have commenced some 1015 million years before that of crown Adoxaceae, but the major cause of the difference in species number appears to have been the radiation of the herbaceous Valerina clade within Caprifoliaceae, and especially the relatively recent and rapid diversification of Dipsacaceae and core Valerianaceae. The herbaceous habit evolved independently at least three times within Adoxaceae (once at the base of Adoxina and twice within Sambucus; Eriksson and Donoghue, 1997
), but these shifts apparently were not associated with major changes in diversification rate. The key to understanding the high species diversity of the Valerina clade might lie not in herbaceousness alone, but rather in the radiation of several of its sublineages into newly available habitats.
Summary
Our analyses of Dipsacales provide a concrete example of the great extent to which age estimates can vary depending on the use of different fossil calibration points, different analytical methods, and different genes. Methods that assume either a global or local clock yield age estimates for Dipsacales that are likely to be too old based on our knowledge of the angiosperm fossil record. NPRS yields considerably younger age estimates for Dipsacales than either PL or Bayesian approaches, which may reflect sensitivities of NPRS to the distribution of long and short branches in a tree (Sanderson, 2002
, 2003
). The great variance observed here suggests that age estimates based on single genes and/or on single estimation methods should be treated very cautiously.
The convergence of combined PL and Bayesian analyses on a set of similar age estimates for the Dipsacales provides a basis for further work on the timing of their radiation in relation to biogeography and morphological evolution. Despite the absence of an unequivocal macrofossil record of Dipsacales from the Cretaceous, our analyses strongly imply that Dipsacales originated by the mid-Cretaceous, well before previous age estimates, and that its two major lineages diverged by the start of the Tertiary. This is consistent with reports of possible caprifolioid pollen from the Cretaceous (Muller, 1981
). Several major lineages appear to have diversified during the Eocene, but the major herbaceous lineages began their diversification in the Miocene, coincident with significant climate change. Much of the diversity in these lineages is probably accounted for by even more recent diversification associated with the occupation of two new regionsalpine habitats in South America by core Valerianaceae and semi-arid Mediterranean areas by core Dipsacaceae. The greatest morphological disparity is associated with the divergence of the older lineages, whereas the generation of the greatest species diversity appears to be associated with several younger and less disparate clades.
|
|
| FOOTNOTES |
|---|
2 Author for correspondence, present address: Department of Biology, Xavier University of Louisiana, New Orleans, LA 70125, phone: (504) 520-5011. E-mail: cbell3{at}xula.edu ![]()
| LITERATURE CITED |
|---|
|
|
|---|
Backlund A. M. J. Donoghue 1996 Morphology and phylogeny of the order Dipsacales. Department of Systematic Botany, Uppsala University, Uppsala, Sweden
Baldwin B. G. M. J. Sanderson 1998 Age and rate of diversification of the Hawaiian silversword alliance (Compositae). Proceedings of the National Academy of Sciences, USA 95: 9402-9406
Bell C. D. 2001 PORN*: a hierarchical likelihood ratio calculator for LINUX. http://www.phylodiversity.net/pornstar. Yale University, New Haven, Connecticut, USA
Bell C. D. 2004 Preliminary phylogeny of Valerianaceae (Dipsacales) inferred from nuclear and chloroplast DNA sequence data. Molecular Phylogenetics and Evolution 31: 340-350[CrossRef][ISI][Medline]
Bell C. D. M. J. Donoghue 2003 Phylogeny and biogeography of Morinaceae (Dipsacales) based on nuclear and chloroplast DNA sequences. Organisms, Diversity, and Evolution 3: 227-237
Bell C. D. M. J. Donoghue In press Phylogeny and biogeography of Valerianaceae (Dipsacales) with special reference to the South American valerians. Organisms, Diversity, and Evolution.
Bell C. D. E. J. Edwards S.-T. Kim M. J. Donoghue 2001 Dipsacales phylogeny based on chloroplast DNA sequences. Harvard Papers in Botany 6: 481-499
Conti E. J. Eriksson T. Schonenberger K. Sytsma D. Baum 2002 Early Tertiary out-of-India dispersal of Crypteroniaceae: evidence from phylogeny and molecular dating. Evolution 56: 1931-1942[CrossRef][ISI][Medline]
Donoghue M. J. 1983 The phylogenetic relationships of Viburnum. In N.I. Platnick and V.A. Funk [eds.], Advances in Cladistics, vol. 2, 143166. Columbia Univ. Press, New York, New York, USA
Donoghue M. J. B. G. Baldwin J. Li R. C. Winkworth 2004 Viburnum phylogeny based on chloroplast trnK intron and nuclear ribosomal ITS DNA sequences. Systematic Botany 29: 188-198[CrossRef][ISI]
Donoghue M. J. C. D. Bell J. Li 2001 Phylogenetic patterns in Northern Hemisphere plant geography. International Journal of Plant Sciences 162: S41-S52[CrossRef]
Donoghue M. J. C. D. Bell R. C. Winkworth 2003 The evolution of reproductive characters in Dipsacales. International Journal of Plant Sciences 164: S453-S464[CrossRef]
Donoghue M. J. T. Eriksson P. A. Reeves R. G. Olmstead 2001 Phylogeny and phylogenetic taxonomy of Dipsacales, with special reference to Sinadoxa and Tetradoxa (Adoxaceae). Harvard Papers in Botany 6: 459-479
Donoghue M. J. R. G. Olmstead J. F. Smith J. D. Palmer 1992 Phylogenetic relationships of Dipsacales based on rbcL sequences. Annals of the Missouri Botanic Garden 79: 333-345
Dorofeev P. I. 1963 The Tertiary floras of western Siberia. Technical report, Izdatelbstvo Akedemia Nauk SSSR, Moscow, Russia
Doyle J. A. M. J. Donoghue 1993 Phylogenies and angiosperm diversification. Paleobiology 19: 141-167[Abstract]
Eriksson T. M. J. Donoghue 1997 Phylogenetic analysis of Sambucus and Adoxa (Adoxoideae, Adoxaceae) based on nuclear ribosomal ITS sequences and preliminary morphological data. Systematic Botany 22: 555-573[CrossRef][ISI]
Felsenstein J. 1981 Evolutionary trees from DNA sequences: a maximum likelihood approach. Journal of Molecular Evolution 17: 368-376[CrossRef][ISI][Medline]
Felsenstein J. 1984 Distance methods for inferring evolutionary trees: a justification. Evolution 38: 16-24[CrossRef][ISI]
Gaut B. S. 1998 Molecular clocks and nucleotide substitution rates in higher plants. Evolutionary Biology 30: 93-120[ISI]
Gaut B. S. B. R. Morton B. C. Mc Caig M. T. Clegg 1996 Substitution rate comparisons between grasses and palms: synonymous rate differences at the nuclear gene adh parallel rate differences at the plastid gene rbcL. Proceedings of the National Academy of Sciences, USA 93: 10274-10279
Geological Society of America. 2004 GSA geologic time scale. http:// www.geosociety.org/science/timescale/timescl.htm
Heckman D. S. D. M. Geiser B. R. Eidell R. L. Stauffer N. L. Kardos S. B. Hedges 2001 Molecular evidence for the early colonization of land by fungi and plants. Science 293: 1129-1133
Herendeen P. S. P. R. Crane A. N. Drinnan 1995 Fagaceous flowers, fruits, and capsules from the Campanian (Late Cretaceous) of central Georgia, USA. International Journal of Plant Sciences 156: 93-116[CrossRef]
Huelsenbeck J. P. B. Larget D. L. Swofford 2000 A compound Poisson process for relaxing the molecular clock. Genetics 154: 1879-1892
Huelsenbeck J. P. F. Ronquist 2001 MrBayes: Bayesian inference of phylogeny. Bioinformatics 17: 754-755
Judd W. S. R. W. Sanders M. J. Donoghue 1994 Angiosperm family pairs: preliminary phylogenetic analyses. Harvard Papers in Botany 5: 1-51
Jukes T. H. C. R. Cantor 1969 Evolution of protein molecules. In H. N. Munro [ed.], Mammalian protein metabolism III. 21132. Academic Press, New York, New York, USA
Kishino H. M. Hasegawa 1989 Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in Hominoidea. Journal of Molecular Evolution 29: 170-179[CrossRef][ISI][Medline]
Kishino H. J. L. Thorne W. J. Bruno 2001 Performance of a divergence time estimation method under a probabilistic model of rate evolution. Molecular Biology and Evolution 18: 352-361
Lancucka-Srodoniowa M. 1967 Two new genera: Hemiptelea Planch. and Weigela Thurnb. in the younger Tertiary of Poland. Acta Palaeobotany 8: 1-17
Laroche J. P. Li L. Maggia J. Bousquet 1997 Molecular evolution of angiosperm mitochondrial introns and exons. Proceedings of the National Academy of Sciences, USA 94: 5722-5727
Magallón S. P. R. Crane P. S. Herendeen 1999 Phylogenetic pattern, diversity, and diversification of eudicots. Annals of the Missouri Botanical Garden 86: 297-372[CrossRef][ISI]
Manchester S. R. M. J. Donoghue 1995 Winged fruits of Linnaeeae (Caprifoliaceae) in the Tertiary of western North America: Diplodipelta gen. nov. International Journal of Plant Sciences 156: 709-722[CrossRef]
Manchester S. R. L. Hably 1997 Revision of "Abelia" fruits from the Paleogene of Hungary, Czech Republic, and England. Review of Palaeobotany and Palynology 96: 321-240
Martin W. D. Lydiate H. Brinkmann G. Forkmann H. Saedler R. Cerff 1993 Molecular phylogenies in angiosperm evolution. Molecular Biology and Evolution 10: 140-162[Abstract]
Morley R. J. 2000 Origin and evolution of tropical forests. Wiley, Chichester, UK
Muller J. 1981 Fossil pollen records of extant angiosperms. Botanical Review 47: 1-142
Nikitin V. P. 1976 Flora Mamontovoj Gory. Nauka, Moscow, Russia
Ozaki K. 1980 Late Miocene Tatsumitoge flora of Tottori Prefecture, southwest Honshu, Japan. Science Reports of the Yokohama National University 2: 40-42
Pyck N. P. Roels E. Smets 1999 Tribal relationships in Caprifoliaceae: evidence from a cladistic analysis using ndhF sequences. Systematics and Geography of Plants 69: 145-159[CrossRef]
Pyck N. E. Smets 2000 A search for the position of the seven-son flower (Heptacodium, Dipsacales): combining molecular and morphological evidence. Plant Systematics and Evolution 225: 185-199
Reid E. M. M. E. J. Chandler 1926 Catalogue of Cenozoic plants in the Department of Geology. In The Bembridge flora, vol. 1. British Museum (Natural History), London, UK
Sanderson M. J. 1997 A nonparametric approach to estimating divergence times in the absence of rate constancy. Molecular Biology and Evolution 14: 1218-1231[ISI]
Sanderson M. J. 1998 Estimating rates and time in molecular phylogenies: beyond the molecular clock?. In D. E. Soltis, P. S. Soltis, and J. J. Doyle [eds.], Molecular systematics of plants II, 242264. Kluwer, Boston, Massachusetts, USA
Sanderson M. J. 2002 Estimating absolute rates of molecular evolution and divergence times: a penalized likelihood approach. Molecular Biology and Evolution 19: 101-109
Sanderson M. J. 2003 r8s: inferring absolute rates of molecular evolution and divergence times in the absence of a molecular clock. Bioinformatics 19: 301-302
Sanderson M. J. J. A. Doyle 2001 Sources of error and confidence intervals in estimating the age of angiosperms from rbcL and 18S rDNA data. American Journal of Botany 88: 1499-1516
Sanmartin I. H. Enghof F. Ronquist 2001 Patterns of animal dispersal, vicariance and diversification in the Holarctic. Biological Journal of the Linnaean Society 73: 345-390
Sims H. J. P. S. Herendeen P. R. Crane 1998 New genus of fossil Fagaceae from the Santonian (Late Cretaceous) of central Georgia, USA. International Journal of Plant Sciences 159: 391-404[CrossRef]
Soltis P. S. D. E. Soltis M. W. Chase 1999 Angiosperm phylogeny inferred from multiple genes: a research tool for comparative biology. Nature 402: 402-404
Soltis D. E. P. A. Soltis M. W. Chase M. E. Mort D. C. Albach M. Zanis V. Savolainen W. H. Hahn S. B. Hoot M. F. Fay M. Axtell S. M. Swensen L. M. Price W. J. Kress K. C. Nixon J. S. Farris 2000 Angiosperm phylogeny inferred from 18S rDNA, rbcL, and atpB sequences. Botanical Journal of the Linnaean Society 133: 381-461[CrossRef]
Soltis P. S. D. E. Soltis V. Savolainen P. R. Crane T. G. Barraclough 2002 Rate heterogeneity among lineages of tracheophytes: integration of molecular and fossil data and evidence for molecular living fossils. Proceedings of the National Academy of Science, USA 99: 4430-4435
Swofford D. L. 2002 PAUP* 4.0: phylogenetic analysis using parsimony (* and other methods). Sinauer, Sunderland, Massachusetts, USA
Thorne J. L. H. Kishino 2002 Divergence time and evolutionary rate estimation with multilocus data. Systematic Biology 51: 689-702[CrossRef][ISI][Medline]
Thorne J. L. H. Kishino I. S. Painter 1998 Estimating the rate of evolution of the rate of molecular evolution. Molecular Biology and Evolution 15: 1647-1657[Abstract]
Tiffany B. H. S. R. Manchester 2001 The use of geological and paleontological evidence in evaluating plant phylogeographic hypotheses in the Northern Hemisphere Tertiary. International Journal of Plant Sciences 162: S3-S17[CrossRef]
Upchurch G. R. J. A. Wolfe 1987 Mid-Cretaceous to early Tertiary vegetation and climate: evidence from fossil leaves and woods. In E. M. Friis, W. G. Chaloner, and P. R. Crane [eds.], The origin of angiosperms and their biological consequences, 75105. Cambridge University Press, Cambridge, UK
Wiegmann B. M. D. K. Yeates J. L. Thorne H. Kishino 2003 Time flies, a new molecular time-scale for Brachyceran fly evolution without a clock. Systematic Biology 52: 745-756[CrossRef][ISI][Medline]
Wikstrom N. V. Savolainen M. W. Chase 2001 Evolution of the angiosperms: calibrating the family tree. Proceedings of the Royal Society of London, B 268: 2211-2220[Medline]