Reconstruction of Microsatellite Mutation History ... - Semantic Scholar

3 downloads 0 Views 158KB Size Report
David Weetman,1 Lorenz Hauser2 and Gary R. Carvalho ... expand (Rubinsztein et al. 1995 .... and Nei 1987), with Phylip version 3.5c (Felsenstein 1993).
Copyright  2002 by the Genetics Society of America

Reconstruction of Microsatellite Mutation History Reveals a Strong and Consistent Deletion Bias in Invasive Clonal Snails, Potamopyrgus antipodarum David Weetman,1 Lorenz Hauser2 and Gary R. Carvalho Department of Biological Sciences, University of Hull, Hull HU6 7RX, United Kingdom Manuscript received May 9, 2002 Accepted for publication July 15, 2002 ABSTRACT Direct observations of mutations and comparative analyses suggest that nuclear microsatellites show a tendency to expand, with reports of deletion biases limited to very long alleles or a few loci in multilocus studies. Here we investigate microsatellite evolution in clonal snails, Potamopyrgus antipodarum, since their introduction to Britain in the 19th century, using an analysis based on minimum spanning networks of multilocus microsatellite genotypes. British populations consist of a small number of highly distinct genotype groups with very few outlying genotypes, suggesting clonal lineages containing minor variation generated by mutation. Network patterns suggest that a single introduced genotype was the ancestor of all extant variation and also provide support for wholly apomictic reproduction within the most common clonal lineage (group A). Microsatellites within group A showed a strong tendency to delete repeats, with an overall bias exceeding 88%, irrespective of the exact method used to infer mutations. This highly unusual pattern of deletion bias is consistent across populations and loci and is unrelated to allele size. We suggest that for persistence of microsatellites in this clone, some change in the mutation mechanism must have occurred in relatively recent evolutionary time. Possible causes of such a change in mechanism are discussed.

M

ICROSATELLITES are short, tandemly repeated sequences of noncoding DNA that are widely used as markers in studies of population differentiation, relatedness, and genetic mapping (see Goldstein and Schlo¨tterer 1999). Mutations at microsatellites primarily involve slipped-strand mispairing events leading to the insertion or deletion of a number of repeat units during mitotic cell divisions (Eisen 1999). Many studies directly recording the direction of microsatellite mutations report a bias toward addition of repeats (e.g., Weber and Wong 1993; Amos et al. 1996; Primmer et al. 1996), with comparative, population genetic, and phylogenetic studies also suggesting that microsatellites show a tendency to expand (Rubinsztein et al. 1995; Amos 1999; Cooper et al. 1999; Zhu et al. 2000; Neff and Gross 2001). Examples of microsatellites exhibiting a deletion bias are rare and appear to be limited to a few loci in multilocus studies (DiRienzo et al. 1998; Udupa and Baum 2001) or to very long alleles at a particular locus (Wierdl et al. 1997; Ellegren 2000; Xu et al. 2000; Huang et al. 2002). To our knowledge there are no published records of deletion biases across a range of nuclear microsatellites in any organism. Naturally occurring mutations are difficult to observe

1 Corresponding author: Molecular Ecology and Fisheries Genetics Group, Department of Biological Sciences, University of Hull, Hull HU6 7RX, United Kingdom. E-mail: [email protected] 2 Present address: School of Aquatic and Fishery Sciences, University of Washington, Seattle, WA 98195-5020.

Genetics 162: 813–822 (October 2002)

because of their rarity, but population bottlenecks present the opportunity to study microsatellite evolution indirectly (Estoup and Cornuet 1999). Asexual (apomictic) species may provide particularly informative models, because genotypes are inherited as complete linkage groups, with mutation as the only source of variation between mothers and daughters (Sunnucks et al. 1996). Thus, data can be analyzed by network methods (reviewed by Posada and Crandall 2001), treating multilocus genotypes as haplotypes, and such studies may be relatively free of the population genetic assumptions that burden some indirect analyses of microsatellite evolution (Amos 1999). We utilized a severe bottleneck, arising from a founder event, to study microsatellite mutations in an invasive clone of the hydrobiid snail Potamopyrgus antipodarum. The species is native to New Zealand, where diploid sexual populations are thought to produce occasional triploid apomicts (Dybdahl and Lively 1995). Following an introduction to the River Thames in the late 19th century (Smith 1889), P. antipodarum spread rapidly throughout the United Kingdom and continental Europe (Hughes 1996) and has recently colonized North America (Zaranko et al. 1997). Sexual reproduction has never been recorded in United Kingdom populations (Hughes 1996), and genetic studies using multilocus minisatellites (Hauser et al. 1992) and randomly amplified polymorphic DNAs (RAPDs; Jacobsen et al. 1996) have suggested that only a few distinct clones were introduced, corresponding to previously described morphotypes (Warwick 1952). The aims of our study were to use minimum spanning

814

D. Weetman, L. Hauser and G. R. Carvalho

networks, based on extant multilocus genotypes, to reconstruct the evolutionary history of microsatellite mutations since the introduction of P. antipodarum to Britain. This was achieved by undertaking a survey of British populations to identify multilocus genotype groups and identify their correspondence to previously identified morphological-genetic clonal groups. We report that the populations screened contain three major clonal genotype groups, corresponding to the known morphotypes, and another previously unidentified clone, with evidence for either extremely rare clonal lineages or similarly rare interclonal hybridization. Within the most common clonal group, which appears to be entirely apomictic in our study populations, microsatellites have evolved via a mutational process highly biased toward deletion of repeats.

MATERIALS AND METHODS Samples and screening: Samples of 100 P. antipodarum were collected from eight geographically isolated populations around Britain (locations are shown and described in Figure 1). DNA was extracted following homogenization of whole snails and the eight populations were screened at four microsatellite loci: Pa112, Pa143, Pa254, and Pa121 (extraction, PCR, and scoring procedures are described in Weetman et al. 2001). Three of these populations (TBR, IMF, and BMA), which consisted almost entirely of a single clonal group, were screened at three additional loci: Pa217, Pa56, and Pa132 (Weetman et al. 2001). Other populations were not scored at these three loci either because they contained a single clone with no microsatellite variability or because they included high frequencies of other clones in which the three extra loci could not be scored unambiguously. Data analysis: From the data with four scored microsatellites, we calculated a genetic distance measure based on the number of shared alleles per locus between two individuals, Dxy (Samadi et al. 1999), and produced a pairwise distance matrix from the mean Dxy over the four loci. From this matrix we constructed a tree using the neighbor-joining method (Saitou and Nei 1987), with Phylip version 3.5c (Felsenstein 1993). In the three populations (TBR, IMF, and BMA) for which data from seven loci were available, we constructed minimum spanning networks from the multilocus microsatellite genotypes, first assuming full interconnection between the populations and, second, assuming that the populations were isolated from one another. To construct the networks, we broadly followed the guidelines of Excoffier and Smouse (1994). Our initial step was to determine which were the nearest neighboring genotypes for each genotype, using the criterion of the fewest number of mutations. These connection data were used to produce a first spanning network. It then proved possible to infer the oldest genotype(s) on the basis of frequency, position, and number of connections and thus to infer also the direction of flow of mutations. This allowed a substantial reduction in the number of possible spanning networks, by minimizing the number of mutational steps in pathways. For the final stage of resolution we assumed that connections between low-frequency genotypes and (very) high-frequency (older) genotypes were more likely than those between two low-frequency genotypes, provided the number of mutations were equal (Smouse 1998). In the networks, a single mutation was defined as a change in allele state from one size to another, irrespective of the

number of repeat unit changes involved. This is likely to underestimate the total number of mutations, but was appropriate because an independent analysis of the data using simulations suggests that most of the loci show a mutation process deviating substantially from a single-step mutation model (D. Weetman, L. Hauser and G. R. Carvalho, unpublished results). However, this bias is likely to be relatively small because most of the changes observed involved shifts of only one to three repeat units. From the minimum spanning networks, we counted the number of inferred mutations representing positive and negative changes in allele size. Two estimates for the percentage of deletions were calculated; the first used only the number of inferred mutations in the network, irrespective of the number of snails that shared genotypes. The second estimate assumed that the frequency of different nonancestral genotypes was proportional to the frequency of occurrence of the mutations that produced them. Therefore, a separate mutation was assumed to have given rise to each individual. The first counting method follows parsimony in that it ignores the frequency of genotypes and is likely to underestimate the total number of historical mutations. By contrast, the second method implicitly assumes that the frequency of a genotype is proportional to its age (Watterson and Guess 1977; Excoffier and Smouse 1994) and is more likely to overestimate mutations. These two estimates were calculated to provide approximate lower and upper boundaries for the unknown true number of mutations. One-tailed binomial probabilities were computed to test whether apparent biases observed were significantly ⬎50% of deletions. Where significant biases were found, we also calculated a lower 95% binomial confidence limit (Zar 1984) for the estimate of the percentage of deletions. We found only one case in which two alleles at the same locus changed simultaneously between an inferred mutant individual and its most probable progenitor, and for this ambiguity we recorded each mutational alternative. In a few instances within the networks, more than one alternative pathway of approximately equal probability to a genotype was found, creating loops in the network. Loops may reflect homoplasies or be the result of recombination (Crandall and Templeton 1993; Posada and Crandall 2001) so the genotypes involved were examined to determine whether the loop could be resolved by the assumption of a single recombination event. In cases of loops, the percentage of deletions was calculated once for each mutational pathway involved in each loop, creating alternative mutational scenarios. In fact, the particular alternative scenario assumed had no effect on the estimate of the percentage of deletions, until the data were subdivided into individual loci within populations, because each set of alternatives involved the same number of additions and deletions. Thus, estimates of the percentage of deletions for the maximum and minimum number of possible mutations are given only at the stage when specific loci are examined. Finally the relationship between the size of progenitor alleles (in repeat units) and the sign of the mutational change (addition or deletion) was tested by logistic regression (in SPSS version 9), using pooled data from all populations and loci to provide suitable statistical power for the test.

RESULTS

Genotypic composition of the sampled populations: Thirty-four multilocus genotypes (based on four loci) were found in the British populations surveyed. Thirty of these clustered into four widely divergent groups,

Potamopyrgus Microsatellite Deletion Bias

815

Figure 1.—Phenetic tree based on the neighbor-joining method depicting distance (Dxy) among the multilocus microsatellite genotypes detected in eight British populations of P. antipodarum. Proportionate frequencies of genotype groups (genotypes within dashed circles) in each population are shown in the pie charts: white, clone A; gray, clone B; striped, clone C; stippled, clone D; black, other ungrouped genotypes. Sampling locations of the populations are shown on the map, their numbers corresponding to those on the pie charts. The populations are TBR, disused reservoir by River Thames in Barnes, London; CAE, stream flowing into the sea by Caerhays Castle, Cornwall; HAA, stream below Harlech Castle, Gwynedd; BMA, Budworth Mere (lake), Cheshire; IMF, river near Fleshwick, Isle of Man; WBB, coastal pond at West Barns, near Dunbar, Lothian; DUN, stream inland from Dunstanburgh Castle; STA, agricultural drainage ditches near Stainfield, Lincolnshire.

corresponding to four putative clonal lineages (Figure 1). Clonal lineage A was by far the most common and widespread of these groups, comprising ⵑ67% of all the snails screened, and was the only lineage present in four of the populations (TBR, IMF, DUN, and STA) and the most common in a further two (BMA and HAA). All locations dominated by clone A are freshwater sites, whereas the other two, WBB (dominated by clonal group B) and CAE (a mixture of three major clonal groups) are coastal, brackish-water locations (see Figure 1). Outside of these groupings, the four other rare genotypes identified comprised only 0.6% of the total sample. All were highly divergent from both the major groups and one another (Figure 1). Microsatellite variation within clonal group A: Figure 2 shows a minimum spanning network for multilocus genotypes from three of the populations (BMA, IMF, and TBR) based on seven microsatellite loci (genotypes

are listed in Table 1). Two very high-frequency genotypes are central within the network and are separated by a difference (of a single step) at only one allele, with low-frequency genotypes radiating from each. This link between the two dominant genotypes represents the only parsimonious connection between the group of genotypes found in the BMA population and those in the IMF and TBR populations. The lack of interconnection between BMA and IMF confirmed our a priori expectation that the populations would be isolated, and so networks are also shown for each of the populations treated independently (Figure 3). Two outlying genotypes can be identified (31 and 34 in Figures 2 and 3) by the high number of mutations separating these from their nearest neighbor. Genotype 34 falls outside the clone A lineage group (Figure 1), and although this is not the case for genotype 31, both are excluded from further analysis of patterns of variation within clone A. Of the

816

D. Weetman, L. Hauser and G. R. Carvalho Figure 2.—Network of multilocus microsatellite genotypes (based on seven loci) found in three of the populations surveyed (TBR, IMF, and BMA), assuming fully interconnected populations. Genotypes are shown as circles whose area is proportional to their frequency of occurrence. In most cases the length of connections is proportional to the number of inferred mutational changes between genotypes, but bars are also provided to show the number of changes. Deletions are shown as thick bars; additions are shown as thin bars. The central dashed line illustrates the division between the genotypes found in TBR/IMF and BMA, the only overlap occurring with central genotypes 1 and 7. Pathways of approximately equal probability are shown as more than one connection to a genotype. Genotype numbers correspond to those in Table 1.

loops involved in the networks in Figures 2 and 3, only that leading to genotype 29 could conceivably have been the result of a recombination event between the two most probable progenitors, genotypes 10 and 15. Deletion bias within clone A: The highest-frequency genotypes in the networks shown in Figures 2 and 3 fulfilled a number of criteria expected for candidate ancestral genotypes (see discussion). On the basis of this inference, mutations were assumed to flow from these central nodes (genotypes 1 and 7, respectively in Figures 2 and 3) outward toward the tips of the network. An obvious feature of the mutational pathways inferred is the relatively high number of events representing a reduction in allele size. From the network that assumes full interconnection between populations (Figure 2), the estimated percentage of deletions was 88.1% (N ⫽ 42, P Ⰶ 0.001, 95% lower confidence limit ⫽ 76.7%), and a consistently high bias was found across populations (Table 2) when these are treated independently (Figure 3), irrespective of whether or not the estimate incorporated genotype frequencies (Table 2). When data from the population networks (Figure 3) were subdivided to examine the bias for specific loci, a similar general pattern emerged (Table 3). No loci showed a significant percentage of additions and, depending on how mutations were counted, either three or five of the seven loci showed a significant percentage of deletions (Table 3). There was no relationship between allele size (in repeat units) and the direction of mutations (logistic regression, P ⬎ 0.7). All the shifts in allele size inferred from the minimum spanning networks involved changes of a maximum of four repeat

units, with most involving changes of three repeat units or less (see Table 1). DISCUSSION

Clonal composition of British populations: Microsatellite variation among the sampled populations shows a pattern consistent with the existence of a small number of highly divergent clonal lineages within which minor variation has been generated by mutation since the introduction of P. antipodarum to Britain in the late 19th century. The genetic distance between clonal groups is too great to allow for the possibility that one clone has given rise to another by mutation since invasion of Britain. Within groups, variation is minor and variant genotypes tend to be specific to particular populations (see Table 1) suggesting independent origins by mutation. These patterns support the hypothesis that only a few founder clones were introduced to Britain (Hauser et al. 1992). In previous genetic studies using multilocus minisatellites (Hauser et al. 1992) and RAPDs (Jacobsen et al. 1996), the morphotypes of clones A, B, and C (Warwick 1952) were found to correspond to distinct genotypes. In this study we did not attempt to match every individual morphotype to a microsatellite genotype. However, morphological examinations of shells suggested that morphotypes of clones A, B, and C were present in roughly the frequencies within populations revealed by subsequent microsatellite analysis. Thus, morphological identification of clones, multilocus minisatellites, RAPDs, and multilocus microsatellites all appear to produce concordant results.

95 1 2 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

81 0 1 0 0 2 1 1 2 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 0 0 0 0 0

IMF 8 0 0 0 0 0 65 0 0 0 0 0 0 0 0 1 3 2 1 4 4 1 3 1 1 0 0 0 0 0 0 1 1 1 2 1

BMA 184 1 3 1 1 2 66 1 2 1 1 1 1 1 1 1 3 2 1 4 4 1 3 1 1 1 1 1 1 1 1 1 1 1 2 1

兺 219/219/225 219/219/225 219/219/225 213/219/219 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/219 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/225 219/219/219 219/219/225 219/219/225 219/219/225 213/219/219 219/219/219 219/219/225

Pa112 148/157/166 148/157/166 148/157/166 148/157/166 148/157/166 148/157/166 148/157/166 148/157/166 148/157/166 148/157/166 148/157/166 148/157/166 148/157/169 148/157/166 148/157/163 148/157/166 148/157/166 148/157/166 148/157/166 148/157/166 148/157/166 148/157/166 148/157/166 148/157/166 148/157/166 148/151/166 148/157/166 148/157/166 148/157/163 148/151/166 148/157/163 148/157/163 148/157/166 148/151/163 148/157/166 148/157/169

Pa143 129/147/171 129/147/167 129/147/171 129/147/171 129/147/171 129/147/171 129/147/171 129/147/171 129/147/169 129/147/171 129/147/171 127/147/171 129/147/171 129/147/175 129/147/171 129/147/171 129/147/171 129/147/171 129/147/171 129/147/171 127/147/171 129/147/169 129/147/171 129/147/171 129/147/171 127/147/171 129/147/171 127/145/171 129/147/171 127/147/171 129/147/171 129/145/171 127/147/171 129/147/167 129/147/169 127/147/171

Pa254 103/109/118 103/109/118 103/109/118 103/109/118 109/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 106/109/109 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118 103/109/118

Pa121 171/171/171 171/171/171 171/171/171 171/171/171 171/171/171 169/169/171 171/171/171 165/171/171 171/171/171 169/171/171 171/171/171 171/171/171 171/171/171 171/171/171 171/171/171 165/167/167 171/171/171 165/171/171 171/171/171 169/171/171 171/171/171 171/171/171 167/171/171 169/169/169 165/169/169 171/171/171 169/169/169 171/171/171 169/171/171 169/169/169 165/167/171 171/171/171 167/169/171 171/171/171 171/171/171 169/171/171

Pa217

151/151/151 151/151/151 149/151/151 151/151/151 151/151/151 151/151/151 151/151/151 151/151/151 151/151/151 151/151/151 151/151/151 151/151/151 151/151/151 151/151/151 151/151/151 151/151/151 151/151/151 151/151/151 149/151/151 151/151/151 151/151/151 151/151/151 151/151/151 151/151/151 151/151/151 151/151/151 149/151/151 149/151/151 151/151/151 151/151/151 149/151/151 151/151/151 151/151/151 151/151/151 149/151/151 151/151/151

Pa56

176/176/188 176/176/188 176/176/188 176/176/188 176/176/188 176/176/188 176/176/191 176/176/188 176/176/188 176/176/188 176/176/188 176/176/188 176/176/188 176/176/188 176/176/188 176/176/191 176/176/191 176/176/191 176/176/191 176/176/191 176/176/191 176/176/191 176/176/191 176/176/191 176/176/191 176/176/188 176/176/188 176/176/188 176/176/188 176/176/188 176/176/188 176/176/191 176/176/191 176/176/191 176/176/191 176/176/191

Pa132

Frequencies of each genotype within each population (TBR, IMF, and BMA) and their total (兺) are shown. The size of the PCR product amplified for each allele is given in base pairs. See Table 3 for details of the microsatellite sequences.

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36

TBR

Multilocus microsatellite genotypes, based on seven loci, found within the (triploid) clone A lineage group

TABLE 1

Potamopyrgus Microsatellite Deletion Bias 817

818

D. Weetman, L. Hauser and G. R. Carvalho

Figure 3.—Networks constructed from multilocus microsatellite genotypes assuming separate, unconnected populations. See Figure 2 legend for further explanation.

Identification of rare genotypes: A previously unidentified clone, which we have called clone D, was found as the most common genotype in the CAE population from Cornwall. Although we did not identify this as a distinct morphotype, there has been anecdotal description of a fourth potential morphological strain (T. Warwick, personal communication). This additional clonal genotype appears to be uncommon and to have a restricted distribution. TABLE 2 Number of inferred mutations (N ) within each of the populations of clone A (from network in Figure 3), counting either the number of genotypes (G) or their frequencies (F ) Population TBR IMF BMA

Count

N

% deletion

95% LCL

G F G F G F

4 5 19 21 22 41

75.0 80.0 78.9** 81.0** 95.5**** 97.6****

58.1 62.2 80.2 88.9

Lower 95% confidence limits (95% LCL) for the percentage of deletions are shown where estimates are significantly different from zero (one-tailed binomial probability). **P ⬍ 0.01; ****P Ⰶ 0.001.

Four other genotypes, represented by a total of just five individuals, were identified. These may constitute highly mutated individuals originating from the major clones, extremely rare clones, or they may be hybrids between genotypes from within clonal groups. With the possible exception of the individual from the very large and relatively variable BMA population, it is unlikely that rather distantly related genotypes such as these could be mutant descendants of the four major clonal lineages. A study using allozymes to investigate the genetics of P. antipodarum in Britain reported evidence for multiple clones, which did not correspond to morphological types (Foltz et al. 1984). Although this contradicts all genetic data from DNA markers (above) and was probably questionable because of technical considerations (Hauser et al. 1992; Hughes 1996), we cannot exclude the possibility that other rare clones do exist within Britain. Very rare hybridization events (through sexual reproduction) between clones might occur and have been recorded in other primarily asexual snails (e.g., Samadi et al. 1999), but if this is the case here, based on the allele composition of the very rare genotypes (data not shown), it does not appear that they could have arisen from recombination between any pair of genotypes recorded within the clonal groups. Clonal dynamics: Clonal lineage A is dominant through-

Potamopyrgus Microsatellite Deletion Bias

819

TABLE 3 As Table 2, but with the number of inferred mutations within each population shown separately for each of the seven loci screened Maximum Locus Pa112 (AAG)6-10 Pa143 (AAT)7-14 Pa254 (GA)20-44a Pa121 (ATT)8-12 Pa217 (CA)11-14 Pa56 (TG)14-15a Pa132 (TAA)13-18

Count

N

% deletion

G F G F G F G F G F G F G F

4 7 6 6 12 17 3 3 17 24 6 7 2 9

100 100** 66.7 66.7 91.7** 92.3** 33.3 33.3 100**** 100**** 100* 100* 50 88.9*

Minimum 95% LCL 65.2

66.1 68.3

83.8 87.3 60.7 65.2 57.1

N

% deletion

3 5 5 5 8 9 3 3 14 20 5 6 2 9

100 100 60 60 87.5* 88.9 33.3 33.3 100**** 100**** 100* 100* 50 88.9*

95% LCL 54.9

53.0 57.1

80.7 86.1 54.9 60.7 57.1

In some cases alternative mutational pathways involving different loci were present in the network (Figure 3), so the maximum and minimum possible number of mutations are shown for each locus. *P ⬍ 0.05; **P ⬍ 0.01; ****P Ⰶ 0.001. a Repeat motifs have point interruptions—sequences given in Weetman et al. (2001).

out freshwater habitats, with B almost entirely restricted to brackish locales and C found only in the population from Wales (HAA). This description of clonal frequencies, habitat types, and geographical locations corresponds closely to those noted ⬎50 years ago (reported in Warwick 1952, 1969), on the basis of morphological descriptions of types A, B, and C. This suggests rather stable clonal dynamics, which may be related to the small number of clonal genotypes, apparent lack of conspecific sexual competitors, and an absence of parasites, thought to balance advantages of apomictic reproduction in New Zealand populations (Lively 1987). These factors are also likely to explain, at least in part, the remarkable success of P. antipodarum as an invasive species. Following the introduction to Britain, P. antipodarum has spread throughout much of continental Europe, which appears to be dominated by the same clonal groups A and B as Britain (Warwick 1969; Jacobsen et al. 1996). Criteria for inclusion in the study of within-lineage microsatellite evolution: The microsatellites used in this study were isolated from individuals of lineage A and produced unambiguous triploid microsatellite patterns at all seven loci only when amplifying DNA from this clone. For this reason and because of the numerical dominance of this clonal group in our samples, we concentrated our study of within-clonal-lineage microsatellite evolution on group A individuals. This strategy also greatly limited the potential for null alleles, which are of particular concern in studies of polyploid species. Moreover, we did not consider individuals from two small popu-

lations (DUN and STA) because, although of group A, they appeared to be invariant from the main clone A genotype, at least as determined by the four loci at which they were screened (genotype with frequency of 503 in Figure 1). Group A individuals from the HAA and CAE populations were also excluded because the potential for occurrence of any interclonal hybridization would be relatively higher where there was a mixture of clones. Thus, we concentrated the analysis on three populations, TBR, IMF, and BMA. Identification of an ancestral introduced genotype and apomixis within clonal lineage A: The minimum spanning networks based on multilocus microsatellite genotypes showed the general form of star phylogenies, with a high-frequency genotype in the center linked to several much lower-frequency genotypes and a general decrease in frequency toward the tips of the network. Star phylogenies are characteristic of recent expansion from a small number of ancestral haplotypes (Avise 2000), which can be inferred from (1) their high frequency, (2) their interior position in the network, (3) their high number of connections to other haplotypes, and (4) their relatively wide geographical distribution (Watterson and Guess 1977; Donnelly and Tavare´ 1986; Crandall and Templeton 1993; Excoffier and Smouse 1994; Crandall 1996; Avise 2000; Posada and Crandall 2001). These diagnostics all apply to the minimum spanning networks produced from the multilocus microsatellite genotypes in the three populations studied. In the network assuming full interconnection between the populations there are two high-frequency genotypes,

820

D. Weetman, L. Hauser and G. R. Carvalho

separated by a single mutation (see Figure 2), one dominant in the TBR and IMF populations and the other in the BMA population. To determine whether one of these was likely to be a single common ancestor we genotyped a small number (N ⫽ 5 per population) of clonal group A individuals from the geographically dispersed STA, CAE, and HAA populations at a later date (DNA from DUN individuals had unfortunately degraded). All 15 of these extra samples from STA, CAE, and HAA were of genotype 1, the most common in TBR and IMF. Thus, in addition to its frequency, position, and number of connections in the networks, because of this widespread distribution, this is likely to be the single ancestral genotype for clone A that was introduced to Britain. The most common genotype in BMA (genotype 7) may be the result of a mutation very early in the colonization process. This suggestion is supported by the number of connections to other genotypes originating from genotype 7, which is only slightly lower than those from genotype 1. Minimum spanning networks have been more commonly used in analyses of mitochondrial DNA (Avise 2000) but have also been applied to studies of microsatellites on the human Y chromosome (e.g., Karafet et al. 1999) in which nonrecombinant haplotypes are transmitted intact except for mutation. Our networks, obtained from triploid data, correspond closely to the form of minimum spanning networks constructed from haploid data (see examples for mtDNA star phylogenies in Avise 2000), with their central high-frequency genotypes and relatively few genotypes involved in loops. We found only one genotype that could have potentially arisen via recombination, in that this would resolve a loop in the networks (Crandall and Templeton 1993; Posada and Crandall 2001). However, because the genotype involved was only two mutational steps away from the core of the network and this was an isolated possible incidence of recombination, it is difficult to argue that recombination is a more probable explanation than homoplasy. We consider it more likely that reproduction within these populations is entirely apomictic, suggesting concordance between microsatellite data and the lack of evidence of sexual reproduction in P. antipodarum in Britain (Hughes 1996). Microsatellites show a strong and consistent bias toward deletion of repeats: For interconnected populations and pooled loci we estimated that ⬎88% of the mutations inferred from the minimum spanning networks were deletions of repeats. Analysis based on unconnected populations suggested estimates of 75 or 80% for TBR, 79 or 81% for IMF, and 95 or 97% for BMA, respectively, depending on whether the genotypes or their frequencies were counted as mutations. Although the estimate for TBR is based on very limited data, the consistency across populations suggests a deletion bias approaching or exceeding 80%. It is conceivable that

estimates based on pooled data might be biased by the effect of a single locus, notably Pa217, which showed not only 100% deletions, but also the highest mutation rate (D. Weetman, L. Hauser and G. R. Carvalho, unpublished results). However, 100% of deletions were found for two of the other loci, albeit based on fewer mutations, and a bias of between 87.5 and 92% (depending on the method of estimation) was found for the second most variable locus Pa254. Indeed, only one of the seven loci showed ⬍50% of deletions and this was from only three inferred mutations. Therefore, we suggest that a strong bias toward deletions may be a common feature of the microsatellites of clone A of P. antipodarum in Britain. To our knowledge, this is the first example of a consistent deletion bias across a sample of nuclear microsatellite loci in any organism. Presence of microsatellites with a strong deletion bias suggests a recent change in mutation mechanism: The existence of a strong deletion bias raises the question of how microsatellites could have persisted in this clone of P. antipodarum. Two models of microsatellite evolution could provide an explanation, if the observed deletion bias is caused by a nonrandom sample of loci. First, a bias toward deletions for long microsatellite alleles, with an insertion bias for shorter alleles, could preserve a stable overall allele distribution (Xu et al. 2000). Here, however, this is inapplicable because the alleles of most of our loci were of only short to moderate length (see Table 3) and also we found no relationship between allele size and the direction of change. Second, constraints on the total length of linked loci might lead to insertion biases for some loci, coupled with deletions for others (Dermitzakis et al. 1998). This possibility is difficult to rule out, but it seems unlikely since none of the seven loci had a significant tendency to expand, though other studies have found significant expansion biases to be common. Therefore, to explain the existence of both the extreme directional bias and the presence of mutable loci in P. antipodarum it would seem that some factor affecting the direction of mutations has changed in relatively recent evolutionary time. Possible causes of a change in mutation mechanism leading to a deletion bias: Could loss of meiosis caused by conversion from diploid sexuality to triploid apomixis have altered the mutation process? Although some DNA slippage-related mutations might occur just prior to meiosis via gene conversion type events (Amos 1999), microsatellite mutations occur primarily during mitosis (reviewed by Sia et al. 1997a; Eisen 1999), so lack of meiosis per se is probably an unlikely explanation. Moreover, all the inferred mutations involved changes of a maximum of four repeat units, which would suggest that they have arisen wholly by slippage, and not by some unusual mutation mechanism. Chenuil et al. (1997) reported that tetraploid sexual barbels (Barbus sp.) appear to have fewer and shorter dinucleotide microsatel-

Potamopyrgus Microsatellite Deletion Bias

lites than three other diploid cyprinid species examined, suggesting that perhaps polyploidy, rather than asexuality, may be associated with reduced microsatellite size caused by a mutation bias toward deletions. A change in ploidy level might be a mechanistic cause of a change in mutation process; for example, the extra polymerase or mismatch-repair (MMR) proteins encoded by one or more extra sets of chromosomes might disrupt the MMR protein complexes formed and so affect their function. Currently, the age of the original genotype of clone A of P. antipodarum is unknown but an explanation based on change in ploidy would rely on a relatively recent conversion to triploidy to be consistent with the continued existence of microsatellites. A deletion bias might also be favored by selection for reduced genome size. Indeed, the only published report of a deletion bias that is consistent across microsatellites is in mitochondrial microsatellites of yeast, Saccharomyces cerevisiae (the nuclear microsatellites showed a very strong insertion bias; Sia et al. 2000). Mitochondrial genomes are thought to be under selection to reduce their size (Selosse et al. 2001), and for opportunistic organisms, because of the expected inverse correlation between genome size and development rate (Charlesworth et al. 1996; Hughes 1999), selection would be predicted to act against the accumulation of large amounts of “junk” DNA (Charlesworth et al. 1996). For P. antipodarum clones invading Britain, successful persistence and spread may hinge on fast development, providing the ability to rapidly increase numbers following founder events or other population bottlenecks. Selection acting on individual microsatellite mutations seems unlikely because of their marginal effect on total genome size, but variants for both polymerase-␦ (Kokoska et al. 2000) and MMR proteins (Sia et al. 1997b), which cause preferential deletion of repeat units, have been found in S. cerevisiae and could have a more substantial effect in the genome upon which selection can act. Such a variant might have become fixed by chance in clonal group A of P. antipodarum, but because of the consistency of the deletion bias across populations, either the variant must have been introduced or it must have occurred very early in the colonization process. MMR variants that raise the mutation rate in clones of Escherichia coli adapting to new environments (“mutators”) offer a substantial, but transitory, advantage (de Visser et al. 1999). However, a polymerase or MMR variant that changes the indel ratio, rather than rate, of mutations provides the potential to regulate repetitive DNA, and so could be under long-term positive selection in an asexual colonist such as P. antipodarum. We are grateful to Bill Amos, Brian Charlesworth, Africa Go´mez, David Lunt, and Craig Primmer for comments on an earlier version of this manuscript and to Cock van Oosterhout for helpful discussions. The manuscript was much improved following comments on a previ-

821

ous version from two anonymous referees. This study was supported by N.E.R.C. (UK) grant GR9/04307.

LITERATURE CITED Amos, W., S. J. Sawcer, R. W. Feakes and D. C. Rubinsztein, 1996 Microsatellites show mutational bias and heterozygote instability. Nat. Genet. 13: 390–391. Amos, W., 1999 A comparative approach to the study of microsatellite evolution, pp. 66–79 in Microsatellites: Evolution and Applications, edited by D. B. Goldstein and C. Schlo¨tterer. Oxford University Press, Oxford. Avise, J. C., 2000 Phylogeography: The History and Formation of Species. Harvard University Press, Cambridge, MA. Charlesworth, B., P. Sniegowski and W. Stephan, 1996 The evolutionary dynamics of repetitive DNA in eukaryotes. Nature 371: 215–220. Chenuil, A., E. Desmarais, L. Poyaud and P. Berrebi, 1997 Does polyploidy lead to fewer and shorter microsatellite in Barbus (Teleostei: Cyprinidae)? Mol. Ecol. 6: 169–178. Cooper, G., N. J. Burroughs, D. A. Rand, D. C. Rubinsztein and W. Amos, 1999 Markov chain Monte Carlo analysis of human Y-chromosome microsatellites provides evidence of biased mutation. Proc. Natl. Acad. Sci. USA 96: 11916–11921. Crandall, K. A., 1996 Multiple interspecies transmission of human and simian T-cell leukemia/lymphoma virus type I sequences. Mol. Biol. Evol. 13: 115–131. Crandall, K. A., and A. R. Templeton, 1993 Empirical tests of some predictions from coalescent theory with applications to intraspecific phylogeny reconstruction. Genetics 134: 959–969. Dermitzakis, E. T., A. G. Clark, C. Batargias, A. Magoulas and E. Zouros, 1998 Negative covariance suggests mutation bias in a two-locus microsatellite system in the fish Sparus aurata. Genetics 150: 1567–1575. de Visser, J. A. G. M., C. W. Zeyl, P. J. Gerrish, J. L. Blanchard and R. E. Lenski, 1999 Diminishing returns from mutation supply rate in asexual populations. Science 283: 404–406. DiRienzo, A., P. Donnelly, C. Toomajian, B. Sisk, A. Hill et al., 1998 Heterogeneity of microsatellite mutations within and between loci, and implications for human demographies. Genetics 148: 1269–1284. Donnelly, P. and S. Tavare´, 1986 The ages of alleles and a coalescent. Adv. Appl. Probab. 18: 1–19. Dybdahl, M., and C. M. Lively, 1995 Diverse, endemic and polyphyletic clones in mixed populations of a freshwater snail (Potamopyrgus antipodarum). J. Evol. Biol. 8: 385–398. Eisen, J. A., 1999 Mechanistic basis for microsatellite instability, pp. 34–48 in Microsatellites: Evolution and Applications, edited by D. B. Goldstein and C. Schlo¨tterer. Oxford University Press, Oxford. Ellegren, H., 2000 Heterogeneous mutation processes in human microsatellite DNA sequences. Nat. Genet. 24: 400–402. Estoup, A., and J.-M. Cornuet, 1999 Microsatellite evolution: inferences from population data, pp. 49–65 in Microsatellites: Evolution and Applications, edited by D. B. Goldstein and C. Schlo¨tterer. Oxford University Press, Oxford. Excoffier, L., and P. E. Smouse, 1994 Using allele frequencies and geographic subdivision to reconstruct gene trees within species: molecular variance parsimony. Genetics 136: 343–359. Felsenstein, J., 1993 PHYLIP (Phylogeny Inference Package), version 3.5c. Department of Genetics, University of Washington, Seattle. Foltz, D. W., H. Ochman, J. S. Jones and R. K. Selander, 1984 Genetic heterogeneity within and among morphological types of the parthenogenetic snail, Potamopyrgus jenkinsi (Smith 1889). J. Molluscan Stud. 50: 242–245. Goldstein, D. B., and C. Schlo¨tterer, 1999 Microsatellites: Evolution and Applications. Oxford University Press, Oxford. Hauser, L., G. R. Carvalho, R. N. Hughes and R. E. Carter, 1992 Clonal structure of the introduced freshwater snail Potamopyrgus antipodarum (Prosobranchia: Hydrobiidae), as revealed by DNA fingerprinting. Proc. R. Soc. Lond. Ser. B 249: 19–25. Huang, Q. Y., F. H. Xu, H. Shen, H. Y. Deng, Y. J. Liu et al., 2002

822

D. Weetman, L. Hauser and G. R. Carvalho

Mutation patterns at dinucleotide microsatellite loci in humans. Am. J. Hum. Genet. 70: 625–634. Hughes, A. L., 1999 Adaptive Evolution of Genes and Genomes. Oxford University Press, New York. Hughes, R. N., 1996 Evolutionary ecology of parthenogenetic strains of the prosobranch snail, Potamopyrgus antipodarum (Gray) ⫽ P. jenkinsi (Smith). Malacol. Rev. Suppl. 6: 101–113. Jacobsen, R., V. E. Forbes and O. Skovgaard, 1996 Genetic population structure of the prosobranch snail Potamopyrgus antipodarum (Gray) in Denmark using PCR-RAPD fingerprints. Proc. R. Soc. Lond. Ser. B 263: 1065–1070. Karafet, T., L. P. Osipova, O. L. Posukh, V. Wiebe and M. F. Hammer, 1999 Y chromosome microsatellite haplotypes and the history of Samoyed-speaking populations in northwest Siberia, pp. 249–265 in Microsatellites: Evolution and Applications, edited by D. B. Goldstein and C. Schlo¨tterer. Oxford University Press, Oxford. Kokoska, R. J., L. Stefanovic, J. DeMai and T. D. Petes, 2000 Increased rates of genomic deletions generated by mutations in the yeast gene encoding DNA polymerase delta or by decreases in the cellular levels of DNA polymerase delta. Mol. Cell. Biol. 20: 7490–7504. Lively, C. M., 1987 Evidence from a New Zealand snail for the maintenance of sex by parasitism. Nature 328: 519–521. Neff, B. D., and M. R. Gross, 2001 Microsatellite evolution in vertebrates: inference from AC dinucleotide repeats. Evolution 55: 1717–1733. Posada, D., and K. A. Crandall, 2001 Intraspecific gene genealogies: trees grafting into networks. Trends Ecol. Evol. 16: 37–45. Primmer, C. R., H. Ellegren, N. Saino and A. P. Møller, 1996 Directional evolution in germline microsatellite mutations. Nat. Genet. 13: 391–393. Rubinsztein, D. C., W. Amos, J. Leggo, S. Goodburn, R. L. Margolis et al., 1995 Microsatellite evolution—evidence for directionality and variation in rate between species. Nat. Genet. 10: 337–343. Saitou, N., and M. Nei, 1987 The neighbour-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4: 406–425. Samadi, S., J. Mava´rez, J. P. Pointier, B. Delay and P. Jarne, 1999 Microsatellite and morphological analysis of population structure in the parthenogenetic freshwater snail Melanoides tuberculata: insights into the creation of clonal variability. Mol. Ecol. 8: 1141– 1153. Selosse, M.-A., B. Albert and B. Godelle, 2001 Reducing the genome size of organelles favours gene transfer to the nucleus. Trends Ecol. Evol. 16: 135–141. Sia, E. A., S. Jinks-Robertson and T. D. Petes, 1997a Genetic control of microsatellite stability. Mutat. Res. 383: 61–70.

Sia, E. A., R. J. Kokoska, M. Dominska, P. Greenwell and T. D. Petes, 1997b Microsatellite instability in yeast: dependence on repeat unit size and DNA mismatch repair genes. Mol. Cell. Biol. 17: 2851–2858. Sia, E. A., C. A. Butler, M. Dominska, P. Greenwell, T. D. Fox et al., 2000 Analysis of microsatellite mutations in the mitochondrial DNA of Saccharomyces cerevisiae. Proc. Natl. Acad. Sci. USA 97: 250–255. Smith, E. A., 1889 Notes on British Hydrobiidae with a description of a supposed new species. J. Conchol. 6: 142–146. Smouse, P. E., 1998 To tree or not to tree. Mol. Ecol. 7: 399–412. Sunnucks, P., P. R. England, A. C. Taylor and D. F. Hales, 1996 Microsatellite and chromosomal evolution of parthenogenetic Sitobion aphids in Australia. Genetics 144: 747–756. Udupa, S. M., and M. Baum, 2001 High mutation rate and mutational bias at (TAA)n microsatellite loci in chickpea (Cicer arietinum L.). Mol. Genet. Genomics 265: 1097–1103. Warwick, T., 1952 Strains in the mollusc Potamopyrgus jenkinsi (Smith). Nature 169: 551–552. Warwick, T., 1969 Systematics of the genus Potamopyrgus (Hydrobiidae) in Europe and causation of the keel in this clone. Proc.Third Eur. Malacol. Congr. 9: 301–302. Watterson, G. A., and H. A. Guess, 1977 Is the most frequent allele the oldest? Theor. Popul. Biol. 11: 141–160. Weber, J. L., and C. Wong, 1993 Mutation of human short tandem repeats. Hum. Mol. Genet. 2: 1123–1128. Weetman, D., L. Hauser and G. R. Carvalho, 2001 Isolation and characterization of di- and trinucleotide microsatellites in the freshwater snail Potamopyrgus antipodarum. Mol. Ecol. Notes 1: 185–187. Wierdl, M., M. Dominska and T. D. Petes, 1997 Microsatellite instability in yeast: dependence on the length of the microsatellite. Genetics 146: 769–779. Xu, X., M. Peng, Z. Fang and X. Xu, 2000 The direction of microsatellite mutations is dependent upon allele length. Nat. Genet. 24: 396–399. Zar, J. H., 1984 Biostatistical Analysis. Prentice-Hall International, London. Zaranko, D. T., D. G. Farara and F. G. Thompson, 1997 Another exotic mollusc in the Laurentian Great Lakes: the New Zealand native Potamopyrgus antipodaum (Gray 1843) (Gastropoda, Hydrobiidae). Can. J. Fish. Aquat. Sci. 54: 809–814. Zhu, Y., D. C. Queller and J. E. Strassmann, 2000 A phylogenetic perspective on sequence evolution in microsatellite loci. J. Mol. Evol. 50: 324–338. Communicating editor: G. B. Golding