The genetic basis of eyespot size in the butterfly Bicyclus anynana: an analysis of line crosses

Play all audios:

The tropical butterfly Bicyclus anynana shows wide variation in the size of the eyespots on the ventral side of its wings. Differences in the environmental temperature during late larval and

early pupal stages are a major source of this variation, but variation also exists within temperatures. Using lines selected at a single temperature for large and small eyespots, and a

number of crosses derived from these lines, we studied the genetic basis of eyespot size in B. anynana. We applied Lande’s modification of the Castle–Wright (C–W) estimator to estimate the

minimum number of genes contributing to the difference between the two lines. Estimates indicated that at least five genes are involved. As the C–W estimator is based on a number of

simplifying assumptions, we tested to what extent one of these assumptions (additive gene action) is actually met. Joint-scaling tests indicated that the assumption of additivity is not

satisfied and that dominance and probably epistasis play a role. Because reciprocal crosses were available we looked for evidence for sex-linkage and cytoplasmic effects. No evidence for

cytoplasmic effects was found, but the data were consistent with the presence of one or more loci on the X-chromosome. The results are discussed in the context of the current model of

eyespot formation.

Butterfly wing patterns in general, and butterfly eyespots in particular, have proven very useful objects in the study of the genetic and developmental bases of phenotypic evolution.

Although spectacularly diverse, butterfly wing patterns nonetheless possess an underlying simplicity: the nymphalid groundplan (Nijhout, 1991). This groundplan — the collection of homologies

among pattern elements — has been modified in numerous ways to generate patterns that are involved in camouflage, deflection of predators, mimicry, and communication between the sexes.

Experimental investigation of wing patterns is greatly facilitated by their structural simplicity: the colour patterns are mosaics of tiny, differently pigmented scales that cover the wings

like tiles on a roof. Nijhout (1980) conducted a series of transplant and cautery experiments in early pupae of Precis coenia that resulted in his formulation of the morphogen-sink model. In

this model, cells at the centre (or focus) of the future adult pattern element (e.g. an eyespot) produce a chemical (the morphogen) that diffuses into surrounding cells. These surrounding

cells respond to different concentrations of the morphogen by producing differently coloured scales (see French & Brakefield (1992) for an alternative model, in which the focus acts as a

sink rather than a source).

Evolutionary change of wing patterns requires a genetic basis of pattern variation, which can be examined using spontaneous or induced mutations, or in artificial selection experiments.

These approaches have been used extensively in the tropical butterfly Bicyclus anynana [see Brakefield (1998) and Brakefield & French (1999) for summaries]. Selection experiments have

uncovered considerable amounts of genetic variation in various characteristics of eyespots in Bicyclus. Not all variation in this species has a genetic basis, however. Successive generations

of Bicyclus show distinct phenotypes in response to wet–dry seasonality. In the wet season individuals are conspicuous because of the large eyespots on the ventral side of their wings,

whereas in the dry season individuals are cryptic because their eyespots are greatly reduced in size. In the laboratory, rearing larvae at a high (27°C) or low (17°C) temperature results in

the wet season form or dry season form, respectively. Intermediate temperatures yield intermediate phenotypes, indicative of a continuous reaction norm. This polyphenism is thought to have

evolved as a balance between selection for crypsis in the cool, dry season and selection for large eyespots (that can act as deflection devices) in the warm, wet season. Results from field

experiments support this hypothesis (N. Reitsma, G. Engelhard, and P. Brakefield, unpubl. data).

Here we report on a study of the genetic basis of eyespot size in Bicyclus anynana. A pair of lines was selected for large eyespots (the HIGH line) and for small eyespots (the LOW line) on

the ventral side of the hindwings. These lines, and a number of derived line crosses, have been used by Brakefield et al. (1996) to produce initial estimates of the minimum number of loci

contributing to the difference in eyespot size applying Lande’s (1981) modification of the Castle–Wright (C–W) estimator (Castle, 1921; Wright, 1968). In this article we will provide

additional (and more detailed) information to that reported by Brakefield et al. (1996). Because the C–W estimator assumes equal, additive allelic effects we examined whether an additive

model actually suffices to explain the data. Finally, because reciprocal crosses were available for all line crosses it was also possible to look for both maternal effects and effects of

loci on sex chromosomes.

The stock population was established from more than 80 gravid females collected in 1988 near Nkhata Bay in Malawi. It is maintained at an adult population size of 600–800 with some overlap

of generations. Larvae feed on young maize plants; adult butterflies are given mashed banana.

The selection lines were established by truncation selection on the size of the largest (fifth) eyespot on the ventral hind wing. At least 40 females were used as selected parents in each

line after being held in a mating cage with about 100 males with extreme large or small eyespots (under these conditions estimates of Ne will be at least 50; see Saccheri et al., 1999).

Selection for the first 10 generations was applied to butterflies reared at 20°C. The lines responded readily to selection due to the presence of considerable amounts of additive genetic

variance (similar selection experiments yielded realized heritabilities of about 0.4). In order to increase phenotypic variation, selection was then continued by rearing the LOW line at 23°C

for 10 generations and the HIGH line at 18°C for six generations. In each generation 300–400 individuals of each sex were measured with high repeatability using an image analysis system.

All crosses were made at the end of selection with about 50 individuals of each sex. The rearing was carried out over two generations because it was logistically impossible to rear all

lines/line crosses at the same time. Each line/line cross was reared in a separate cage at 23°C, 70% relative humidity, and under a 12 h:12 h light:dark cycle. 23°C is an intermediate

temperature with respect to both seasonal forms so that temperature-effects on the lines/line crosses are minimized.

Eyespots in Bicyclus consist of a white centre, a black disc, and a gold outer ring. Eyespot size was measured as the diameter of the black disc of the fifth eyespot on the ventral hind

wing. In the LOW line it sometimes occurred that the black ring was absent; in these cases the diameter of the white centre of the eyespot was measured. Forewing length was taken as a

measure of overall body size. Measurements were made with high repeatability using a binocular microscope fitted with a micrometer.

The Shapiro–Wilk test and Levene’s test were used to assess departures from normality and homogeneity of variances, respectively; Tukey’s studentized range test was used to find out which

line crosses differed from each other. The line crosses will be considered separately in the analyses of cytoplasmic effects and sex linkage, but they will be pooled in the analyses of the

number of genes and of the composite effects.

When referring to line crosses the female parent will be given first. For example, the F1 cross ‘H × L’ refers to the progeny of a cross between HIGH line females and LOW line males. Their

offspring (=F2) is referred to as ‘HL × HL’. ‘HL × H’ is an example of a backcross, in this case between H × L (F1) females and HIGH line males.

The C–W method assumes that loci are unlinked and that alleles have equal, additive effects both within and among loci. Furthermore, all alleles that increase the magnitude of the focal

trait should be fixed in one parental (HIGH) line while all alleles that decrease the value of that trait should be fixed in the other parental (LOW) line. Violations of one or more of these

assumptions will generally result in underestimating the number of effective factors nE (Cockerham, 1986; Zeng et al., 1990; Zeng, 1992).

A consequence of linkage is that estimates of nE should be smaller than the haploid number of chromosomes plus the mean number of recombination events per gamete. Assuming one or two

recombination events per chromosome, estimates of nE for B. anynana (n=13) cannot exceed 26–39. The claim by Turner & Sheppard (1975) that crossing over is absent in females of two

Heliconius species (and some other butterfly species) is interesting in this context. As we have no data on recombination frequencies in Bicyclus we cannot use the modified C–W estimators

given by Zeng (1992) and Zeng et al. (1990) that take linkage into account.

When the assumptions hold the number of genes responsible for the difference between the parental lines is given by:

where μi is the mean of the parental line i and σ2s is the segregational variance (the excess variance that appears in the F2). A biased estimator of nE is

where the extra terms in the numerator are the sampling variances of the means of the parental lines.

When data from the parental lines, the F1, the F2, and the backcrosses B1 and B2 are available four ways of estimating σ2s are possible (see formulas 4a–d in Lande, 1981). Corresponding to

the four ways of estimating there are four ways of estimating Var[σ2s] (formulas 7 and 8a–d in Lande, 1981). Instead of these four interrelated estimates one can also combine all the

information into one estimate (Cockerham, 1986). Because these sampling variances require that the trait values are normally distributed, empirical standard errors were obtained by a

bootstrap procedure (Efron & Tibshirani, 1993; Manly, 1997) in which 1000 values of n^E were calculated by sampling with replacement from the pooled raw data.

The C–W model depends critically on the assumption of additive allelic effects. The availability of line crosses enables one to obtain an impression of the relative contributions of

additive, dominance, and epistatic effects to the differentiation of the two parental lines. Here only models with additive and additive + dominance effects will be considered. A model that

includes epistasis (containing six parameters) cannot be evaluated because data from only six lines/line crosses are available.

In a joint-scaling test, see Lynch & Walsh, (1998) for details, weighted least-square regression is used to estimate the parameters of an additive model (i.e. the expected mean phenotype of

the F2 μ0, and the composite additive effect αci), or an additive + dominance model (μ0, αci and the composite dominance effect δ1c). If the assumption of normality is met, then a χ2

statistic for goodness-of-fit can be used to compare the estimates with the observed means. Starting with the simplest model, higher-order composite effects are added until the predictions

are no longer significantly different from the observations. The test can be applied to variances in an analogous manner. For variances, only results for an additive model will be given

because further partitioning of the segregational variance into dominance and epistatic components gives statistically questionable results (Lynch & Walsh, 1998).

The basic statistics for each line/line cross are given in Table 1. Most lines/line crosses show little (0.05 < P < 0.01) or no departures from normality; however, the LOW line (both sexes),

H × L (females), L × H (females), and the stock (females) deviate markedly (P < 0.01 or P < 0.001) from normality. As the HIGH and LOW lines may be at the limits of their phenotypic

expression, we might expect their phenotypic distributions to be skewed to the left and to the right, respectively. This expectation is borne out for the LOW line, but not for the HIGH line.

Neither logarithmic transformations (cf. chapter 10 in Wright, 1968) nor Box–Cox transformations were able to remove these departures; in fact, these transformations made matters worse

(data not shown).

The means of the reciprocal crosses of the F1 differ in females (F1,227=19.49, P < 0.0001; a Mann–Whitney test – necessary because of non-normality — confirms this result) but not in males

(F1,204=1.56, P=NS); both crosses have equal variances (Females: F1,227=1.76, P=NS; Males: F1,204=0.59, P=NS). The reciprocal crosses of the F2 differ in their means in both females

(F1,283=4.54, P=0.0341) and males (F1,300=15.99, P < 0.0001); the variances do not differ (Females: F1,283=0.30, P=NS; Males: F1,300=0.86, P=NS). Highly significant differences occur among

the means of the backcrosses to the HIGH line (Females: F3,577=13.08, P < 0.0001. According to Tukey’s studentized range test differences exist between HL × H and H × LH, HL × H and H × HL,

LH × H and H × HL, and LH × H and H × LH; Males: F3,603=4.67, P=0.0031. Tukey’s studentized range test reveals a difference between LH × H and HL × H). No significant differences are found

among the variances (Females: F3,577=2.15, P=NS; Males: F3,603=0.58, P=NS). No differences occur among the means of the backcrosses to the LOW line (Females: F3,440=1.75, P=NS; Males:

F3,512=1.94, P=NS); their variances show little (Females: F3,440=2.94, P=0.0331) or no (Males: F3,512=0.52, P=NS) heterogeneity.

In both sexes only 4 out of 15 lines/line crosses (including the stock) show a significant linear relationship between eyespot diameter and forewing length. After ‘controlling’ for size

effects using the eyespot diameter/forewing length ratio (as in Brakefield et al., 1996), a significant linear relationship occurs in 6 lines/line crosses. Size effects appear to have little

consequences for estimates of nE, (however, see below).

Differences between reciprocal F1, F2, and backcross populations might be caused either by maternal effects or by sex-linked genes, or both. Taking into account the fact that in butterflies

females are the heterogametic sex and males the homogametic sex, the preceding section will now be reconsidered to see if there is evidence for maternal effects and/or sex-linked genes

affecting eyespot size. The sex chromosomes will be referred to as X and Y although Z (=X) and W (=Y) are frequently used for Lepidoptera. As female butterflies are XY, effects of loci on

the Y-chromosome are confounded with cytoplasmic effects; the term ‘cytoplasmic effect’ therefore covers both effects.

The males of the reciprocal F1 crosses have both an X-chromosome from the HIGH line and one from the LOW line, but they differ in the origin of their cytoplasm (females differ in the origin

of both their X-chromosome and their cytoplasm). Because no significant difference occurs between the F1s in males, a cytoplasmic effect seems unlikely. X-linkage would show up as a

difference between reciprocal crosses in F1 females and a resemblance between the females and their fathers. H × L females are indeed significantly smaller than L × H females. The magnitude

of this difference (0.155 ± 0.0351 mm) accounts for 9.3% of the difference between females of the parental lines (1.659 ± 0.0181 mm).

One half of the females of both F2 populations contain X-chromosomes from the HIGH line, the other half X-chromosomes from the LOW line. One F2 cross has ‘HIGH line cytoplasm’, however,

while the other F2 consists of ‘LOW line cytoplasm’. A difference was found, but the evidence is weak. The males of the two F2 populations differ in both their cytoplasm and their

X-chromosomes. Ignoring the small difference between the reciprocal crosses in females (i.e. assuming no cytoplasmic effect) the highly significant difference between the crosses (which is

in the predicted direction) in males might be a result of sex-linkage. The magnitude of this difference (0.123 ± 0.0306 mm) accounts for 9.3% of the difference between males of the parental

lines (1.318 ± 0.0249 mm). Note that in this latter case the 9.3% difference is actually caused by two X-chromosomes rather than one, but that only 50% of the individuals of both F2

populations differ in two X-chromosomes.

Two of the backcrosses provide an opportunity for assessing cytoplasmic effects in females. The female offspring of the cross between H × L females and LOW line males and between L × H

females and LOW line males both received their X-chromosome from their LOW line father, but they received different cytoplasms. The means of the two crosses do not differ, however, again

supporting the conclusion that cytoplasmic effects are absent. The same line of reasoning can be applied when considering the female offspring of crosses between L × H and H × L females and

HIGH line males. These crosses, too, do not differ significantly.

The basic statistics are given in Table 1. When the variances of the lines/line crosses are plotted against the means and the assumptions of the C–W method are met, this should give a

triangular pattern with F1 and backcross populations at the midpoints of the edges connecting the parental and F2 populations. Figure 1(a) shows substantial deviations from this pattern;

log-transforming the data removes some dependence of the variances on the means, but still does not yield the triangle (Fig. 1b).

The relationship between means and variances of eyespot size for lines/line crosses and the stock in males (–▴–) and females (–▪–) for (a) raw data (mm) and (b) log-transformed data. H, HIGH

line; L, LOW line; BH, backcross to the HIGH line; BL, backcross to the LOW line; S, unselected stock.

The females in the F1 and the males in the F2 show marked departures from normality; the females of the backcross to the LOW line differ only slightly from normality. Neither logarithmic

transformations nor Box-Cox transformations were able to remove these departures (data not shown). The variances of the backcrosses are significantly different in females (F1,1023=28.10, P <

0.0001), but not in males (F1,1121=2.83, P=NS).

Estimates of nE are given in Table 2. Estimates based on pooled raw data indicate that at 23°C at least 6–12 loci in females and 5–6 loci in males contribute to the difference in eyespot

size between the HIGH and the LOW line. Using a combined estimate of σ2s (Cockerham, 1986) yields estimates (SE) of 6.51 (1.01) and 5.73 (0.51) for females and males, respectively.

Dividing the diameter of the black ring by the length of the forewing gives estimates and associated standard errors that are comparable to those based on raw data (see also table 1 in

Brakefield et al. 1996; note that the values reported by Brakefield et al. are slightly different because the sampling variances of the means of the parental lines were not taken into

account). Another way of removing the effect of wing size is to use the residuals of a linear regression of eyespot diameter on forewing length. Both estimates and standard errors based on

these residuals are very close to the estimates based on uncorrected diameters. Box–Cox transformations (Males: λ=1.124; Females: λ=1.236) yield the smallest estimates.

The estimates of nE and their standard errors obtained from the bootstrap procedure are somewhat higher than the raw data values in males. In females the standard errors of estimates 1 and 3

are considerably higher than the raw data standard errors.

So far, estimates of nE have been obtained by computing the segregational variance as a linear function of the observed phenotypic variances within lines. However, the joint-scaling test

applied to variances (see below) yields least-square estimates of σ2s and its sampling variance that can be used to calculate n^E and its standard error. These estimates (SE) are 5.79 (0.84)

for females and 5.15 (0.50) for males.

Tables 3 and 4 shows the results of joint-scaling tests applied to the means of the line crosses. The additive model is clearly insufficient to explain the data in both males (χ24=26.69, P <

0.0001) and females (χ24=57.78, P < 0.0001). Including dominance still gives a poor fit (Males: χ23=15.71, P=0.0015; Females: (χ23=33.75, P < 0.0001), although the improvement is

considerable [Males: Λ= 10.98, Pr(χ21 ≥ 10.98)=0.0005; Females: Λ= 24.03, Pr(χ21 ≥ 24.03) < 0.0001].

The results of a joint-scaling test applied to the variances of the line crosses are given in Tables 5 and 6. Estimates (SE) of Var(P1), Var(P2), and the segregational variance Var(S) are

0.0396 (0.0029), 0.203 (0.0014), and 0.0421 (0.0037), respectively, for males, and 0.0673 (0.0064), 0.0241 (0.0079), and 0.0595 (0.0084) for females. The results give strong support to the

conclusion that the additive model should be rejected (Males: χ23=58.86, P < 0.0001; Females: χ23=347.01, P < 0.0001).

Using the C–W estimator we find that at least 5–14 loci in females and 5–9 loci in males contribute to the difference in mean eyespot size between the HIGH line and the LOW line at 23°C.

These numbers are equal to or smaller than the haploid number of chromosomes in B. anynana (n=13). The estimates and their standard errors are fairly stable across the different ways of

estimating nE in males. In females, however, especially estimates 1 and 3 and their standard errors are considerably higher than estimates 2 and 4. Given the assumptions of the C–W estimator

the figures in Table 2 may seriously underestimate nE, although the large sample sizes may reduce the consequences of these violations to some extent. Furthermore, Lynch & Walsh (1998)

emphasize ‘that each estimate only applies to the specific pair of parental lines and that substantial differences would be likely if other parental stocks were used’ (p. 238).

Even a brief inspection of Fig. 1 makes clear that additive and additive + dominance models are insufficient to explain the data. With an additive model we would expect the means of both the

F1 and F2 to lie halfway between the parental means. The mean of the F1 lies actually closer to the mean of the LOW line, thus suggesting partial dominance. (Note that the LOW line may be

truncated by the lower limit for the expression of eyespot size). The mean of the F2, however, lies closer to the mean of the HIGH line. This shift might be a result of the breakdown of

linkage disequilibrium or to differences in rearing conditions (e.g. temperature) because rearing of the F1 and F2 could not be performed simultaneously. A third explanation might be that

the F2 is more sensitive to temperature (i.e. shows more phenotypic plasticity) than either the parental lines or the F1. The parental lines (especially the LOW line) are less plastic than

the stock (cf fig. 6 in Brakefield et al., 1996), and the reaction norms of the F2 differ markedly from those of the stock (P.J. Wijngaarden and P.M. Brakefield, unpubl. obs.);

unfortunately, the reaction norms of the F1 have never been assessed.

Joint scaling tests confirm the insufficiency of additive and additive + dominance models. Including epistasis might give an adequate fit between model and data, but our number of lines did

not allow an evaluation of such a model. Despite this lack of additivity considerable amounts of additive genetic variance were available to create the difference between the HIGH and LOW

line. It is well known that dominance and epistasis can contribute to the additive genetic variance (Lynch & Walsh, 1998). Unfortunately, these different sources of additive variance are

hard to disentangle experimentally.

In Nijhout’s (1980) model of eyespot formation the individual scale cells that make up the eyespots produce only one pigment depending on the morphogen level relative to certain threshold

values in these cells. It should therefore not have come as a surprise that simple additive or additive + dominance models do not work. Transplant experiments showed that the response to

selection on the size of eyespots on the dorsal wing surfaces was mainly caused by changes in the activity of the focal cells and, to a lesser extent, by changes in the sensitivity of the

surrounding cells (Monteiro et al., 1994). Unless the sensitivities of these surrounding cells can be kept constant eyespot size will have an epistatic component. Similar results were

obtained by Nijhout & Paulsen (1997) when they explored the behaviour of a one-dimensional diffusion gradient and threshold model that might apply in many developmental systems (including

butterfly eyespots). The model contained six parameters, each controlled by a single locus with two alleles. Although the alleles were assumed to have additive effects, dominance and a

substantial amount of epistasis showed up as emergent properties.

The difference in mean eyespot size in the HIGH and LOW lines is the result of artificial selection, initially at a single intermediate temperature. A similar difference in phenotype can be

obtained by rearing larvae at different temperatures (i.e. through phenotypic plasticity). Investigations of the physiological basis of this environmentally induced variation in eyespot size

have shown that the dynamics of ecdysteroid hormones shortly after pupation play an important role in the response to rearing temperature (see Koch et al., 1996; Brakefield et al., 1998).

In addition, individuals carrying the mutant Bigeye allele show substantially larger ventral eyespots than wild-type individuals at any particular temperature, an effect that probably

involves the response to focal signalling (Brakefield et al., 1996). It seems likely that alleles of genes, which influence both of these classes of change in eyespot size, will contribute

to the difference between the selected lines analysed in this report. A challenge for future research is to map and identify at least some of these genes using both a candidate gene approach

for developmental genes such as those of the hedgehog signalling pathway (Carroll et al., 1994; Brakefield et al., 1996; Keys et al., 1999), and for genes involved in the hormonal

regulation of phenotypic plasticity in eyespot size (Brakefield et al., 1998).

We found no evidence for cytoplasmic effects (i.e. maternal effects or effects of loci on the Y-chromosome). Differences between reciprocal crosses were found that are consistent with

effects of one or more X-linked loci. In both cases where X-linkage might become apparent the differences between the reciprocal crosses accounted for 9.3% of the difference between the

parental lines. That these differences account for the same percentage is consistent with the hypothesis that dosage compensation is absent in butterflies (Johnson & Turner, 1979) because

they are caused by different numbers of X-chromosomes. X-linkage is common for quantitative traits in Lepidoptera and is thought to play an important role in species differences (Sperling,

1994). Its implications for B. anynana are difficult to assess as long as we do not know to what extent differences in eyespot size can be attributed to differences in source activity and

differences in threshold values.

We are grateful to Russ Lande for his comments on an earlier version of the manuscript.

Present address: Laboratory of Genetics, Wageningen University, Dreijenlaan 2, NL-6703 HA, Wageningen, The Netherlands

Institute of Evolutionary and Ecological Sciences, Leiden University, PO Box 9516, RA Leiden, NL-2300, The Netherlands

Anyone you share the following link with will be able to read this content: