PLoS Genetics
Home Polygenic adaptation of rosette growth in Arabidopsis thaliana
Polygenic adaptation of rosette growth in <i>Arabidopsis thaliana</i>
Polygenic adaptation of rosette growth in Arabidopsis thaliana

The authors have declared that no competing interests exist.

Article Type: research-article Article History
Abstract

The rate at which plants grow is a major functional trait in plant ecology. However, little is known about its evolution in natural populations. Here, we investigate evolutionary and environmental factors shaping variation in the growth rate of Arabidopsis thaliana. We used plant diameter as a proxy to monitor plant growth over time in environments that mimicked latitudinal differences in the intensity of natural light radiation, across a set of 278 genotypes sampled within four broad regions, including an outgroup set of genotypes from China. A field experiment conducted under natural conditions confirmed the ecological relevance of the observed variation. All genotypes markedly expanded their rosette diameter when the light supply was decreased, demonstrating that environmental plasticity is a predominant source of variation to adapt plant size to prevailing light conditions. Yet, we detected significant levels of genetic variation both in growth rate and growth plasticity. Genome-wide association studies revealed that only 2 single nucleotide polymorphisms associate with genetic variation for growth above Bonferroni confidence levels. However, marginally associated variants were significantly enriched among genes with an annotated role in growth and stress reactions. Polygenic scores computed from marginally associated variants confirmed the polygenic basis of growth variation. For both light regimes, phenotypic divergence between the most distantly related population (China) and the various regions in Europe is smaller than the variation observed within Europe, indicating that the evolution of growth rate is likely to be constrained by stabilizing selection. We observed that Spanish genotypes, however, reach a significantly larger size than Northern European genotypes. Tests of adaptive divergence and analysis of the individual burden of deleterious mutations reveal that adaptive processes have played a more important role in shaping regional differences in rosette growth than maladaptive evolution.

The rate at which plants grow is a major functional trait in plant ecology. However, little is known about its genetic variation in natural populations. Here, we investigate genetic and environmental factors shaping variation in the growth rate of Arabidopsis thaliana and ask whether genetic variation in plant growth contributes to adaptation to local environmental conditions. We grew plants under two light regimes that mimic latitudinal differences in the intensity of natural light radiation, and measured plant diameter as it grew over time. When the light supply was decreased, plant diameter grew more slowly but reached a markedly larger final size, confirming that plants can adjust their growth to prevailing light conditions. Yet, we also detected significant levels of genetic variation both in growth rate and in how the growth dynamics is adjusted to the light conditions. We show that this variation is encoded by many loci of small effect that are hard to locate in the genome but overall significantly enriched among genes associated with growth and stress reactions. We further observe that Spanish genotypes tended to reach, on average, a significantly larger rosette size than Northern European genotypes. Tests of adaptive divergence indicate that these differences may reflect adaptation to local environmental conditions.

Wieters,Steige,He,Koch,Ramos-Onsins,Gu,Guo,Sunyaev,de Meaux,and Nordborg: Polygenic adaptation of rosette growth in Arabidopsis thaliana

Introduction

Growth rate is a crucial component of individual fitness, as it reflects the capacity of the organism to acquire resources and conditions reproductive output [1,2]. In experimental evolutionary studies, relative growth rate provides a measure of microbial adaptation in response to selection [3]. In plants, however, little is known about the evolutionary processes that influence variation in plant growth rate, despite its cornerstone importance in plant ecology [46].

Four processes may explain variation in growth rate: random evolution due to drift, plasticity, adaptation or maladaptation. Plasticity describes the immediate adjustment of plant growth rate in response to environmental modifications [7]. Such change may occur as a passive consequence of resource limitations. Plant growth, for example, is slower in drought conditions or at lower temperatures [8,9]. Plastic adjustments of plant growth, however, can also actively contribute to maintaining fitness under challenging conditions. For example, shade avoidance allows plants to outgrow neighbors competing for light [10]. Such reactions may allow the organism to maintain high fitness when the environment becomes challenging, without having to evolve genetically [11].

As the distribution range of a species expands, plastic modifications may become insufficient to adjust fitness, and genetic variation may be required for local adaptation [7,12]. There is clear evidence that genetic variation in plastic life history traits such as flowering time or seed dormancy contributes to the evolution of life-history decisions that are tailored to the local optimal growth season [1316]. Surprisingly, the extent to which genetic variation in plant growth rate itself contributes to local adaptation is not known. Answering this question requires that the effect of natural selection on phenotypic divergence be disentangled from the effect of drift [17].

Genetic variation in growth rate may also arise in the absence of a compelling environmental change, as a consequence of population genetics processes. In bottlenecked populations, or in the aftermath of rapid range expansion, increased drift hampers the efficient removal of deleterious mutations, and individuals may become less fit [1822]. Because plant growth is a component of fitness, genotypes carrying a larger burden of deleterious mutations may show decreased growth. Genetic variation in growth rates may thus also reflect maladaptation resulting from decreased population size.

The annual species A. thaliana has become a model system for both molecular and evolutionary biology, and it is well suited for determining the ecological and evolutionary significance of plant growth rates [23,24]. A. thaliana individuals can adjust their growth rate plastically to maintain their fitness. Plant rosettes grow to a larger diameter when light becomes limited [10,25]. Ample genetic variation in plant growth rates has also been documented in this species [2628]. In addition, there is evidence that the resources allocated to growth are not identical throughout the species’ range, because trade-offs between growth rate and development change with latitude (reviewed in [12,29,30]). Furthermore, traits related to how resources are allocated to growth, such as growth inhibition upon the activation of plant defense, or plant dwarfism, have been associated with adaptation [3135]. In summary, adaptive variation in the rate of plant growth may have evolved in A. thaliana. At the same time, the maladaptive or neutral evolution of a decreased growth rate cannot be excluded a priori. Indeed, A. thaliana has experienced recent severe bottlenecks in parts of its range, such as in Northern European or Chinese populations, which locally increased the rate of genetic drift and led to an accumulation of deleterious genetic variants [3638]. Neutral evolutionary forces could therefore also have modified growth rate in these populations.

To determine the roles of deleterious variation, adaptive evolution and/or plasticity in the genetic variation among plant growth rates, we analyzed variation among rosette growth rates across genotypes sampled from four broad regions (China, Spain, Northern and Western Europe). To assess the relative roles of genetic and plastic variation, we grew plants under two light regimes that mimicked constitutive latitudinal differences in natural light intensity and characterized genetic variation in growth plasticity. This analysis reveals significant regional differences in growth dynamics, most of which have a polygenic basis. Population genetics analyses indicate that local selective pressures have helped shape this variation.

Materials and methods

Phenotypic analysis and estimation of growth rate parameters

We chose 278 genotypes of Arabidopsis thaliana originating from 220 locations distributed throughout 4 regions for phenotypic analyses of growth rate variation (Northern Europe, Western Europe, Spain and Central-Eastern China, S1 Table and S1 Fig). A PCA confirmed that genotypes within these regions formed distinct phylogeographic clusters (S2 Fig), whose specific evolutionary history has been previously documented [13,3739].

Seeds were stratified for 3 days at 4°C in the dark on wet paper, and six replicate seedlings per genotype were replanted, each in one 6x6 cm round pots containing soil (“Classic” from Einheitserde) mixed with perlite. Growth was measured in a split-plot design, under two light regimes, high light (HL) and low light (LL) in the same chamber but in successive independent trials. Plants were grown in a temperature-controlled walk-in growth chamber (Dixell, Germany) set at 20°C day and 18°C night, and watered once a week. For each light regime, pots were randomized within three blocks of 8 trays with 7x5 pots, with one replicate of each genotype in each block. Trays were randomized and the rows in the trays were rotated every two to three days to account for variability within the chamber. The plants were exposed to light for 12 h with LEDs (LED Modul III DR-B-W-FR lights by dhlicht) set to 100% intensity of blue (440nm), red (660nm) and white (HL conditions) or 30% of red and blue plus 100% of white light (LL conditions), followed by a 10 min far-red light pulse to simulate sunset (40% intensity at 735nm). The total measured light intensity was 224 +/- 10 μmol/m2s in HL and 95 +/- 7 in LL. These two light regime mimick latitudinal differences in natural light intensity (S3 Fig).

Individual plants were photographed approximately bi-weekly with a Canon EOS 5D Mark III digital camera until days 46 (8 weeks) and 89 (13 weeks), for those grown under the HL and LL regimes, respectively (image data is available on dryad, doi:10.5061/dryad.s1rn8pk5m) [40]. We only measured diameter for one time point per week, but included additional measurements if it was necessary to fit the logistic curves. Flowering time was measured as days to first flower opening. For genotypes without a flowering individual by the end of the experiment, a flowering time value of 59 or 90 days (last date that flowering was scored) was assigned to HL and LL plants, respectively. Since only 37% of the plants in the experiment flowered, we also used flowering time data from the 1001 Genomes project, according to which flowering was scored at 10 and 16°C for 177 genotypes [39]. A measure of the diameter of each plant (defined as the longest distance between two leaves) was extracted at least once a week with ImageJ (v.1.50b, [41]). In a preliminary experiment conducted on a subset of 17 genotypes, we used Rosette Tracker, an ImageJ tool [42], to show that diameter correlated positively with rosette area under both light regimes (HL r = 0.83, p<3.2e-5, LL r = 0.56, p<0.0186). We confirmed that plant diameter accurately predicts rosette area on this larger data set (r = 0.929, p = <2.2e-16 in HL). Rosette diameter was therefore used to determine the increase in rosette area over time.

We conducted two additional experiments to test the ecological relevance of rosette growth variation measured under controlled conditions. First, all genotypes were grown under HL conditions, in 5 replicates. In this experiment, instead of rosette diameter, we measured hypocotyl length after 15 days, to quantify variation in seedling growth. We further weighted 3-week old plants with a precision balance (Sartorius AC 210 P with accuracy of 0.1 mg) to quantify variation in plant biomass. We also followed a similar experimental design to measure the diameter of plants grown outdoor in 2 replicates in the field of the Cologne Institute of Plant Sciences. Sand was used instead of soil in 9 cm diameter pots. Seeds were sown in September 2016, which corresponds to the native season in the area and put outside after a week.

Statistical analysis of genetic variance

All following analyses were conducted using R (version 3.6.3) [43], and function names refer to those in the R package mentioned unless otherwise noted. We provide an Rmarkdown script detailing the statistical analysis of phenotypic variation (S1 File).

The split-plot design of our study allowed us to conduct the analysis in successive steps. First, we extracted three parameters that together provided a comprehensive description of individual rosette growth. For this, rosette diameter measurements over time (our input phenotype) were modeled as a three-parameter logistic growth using the drm function from the drc package in R [44]. The three following growth parameters were extracted: final size (FS, largest estimated rosette diameter in cm), slope (factor of magnification in the linear phase) and t50 (inflection point; time at which growth is maximum and half of FS has been reached, which quantifies the duration of the rosette area growth phase in number of days). We show examples for the estimation of growth rate in S4 Fig For each parameter, a genotypic mean correcting for block, tray and position effect was computed with a generalized linear model with a Gaussian error distribution and the following model: parameter~accession+block+tray/(row+col)+error. Genotypic means in HL and LL were extracted separately, because light treatments had to be performed in separate trials. To quantify the plasticity of growth to the light regime of each genotype, we correlated genotypic means in LL against HL and extracted the residuals. GxE estimates thus quantify the deviation of the response of a given genotype from the mean response of all genotypes for the respective parameter. The estimate increases with the magnitude of growth plasticity induced by a decrease in light intensity (S5 Fig). Broad-sense heritability (H2) was determined for each trait in each environment as previously reported [45]. Briefly, genetic and environmental variances were estimated using the lme function from the nlme package [46], with the block as fixed and genotype as random effect and heritability was determined as the ratio of genetic variance over the total variance. The heritability of GxE was not estimated, because we quantified plasticity on the basis of changes in genotypic means in the two light conditions. For this reason, we also computed trait pseudo-heritability, which is based on the genome-wide association study (GWAS) mixed model (see below) and allowed us to estimate the proportion of the observed phenotypic variance that is explained by genotypic relatedness for all traits [47].

To assess the correlation of phenotypic traits with climatic variables, we investigated solar radiation estimates, temperature, precipitation, humidity and wind speed with 2.5-min grid resolution (WorldClim2 database, [48], accessed on March 20, 2018) and soil water content [49]. Following [45], we estimated the mean over the putative growth season for each genotype in addition to the annual averages.

Because of the strong correlations between climatic variables, we conducted principal component analyses (PCAs) to combine the data. We analyzed annual average radiation separately and combined the other variables into the PCAs: growing season data, variables related to precipitation and to temperature. Raw climatic data and the principle components (PCs) are in S1 Table and the loadings of the PCA are in S11 Table.

Regional differences in mean growth parameters were tested with a multivariate analysis, using the manova function, the matrix of growth parameters (genetic means) or plasticity and the following model: growth~ population*light regime. Significance levels were determined by the Pillai test.

For univariate analysis, we used GLMs to test the effect of population of origin on the genotypic means. A Gaussian distribution was taken for error distribution, and the dispersion parameter was estimated by the glm function. Group means were compared with the glht function (which performs general linear hypothesis testing) and plotted on boxplots using the cld function, both of the multcomp package [50].

Pairwise trait correlations within and across populations were calculated with the cor.test function (Pearson’s product-moment correlation), and p-values were established using the lmekin function in R, which includes a kinship matrix of individuals (see below) and thus corrects for population structure (after [29]). We used the corrplot function from the corrplot package to plot correlations [51]. Plots were modified using inkscape version 0.92.3 (inkscape.org, [52]). Significance levels were adjusted for false-discovery rates with the function p.adjust.

Genome-wide association studies

Genomic data were available for 231 of the 278 genotypes included in the phenotypic analysis, i.e. for 84 genotypes from Northern Europe (NE, predominantly Sweden), 3 from Western Europe (WE), 119 from Spain (SP) and 15 from China (CH) [38,39]⁠. Chinese genotypes were excluded from the GWAS because of their limited number (15 genotypes) and their strong genetic divergence ([38], S2 Fig). In total, the growth parameters of 203 and 201 genotypes grown under HL and LL, respectively, were used for the GWAS. Genome-wide association studies were conducted using the method from [53]. The corresponding GWAS package was downloaded from https://github.com/arthurkorte/GWAS. Single Nucleotide Polymorphisms (SNPs) with minor allele frequency below 5% or with more than 5% missing data were removed from the genotype matrix, resulting in a matrix of 1,448,192 SNPs, produced with vcftools (—012 recode option) (version 0.1.15, [54]). A kinship matrix was computed with the emma.kinship function. For each growth parameter, genotypic means were used as phenotype measurements. We performed GWAS across the European sample of genotypes but also within region (Spain and Northern Europe). For each SNP, the script output delivered p-values, which were Bonferroni corrected for multiple testing, and effect sizes. For each trait, we estimated pseudo-heritability, the proportion of the observed phenotypic variance that is explained by the estimated relatedness (e.g. kinship matrix, [47]). To identify candidate genes underpinning significant GWAS associations, we calculated the linkage disequilibrium (LD) around the SNP of interest and selected all genes that were in a genomic window with LD above 0.5 within a 250 kb window around the SNP. The LD was calculated as the Pearson correlation between the frequencies of allele pairs. Additionally, we downloaded an annotation of loss-of-function (LOF) variants [55] and performed a GWAS association following the procedure described above except that the SNP data set was replaced with the LOF data set, which assigned one of two states (functional or LOF), for each genotype and each of the 2500 genes with known LOF alleles.

Validation of the polygenic signal

In humans, where population structure and environmental variation are correlated, insufficient correction of the genetic associations caused by shared ancestry has been shown to create spurious associations [5658]. Even though environmental variance is much better controlled in common garden experiments including kinship as a covariate, association tests can still be confounded by genetic relatedness [57]. This is of particular concern when many trait/SNP associations are below the Bonferroni significance threshold. The rate of false positive was not excessively inflated by genomic differentiation between regions, because GWAS performed within regions (Northern Europe or Spain) had similar p-value distributions than GWAS performed on the complete phenotypic dataset (S6 and S7 Figs). We nevertheless used two additional approaches to confirm the polygenic basis of traits. First, we examined whether phenotypic variation could be predicted by polygenic scores derived from sub-significant GWAS hits, with p<10−4. To this end and for each trait, we calculated genotypic means, performed GWAS, as described above, and computed polygenic scores following [53]. SNPs with a significant association with the phenotype were pruned to remove SNPs standing in strong linkage disequilibrium with plink version 1.90 [59], following [56]. The plink -clump function was set to select SNPs below a (GWAS) P- value threshold of 0.0001, start clumps around these index SNPs in windows of 1 Mb, and remove all SNPs with P < 0.01 that are in LD with the index SNPs. The SNP with the lowest p-value in a clump was retained for further analysis. Briefly, input files, including allele frequencies for all SNPs, all SNPs with GWAS p-values lower than 10−4 and their effect size estimates were created with a custom R script. We defined each genotype as its own population (genotypes from Spain and Northern Europe were grouped in regions). Scripts were downloaded from https://github.com/jjberg2/PolygenicAdaptationCode. The pipeline was run with default parameters, and polygenic scores (Z-scores) were estimated. We used three approaches to validate the relevance of GWAS associations for predicting the phenotype. First, we used 80% of the genotypes and used the resulting GWAS association to compute a polygenic score for the remaining genotypes. Second, we took two replicates to compute the polygenic scores, and tested whether it predicted the phenotype of the third replicate. Third, we correlated the phenotypic values predicted by polygenic scores (calculated this time on the basis of all three replicates) with the observed phenotypic value. We repeated this replacing the set of SNP associated at p<10−4 with 1000 sets of an equal number of randomly chosen SNPs. We then compared the correlation of polygenic scores to the input phenotype for SNPs associated at subsignificant level (p<10–4) to the correlation expected for random sets of SNPs. Correlations were calculated with the R function cor.test (Pearson’s product-moment correlation).

In addition, we investigated functional enrichment among genes within 10kb of GWAS associated SNPs. To assign a single GWAS p-value for each gene, either we assigned for each trait the lowest p-value of SNPs within the gene, or, if no SNP was within the gene, we assigned the p-value from the physically closest SNP [47]. When there were GWAS hits in the vicinity of duplicated genes, we removed tandem duplicated genes within a 10-gene sliding window. For this, we first aligned all TAIR10 genes against each other by using BLAST (version 2.9.0, available at https://blast.ncbi.nlm.nih.gov). Then the duplicated genes were selected as genes with an e-value <1e-30. Finally, tandem duplicated genes identified with gene distance <10-genes were filtered out to avoid inflated functional enrichments. If the polygenic signal only due to insufficient correction for population structure, we expect that similar functions will be enriched among population structure outliers and among genes with low GWAS p-values. We thus computed Fst-values for each gene with the F_ST.stats function of the PopGenome library [60]tween Spain and Northern Europe. Negative Fst values were set to zero.

Enrichments were tested as previously described [61]. To call GO enrichment significant, we determined the conservative threshold p = 0.008. This threshold was determined as the 0.01% quantile of the p-value distribution when GO enrichments were tested for 1000 random sets of the same number of SNP. To assess similarity between traits in Gene Ontology (GO) enrichments, we calculated graph-based similarity with the GOSemSim package [62]. A distance matrix was estimated with average connectivity between the GO terms. The clustered GO categories were then plotted as a dendrogram with the plot.phylo function from the ape package (version 5.3, [63]). GO categories enriched at p-value below 0.001 were highlighted. The distribution of enriched GO categories was evaluated by visual inspection.

Testing for adaptation or maladaptation

For population genetics analyses, we sampled one genotype at random whenever plants were sampled in the same location, acquiring a total of 220 genotypes. As a proxy for the genomic load imposed by deleterious mutations, the number of derived non-synonymous mutations per haploid genome has been proposed [64]. This approach was not possible here because the genomes of individuals from China and Europe were sequenced in different labs, and the depth and quality of sequencing varied too much to make a fair comparison. Instead, we used two data sets that together catalogued LOF alleles after controlling as much as possible for heterogeneity in sequencing quality: one that included European genotypes [55] and a more recent data set that included Chinese genotypes [65]. As an estimate of the individual burden of deleterious mutations, we counted the number of LOF alleles for each individual and tested whether individuals with a larger number tended to have a lower growth rate using the Spearman rank correlation.

To search for footprints of adaptive evolution, we computed an Fst value between Spain and Northern Europe for each SNP in the GWAS analysis using the R-package hierfstat and the basic.stats function [66]. Negative Fst values were set to zero, and the quantile function was used to calculate the 95th percentile. The Fst distribution of SNPs associated with any GWAS (p<10e-4) was compared to the genome-wide distribution with a Kolmogorov-Smirnov test. We also computed the likelihood that its 95th percentile was greater than the 95th percentile of 10 000 random samples of an equally large set of SNPs. To compare the phenotypic differentiation of traits, Qst values for the phenotypic traits were estimated as previously described [45]. Briefly, Qst was estimated as VarB / (VarW + VarB), where VarW is the genotypic variance within and VarB between regions. These variances were estimated with the lme function of the nlme package [46], with the block as fixed and population/genotype as random effect. We extracted the intercept variance for VarB and the residual variance for VarW. Since replicates were taken from the selfed progeny of each genotype, VarB and VarW are broad-sense genetic variance components. To reveal signatures of local adaptation, the Qst of each trait was compared to the 95th percentile of the Fst distribution (between Spain and Northern Europe) [67,68]. We verified that outlier Qst values were unlikely to arise randomly. For this, we permuted phenotypic data by randomizing genotype labels and verified that the difference between observed Qst and 95th percentile of Fst was significantly greater than for randomized Qst, following [45]. In a second approach, we used a multivariate normal distribution to generate phenotypic divergence based on the kinship matrix to generate an expected Qst distribution [69]. Finally, we applied the over-dispersion test (Qx test), which compares polygenic scores computed for associated versus random SNPs (null model), in a process similar to a Qst/Fst comparison, but assuming that each population is composed of the selfing progeny of one genotype [53]. A Qx significantly larger than the Qx computed for the null model indicates that polygenic trait prediction is more differentiated than expected from the kinship matrix and can be taken as an indication that the trait has evolved under divergent selection, either within or between regions [53].

Results & discussion

Ecological relevance of rosette growth variation

On the basis of the more than 15,000 rosette images we collected, we used rosette diameter as a proxy to describe rosette growth variation with three parameters; each refers to the ways in which growth can differ among genotypes: i) the time until the exponential growth phase is reached (t50), ii) the speed of growth during the linear growth phase (slope) and iii) the final size (FS) at which rosette diameter plateaus at the end of the rosette growth phase (Fig 1 and S2 and S3 Tables). Of the parameters, FS displayed the highest broad-sense heritability, in plants grown under both regimes: high light (HL, H2 = 0.636) and low light (LL, H2 = 0.794, S4 Table). Trait variation measured in controlled settings sometimes fails to reflect variation expressed in natural conditions [70,71]. This is not the case for rosette growth variation in A. thaliana. FS in HL conditions correlated positively with plant biomass (r = 0.267, p-value = 4.6e-5) and seedling growth (r = 0.372, p-value = 9.1e-7) in the growth chamber. FS measured in HL also correlated positively with plant diameter measured under natural light in the field (r = 0.263, p-value = 0.0009). This indicates that a significant part of the variation we report is ecologically relevant.

Regional growth rate estimates in HL and LL.
Fig 1

Regional growth rate estimates in HL and LL.

Predicted growth curves averaged over region (from drm function). The growth curves were estimated from diameter measurements at different time points. Diameter measurements for HL are from day 11 to 46 and for LL from day 24 to 89. An illustration of the parameters that are estimated from these growth curves are included in the plot (Final Size is a diameter, t50 a time point and Slope the fold increase in the linear phase). HL (dashed line), LL (solid line), China (orange), Northern Europe (green), Spain (purple) and Western Europe (red).

Environmental plasticity has the strongest impact on plant growth variation

Light regimes revealed that plasticity has the strongest impact on rosette growth (Fig 1, MANOVA HL vs LL: F = 2275.37, df = 1, p-value = <2.2e-16, Table 1). In plants grown under LL, the maximum growth rate was delayed and rosette growth plateaued at a larger size (Table 1 and Fig 2). This observation was in agreement with the reduced relative growth rate reported in many plant species when light supply decreases, whereas the larger FS reflected the expected shade avoidance reaction [72]. We observed that plants reached a larger diameter (and rosette area) by elongating their petiole and minimizing leaf blade overlap in LL, a reaction known as the shade avoidance response. This strong modification of leaf shape may explain the predominant impact of environmental variation we report here (Table 1). Nevertheless, we detect significant levels of genetic variation in growth plasticity to light (F = 2.0, df = 270, p<2.2e-16). We quantified growth plasticity as the individual deviation of the genotypic mean of each genotype in HL and LL from the average reaction of the population to the change in light regime (S8 Fig).

Significant regional differentiation of Final Size and t50 in HL and LL.
Fig 2

Significant regional differentiation of Final Size and t50 in HL and LL.

A.thaliana genotypes are grouped based on geographical origin. Box plots show regional variation in Final Size (upper row) and t50 (lower row) for HL (left) and LL (right). Groups that do not share a letter are significantly different according to Tukey’s HSD (p-value < 0.05). Region information: China (CH, n = 20), Northern Europe (NE, n = 58), Spain (SP, n = 119) & Western Europe (WE, n = 29).

Table 1
The multivariate analysis was conducted on the estimates of FS, t50 and slope for all 270 genotypes in three replicates and accounting for block effects nested within light treatment.
Multi- and uni-variate analyses of growth variation in response to light regime, genotype and their interaction.
Multivariate analysis (MANOVA)Final Sizet50Slope
ResponsedfFp-valueFp-valueFp-valueFp-value
Block427.5< 2.2E-1630.15< 2.2E-1612.249.8e-1030.50< 2.2E-16
Light regime17388.7< 2.2E-1615687.23< 2.2E-169289.44< 2.2E-1634.017.2e-9
Genotype2793.6< 2.2E-168.43< 2.2E-162.51< 2.2E-162.036< 2.2E-16
Light*Genotype2702.0< 2.2E-162.97< 2.2E-161.592.3e-071.614.4e-08

Spanish genotypes show the most vigorous rosette growth

We found evidence for rosette growth variation across regions (MANOVA in Table 1 and Figs 1 and 2). Within Europe, Spanish genotypes reached the largest FS in both HL and LL plants (Tables 1 and S5, MANOVA: F = 16.37, df = 3, p-value = 5.35e-10). Although the growth slope did not differ significantly across regions, we observed that, under HL conditions, Spanish genotypes reached 50% of their FS (= t50) significantly later than the genotypes originating from Northern Europe (t50 = 15.17 vs 13.79, respectively, GLHT z = 3.061, p-value = 0.011, Fig 2C). This effect was also observed for plants grown under LL conditions but we detected no regional difference in GxE (Fig 2D). Since Spain and Northern Europe do not differ in their average flowering time (S9 and S10 Figs), the larger rosette size observed in Spain is not due to an extension of the duration of vegetative growth in this population.

Chinese genotypes show that growth rate variation is constrained in evolution

Despite a long history of population isolation that was magnified by a strong bottleneck after the last glacial period [38,73], the growth rate of Chinese genotypes was comparable to that shown by most European genotypes (S5 Table and Fig 2A and 2B). Under LL conditions, Chinese genotypes showed lower t50 and FS values only when compared to Spanish genotypes (S5 Table and Fig 2A–2D). Under HL, genotypes from China did not differ significantly from those from any other region (Figs 2 and S11). The analysis of Chinese genotypes indicates that the phenotypic evolution of rosette growth does not scale with the extent of genetic divergence (Fst between Europe and China is 0.057 on average, with a standard deviation of 0.147, and much greater than Fst between Spain and Northern Europe, KS test, D = 0.39, p<2.2e-16). A parsimonious explanation to the fact that growth rate has not significantly changed despite extensive population divergence, is that the evolution of growth rate is likely to be constrained by stabilizing selection around a growth optimum [1].

The Chinese population was also the only one to show a difference in GxE (S8 Fig). Compared to Spanish genotypes, Chinese genotypes displayed a GxE that was lower for t50 and higher for slope (t50: GLHT z = 2.748, p-value = 0.028; slope: GLHT z = -3.224, p-value = 0.006; S8 Fig). When grown under the LL regime, these genotypes displayed a lower FS than genotypes from Spain. In contrast, within Europe, we observed no significant difference in the growth plasticity of plants in relation to light regime, despite the fact that Northern populations are exposed to lower average light intensity (S3 Fig).

GWAS reveal only two SNPs significantly associated with rosette growth variation

We used GWAS to determine the genetic basis of variation in growth rate within Europe (Figs 3 and S12S17). The sample size (15) and strong population structure of Chinese genotypes precluded their inclusion in this analysis (S2 Fig). Henceforth, we focused on the analysis of genetic variation within and among European populations. Overall, we found few significant genetic associations, indicating that genetic variance for growth rate is generally polygenic. One SNP (chromosome 1, position 24783843) associated with t50 variations in LL plants (effect size = -2.475, p-value = 2.6E-9, Fig 3 and S6 Table). A second SNP (chromosome 3, position 951043) was significantly associated with the slope of rosette diameter growth in HL plants within Spain (effect size = 1.229, p-value = 8.4E-7, Fig 3 and S6 Table) and was polymorphic only in the Spanish set of genotypes. This SNP was within a 1Mb DNA fragment showing strong local LD and enclosing 21 genes. Two additional SNPs were associated with GxE for FS and t50 in HL plants in Northern Europe, respectively, with p-values just below the Bonferroni threshold (S6 Table). Yet, we found no SNP significantly associated with FS above the Bonferroni threshold, although FS is the most heritable trait (S4 Table). Diverse genetic setups can result in such polygenic architecture: large effect size variants that are too rare to be detected, many variants with effect sizes too small to be individually significant, or the presence of multiple alleles at causal loci that will blur the genetic association signal [47,74]. Local genetic variation in slope and t50, growth parameters which display moderate but significant genetic variance, appear to be controlled by low-frequency variants of comparatively larger effect, since some of them were associated above Bonferroni threshold (S6 Table). This genetic architecture resembles that reported in the same species for flowering time [75,76]. In contrast to slope and t50, variation in FS appeared more polygenic since it has the highest heritability and no SNP association above Bonferroni confidence levels.

GWAS-results for 4 phenotypes.
Fig 3

GWAS-results for 4 phenotypes.

Manhattan plots of GWAS of t50 in LL with all European genotypes (a), with a peak on Chromosome 1, GxE of Final Size with all European genotypes (b) with a peak on Chromosome 2, Slope in HL within Spain (c) with a peak on Chromsome 3 and t50 in HL in Northern Europe (d) with a peak on Chromosome 1. The dotted line shows the corresponding Bonferroni threshold adjusted for a p-value of 0.05.

Polygenic scores and functional enrichments confirm the polygenic basis of growth variation

Traits with polygenic architecture are controlled by variation in many loci of low frequency and/or low effect sizes and dissecting their evolution is arguably a major challenge today in evolutionary biology [7780]. Specifically, random SNPs with outlier frequency are not always sufficiently corrected for with the kinship matrix and these may give rise to spurious associations. Studies of polygenic traits such as human height have shown that residual effects of population structure can give signals of genetic association [56,81]. Similar effects were also encountered in studies of phenotypic variation in plant systems [57]. They are expected whenever environmental variance co-varies with population structure, as is likely the case in human studies, but can also persist in common garden studies if populations are geographically differentiated in the genetic component of the trait. To confirm the polygenic basis of growth variation, we evaluated the biological relevance of marginally significant genetic associations. The associated sets were composed of 22 to 37 unlinked SNPs. We used their effect sizes to compute polygenic scores for each parameter [53]. We first used to use 80% of the data to identify SNPs associating with rosette growth and test whether they can be used to correctly predict the phenotype of the remaining 20% of the data. This approach, however, did not yield significant predictions (rho = 0.07979094, p = 0.6189), which is not surprising because it usually does not perform well in structured populations [82]. We took a second approach to measure polygenic score accuracy. We used two of the three replicates to compute polygenic scores and tested whether they correlated significantly with the phenotype measured independently in the third replicate (S7A Table). The correlation was highest for FS measured in LL plants (Rho = 0.567, p = <2.2e-16). In fact, FS, the most heritable trait, could be predicted with the highest accuracy in plants grown under both light regimes (S7A Table). When we used random sets of SNPs as input, the computed polygenic scores were significantly correlated with the observed phenotype, indicating that population structure contributes to a significant but small fraction of the variance in polygenic scores. Nevertheless, with this third approach, we showed that polygenic scores computed on the effect sizes of SNPs associated at sub-significant level were markedly more correlated with the observed phenotype than those computed with random SNP sets (S7B Table). This confirms that sub-significant genetic associations, despite their marginal significance, effectively recapitulate some of the traits’ heritability.

We further asked whether sub-significant associations could collectively reveal the specific molecular basis of each trait. We selected SNPs showing a sub-significant association (p<0.0001) and investigated functional enrichment among genes that mapped within 10kb of the SNP. To consolidate our confidence in the functional enrichment, we also pruned tandem duplicates from the annotated set, and determined a p-value threshold that was below the level of significance that can be obtained with GWAS on a permuted data set (see Materials and Methods). While the results reveal many categories without an easily interpretable link to growth, many traits showed functional enrichment within gene ontology (GO) categories, whose link to growth has been documented (S9 Table). For example, genes associated with variation in FS, the most polygenic trait, were enriched among genes involved in the growth-related functions “cotyledon development”, “auxin polar transport” and “response to mechanical stimulus”(p-value = 0.0053 or lower). Interestingly, mechanical stimuli have been shown to strongly influence seedling growth, and we observed that FS correlated with hypocotyl length and biomass in 3-week-old plants (S18 Fig, [82]). Additionally, several categories related to defense and stress reactions, such as “response to salt stress”, “response to chitin”, “regulation of defense response to fungus” and “negative regulation of defense response”, were enriched. Variation in stress-related functions is known to have an impact on plant growth in A. thaliana [33]. Furthermore, we also found that SNPs associated with FS plasticity to light are enriched among genes involved in the shade avoidance response (p = 0.0023), by which plants exposed to limited light conditions increase stem elongation [10,83]. Associated genomic regions included, for example, PHY RAPIDLY REGULATED 2 (PAR2, AT3G58850), a negative regulator of shade avoidance [84] or LONG HYPOCOTYL UNDER SHADE (BBX21, AT1G75540), a regulator of de-etiolation and shade avoidance [85]. Altogether, functional enrichments among genes located in the vicinity of GWAS hits indicated that a biological signal is detectable among sub-significant genetic associations.

As shown above, population structure impacts the results of GWAS and population structure outliers may drive this signal of association. Indeed, genes with elevated Fst reflecting population structure or even regional adaptation of (other) traits could create spurious associations with traits that have a distinct genetic basis but are also differentiated between regions. We thus verified that functional enrichment among genes with SNP associations were different from those observed among genes with elevated Fst. We determined enriched GO categories among genes in the vicinity of GWAS associated loci (p<10–4) and among genes ranked by Fst between Spain and Northern Europe. We visualized overlaps in functional enrichment by clustering GO terms on the basis of the genes they shared (S19 Fig). The enrichment based on Fst revealed three strongly enriched GO terms: “organ morphogenesis”, “circadian rhythm” and “virus-induced gene silencing” (p = 0.0009 or lower, S10 Table). The enrichment in GO category “circadian rhythm” may reflect the local adaptation to Northern variations in day length [86,87]. Genes close to SNPs associated with the different growth parameters, however, had clearly distinctive patterns of functional enrichment (S19 Fig and S10 Table). We therefore argue that even though population structure outliers may create some false-positive associations, the polygenic pattern of association that we observe at sub-significant level cannot be explained by the history of population divergence alone.

No association between per-individual burden and growth

In areas located at the edge of the distribution range of A. thaliana, populations may have accumulated an excess of deleterious mutations in the aftermath of their genetic isolation [39,88]. This could have resulted in a mutational load that would have decreased fitness components such as plant growth, because it influences the resources available for the production of progeny [1,20,22]. We thus hypothesized that the lower FS observed in Northern Europe may result from maladaptive forces associated with the demographic history of the region.

This hypothesis could not be supported. No significant difference was detected in total number of LOF mutations per genome in Northern Europe compared to Spain (GLHT: z-value = 0.634, p-value = 0.526, S20 Fig). This observation has been previously reported [55]. In addition, we detected no significant correlation between the number of LOF alleles per genome and the average final size in HL or LL plants within Europe (r in HL = 0.079, p = 0.262, r in LL = 0.029, p = 0.684). Furthermore, we observed no significant difference in growth between Northern European and Chinese populations, despite their significantly higher burden of LOF alleles per genome (GLHT China versus Northern Europe: z-value = -20.259, p-value = <1e-4, S21 Fig, [65]). Therefore, we conclude that the individual burden of LOF mutations is unrelated to rosette growth variation.

We reasoned that lower growth rate might also be associated with a small subset of LOF mutations. To test this hypothesis, we investigated genetic associations between LOF alleles and the three growth parameters (see Materials and Methods). This analysis is similar to a GWAS, but utilises information on approximately 2500 genes that have at least one loss-of-function allele in any of the 1001 Genomes lines [39,55]. We detected no association between LOF alleles and FS, yet there was a significant association of LOF variation at gene AT2G17750 with variation in both t50 in LL plants and t50 plasticity (Fig 4, effect size = -3.542, p-value = 7.49e-6, gene-Fst = 0.113, and effect size = -4.470, p-value = 4.99e-6 for t50 and t50 plasticity, respectively). AT2G17750 encodes the NEP-interacting protein (NIP1) active in chloroplasts, which was reported to mediate intra-plastidial trafficking of an RNA polymerase encoded in the nucleus [89]. NIP1controls the transcription of the rrn operon in protoplasts or amyloplasts during seed germination and in chloroplasts during later developmental stages [89]. The LOF variant is present primarily in Northern Europe (MAF = 16 and 0.8% in Northern Europe and Spain, respectively) but is unlikely to be deleterious: it correlates with a decrease of t50, which is a faster entry in the exponential growth phase indicative of increased growth vigor (Fig 4). Taken together, this result does not support the hypothesis that decreased FS in Northern Europe or China is controlled by deleterious variation.

Loss-of-function association and phenotype of t50 in LL/GxE.
Fig 4

Loss-of-function association and phenotype of t50 in LL/GxE.

Manhattan plot of a GWAS with loss-of-funcion alleles and t50LL (a) and t50GxE (c) as input phenotypes with the same association (AT2G17750) above the Bonferroni threshold (dashed line). Boxplot of the phenotype of t50LL (b) and t50GxE (d) versus the allele state at AT2G17750 (0 means functional, 1 is a loss-of-function). The colors separate the populations into Spain (purple) and Northern Europe (green).

FS variation might reflect local adaptation at the regional scale

During the growth season, Northern European A. thaliana populations are exposed to lower average temperatures (S3 Fig). Smaller rosettes are more compact, and increased compactness is often observed in populations adapted to cold temperatures [32,9092]. Freezing tolerance, which was indeed reported to be higher in Northern Europe, is associated with functions affecting rosette size [93]. We thus hypothesized that the decreased FS and t50 observed for Northern European genotypes grown under both light regimes is the result of polygenic adaptation to lower average temperatures. We used the 14 to 47 LD-pruned set of SNPs associating in GWAS at a sub-significant level (p<1e-4) to compute polygenic scores for each genotype and each trait, and used Qx, a summary statistic that quantifies their variance across locations of origin. A Qx value outside of neutral expectations inferred from the kinship variance in the population, indicates excess differentiation of polygenic scores, as expected if individual populations evolved under divergent selection [53]. We observed that all traits displayed a strongly significant Qx (S8 Table). The differentiation of polygenic scores between the individual populations of origin suggests that divergent selection may be acting locally. Local adaptation has indeed been reported at this scale in this species [94]. This result should however be taken with caution, because, like the GWAS hits it is using, the Qx statistics is sensitive to population structure outliers. Clearly, population structure might underpin more of the GWAS signal detected for slope or t50, which are markedly less heritable than FS.

Interestingly, we observed that FS measured in HL and t50 measured in LL displayed polygenic scores that differed significantly between regions (p-value = 0.0162 and 0.0309, respectively, S22 Fig). We thus further tested whether, at the phenotypic level, regional differentiation in growth rate departed from neutral expectations. We first investigated whether variants associated with phenotypic variation in rosette diameter showed increased genetic differentiation. Compared to the Fst distribution of 10 000 random sets of SNPs, the 95th percentile of 1360 SNPs associating with all three parameters was always higher (p<10−4). Thus, associated SNPs are collectively more likely to be differentiated than the rest of the genome. This pattern is not caused by the confounding effect of population structure, because the functional enrichments are mostly specific to the phenotypes (S19 Fig). We note, however, that a few spurious genetic associations could contribute to both higher Fst and over-dispersion of polygenic scores [5658]. Additional evidence based on approaches independent of GWAS is therefore required to support the adaptive significance of regional differences in growth rate in Europe. To this end, we used the population kinship matrix to parameterize a multivariate normal distribution and predict the amount of additive phenotypic divergence expected if the trait evolves neutrally [69]. We observed that differentiation for FS measured in HL plants was marginally more differentiated than predicted under neutral conditions (Qst = 0.325, p-value = 0.085, Fig 5). The other parameters did not depart from neutrality (Qst ranging from 0.029 to 0.27, min p = 0.11, Fig 5). Since the divergent Chinese population indicates that the unconstrained evolution of growth rate variation is unlikely, this test might be overly conservative. In addition, it predicts the divergence in additive genetic variance, but in the selfing species A. thaliana, the whole genetic variance, i.e. broad sense heritability, can contribute to adaptation. We also compared the distribution of phenotypic variation within and between regions to the SNP Fst-distribution [68]. We used the Fst between Northern Europe and Spain as an estimate for nucleotide differentiation and compared it to the differentiation of these populations at the phenotypic level (Qst) [13,45,67]. For FS and t50, the Qst was significantly greater than genetic differentiation at 95% of single nucleotide (Table 2). This suggests that selective forces have contributed to the regional adaptation of FS in Europe. Other climatic components like temperature could also have strong effects on growth differences between populations. Nevertheless, we detected only weak correlations between growth variation and temperature at the location of origin (S23 Fig and S11 Table), suggesting that growth rate could be locally adapted to the conditions prevailing in each region. The environmental factors contributing to adaptive divergence in plant growth thus remain to be determined in this species.

Expected distribution for quantitative trait differentiation between the Spanish and Northern European population.
Fig 5

Expected distribution for quantitative trait differentiation between the Spanish and Northern European population.

Qst. The expectation is based on a multivariate normal distribution assuming a neutral trait with polygenic basis. Vertical lines indicate observed Qst for the individual growth parameters, FS (Solid line), t50 (Dashed line), Slope (Dot line), in HL (orange) and LL (cyan). The red arrows show the 90th, 95th and 99th percentiles of the distribution.

Table 2
FS and t50 quantitative differentiation (Qst) exceed differentiation given by single SNPs.
TraitQstPercentile of Fst
FSHL0.37996.57
FSLL0.28295.80
t50HL0.30095.95
t50LL0.18994.79
SLHL0.08193.23
SLLL0.01091.62

Qst for each trait measured in HL and LL plants. Linear mixed models were used to quantify the ratio of genetic variation between versus within Spain and Northern Europe (Qst). The 95th percentile of the distribution for single SNP Fst between these two regions was 0.205. Permutations confirmed that this test is conservative (see Materials and Methods). HL: plants grown under high light regime, LL: plants grown under low Light regime, FS: Final Size, t50: time to maximum growth and SL: slope.

Conclusion

Our comprehensive analysis of genetic diversity in rosette growth rate, within and between three broad regions of the distribution area of A. thaliana, reveals the environmental and evolutionary factors that control this complex trait, which is of central importance for plant ecology. We show that plastic reactions to light intensity have the strongest impact on variation in rosette growth rates. Yet, we also provide evidence for significant genetic variation within and between regions. We observed that Spanish genotypes show more vigorous rosette growth and reach the larger size, regardless of light conditions. Although GWAS reveal very few associations that pass Bonferroni correction, analyses of functional enrichments and polygenic scores demonstrate that the polygenic basis of trait variation can also be explored in the presence of moderately significant genetic associations. The greater phenotypic differentiation observed within Europe compared to between Europe and China, a pattern opposite to measures of genetic divergence, provides a strong indication that stabilizing selective forces constrain the evolution of growth rate over time. The analysis of polygenic scores and patterns of differentiation suggests that much of the variation observed within Europe has been shaped by natural selection, rather than by the burden imposed by deleterious mutations. Leveraging polygenic associations in local adaptation studies remains challenging [78]. Methodological developments that improve the use polygenic associations for the study of local adaptation are needed to consolidate these conclusions. Understanding the potential of polygenic trait architectures will help better integrate complex traits in our understanding of the genetic processes underpinning ecological specialization [6,95].

Acknowledgements

We thank Prof. Andreas Beyer and Prof. Arthur Korte for advice regarding GWAS analyses and Emily Wheeler, Boston, for editorial assistance.

References

TMitchell-olds. Genetic constraints on life-history. Evolution (N Y). 1996;50: 140145.

AJWilson, JMPemberton, JGPilkington, THClutton-Brock, DWColtman, LEBKruuk. Quantitative genetics of growth and cryptic evolution of body size in an island population. Evol Ecol. 2007;21: 337356. 10.1007/s10682-006-9106-z

SFElena, RELenski. Evolution experiments with microorganisms: The dynamics and genetic bases of adaptation. Nat Rev Genet. 2003;4: 457469. 10.1038/nrg1088

JPGrime. Vegetation classification by reference to strategies. Nature. 1974;250: 2631.

JPGrime. Evidence for the Existence of Three Primary Strategies in Plants and Its Relevance to Ecological and Evolutionary Theory. Am Nat. 1977;111: 11691194. 10.1086/283244

KJRPByers, SXu, PMSchlüter. Molecular mechanisms of adaptation and speciation: why do we need an integrative approach? Mol Ecol. 2017;26: 277290. 10.1111/mec.13678

LMChevin, RLande. When do adaptive plasticity and genetic evolution prevent extinction of a density-regulated population? Evolution (N Y). 2010;64: 11431150. 10.1111/j.1558-5646.2009.00875.x

JABac-Molenaar, CGranier, JJBKeurentjes, DVreugdenhil. Genome wide association mapping of time-dependent growth responses to moderate drought stress in Arabidopsis. Plant Cell Environ. 2016;39: 88102. 10.1111/pce.12595

CKörner. Paradigm shift in plant growth control. Curr Opin Plant Biol. 2015;25: 107114. 10.1016/j.pbi.2015.05.003

10 

CLBallaré, RPierik. The shade-avoidance syndrome: Multiple signals and ecological consequences. Plant Cell Environ. 2017;40: 25302543. 10.1111/pce.12914

11 

MPigliucci. Evolution of phenotypic plasticity: Where are we going now? Trends Ecol Evol. 2005;20: 481486. 10.1016/j.tree.2005.06.001

12 

MTakou, BWieters, SKopriva, GCoupland, ALinstädter. Linking genes with ecological strategies in Arabidopsis thaliana. J Exp Bot. 2019;70: 11411151. 10.1093/jxb/ery447

13 

IKronholm, FPicó, CAlonso-Blanco, JGoudet, Jde Meaux. Genetic basis of adaptation in Arabidopsis thaliana: Local adaptation at the seed dormancy QTL DOG1. Evolution (N Y). 2012;66: 22872302. 10.1111/j.1558-5646.2012.01590.x

14 

EKerdaffrec, DLFiliault, AKorte, ESasaki, VNizhynska, ÜSeren, et al Multiple alleles at a single locus control seed dormancy in Swedish Arabidopsis. Elife. 2016;5: e22502 10.7554/eLife.22502

15 

JARNavarro, MWilcox, JBurgueño, CRomay, KSwarts, STrachsel, et al A study of allelic diversity underlying flowering-time adaptation in maize landraces. Nat Genet. 2017;49: 476480. 10.1038/ng.3784

16 

PWHughes, WJJSoppe, MCAlbani. Seed traits are pleiotropically regulated by the flowering time gene PERPETUAL FLOWERING 1 (PEP1) in the perennial Arabis alpina. Mol Ecol. 2019;28: 11831201. 10.1111/mec.15034

17 

RCLewontin, JKrakauer. Distribution of gene frequency as a test of the theory of the selective neutrality of polymorphisms. Genetics. 1973;74: 175195.

18 

OHallatschek, PHersen, SRamanathan, DRNelson. Genetic drift at expanding frontiers promotes gene segregation. Proc Natl Acad Sci U S A. 2007;104: 1992619930. 10.1073/pnas.0710150104

19 

LExcoffier, MFoll, RJPetit. Genetic Consequences of Range Expansions. Annu Rev Ecol Evol Syst. 2009;40: 481501. 10.1146/annurev.ecolsys.39.110707.173414

20 

YWilli, MFracassetti, SZoller, JVan Buskirk. Accumulation of Mutational Load at the Edges of a Species Range. Mol Biol Evol. 2018;35: 781791. 10.1093/molbev/msy003

21 

SKlopfstein, MCurrat, LExcoffier. The Fate of Mutations Surfing on the Wave of a Range Expansion. Mol Biol Evol. 2006;23: 482490. 10.1093/molbev/msj057

22 

MTakou, THämälä, EKoch, KASteige, HDittberner, LYant, et al Maintenance of adaptive dynamics and no detectable load in a range-edge out-crossing plant population. bioRxiv. 2020 10.1101/709873

23 

TMitchell-Olds, JSchmitt. Genetic mechanisms and evolutionary significance of natural variation in Arabidopsis. Nature. 2006;441: 94752. 10.1038/nature04878

24 

JBergelson, FRoux. Towards identifying genes underlying ecologically relevant traits in Arabidopsis thaliana. Nat Rev Genet. 2010;11: 867879. 10.1038/nrg2896

25 

PHornitschek, M VKohnen., SLorrain, JRougemont, KLjung, ILópez-Vidriero, et al Phytochrome interacting factors 4 and 5 control seedling growth in changing light conditions by directly controlling auxin signaling. Plant J. 2012;71: 699711. 10.1111/j.1365-313X.2012.05033.x

26 

FVasseur, MExposito-Alonso, OJAyala-Garay, GWang, BJEnquist, DVile, et al Adaptive diversification of growth allometry in the plant Arabidopsis thaliana. Proc Natl Acad Sci. 2018;115: 34163421. 10.1073/pnas.1709141115

27 

JABac-Molenaar, DVreugdenhil, CGranier, JJBKeurentjes. Genome-wide association mapping of growth dynamics detects time-specific and general quantitative trait loci. J Exp Bot. 2015;66: 55675580. 10.1093/jxb/erv176

28 

MHanemian, FVasseur, EMarchadier, EGilbault, JBresson, IGy, et al Transcriptional natural variation at FLM induces synergistic pleiotropy in Arabidopsis thaliana. bioRxiv. 2019; 658013 10.1101/658013

29 

SGlander, FHe, GSchmitz, AWitten, ATelschow, Jde Meaux. Assortment of Flowering Time and Immunity Alleles in Natural Arabidopsis thaliana Populations Suggests Immunity and Vegetative Lifespan Strategies Coevolve. Genome Biol Evol. 2018;10: 22782291. 10.1093/gbe/evy124

30 

MDebieu, CTang, BStich, TSikosek, SEffgen, EJosephs, et al Co-Variation between Seed Dormancy, Growth Rate and Flowering Time Changes with Latitude in Arabidopsis thaliana. PLoS One. 2013;8: 112. 10.1371/journal.pone.0061075

31 

MMVetter, IKronholm, FHe, HHäweker, MReymond, JBergelson, et al Flagellin perception varies quantitatively in arabidopsis thaliana and its relatives. Mol Biol Evol. 2012;29: 16551667. 10.1093/molbev/mss011

32 

YLuo, AWidmer, SKarrenberg. The roles of genetic drift and natural selection in quantitative trait divergence along an altitudinal gradient in Arabidopsis thaliana. Heredity (Edinb). 2015;114: 2208. 10.1038/hdy.2014.89

33 

MTodesco, SBalasubramanian, TTHu, MBTraw, MHorton, PEpple, et al Natural allelic variation underlying a major fitness trade-off in Arabidopsis thaliana. Nature. 2010;465: 632636. 10.1038/nature09083

34 

TZüst, AAAgrawal. Trade-Offs Between Plant Growth and Defense Against Insect Herbivory: An Emerging Mechanistic Synthesis. Annu Rev Plant Biol. 2017;68: 513534. 10.1146/annurev-arplant-042916-040856

35 

NHDavila Olivas, EFrago, MPMThoen, KJKloth, FFMBecker, JJAvan Loon, et al Natural variation in life history strategy of Arabidopsis thaliana determines stress responses to drought and insects of different feeding guilds. Mol Ecol. 2017; 29592977. 10.1111/mec.14100

36 

IKronholm, OLoudet, Jde Meaux. Influence of mutation rate on estimators of genetic differentiation—lessons from Arabidopsis thaliana. BMC Genet. 2010;11: 114. 10.1186/1471-2156-11-1

37 

AFulgione, MKoornneef, FRoux, JHermisson, AMHancock. Madeiran Arabidopsis thaliana Reveals Ancient Long-Range Colonization and Clarifies Demography in Eurasia. Mol Biol Evol. 2017 10.1093/molbev/msx300

38 

YZou, XHou, QWu, JChen, ZLi, THan, et al Adaptation of Arabidopsis thaliana to the Yangtze River basin. Genome Biol. 2017; 111. 10.1186/s13059-016-1139-1

39 

1001 Genomes Consortium T. 1,135 Genomes Reveal the Global Pattern of Polymorphism in Arabidopsis thaliana—suppl. Material. Cell. 2016;166.

40 

Wieters B, de Meaux, J. Phenotype pictures of Arabidopsis thaliana in high light and low light conditions [Data set]. 2020 10.5061/dryad.s1rn8pk5m

41 

WRasband. ImageJ. U.S. National Institutes of Health 2012.

42 

JDe Vylder, FVandenbussche, YHu, WPhilips, DVan Der Straeten. Rosette Tracker: an open source image analysis tool for automatic quantification of genotype effects. 2012 10.1104/pp.112.202762

43 

R-Development-Core-Team. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2008 Available: http://www.r-project.org

44 

CRitz, FBaty, JCStreibig, DGerhard. Dose-response analysis using R. PLoS One. 2015;10: 113. 10.1371/journal.pone.0146021

45 

HDittberner, AKorte, TMettler-Altmann, APMWeber, GMonroe, Jde Meaux. Natural variation in stomata size contributes to the local adaptation of water-use efficiency in Arabidopsis thaliana. Mol Ecol. 2018; 40524065. 10.1111/mec.14838

46 

JPinheiro, DBates, SDebRoy, DSarkar, RCTeam. nlme: linear and nonlinear mixed effects models. 2018.

47 

AKorte, AFarlow. The advantages and limitations of trait analysis with GWAS: a review. Plant Methods. 2013;9: 2938. 10.1186/1746-4811-9-29

48 

SEFick, RJHijmans. Worldclim 2: New 1-km spatial resolution climate surfaces for global land areas. International Journal of Climatology. 2017.

49 

ATrabucco, RZomer. Global soil water balance geospatial database. CGIAR Consortium for Spatial Information. Publ Online. 2010 Available: http://www.cgiar-csi.org

50 

THothorn, FBretz, PWestfall, RMHeiberger, ASchuetzenmeister, SScheibe. Package ‘multcomp’—Simultaneous Inference in General Parametric Models. 2017.

51 

TWei, VSimko. R package “corrplot”: Visualisation of a Correlation Matrix (Version 0.84). 2017.

52 

The Inkscape Team. Inkscape. 2007. Available: inkscape.org

53 

JJBerg, GCoop. A Population Genetic Signal of Polygenic Adaptation. PLoS Genet. 2014;10 10.1371/journal.pgen.1004412

54 

PDanecek, AAuton, GAbecasis, CAAlbers, EBanks, MADePristo, et al The variant call format and VCFtools. Bioinformatics. 2011;27: 21562158. 10.1093/bioinformatics/btr330

55 

JMonroe, TPowell, NPrice, JMullen., AHoward, KEvans, et al Drought adaptation in Arabidopsis thaliana by extensive genetic loss-of- function. Elife. 2018;7: 221. 10.7554/eLife.41038

56 

MSohail, RMMaier, AGanna, ABloemendal, ARMartin, MCTurchin, et al Polygenic adaptation on height is overestimated due to uncorrected stratification in genome-wide association studies. Elife. 2019;8: 117. 10.7554/eLife.39702

57 

MAFustier, NEMartínez-Ainsworth, JAAguirre-Liguori, AVenon, HCorti, ARousselet, et al Common gardens in teosintes reveal the establishment of a syndrome of adaptation to altitude. PLoS Genetics. 2019 10.1371/journal.pgen.1008512

58 

JJBerg, AHarpak, NSinnott-Armstrong, AMJoergensen, HMostafavi, YField, et al Reduced signal for polygenic adaptation of height in UK Biobank. Elife. 2019;8: 147. 10.7554/eLife.39725

59 

Purcell S, Chang C. PLINK 1.9. 2019. Available: www.cog-genomics.org/plink/1.9/

60 

BPfeifer, UWittelsburger, SERamos-Onsins, MJLercher, UWittelsbürger, SERamos-Onsins, et al PopGenome: An Efficient Swiss Army Knife for Population Genomic Analyses in R. Mol Biol Evol. 2014;31: 19291936. 10.1093/molbev/msu136

61 

FHe, ALArce, GSchmitz, MKoornneef, PNovikova, ABeyer, et al The Footprint of Polygenic Adaptation on Stress-Responsive Cis-Regulatory Divergence in the Arabidopsis Genus. Mol Biol Evol. 2016;33: 20882101. 10.1093/molbev/msw096

62 

GYu, FLi, YQin, XBo, YWu, SWang. GOSemSim: An R package for measuring semantic similarity among GO terms and gene products. Bioinformatics. 2010;26: 976978. 10.1093/bioinformatics/btq064

63 

EParadis, SBlomberg, BBolker, JBrown, JClaude, HSCuong, et al Package ‘ape’: Analyses of Phylogenetics and Evolution. 2019.

64 

YBSimons, GSella. The impact of recent population history on the deleterious mutation load in humans and close evolutionary relatives. Curr Opin Genet Dev. 2016;41: 150158. 10.1016/j.gde.2016.09.006

65 

YCXu, XMNiu, XXLi, WHe, JFChen, YPZou, et al Adaptation and phenotypic diversification in arabidopsis through loss-of-function mutations in protein-coding genes. Plant Cell. 2019;31: 10121025. 10.1105/tpc.18.00791

66 

JGoudet. HIERFSTAT, a package for R to compute and test hierarchical F -statistics. Mol Ecol Notes. 2005;5: 184186. 10.1111/j.1471-8278

67 

TLeinonen, RJSMcCairns, RBO’Hara, JMerilä. Q(ST)-F(ST) comparisons: evolutionary and ecological insights from genomic heterogeneity. Nat Rev Genet. 2013;14: 17990. 10.1038/nrg3395

68 

MCWhitlock, FGuillaume. Testing for spatially divergent selection: Comparing QST to FST. Genetics. 2009;183: 10551063. 10.1534/genetics.108.099812

69 

EMKoch. The effects of demography and genetics on the neutral distribution of quantitative traits. Genetics. 2019;211: 13711394. 10.1534/genetics.118.301839

70 

BBrachi, NFaure, MHorton, EFlahauw, AVazquez, MNordborg, et al Linkage and association mapping of Arabidopsis thaliana flowering time in nature. PLoS Genet. 2010;6: 40 10.1371/journal.pgen.1000940

71 

CKülheim, JÅgren, SJansson. Rapid regulation of light harvesting and plant fitness in the field. Science (80-). 2002;297: 9193. 10.1126/science.1072359

72 

KAFranklin. Shade avoidance. New Phytol. 2008;179: 930944. 10.1111/j.1469-8137.2008.02507.x

73 

ADurvasula, AFulgione, RMGutaker, SIAlacakaptan, PJFlood, CNeto, et al African genomes illuminate the early history and transition to selfing in Arabidopsis thaliana. Proc Natl Acad Sci. 2017;114: 201616736 10.1073/pnas.1616736114

74 

ESasaki, PZhang, SAtwell, DMeng, MNordborg. " Missing " G x E Variation Controls Flowering Time in Arabidopsis thaliana. PLoS Genet. 2015; 118. 10.1371/journal.pgen.1005597

75 

BMéndez-Vigo, NHGomaa, CAlonso-Blanco, FXavier Picó. Among- and within-population variation in flowering time of Iberian Arabidopsis thaliana estimated in field and glasshouse conditions. New Phytol. 2013;197: 13321343. 10.1111/nph.12082

76 

CShindo, MJAranzana, CLister, CBaxter, CNicholls, MNordborg, et al Role of FRIGIDA and FLOWERING LOCUS C in determining variation in flowering time of Arabidopsis. Plant Physiol. 2005;138: 11631173. 10.1104/pp.105.061309

77 

YZan, ÖCarlborg. A Polygenic Genetic Architecture of Flowering Time in the Worldwide Arabidopsis thaliana Population. Jde Meaux, editor. Mol Biol Evol. 2019;36: 141154. 10.1093/molbev/msy203

78 

NPrice, LLopez, AEPlatts, JRLasky. In the presence of population structure: From genomics to candidate genes underlying local adaptation. Ecol Evol. 2020; 18891904. 10.1002/ece3.6002

79 

MWellenreuther, BHansson. Detecting Polygenic Evolution: Problems, Pitfalls, and Promises. Trends Genet. 2016;32: 155164. 10.1016/j.tig.2015.12.004

80 

KCsilléry, ARodríguez-Verdugo, CRellstab, FGuillaume. Detecting the genomic signal of polygenic adaptation and the role of epistasis in evolution. Mol Ecol. 2018;27: 606612. 10.1111/mec.14499

81 

JKPritchard, ADi Rienzo. Adaptation—not by sweeps alone. Nat Rev Genet. 2010;11: 665667. 10.1038/nrg2880

82 

JBraam, RWDavis. Rain-, wind-, and touch-induced expression of calmodulin and calmodulin-related genes in Arabidopsis. Cell. 1990 10.1016/0092-8674(90)90587-5

83 

DLFiliault, JNMaloof. A genome-wide association study identifies variants underlying the Arabidopsis thaliana shade avoidance response. PLoS Genet. 2012;8: 112. 10.1371/journal.pgen.1002589

84 

JBou-Torrent, IRoig-Villanova, AGalstyan, JFMartínez-García. PAR1 and PAR2 integrate shade and hormone transcriptional networks. Plant Signal Behav. 2008;3: 453454. 10.4161/psb.3.7.5599

85 

HEHoltan, SBandong, CMMarion, LAdam, STiwari, YShen, et al Bbx32, an arabidopsis b-box protein, functions in light signaling by suppressing HY5-regulated gene expression and interacting with STH2/BBX21. Plant Physiol. 2011;156: 21092123. 10.1104/pp.111.177139

86 

ADe Montaigu, AGiakountis, MRubin, RTóth, FCremer, VSokolova, et al Natural diversity in daily rhythms of gene expression contributes to phenotypic variation. Proc Natl Acad Sci U S A. 2015;112: 905910. 10.1073/pnas.1422242112

87 

MJSalmela, CWeinig. The fitness benefits of genetic variation in circadian clock regulation. Curr Opin Plant Biol. 2019;49: 8693. 10.1016/j.pbi.2019.06.003

88 

SPeischl, IDupanloup, MKirkpatrick, LExcoffier. On the accumulation of deleterious mutations during range expansions. Mol Ecol. 2013;22: 59725982. 10.1111/mec.12524

89 

JAzevedo, FCourtois, M-AHakimi, EDemarsy, TLagrange, SLerbs-mache, et al Intraplastidial trafficking of a phage-type RNA polymerase is mediated by a thylakoid RING-H2 protein. PNAS. 2008; 27. 10.1073/pnas.0800909105

90 

TJonas, CRixen, MSturm, VStoeckli. How alpine plant growth is linked to snow cover and climate variability. J Geophys Res Biogeosciences. 2008;113: 110. 10.1029/2007JG000680

91 

SGByars, WPapst, AAHoffmann. Local adaptation and cogradient selection in the alpine plant, Poa hiemata, along a narrow altitudinal gradient. Evolution (N Y). 2007;61: 29252941. 10.1111/j.1558-5646.2007.00248.x

92 

BLi, J-ISuzuki, THara. Latitudinal variation in plant size and relative growth rate in Arabidopsis thaliana. Oecologia. 1998;115: 293301. 10.1007/s004420050519

93 

MWHorton, GWillems, ESasaki, MKoornneef, MNordborg. The genetic architecture of freezing tolerance varies across the range of Arabidopsis thaliana. Plant Cell Environ. 2016;39: 25702579. 10.1111/pce.12812

94 

LFrachon, CBartolli, SCarrère, OBouchez, AChaubet, MGautier, DRoby, FRoux. A Genomic Map of Climate Adaptation in Arabidopsis thaliana at a Micro-Geographic Scale. Frontiers in Plant Science. 2018 10.3389/fpls.2018.00967

95 

NBarghi, JHermisson, CSchlötterer. Polygenic adaptation: a unifying framework to understand positive selection. Nat Rev Genet. 2020 10.1038/s41576-020-0250-z