PLoS ONE
Home Quantile regression in genomic selection for oligogenic traits in autogamous plants: A simulation study
Quantile regression in genomic selection for oligogenic traits in autogamous plants: A simulation study
Quantile regression in genomic selection for oligogenic traits in autogamous plants: A simulation study

Competing Interests: The authors have declared that no competing interests exist.

¤

Current address: Department of Statistics, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil

Article Type: research-article Article History
Abstract

This study assessed the efficiency of Genomic selection (GS) or genome‐wide selection (GWS), based on Regularized Quantile Regression (RQR), in the selection of genotypes to breed autogamous plant populations with oligogenic traits. To this end, simulated data of an F2 population were used, with traits with different heritability levels (0.10, 0.20 and 0.40), controlled by four genes. The generations were advanced (up to F6) at two selection intensities (10% and 20%). The genomic genetic value was computed by RQR for different quantiles (0.10, 0.50 and 0.90), and by the traditional GWS methods, specifically RR-BLUP and BLASSO. A second objective was to find the statistical methodology that allows the fastest fixation of favorable alleles. In general, the results of the RQR model were better than or equal to those of traditional GWS methodologies, achieving the fixation of favorable alleles in most of the evaluated scenarios. At a heritability level of 0.40 and a selection intensity of 10%, RQR (0.50) was the only methodology that fixed the alleles quickly, i.e., in the fourth generation. Thus, it was concluded that the application of RQR in plant breeding, to simulated autogamous plant populations with oligogenic traits, could reduce time and consequently costs, due to the reduction of selfing generations to fix alleles in the evaluated scenarios.

Oliveira,Nascimento,Nascimento,Sant'Anna,Romero,Azevedo,Bhering,Moura,and Rahimi: Quantile regression in genomic selection for oligogenic traits in autogamous plants: A simulation study

Introduction

In mid-2019, the world population reached 7.7 billion inhabitants and a further rise to 9.7 billion by 2050 is estimated [1]. Thus, more food must be produced to feed this population, although agricultural areas are increasingly limited and concerns about the negative environmental impacts of food production are growing [2, 3].

Since the Green Revolution in the 1960s, which caused a boost in the production potential of several crops, it is generally expected that plant breeding efforts will be able to secure the required yield gains [4]. The productivity of coffee trees, for example, has increased considerably, and one of the main reasons is the use of improved cultivars. In Brazil, coffee cultivars that were released and are still in use, e.g., “Mundo Novo”, are 240% more productive than introduced varieties [5]. Plant breeding programs, aside from focusing on higher yields, require the improvement of several other traits [6], e.g., the development of plants with a more appropriate architecture for higher plant density and mechanical management, better resistance and tolerance to biotic and abiotic stresses, adaptation to and stability in different cultivation environments, and a higher fruit and grain quality [711].

To meet the growing producer, consumer and market demand, a complex, continuous and dynamic breeding process is required [4], resulting in costly long-term projects for the development of superior cultivars. The developmental period of an improved cultivar of a perennial species can be over 25 years [12] and for annual species approximately 12 years. Thus, the search for procedures capable of providing superior genotypes in less time and, consequently, at a lower cost, has been intensified [4, 13, 14].

With a view to reducing the time demand and increasing selection accuracy, Meuwissen et al. [15] proposed the genome-wide selection (GWS). This kind of selection uses direct DNA information based on molecular markers to predict the genomic estimated breeding value (GEBV) of an individual, which is a measure used to select the best individuals, according to their merit within the population. The main advantage of GWS, compared to phenotypic selection, is that the GEBV of individuals whose phenotypes were not yet collected can be estimated, thus resulting in a reduced generation interval and an increase in genetic gain [1618].

The possibilities of applying GWS in autogamous plant breeding have been described in the literature. According to Heffner et al. [19], the prediction accuracy of GWS was superior to phenotypic selection in wheat. In simulated scenarios to improve oligogenic traits in Coffea Arabica, with different population densities and sizes, Romero [20] tried to determine the generation in which a favorable allele is fixed. As a result, the author observed that in small populations (as commonly used in breeding programs, e.g., for coffee), favorable alleles were fixed in the sixth generation (F6), while in large populations, fixation occurred in the fifth generation (F5). The GWS was also successfully applied in other crops, such as rice [21], oats [22] and barley [23].

An alternative and still little explored methodology for GWS studies is Quantile Regression (QR) [24]. Such methodology, unlike traditional methods based on averages, allows to adjust regression models throughout the distribution of the dependent variable, does not require assumptions about the distribution of the error and is robust to outliers. Parameter estimation is based on the weighted absolute errors method [25]. To deal with dimensionality problems in GWS studies, which are common in the marker matrix, Li and Zhu [26] proposed the Regularized Quantile Regression (RQR). The use of RQR in a GWS study was proposed by Nascimento et al. [27], in order to estimate GEBV for different quantiles of the phenotype of interest [28, 29]. In their study, Nascimento et al. [27] used RQR to estimate GEBV from simulated data with scenarios with different skewness levels in the phenotype distribution. The results of the RQR were compared to those of the BLASSO (Bayesian Least Absolute Shrinkage and Selection Operator) method, and the authors observed a lower mean square error of the former. The results indicated the viability of this alternative for GWS analysis, even in scenarios without skewness of the phenotype distribution. The approaches RQR and BLASSO were also used by Santos et al. [30] to estimate the genetic merit in pigs for asymmetric traits related to the pig carcass, and observed equally or more accurate results by RQR than BLASSO for all evaluated traits.

In spite of the interesting and promising results, RQR has not yet been evaluated throughout an entire breeding process, considering the reproductive system of a plant species. Thus, this study evaluated whether the use of RQR in GWS, for simulated data of autogamous plants with oligogenic traits, at different selection and heritability levels, allows the fixation of favorable alleles in earlier generations than the commonly used GWS methodologies. The results of the predictive capacity, mean and genotypic variance obtained by RQR were compared with traditional methods of genomic selection, specifically with RR-BLUP and BLASSO.

Material and methods

Population simulation

For this study, a 1040 cM genome was simulated, using software GENES [31], with markers spaced 1 cM apart, with eight linkage groups, resulting in a total of 1048 markers [32]. Oligogenic traits controlled by four loci were simulated, located in four different linkage groups, with uniform effect and absence of dominance and epistasis.

The F1 population was established by crossing contrasting parents, thus generating gametes for the formation of the F2 population, consisting of 625 individuals. Once the base genome was generated, genotypic values and three sets of phenotypic values were simulated, at heritability levels (h2) of 0.10, 0.20 and 0.40. To determine the genotypic values (vgi), the following equation was used:

where vgi is the genotypic value of individual i; μ the genotype population mean (here μ = 1.0); ai the additive effect of individual i, with ai=k=14ρkαk, where ρk = 2.5 is the effect of the favorable allele with the same contribution to the whole locus k; αk the contribution of locus k (1, 0 or -1 for genotypic classes AA, Aa and aa, respectively); and di is the dominance effect, assumed to be null in this study (di = 0).

The phenotypic values (vfi) were determined by the following equation:

where vfi is the phenotypic value of individual i; vgi the genotypic value of individual i; εi the environmental effect generated according to a normal distribution, where mean and variance are compatible with the specific trait heritability (εiN(0,σe2)), with σe2=σg2(1h2)h2, where σg2 is the genetic variance [33]. The phenotypic and genotypic simulated data sets are freely accessible at https://zenodo.org/record/4292736#.X8BDrmhKjIU.

The advanced generations F3, F4, F5 and F6 were obtained from F2 as base generation by selfing. The individuals with the highest GEBV obtained by an adjusted GWS model in F2 were selected. The number of selected individuals depends on the selection intensity. The selection/simulation process of the progenies was repeated until the sixth generation. The generations F3 to F6, with 200 individuals [34], were generated from the genotype of the selected individuals, simulating a selfing process. The F3 to F6 populations were simulated using software R [35].

Genomic prediction

Based on the simulated F2 population, it was stipulated that 80% of the individuals would belong to the estimation population and 20% to the validation population. The genomic genetic values of the individuals were estimated by RQR [26] based on different quantiles (0.10, 0.50 and 0.90), using RR-BLUP [15] and BLASSO [36]. Two selection intensities (10% and 20%) and three heritability levels (h2 = 0.10, 0.20 and 0.40) were considered, and each evaluated scenario was simulated 30 times. For all evaluated methods, the general GWS model was considered [15]:

where yi is the ith observation of phenotype y (i = 1,2,…,625); μ the overall mean; gj the effect of the jth marker (j = 1,2,…,1040); xij are the elements of the incidence matrix of marker j in individual i, with parameterization 1, 0 and −1; and ei is the ith observation of the random error eiN(0,σe2).

The parameters of model (2) were estimated by three methodologies: RQR, at three quantiles (τ = 0.10, 0.50 and 0.90), BLASSO and RR-BLUP.

In RQR, the marker effects are computed by solving the following optimization problem [26]:

where j=1p|gj| is the sum of the absolute values of the regression coefficients, λ the penalty parameter, n = 625, p = 1040 and ρτ(.), called “check function” by Koenker and Bassett (1978), and defined by:
in this study, τ = 0.1; τ = 0.5 and τ = 0.9.

Note that, in the RQ, the coefficients are estimated from the minimization of the weighted sum of the vertical distances between the observed and estimated values [25]. For that, linear programming algorithms are used [37, 38]. One of the methods used to estimate these coefficients is the Simplex Method. For details on the Simplex Method, please consult Koenker [38].

After estimating the regression coefficients (marker effects), for the three quantiles considered (τ = 0.1; τ = 0.5 and τ = 0.9), the GEBV of the ith individual, based on quantile τ (GEBVi(τ)), was calculated by the following equation:

where g^(τ) is the estimated effect of the jth SNP marker based on quantile τ (τ = 0.1; τ = 0.5 e τ = 0.9). The GEBVs were also determined by BLASSO and RR-BLUP, according to the equation:
where g^j is the effect of the jth SNP marker, estimated by the two said methods.

According to the recommendation of Santos et al. [30], the penalty parameter λ of RQR was defined as half the penalty parameter resulting from the BLASSO method.

To compare the analyzed methodologies, the predictive capacity (ry,y^) of the methods was calculated, which is the correlation coefficient between the observed phenotypic (y) and the estimated genomic value (y^) in each generation. The genotypic means and variances in each generation were also determined.

Based on Eq 1, with μ = 1.0 and ai=k=14ρkαk=10, it can be said that favorable alleles in a given generation are fixed when the genotypic mean of a population reaches 11 and variance zero.

Computational aspects

The calculations and estimates were performed in the R program [35]. The function used to estimate the regression parameters at the three quantiles was rq of the quantreg package [39]. The regression coefficients were estimated by RR-BLUP using the mixed.solve function of package rrBLUP [40]. The Bayesian models were adjusted with the bglr function in package BGLR [41], with 100,000 iterations for the MCMC (Markov chain Monte Carlo) algorithms, of which 10,000 were discarded as burn-in, to ensure chain heating and a thin of 5. Convergence analysis was performed based on the criteria proposed by Raftery and Lewis [42] and Heidenberg and Welch (1983) [43].

Results and discussion

The mean estimates of the predictive capacity (PC) in scenarios with heritability of 0.10 varied between 0.20 (RQR (0.10)) and 0.45 (BLASSO and RR-BLUP) in generation F2 (Fig 1); between 0.40 (RQR (0.10)) and 0.5 (BLASSO and RR-BLUP) at heritability 0.20 (Fig 2); and between 0.65 (RQR (0.10) and RQR (0.90)) and 0.70 (RR- BLUP) in the scenarios with heritability of 0.40 (Fig 3). In general, PC rises as heritability increases (Figs 13). This result was already expected, since traits with higher heritability are less affected by environmental variation, facilitating the breeding process [44].

Average predictive capacity (y-axis) of the models evaluated over five generations (x-axis).
Fig 1

Average predictive capacity (y-axis) of the models evaluated over five generations (x-axis).

Considering a heritability of 0.10 and two selection intensities. (A) SP = 10%; (B) SP = 20%.

Average predictive capacity (y-axis) of the models evaluated over five generations (x-axis).
Fig 2

Average predictive capacity (y-axis) of the models evaluated over five generations (x-axis).

Considering a heritability of 0.20 and two selection intensities. (A) SP = 10%; (B) SP = 20%.

Average predictive capacity (Y axis) of the models evaluated over five generations (x axis).
Fig 3

Average predictive capacity (Y axis) of the models evaluated over five generations (x axis).

Considering a heritability of 0.4 and two selection intensities. (A) SP = 10%; (B) SP = 20%.

Over the generations, the PC estimates decreased, to values close to zero in F6 in several scenarios (Figs 13). This result can be explained by the fact that the model to predict GEBV in the F3 –F6 generations was adjusted in F2. Specifically, since selection occurs over generations, the allele frequency of the initial generation changes, which leads to a reduction in the marker-QTL linkage disequilibrium (LD). Over the generations, Sant’Anna et al. [33] observed a drop in LD, which is reflected in the predictive capacity of the model for autogamous populations. In allogamous species on the other hand, LD is dissipated by advancing a single generation, resulting in a low efficiency of GS procedures based on models adjusted in previous populations.

There was an increase in means over the generations and a decrease in the genotypic variances to values close to zero from the third generation onwards, for all evaluated methods (Table 1 and S1S3 Figs). These results are in line with the theory of quantitative population genetics, which states that in response to directional selection, the allele frequency of traits with few major-effect loci changes rapidly, inducing a phenotypic response [45]. In this way, the population mean increases since the allele value is positive in the simulation process.

Table 1
Means and mean genotypic variances (n = 30) of five generations in response to selection by five predictive methods, based on three heritability levels (0.10, 0.20 and 0.40) and two selection intensities (10 and 20%).
h2MethodsSPF3F4F5F6
GMGVGMGVGMGVGMGV
0.10BLASSO10%6.44±0.309.03±1.357.78±0.987.66±2.229.14±1.335.56±4.2610.45±0.961.46±1.69
20%5.76±0.2010.55±0.907.73±0.488.48±1.529.09±0.985.74±2.8310.15±1.062.24±1.99
RRBLUP10%6.48±0.319.05±1.307.89±0.798.14±2.109.45±1.034.82±2.4110.24±1.221.78±2.38
20%5.82±0.2110.34±1.027.81±0.598.46±1.479.14±0.985.90±2.9210.31±0.922.30±2.93
RQR (0.10)10%4.19±1.0511.97±2.445.91±1.579.92±2.977.89±1.936.10±3.808.54±2.192.21±3.01
20%3.69±0.9314.00±2.015.44±1.1311.24±2.087.73±1.687.04±2.928.39±1.724.50±3.57
RQR (0.50)10%4.34±0.8212.07±2.596.47±1.499.14±3.268.20±1.866.16±4.649.42±2.062.01±2.95
20%3.80±0.7413.63±2.305.88±1.2010.93±2.478.39±1.577.16±5.199.74±1.632.24±2.78
RQR (0.90)10%4.78 ±0.8311.12±2.426.71 ±1.208.83±3.577.96±2.115.15± 4.538.76±2.672.40±2.90
20%4.47±0.8012.02±2.016.39±1.209.64±3.107.85±1.576.90±3.539.09±2.233.25±3.39
0.20BLASSO10%7.49±0.296.02±0.799.95±0.403.04±0.9210.75±0.370.82±1.1810.95±0.160.21±0.58
20%6.90±0.207.38±0.669.70±0.353.65±0.7910.68±0.271.13±0.8910.90±0.230.37±0.79
RRBLUP10%7.47±0.306.21±0.919.97±0.412.94±0.9010.73±0.270.94±0.8410.80±0.430.64±1.35
20%6.90±0.187.37±0.689.70±0.193.86±0.6510.65±0.281.30±1.1810.88±0.260.40±0.79
RQR (0.10)10%6.40±0.757.42±1.907.85±1.145.84±1.789.01±1.553.86±2.289.25±2.001.85±2.53
20%5.84±0.5910.02±1.697.86±0.816.63±1.499.02±1.224.72±1.9410.02±1.501.78±1.98
RQR (0.50)10%6.69±0.506.95±1.659.41±0.784.38±1.7410.72±0.551.07±1.9410.98±0.140.11±0.59
20%6.36±0.648.24±2.139.12±0.945.35±2.5310.56±0.721.53±2.1810.92±0.260.34±1.02
RQR (0.90)10%6.32±0.627.68±1.788.01±1.255.99±2.089.26±1.613.29±2.6110.30±1.420.88±1.80
20%6.09±0.498.67±1.848.08±1.016.92±2.149.25±1.374.29±2.8010.20±1.264.29±2.69
0.40BLASSO10%8.01±0.164.79±0.6010.47±0.231.71±0.6710.84±0.250.54±0.8110.99±0.030.03±0.13
20%7.31±0.156.76±0.6210.18±0.212.70±0.6310.82±0.200.63±0.6810.95±0.100.20±0.37
RRBLUP10%7.98±0.204.95±0.6910.45±0.261.83±0.8710.87±0.210.45±0.6610.94±0.150.20±0.50
20%7.32±0.136.64±0.4910.18±0.262.60±0.7310.43±0.170.62±0.6210.94±0.210.20±0.62
RQR (0.10)10%7.97±0.244.81±0.7310.56±0.301.47±0.9710.95±0.160.21±0.6710.96±0.230.16±0.86
20%7.37±0.236.28±0.6410.27±0.292.35±0.8310.95±0.170.20±0.4910.97±0.120.12±0.51
RQR (0.50)10%8.29±0.223.95±0.6510.86±0.210.50±0.6910.95±0.180.16±0.5711.00±0.000.00±0.00
20%7.54±0.175.93±0.6410.68±0.231.11±0.7610.96±0.090.14±0.3110.98±0.100.09±0.40
RQR (0.90)10%7.74±0.305.45±1.1210.17±0.392.68±1.2810.64±0.391.28±1.3410.77±0.620.64±1.31
20%7.15±0.256.91±0.929.98±0.443.32±1.5910.54±0.411.73±1.5610.73±0.640.80±1.65

h2: heritability; SP: selection intensity; GM: genotypic mean; GV: genotypic variance; RQR: regularized quantile regression; BLASSO: Bayesian Lasso. Genotypic means and variances in F2 were 1.10 ± 0.00 and 12.19 ± 0.00, respectively, in all tested scenarios.

The results of fixation or non-fixation of favorable alleles were different in the evaluated scenarios. In scenarios with trait heritability of 0.10, the RQR (τ = 0.1) with a selection intensity of 10% failed to fix the favorable alleles until the sixth generation, as it did not reach the genotypic mean 11 (8.54 ± 2.19) (Table 1). When selection was based on the models RQR (τ = 0.1) or BLASSO, with a selection intensity of 20%, the favorable alleles were not fixed until the sixth generation, as the genotypic variance did not reach zero by these methods (4.50 ± 3.57 and 2.24 ± 1.99, respectively) (Table 1). For the other methods, even in low-heritability scenarios, the favorable alleles could be fixed until the sixth generation (Table 1). According to Goddard [46], the speed at which a population increases or decreases the level of an allele depends on its initial frequency. Thus, the greatest difficulty in fixing alleles in a low-heritability scenario may be due to the greater environmental effect that affects the estimation of GEBV, making it even more difficult to select individuals with the desired alleles than in the other scenarios, where the disturbing environmental effect is lower.

For the scenarios with a trait heritability of 0.20 and regardless of the selection intensity, the tested methods allowed the fixation of favorable alleles until the sixth generation, except for RQR (τ = 0.90), at a selection intensity of 20% (Table 1). Selection based on the BLASSO models, at an intensity of 10%, and on RQR (τ = 0.50), at intensities of 10 and 20%, reached fixation in F5 (Table 1).

Moreover, at a heritability level of 0.40, RQR (τ = 0.50) at a selection intensity of 10% allowed the establishment of favorable alleles as early as in the fourth generation, with a genotypic mean of 10.86 ± 0.21 and genotypic variance of 0.50 ± 0.69, while the other methods allowed allele fixation in the fifth or sixth generation (Table 1). With these results, there was a reduction of one (h2 = 0.20) or two generations (h2 = 0.40) in the fixation process of favorable alleles. The reduction of generations in a plant breeding program is decisive in view of the savings in terms of time, efforts and costs. In coffee for example, one selection generation lasts on average six years [47], i.e., by this technique, the breeding process can be considerably reduced, thus reducing the time required to develop genetically superior genotypes and, consequently, save costs.

Although the BLASSO and RR-BLUP methods had the highest predictive capacity in F2 in all evaluated scenarios, the results in relation to favorable allele fixation were equal to or lower than by RQR (0.50).

Generally speaking, the breeding process by RQR can be equal to or faster than by the standard GS methodologies. Although to date little explored in breeding, the RQR method has been shown to be very promising for genomic selection and association studies, in both plant and animal breeding [2730, 48]. In this study, RQR (τ = 0.50) fixed the favorable alleles in the fourth generation (F4) in the scenario with a heritability of 0.4 and selection intensity of 10%. The efficiency of RQR, in contrast with the traditional methods, based on conditional means, can be explained by the possibility of fitting models at different levels (quantiles) of the phenotype distribution, and consequently making a more thorough study of the phenomenon of interest possible [24]. Specifically, for highly skewed phenotypic distributions, the results of quantile models that allow a quantile fitting far from the mean are interesting. In an evaluation of quantiles 0.25 and 0.75 for right- and left-skewed distributions, respectively, Nascimento et al. [27] and Barroso et al. [28] observed that these models have a higher predictive capacity and lower mean square errors than the traditional GS methodologies, respectively.

In this study, since the data were generated assuming a normal (symmetrical) distribution, better results were expected from mean- or median-based methodologies. However, the best results were based on medians, which may be related to the rarity of occurrence, both in simulated and in practical processes, of a perfectly symmetrical distribution. Thus, a median-based methodology such as RQR (τ = 0.50) can better describe the functional relationship between the dependent and explanatory variables and is robust to outliers, in cases of symmetry deviations in the phenotype distribution [25, 38].

Conclusions

The use of Regularized Quantile Regression models proved effective in genomic selection studies, for allowing an accelerated development of superior genotypes in relation to traditional GS methodologies. Among the simulated conditions, the configuration of Regularized Quantile Regression (τ = 0.50), at a heritability of 0.40 and selection intensity of 10% was the most efficient, since favorable alleles could be fixed more quickly, as early as in the fourth generation.

References

Organização das Nações Unidas [homepage na Internet]. População mundial deve chegar a 9.7 bilhões de pessoas em 2050, diz relatório da ONU [acesso em 25 jan 2020]. Disponível em: https://nacoesunidas.org/populacao-mundial-deve-chegar-a-97-bilhoes-de-pessoas-em-2050-diz-relatorio-da-onu/.

HCJGodfray, JRBeddington, IRCrute, LHaddad, DLawrence, JFMuir, et al Food Security: The Challenge of Feeding 9 Billion People. Science. 2010; 327(5967): 812818. 10.1126/science.1185383

MCHunter, RGSmith, MESchipanski, LWAtwood, DAMortensen. Agriculture in 2050: Recalibrating targets for sustainable intensification. Bioscience. 2017; 67(4): 386391.

JNCobb, PSBiswas, JDPlatten. Back to the future: revisiting MAS as a tool for modern plant breeding. Theor Appl Genet. 2019; 132 (3): 647667. 10.1007/s00122-018-3266-4

OGuerreiro Filho, MAPRamalho, VTAndrade. Alcides Carvalho and the selection of catua cultivar: Interpreting the past and drawing lessons for the future. Crop Breed Appl Biotechnol. 2018; 18(4): 460466.

DAkdemir, WBeavis, RFritsche-Neto, AKSingh, JIsidro-Sánchez. Multi-objective optimized genomic breeding strategies for sustainable food improvement. Heredity (Edinb). 2019; 122(5):672683. 10.1038/s41437-018-0147-1

PFByrne, GMVolk, CGardner, MAGore, PWSimon, SSmith. Sustaining the future of plant breeding: The critical role of the USDA-ARS national plant germplasm system. Crop Sci. 2018; 58(2): 451468.

PBarbosa I de, WGda Costa, MNascimento, CDCruz, ACBde Oliveira. Recommendation of Coffea arabica genotypes by factor analysis. Euphytica. 2019; 215(10): 110.

JLado, AIMoltini, VEsteban, GRodríguez, PArcia, MRodríguez, et al Integration of sensory analysis into plant breeding: review. Agrociencia Uruguay. 2019; 23(01): 115.

10 

LMarie, CAbdallah, CCampa, PCourtel, MBordeaux, LNavarini, et al G × E interactions on yield and quality in Coffea arabica: new F1 hybrids outperform American cultivars. Euphytica. 2020; 216(5): 117.

11 

TASetotaw, ETCaixeta, EMZambolim, TVSousa, AAPereira, ACBaião, et al Genome Introgression of Híbrido de Timor and Its Potential to Develop High Cup Quality C. arabica Cultivars. J Agric Sci. 2020; 12(4): 6476.

12 

AValencia, AYMorales, PMoncada, HAlfonso, JCHerrera. Introgression of the SH3 gene resistant to rust (Hemileia vastatrix) in improved lines of CASTILLO ® variety (Coffea arabica L.). J Plant Breed Crop Sci. 2017; 9: 130138.

13 

ERAlkimim, ETCaixeta, TVSousa, AAPereira, ACBde Oliveira, LZambolim, et al Marker-assisted selection provides Arabica coffee with genes from other Coffea species targeting on multiple resistance to rust and coffee berry disease. Mol Breed. 2017; 37(1): 6.

14 

TVSousa, ETCaixeta, ERAlkimim, ACBOliveira, AAPereira, NSSakiyama, et al Early selection enabled by the implementation of genomic selection in Coffea arabica breeding. Front Plant Sci. 2019; 9: 112. 10.3389/fpls.2018.01934

15 

THEMeuwissen, BJHayes, MEGoddard. Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001; 157(4): 18191829.

16 

MDV de M DeResende, PPSLopes, RLda Silva, IEPires, RL DaSilva, IEPires. Seleção genômica ampla (GWS) e maximização da eficiência do melhoramento genético. Pesqui Florest Bras. 2008; (56): 6377.

17 

JCrossa, PPérez, Gde los Campos, GMahuku, SDreisigacker, CMagorokosho. Genomic selection and prediction in plant breeding. J Crop Improv. 2011; 25(3): 239261.

18 

JCrossa, PPérez-Rodríguez, JCuevas, OMontesinos-López, DJarquín, Gde los Campos, et al Genomic selection in plant breeding: methods, models, and perspectives. Trends Plant Sci. 2017; 22(11): 961975. 10.1016/j.tplants.2017.08.011

19 

ELHeffner, J-LJannink, MESorrells. Genomic Selection Accuracy using Multifamily Prediction Models in a Wheat Breeding Program. The Plant Genome Journal. 2011; 4(1): 6575.

20 

Romero JV. Seleção genômica usando genotipagem de baixa saturação no melhoramento genético do cafeeiro. Viçosa. Tese [Doutorado em Genética e Melhoramento]. Universidade Federal de Viçosa; 2017.

21 

JSpindel, HBegum, DAkdemir, PVirk, BCollard, Redona, et al Genomic selection and association mapping in rice (Oryza sativa): effect of trait genetic architecture, training population composition, marker number and statistical model on accuracy of rice genomic selection in elite, tropical rice breeding lines. PLoS Genet. 2015; 11(2): 125.

22 

FGAsoro, MANewell, WDBeavis, MPScott, J-LJannink. Accuracy and Training Population Design for Genomic Selection on Quantitative Traits in Elite North American Oats. Plant Genome. 2011; 4(2): 132144.

23 

ZShengqiang, JCMDekkers, RLFernando, JLJannink. Factors affecting accuracy from genomic selection in populations derived from multiple inbred lines: A barley case study. Genetics. 2009; 182(1): 355364. 10.1534/genetics.108.098277

24 

RKoenker, GBassett. Regression Quantiles. Econometrica. 1978; 46(1): 3335

25 

LHao, DQNaiman. Quantile regression. New Delhi: Sage publications; 2007.

26 

YLi, JZhu. L1-norm quantile regression. J Comput Graph Stat. 2008; 17(1): 163185.

27 

MNascimento, FF eSilva., MDVde Resende, CDCruz, ACCNascimento, JMSViana, et al Regularized quantile regression applied to genome-enabled prediction of quantitative traits. Genet Mol Res. 2017; 16(1): 112. 10.4238/gmr16019538

28 

LMABarroso, MNascimento, ACCNascimento, FFSilva, NVLSerão, CDCruz, et al Regularized quantile regression for SNP marker estimation of pig growth curves. J Anim Sci Biotechnol. 2017;8(1):19. 10.1186/s40104-017-0187-z

29 

MNascimento, ACCNascimento, FF e.Silva, LDBarili, NMDo Vale, JECarneiro, et al Quantile regression for genome-wide association study of flowering time-related traits in common bean. PLoS One. 2018; 3(1): 114. 10.1371/journal.pone.0190303

30 

PMdos Santos, ACCNascimento, MNascimento, FFonseca e Silva, CFAzevedo, RRMota, et al Use of regularized quantile regression to predict the genetic merit of pigs for asymmetric carcass traits. Pesqui Agropecu Bras. 2018; 53(9): 10111017.

31 

CDCruz. GENES—Software para análise de dados em estatística experimental e em genética quantitativa. Acta Sci—Agron. 2013; 35(3): 271276.

32 

Lda Costa e Silva, CDCruz, MAMoreira, EGde Barros. Simulation of population size and genome saturation level for genetic mapping of recombinant inbred lines (RILs). Genet Mol Biol. 2007;30(4):11011018.

33 

CSant’ Anna I de, RADiniz Cabral Ferreira, MNascimento, GNSilva, VQCarneiro, CDCruz, et al Multigenerational prediction of genetic values using genome-enabled prediction. PLoS One. 2019; 14(1): 114.

34 

AFerreira, MFda Silva, Lda Costa e Silva, CDCruz. Estimating the effects of population size and type on the accuracy of genetic maps. Genet Mol Biol. 2006;29(1):187192.

35 

R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria 2020; URL http://www.R-project.org/.

36 

GDe Los Campos, HNaya, DGianola, JCrossa, ALegarra, EManfredi, et al Predicting quantitative traits with regression models for dense molecular markers and pedigree. Genetics. 2009; 182(1): 375385. 10.1534/genetics.109.101501

37 

EN daSilva, JúniorSabino da Silva Porto. Istema financeiro e crescimento econômico: uma aplicação de regressão quantílica. Econ Apl. 2006; 10(3):425442.

38 

RKoenker. Regressão quantílica. Nova York: Cambridge University Press; 2005.

39 

Koenker R. quantreg: Quantile regression. R package version 4.91. http://CRAN.Rproject.org/package=quantreg, 2015.

40 

JBEndelman. Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP. The Plant Genome. 2011; 4(3): 250255.

41 

CAMPOS e RODRIGUEZ, Bayesian Generalized Linear Regression. URL: http://cran.r-project.org/web/packages/BGLR/index.html, 2015.

42 

AERaftery, SMLewis. Comment: One long run with diagnostics: Implementation strategies for Markov chain Monte Carlo. Stat Sci. 1992; 7(4): 493497.

43 

PHeidelberger, PDWelch. Simulation Run Length Control in the Presence of an Initial Transient. Oper Res. 1983.

44 

ALegarra, CRobert-Granié, EManfredi, JMElsen. Performance of genomic selection in mice. Genetics. 2008; 180(1): 611618. 10.1534/genetics.108.088575

45 

Falconer, D.S. and Mackay, T.F.C., 1996. Introduction to quantitative genetics, Longman. Essex, England.

46 

HBGoddard ME. Genomic selection. Genetics in the Third Millennium. 2015; 12(4): 37943805.

47 

ASda Conceição, LCFazuoli, MTBraghini. Avaliação e seleção de progênies F3 de cafeeiros de porte baixo com o gene SH3 de resistência a Hemileia vastatrix berk. et br. Bragantia. 2005; 64(4): 547559.

48 

MNascimento, ACCNascimento, JCMDekkers, NVLSerão. Using quantile regression methodology to evaluate changes in the shape of growth curves in pigs selected for increased feed efficiency based on residual feed intake. Animal. 2019; 13(5): 10091019. 10.1017/S1751731118002616