PLoS ONE
Home Keep Garfagnina alive. An integrated study on patterns of homozygosity, genomic inbreeding, admixture and breed traceability of the Italian Garfagnina goat breed
<i>Keep Garfagnina alive</i>. An integrated study on patterns of homozygosity, genomic inbreeding, admixture and breed traceability of the Italian Garfagnina goat breed
Keep Garfagnina alive. An integrated study on patterns of homozygosity, genomic inbreeding, admixture and breed traceability of the Italian Garfagnina goat breed

Competing Interests: The authors have declared that no competing interests exist.

Article Type: research-article Article History
Abstract

The objective of this study was to investigate the genetic diversity of the Garfagnina (GRF) goat, a breed that currently risks extinction. For this purpose, 48 goats were genotyped with the Illumina CaprineSNP50 BeadChip and analyzed together with 214 goats belonging to 9 other Italian breeds (~25 goats/breed), whose genotypes were available from the AdaptMap project [Argentata (ARG), Bionda dell’Adamello (BIO), Ciociara Grigia (CCG), Di Teramo (DIT), Garganica (GAR), Girgentana (GGT), Orobica (ORO), Valdostana (VAL) and Valpassiria (VSS)]. Comparative analyses were conducted on i) runs of homozygosity (ROH), ii) admixture ancestries and iii) the accuracy of breed traceability via discriminant analysis on principal components (DAPC) based on cross-validation. ROH analyses was used to assess the genetic diversity of GRF, while admixture and DAPC to evaluate its relationship to the other breeds. For GRF, common ROH (more than 45% in GRF samples) was detected on CHR 12 at, roughly 50.25–50.94Mbp (ARS1 assembly), which spans the CENPJ (centromere protein) and IL17D (interleukin 17D) genes. The same area of common ROH was also present in DIT, while a broader region (~49.25–51.94Mbp) was shared among the ARG, CCG, and GGT. Admixture analysis revealed a small region of common ancestry from GRF shared by BIO, VSS, ARG and CCG breeds. The DAPC model yielded 100% assignment success for GRF. Overall, our results support the identification of GRF as a distinct native Italian goat breed. This work can contribute to planning conservation programmes to save GRF from extinction and will improve the understanding of the socio-agro-economic factors related with the farming of GRF.

Dadousis,Cecchi,Ablondi,Fabbri,Stella,Bozzi,and Gallego Romero: Keep Garfagnina alive. An integrated study on patterns of homozygosity, genomic inbreeding, admixture and breed traceability of the Italian Garfagnina goat breed

Introduction

Local breeds, which usually consist of a small number of animals, are increasingly recognized by E.U. action plans as a key feature of unique rural landscapes and agroecosystems. For example, local breeds i) are rustic and adapted to their local environment, ii) represent a significant economic resource and have been used for many years in manufacture of niche products, especially in mountainous regions, iii) represent an important and usually unique genetic resource that could be essential to address the future changes in climate or disease outbreaks [1], and iv) play an important role for the preservation of human cultural inheritance. For small ruminants in particular, which can adapt to marginal and difficult production environments, the provision of ecosystem services is of major importance. In mountainous regions of the Mediterranean basin, grazing can be used as a measure of protection against avalanches in winter and fire outbreaks during the summer period. Controlled grazing is a cost-effective, non-polluting, nontoxic, nearly carbon-neutral and effective technique to prevent fire propagation. In this context, goat grazing has been proposed as an eco-friendly solution for wildfire prevention [2]. Moreover, climate change has been considered as an additional challenge to the sustainability of livestock systems (e.g., health and productivity) and local breeds may help overcome this challenge by their ability to adapt to heterogeneity in the regions they are reared. Despite these favourable traits, the relatively low productivity of local unimproved breeds often contributes to low farmer’s income and endangers their existence.

The widespread presence and adaptation of goats (Capra hircus) to a variety of agro-ecological conditions worldwide is well documented [3]. The demographic history of the domesticated goat relates closely to that of human civilization. Sheep, cattle, pigs and goats were among the earliest domesticated ungulates [4, 5]. Based on the Domestic Animal Diversity Information System (DAD-IS) data, 21 goat breeds have already gone extinct (18 of which were reared in the regions of Europe and the Caucasus) and 41 are in a critical situation (all of which are from Europe and the Caucasus) [6]. In Italy, 3 goat breeds are extinct and 12 more are considered to be critically endangered.

The Garfagnina (GRF) is an Italian goat breed at risk of extinction. The latest assessment (October 2019) reported 1,468 animals from 29 different farms (Associazione Regionale Allevatori Toscana-ARAT, personal communication). GRF is mostly reared for dairy production and is found in in central Italy, specifically in the hills and mountains of the northwest Tuscan Apennine area. The origins of this population are not clear. However, it is very likely that this breed was a result of crossings between native goats from the Alps and from the Tuscan-Emilian Apennines. Moreover, local breeders report that the population has been reared for generations for its milk and meat production [7]. This breed has also been linked to the production of typical products, such as the “Controneria” meat kid and the “Caprino delle Apuane” cheese. As it has been reported by Martini et al. [7], GRF goats are usually milked by hand.

To support the management and conservation of this breed, and to provide support to the farmers and to the general region where the breed is reared, a few studies have investigated i) various production characteristics [7, 8], ii) its adaptive profile (via physiological, haematological, biochemical and hormonal parameters) [9] and iii) its resistance to diseases [10, 11]. Martini et al. [7] investigated various zootechnical characteristics of GRF breed in comparison to other Italian and foreign goat breeds. Based on their results, the authors suggested the development of a breeding scheme based on pure bred animals. Nevertheless, no whole genome analysis has been conducted yet to investigate the genomic background of GRF and its ancestry. Genomic information can quantify the genetic diversity among breeds and identify common ancestry and thus inform the planning of conservation programmes.

The GoatSNP50 BeadChip (http://www.goatgenome.org; [12]) released in 2013, together with the recent and publicly available results of the goat AdaptMap project [3], offered us the opportunity to investigate the genetic diversity of GRF and to carry out comparative analyses with other Italian native goat breeds.

The objectives of the present study were to investigate the genetic diversity of GRF by using single nucleotide polymorphisms (SNP) data and to assess its genetic relationship to the other native Italian goat breeds included in the AdaptMap dataset. A unified procedure on admixture, discriminant analysis and runs of homozygosity (ROH) was applied. The latter was used to assess the level of genetic diversity of GRF, while admixture and discriminant analysis to evaluate the relationship of GRF to other breeds. In brief, admixture quantifies proportions of ancestries per individual from pre-defined groups, while discriminant analysis classifies a given sample to one of the groups analyzed. Runs of homozygosity have been widely used in livestock to assess the degree of genomic inbreeding (FROH), i.e. estimates of inbreeding coefficients based on molecular data [1316]. They consist of long consecutive homozygous DNA segments which are distributed along the genome. Overall, our results suggest a distinct genetic pool of GRF with low levels of genomic inbreeding.

Materials and methods

Ethics statement

Blood sampling for GRF goats was conducted by veterinarians and no invasive procedures were applied. Thus, in accordance to the 2010/63/EU guide and the adoption of the Law D.L. 04/03/2014, n.26 by the Italian Government, an ethical approval was not required for our study.

Genomic data

Five millilitres of blood were collected from each of 48 female GRF goats (between 2 and 9 years old; all registered in the herdbook) from a pool of 269 females in the Garfagnina district (Media Valle del Serchio, Lucca, Central Italy). Animals were genotyped with the Illumina GoatSNP50 BeadChip (Illumina Inc., San Diego, CA) containing 53,347 SNPs [17]. Genomic data of nine Italian autochthonous goat breeds, namely Argentata dell’Etna (ARG), Bionda dell’Adamello (BIO), Ciociara Grigia (CCG), Di Teramo (DIT), Garganica (GAR), Girgentana (GGT), Orobica (ORO), Valdostana (VAL) and Valpassiria (VSS) were downloaded from the online repository (https://datadryad.org/stash/dataset/doi:10.5061/dryad.v8g21pt) of the ADAPTmap project [3, 18]. The breeds were selected based on breed abbreviation in the PLINK sample information file (.fam) downloaded from the repository and breed description (code and country) reported in Table 1 of [18]. The two datasets were merged and quality control was conducted in PLINK v1.9 [19, 20] based on the following criteria: i) only autosomes were kept, ii) call rate per SNP >95% and iii) missing values per sample <10%. After editing, 260 samples and 48,716 SNP were retained (Table 1). The distribution of SNP per chromosome (CHR) is presented in S1 Fig.

Table 1
Names of breeds, breed code and number of animals analyzed before (pre-QC) and after (post-QC) quality control per breed.
Breed nameBreed codeNo. pre-QCNo. post-QC
Argentata dell’EtnaARG2524
Bionda dell’AdamelloBIO2424
Ciociara GrigiaCCG1919
Di TeramoDIT2424
GarganicaGAR2020
GirgentanaGGT3030
GarfagninaGRF4848
OrobicaORO2423
ValdostanaVAL2424
ValpassiriaVSS2424

Runs of homozygosity

Analysis of ROH was conducted in R (v. 3.5.0) using the package detectRUNS v. 0.9.5 [21, 22]. The consecutive method [23] that runs under the main function consecutiveRUNS.run was adopted. The required parameters were set to: i) minimum number of 15 SNPs/ROH, ii) 1 Mbp minimum length of ROH and iii) allow one heterozygous SNP within an ROH (to account for genotyping errors). In addition, ROH lengths were split into five classes (0–2, 2–4, 4–8, 8–16 and >16 Mbp). For each of the class and breed, descriptive statistics of ROH per breed, per chromosome, per SNP and per length class were estimated. Principal component analysis (PCA) was used to identify (dis)similarities among breeds, relative to the average number of ROH identified per chromosome. PCA was applied via a singular value decomposition (prcomp function in R [21]). In addition, genomic inbreeding (FROH) was calculated per breed (defined as the length of ROH over the total autosomal length per goat and summed over all goats per breed). Regions with a high frequency of ROH (≥45%) were detected and genes located within ± 1 Mbp were annotated by using the Capra hircus ARS1 (http://www.ensembl.org/index.html) and the variant effect predictor (https://www.ensembl.org/Tools/VEP) Ensembl databases.

Population stratification and ancestry

PCA and admixture analysis were used to infer the presence of distinct populations based on the SNP data. PCA was conducted on the matrix of genotypes. The proportion of mixed ancestry in the breeds was assessed by the ADMIXTURE 1.22 software [24, 25]. The number of ancestries (K) to be retained in the admixture analysis (K = 2 to 10) was evaluated via a 10-fold cross-validation (CV). The final selection on the number of ancestries was done by inspecting the CV error.

Discriminant analysis of principal components

Discriminant analysis was applied to assess the breed traceability of the GRF goats using SNP data. To achieve this, the methodology of discriminant analysis of principal components (DAPC) [26] implemented in the adegenet R package [21, 27, 28] was adopted. In brief, DAPC is a 2-step approach: firstly, a PCA on the matrix of the genotypes is conducted and then, a small number of selected PCs (instead of the original SNP genotypes) is used as an input for the linear discriminant analysis (LDA). The selection on the optimal number of PCs to be further used in the LDA is done via cross-validation (CV) where the data is split in training and validation sets. The following criteria were implemented for selection of PCs: i) 10-fold CV with 30 repetitions, ii) a maximum number of 259 PCs were tested, and iii) the number of PCs to be retained was based on number of PCs associated with the highest mean success. Three different scenarios of DAPC were applied as described below:

    Scenario 1 (supervised learning). The full dataset was analyzed simultaneously. In this scenario, all available data were used for model training and the discriminant functions were extracted based on all animals. This is not, however, a real case scenario, since the discriminant functions were developed utilizing the entire data set. The objective for a practical application is to identify an external individual membership to a group (i.e., external validation). Hence, two more scenarios were developed adopting a CV scheme also for the discriminant function.

    Scenario 2 (semi-supervised learning). Assessment of correct assignment of GRF goats was done via a semi-supervised CV (CVSS). Five GRF goats were sampled representing the testing set of the DAPC analysis. The training set (TRN) was constituted by the remaining of 43 GRF goats plus all the goats from the other breeds. The five GRF samples were classified in one of the 10 breeds presented in the TRN set via the function predict.dapc. The procedure was repeated 10 times and results were averaged over the 10 repetitions.

    Scenario 3 (unsupervised learning). Assignment of GRF goats in a breed but without the presence of any GRF goats in the TRN set (unsupervised CV; CVUS). This scenario could also be viewed as a method to assess the genomic similarity of the GRF with the rest of the breeds (i.e., type of clustering). The approach was similar to Scenario 2 other than the testing population consisted of the entire GRF set and GRF samples had to be classified in one or more of the other 9 breeds. To increase the number of the tested samples in each round of the CV, 80% of GRF breed was sampled. Moreover, to test for the effect of the size in model training for the assignment of GRF, different proportions of TRN set were sampled (20, 30, …, 90%) 10 times each, and results were averaged over 10 repetitions. In other words, the size of the TRN set varied between 42 to 191 goats. All nine breeds were present in each scenario and all GRF goats were used in this scenario.

The terms (semi/un)-supervised should not be confused with the terminology used in machine learning. The introduction of these terms has been used in the manuscript to distinguish among the three approaches that were used in the DAPC analysis, and, although they are, up to a point, analogous with the same terms used in machine learning they are not identical.

Results

Runs of homozygosity

Summary results of the detected ROH regions as either total counts or averaged based on the number of samples per breed are presented in Table 2 and Fig 1a, respectively. A relatively high number of ROH was detected for GRF (n = 2,450), which was the third largest among the breeds in the study. The greatest number of ROH among all breeds was for GGT (n = 2,762) followed by ORO (= 2,693), while the smallest was found for ARG (n = 465). For GRF, the number of ROH per Capra hircus chromosome (CHI) varied from 35 (CHI25) to 158 (CHI1). The maximum length of ROH per chromosome was found on CHI1 (568,887,711bp) and the minimum on CHI23 (113,323,345bp). In general, the total length of ROH per CHR followed the same pattern of the total ROH number per CHR (Fig 1b).

Summary results of runs of homozygosity (ROH): (a) average number detected per breed, (b) number and length of ROH per chromosome (CHR) in Garfagnina breed, (c) frequency distribution of the number of ROH in the breeds analyzed per length class and (d) frequency distribution of the number of ROH in different length classes per breed.
Fig 1

Summary results of runs of homozygosity (ROH): (a) average number detected per breed, (b) number and length of ROH per chromosome (CHR) in Garfagnina breed, (c) frequency distribution of the number of ROH in the breeds analyzed per length class and (d) frequency distribution of the number of ROH in different length classes per breed.

ARG: Argentata dell’Etna; BIO: Bionda dell’Adamello; CCG: Ciociara Grigia; DIT: Di Teramo; GAR: Garganica; GGT: Girgentana; GRF: Garfagnina; ORO: Orobica; VAL: Valdostana and VSS: Valpassiria; In (a) horizontal bars within each boxplot represent the median, and red rhombus the mean.

Table 2
Total number of runs of homozygosity (ROH) detected per breed.
BreedNo. ROH
ARG465
BIO814
CCG496
DIT813
GAR730
GGT2,762
GRF2,450
ORO2,693
VAL1,613
VSS768

For all breeds analyzed except CCG and DIT, the number of ROH relative to the length on the genome decreased with an increase in length (Fig 1c and 1d). In DIT samples, ROH were more frequent in length classes of 4–8 and 8–16 Mbp compared to 2-4Mbp. The percentage of ROH with a length >16Mbp per breed varied between 0.89% to 13.53% for ORO and DIT, respectively. For small ROH length (<2Mbp) the proportion over the total number detected reached ~78% in ARG, while only 35% of ROH was observed for DIT. The pattern of ROH length class was similar among GRF, GGT and ORO with ~50% of ROH having a length < 2Mbp, ~25% between 2-4Mbp, ~13% between 4-8Mbp, ~5% and ~2% between 8-16Mbp and > 16Mbp, respectively (S1 Table).

For GRF, common ROH (more than 45% in the GRF samples analyzed) were detected on CHI12, between ~34.6–35.3 Mbp (Fig 2, Table 3). In total, 14 SNP were contained in this region. The same area was also present in DIT, while a broader region (~33,9–36.5 Mbp) was shared among ARG, CCG, and GGT. To identify similarities among breeds relative to the number of ROH per chromosome, a PCA was conducted on the average number of ROH identified per chromosome. In addition, a heatmap on the actual number of ROH per chromosome was produced. Both approaches placed GRF closer to ORO and GGT with respect to the rest of the breeds (S2 Fig).

Number of times (%) each SNP was detected inside a run of homozygosity (ROH) in Garfagnina (GRF) goats.
Fig 2

Number of times (%) each SNP was detected inside a run of homozygosity (ROH) in Garfagnina (GRF) goats.

Table 3
Most common (45% in each breed) runs of homozygosity (ROH) detected per breed on Capra hircus chromosome 12, with the start-end regions and number of SNP per ROH.
BreedStart-SNPEnd-SNPNo. SNPStart-region (bp)End-region (bp)
ARGsnp30406-scaffold335-807385/ rs268262652snp30399-scaffold335-501928/ rs268262645634,949,98035,255,437
CCGsnp30421-scaffold335-1558038/ rs268262666snp30391-scaffold335-171300/ rs2682626372734,199,32735,586,065
DITsnp30413-scaffold335-1113038/ rs268262659snp30397-scaffold335-418126/ rs2682626431434,644,32735,339,239
GGTsnp30428-scaffold335-1839052/ rs268262673snp11142-scaffold140-760668/ rs2682439835333,918,31336,518,132
GGTsnp17454-scaffold1805-39262/ rs268250096snp55342-scaffold855-361968/ rs2682869362640,620,36442,257,561
GRFsnp30413-scaffold335-1113038/ rs268262659snp30397-scaffold335-418126/ rs2682626431434,644,32735,339,239
VALsnp50169-scaffold717-4207960/ rs268281880snp3193-scaffold1095-4352995/ rs2682362338724,592,90128,744,348

Genomic inbreeding coefficients (FROH) were found to be intermediate for GRF compared to the rest of the breeds analyzed, with a mean value of 0.069 (Fig 3a, S2 Table). The highest values were observed for GGT (0.143) and ORO (0.137). Moreover, the distribution of FROH calculated per CHR was similar, with some high values (>0.5) observed for CHI7, 9, 16, 22 and 25 (Fig 3b).

Summary of the genomic inbreeding coefficients (a) per breed and (b) of the Garfagnina breed per chromosome (Chr).
Fig 3

Summary of the genomic inbreeding coefficients (a) per breed and (b) of the Garfagnina breed per chromosome (Chr).

ARG: Argentata dell’Etna; BIO: Bionda dell’Adamello; CCG: Ciociara Grigia; DIT: Di Teramo; GAR: Garganica; GGT: Girgentana; GRF: Garfagnina; ORO: Orobica; VAL: Valdostana and VSS: Valpassiria.

Population stratification and ancestry

As a first step, a PCA was conducted on the complete data to visualize the general structure and relationships among breeds. The first axis distinguished GRF goats from ARG, CCG, DIT, GAR and GGT, while the second axis further separated GRF from the rest of the breeds (Fig 4). An inspection of all the pairwise comparison between the first 10 axes (PCs) was carried out. In general, GRF was clearly separated from the rest of the breeds. Interestingly, a small GRF subgroup consisting of six goats was observed on the 9th axis (S3 Fig).

Scatterplot of the first two principal components1.
Fig 4

Scatterplot of the first two principal components1.

1Singular value decomposition was applied on the matrix of genotypes. ARG: Argentata dell’Etna; BIO: Bionda dell’Adamello; CCG: Ciociara Grigia; DIT: Di Teramo; GAR: Garganica; GGT: Girgentana; GRF: Garfagnina; ORO: Orobica; VAL: Valdostana and VSS: Valpassiria.

An admixture analysis was conducted to complement the PCA results. A varying number of group ancestries was investigated, ranging from K = 2 up to 10. The model with the minimum CV error was the one with eight group ancestries (S4 Fig). In general, the admixture results were in agreement with PCA, depicting the uniqueness of the GRF gene pool (Fig 5a and 5b). At K = 8 (number of ancestries selected after CV) GRF had almost a breed-specific ancestry, with a small percentage of the GRF goats sharing co ancestry with i) BIO and VSS and ii) ARG and CCG and to a small extent with ORO, VAL, DIT and GGT. It should be noted that apart from GRF, group-specific ancestries, at least to a great extent, existed almost for all breeds except ARG, CCG, GAR and VSS.

Admixture analysis with K = 8 coancestry groups (a) per individual and (b) averaged per breed.
Fig 5

Admixture analysis with K = 8 coancestry groups (a) per individual and (b) averaged per breed.

ARG: Argentata dell’Etna; BIO: Bionda dell’Adamello; CCG: Ciociara Grigia; DIT: Di Teramo; GAR: Garganica; GGT: Girgentana; GRF: Garfagnina; ORO: Orobica; VAL: Valdostana and VSS: Valpassiria.

Discriminant analysis of principal components

In the first scenario of DAPC, all data were used. The first 40 PCs, explaining ~35.75% of the total variability in the SNP data (S5 Fig), were used in the final DAPC model, resulting in an assignment success rate of 100% for all GRF goats to their breed of origin. The pattern of the genetic diversity based on DAPC is presented in Fig 6, where a clear genetic distance of GRF from the rest of the breeds can be observed.

Scatterplot of the first two discriminant components of the DAPC.
Fig 6

Scatterplot of the first two discriminant components of the DAPC.

Breeds are presented by different colors and symbols. ARG: Argentata dell’Etna; BIO: Bionda dell’Adamello; CCG: Ciociara Grigia; DIT: Di Teramo; GAR: Garganica; GGT: Girgentana; GRF: Garfagnina; ORO: Orobica; VAL: Valdostana and VSS: Valpassiria.

An external validation scenario, which better reflects a practical application of the discriminant model, was further assessed. In the first analysis (CVSS scenario), GRF had representative animals in the TRN set where the DAPC model was developed. Also, in this case, a 100% correct classification of GRF goats was observed (S3 Table). Interestingly, the classification of GRF was invariant to the number of PCs selected (ranged between 10 to 70) in DAPC (S4 Table). In the second scenario (CVUS) there were no representative GRF samples in training the model of DAPC. In that case the majority of animals were classified as CCG while few were assigned to DIT in some of the CV replicates (S5 Table). Similar results were obtained with an increased size of TRN set. In the majority of the scenarios, GRF goats were classified either as CCG or DIT, where there were a few cases in which GRF goats were also assigned as VSS, GAR or BIO, but in none of the cases as ORO, GGT or VAL (Fig 7).

Percentage of assignment of the GRF goats in the CVUS scenario.
Fig 7

Percentage of assignment of the GRF goats in the CVUS scenario.

Results were averaged over 10 replicates (CV) in each data subset (from 20 to 90%). CVUS: unsupervised CV, where GRF breed had no representative goats in model training, hence GRF goats had to be classified in one of the other 8 breeds; ARG: Argentata dell’Etna; BIO: Bionda dell’Adamello; CCG: Ciociara Grigia; DIT: Di Teramo; GAR: Garganica; GGT: Girgentana; GRF: Garfagnina; ORO: Orobica; VAL: Valdostana and VSS: Valpassiria.

Discussion

There is increasing scientific evidence that genetic diversity of livestock populations is decreasing worldwide [29, 30]. While some domesticated species, such as chicken, sheep and cattle are uniformly distributed across the world, the vast majority of goats (~560 over a total of ~800 millions) are located in Asia, Near and Middle East, while Europe contributes approximately 5% [30], a quarter of which are reared in Greece (estimated data) [31]. On the total European goat population Italy counts of ~0.5% with ~1 million goats (official data) [31]. The alarming scientific evidence of loss of genetic diversity has led to the development of the Global Plan of Action for Animal Genetic Resources by member countries of the Food and Agriculture Organization (FAO) [32]. Moreover, the concentration of a large proportion of a population in a specific geographical region exposes the breed to the threat of a disease outbreak. This especially holds true for breeds consisting of small populations [30]. Hence, knowledge on the distribution of a breed within a country is critical for policy measures to be taken.

An important question for action measures to be taken to alleviate the problem of a breed extinction, is “what is a breed?”. As reported by Woolliams et al. [33], there is still no clear definition of what a breed is. An interesting definition was presented by Hammond (2007) [33]: “A breed is a breed if enough people say it is”. FAO further outlines the cultural component of a breed. In addition, local livestock populations, generally consisting of small populations and restricted to specific geographical regions, often face difficulties to obtain formal recognition of a breed, since local farmers may not be organized in a breeding association [34]. This difficulty poses further restrictions in policies and funding and, in turn, increases the risk of extinction. Three main pillars have been recently reported for such livestock populations [34]: “Discover, secure and sustain”. For this purpose, it is essential to conduct studies on quantification of genetic resources of the population in concert with phenotypic and genetic analysis.

GRF is an indigenous goat breed facing a high risk of extinction, with a total number of registered animals less than 1,500. Moreover, its farming is restricted to hills and mountains of the north-western Tuscan Apennine area (central Italy). Given the risk status of the breed, scientists have focused on a better characterization of the GRF population. Various zootechnical parameters of the breed, such as milk total and fatty acid composition, milk coagulation properties and casein genotypes have been previously investigated [7], and authors suggested the development of a purebred breeding scheme. However, no study to date has investigated the genetic diversity of GRF.

Runs of Ηomozygosity

As it has been highlighted by Bertolini et al. [35], crossbred goat populations tend to have smaller total ROH length and number compared to purebred populations. The same pattern has been observed when comparing unselected vs. selected populations undergoing breeding programs. Nevertheless, as it has been pointed out by different colleagues [35, 36] the 50k chip is not considered efficient for the accurate detection of small ROH regions, resulting in underestimation of small ROH hits. Despite this, our analysis was based on purebred goats, and thereby a smaller bias is expected. Results of ROH and FROH for the 9 Italian breeds of the AdaptMap project were in agreement with results previously reported [35].

The general pattern of ROH (i.e. in terms of total—and by chromosome—number and length of ROH) for GRF was similar to GGT and ORO. Moreover, common ROH were found for GRF (more than 45% in GRF samples analyzed) on CHR 12 at, roughly, between 34.6–35.3 Mbp (Table 3). The same region was also detected in DIT breed, while the broader region (~33,9–36.5 Mbp) was shared among ARG, CCG, and GGT. Further, a search of genes presented in the top ROH region identified for GRF (~50.25–50.94Mbp, updated on the ARS1 assembly) and 1Mbp up-downstream (~49.25–51.94Mbp, updated on the ARS1 assembly) was carried out. Interestingly, the region ~49-52Mbp has been previously reported in goats [35, 37]. It is worth noting that within this region lay the genes of the general gap junction protein family GJA3 (gap junction protein alpha 3; ~50.642–50.644Mbp), GJB2 (gap junction protein beta 2; ~50.675–50.676Mbp), GJB6 (gap junction protein beta 6; ~50.694–50.695Mbp). The GJB2 and GJB6 are associated with the nervous system, hearing functions and ectodermal processes [38, 39]. Moreover, the SAP18 (Sin3A associated protein 18; ~51.136–51.141Mbp) that is related to gonad development [40], was also mapped in this region.

The narrow region of the detected top ROH runs for GRF on CHI12 spanned the CENPJ (centromere protein J; 50.23–50.27Mb) and the IL17D (interleukin 17D; 50.91–50.93 Mb). More precisely, the snp30397-scaffold335-418126 was found to be an intron of the CENPJ gene, while the snp30413-scaffold335-1113038 was downstream from IL17D. There is a series of studies that have linked the CENPJ with primary microcephaly in humans and mice [4144]. Moreover, CENPJ has been found to regulate the neurogenesis and the cilia disassembly in the developing cortex in mice [45]. Also in mouse, disruption of the CENPJ can cause the Seckel Syndrome [46]. Possible associations with these physiological functions and phenotypic differences among the breeds studied are not obvious.

Apart from the CENPJ and IL17D genes, 9 more genes are found within this genomic area, namely PARP4 (poly(ADP-ribose) polymerase family member 4; ~50.29–50.35Mbp), MPHOSPH8 (M-phase phosphoprotein 8; ~50.36–50.41Mbp), PSPC1 (paraspeckle component 1; ~50.44–50.48Mbp), ZMYM2 (zinc finger MYM-type containing 2; ~50.56–50.63Mbp), as well as the CRYL1 (crystallin lambda 1; ~50.77–50.83Mbp) and the IFT88 (intraflagellar transport 88; ~ 50.84–50.91Mbp).

Downstream this region, in 1Mbp expansion, the genes ATP12A (ATPase H+/K+ transporting non-gastric alpha2 subunit; ~50.08–50.11Mbp) and RNF17 (ring finger protein 17; ~50.11–50.23Mbp) were located. Moreover, upstream the region there were also mapped the EEF1AKMT1 (EEF1A lysine methyltransferase 1; ~50.94–50.95Mbp), LATS2 (large tumor suppressor kinase 2; ~51.07–51.09Mbp), ZDHHC20 (zinc finger DHHC-type containing 20; ~51.16–51.22Mbp), MICU2 (mitochondrial calcium uptake 2; ~51.23–51.28Mbp), and the FGF9 (fibroblast growth factor 9; ~51.34–51.37Mbp).

Breed diversity, ancestry and discrimination

Two approaches, complementary to each other, have been used to infer the GRF relationships with 9 native Italian breeds, namely principal component and admixture analysis. Both analyses confirmed the distinct and unique genetic background of the GRF breed and results were in general agreement with each other. Overall, GRF was placed closer to VSS, BIO and CCG, with a small percentage of ancestries shared with all 9 breeds. Interestingly, both approaches identified a subgroup of 6 goats. In admixture, the six goats showed a unique ancestry (Fig 5a), while this subgroup was identified in PCA on the 9th axis (S3 Fig). Nevertheless, the percentage of variance explained by this axis was <1%.

The DAPC model was used to assess breed traceability based on SNP and was able to classify with 100% success GRF goats to their breed of origin (S3 Table). Moreover, an unsupervised learning was applied, where GRF had no representative samples in TRN set. Results were consistent with the PCA and admixture and assigned the majority of GRF goats to the CCG breed with a small number of goats (varied between 4 to 6) assigned as DIT (S5 Table).

As mentioned in the material and methods section, the primary step of the DAPC analysis is to select the number of PCs to be used in the discriminant model. Hence, a basic question emerged on how robust the DAPC could be considered relative to the number of PCs used. Our analysis showed that, although the assignment success was invariant to the number of PCs in the semi-supervised DAPC analysis (number of PCs varied between 10–70), a pattern was found in the case of the unsupervised model. More precisely, when the DAPC contained 40 PCs some of the goats were classified as DIT. In the rest of the cases, where 60 or 20 PCs were used, all GRF goats were assigned as CCG.

PCA analysis seems to detect common ancestries between GRF goats and Alpine Arc goat breeds whereas the DAPC approach identifies similarities between GRF and the goat breeds from Central Italy. Both hypotheses are consistent with the history of Tuscan goat populations that experienced migratory flows from both northern and central Italy.

Overall, the genomic analysis confirmed the hypothesis that GRF is a result of crosses among goats from the Alpine Arc and Tuscan-Emilian Apennines regions. Nevertheless, based on the genomic information analyzed here, GRF represents a unique genetic pool and was genetically distinguished from 9 different native Italian breeds. This, in turn, resulted in breed traceability with a 100% success rate after CV. To sum up, our analysis complements previous work on various zootechnical and adaptive characteristics [78] of GRF and provides with a more complete description of the breed.

Suggestions for monitoring and conservation

One of the strategic priorities of the Global Plan of Action for Animal Genetic Resources of FAO is breed conservation, either in vivo or in vitro or both [29, 32]. The population status of GRF could be described and summarized in the following points: GRF i) has a distinct genetic pool, ii) exist in a small number of farms and animals, iii) is concentrated in a small geographical region, iv) lacks a formal breeding strategy, v) is reared in semi-extensive and family type farming systems, vi) is not subject to reproductive technologies such as artificial insemination, vii) has low pedigree quality (or almost absent) and viii) lacks of an organized action plan for monitoring and conservation. These points contribute to the critical status of the breed and highlight the need of conservation policies. For conservation in farms, farmers need to be financially supported by local and international authorities. A direct link between the breed and the production of niche products could add an extra economic value. Moreover, the cultural and the environmental components should be further investigated and quantified in the near future.

In addition, storage of biological material in gene banks offers an extra level of security. As recently been reported by Zomerdijk et al [29], gene banks greatly vary among countries and species and goats come second in order in gene banks after cattle. Even though gene banks, are cost-efficient [47, 48], they face various challenges, such as economic losses, loss of genetic material due to lack of liquid nitrogen and risks from diseases and catastrophic events such as floods and earthquakes [29]. Hence, the storage of biological material in a gene bank should be seen as a complementary rather a substitution to in vivo conservation.

Moreover, the idea of the foundation of a GRF breeding association should be considered. Animals are used by farmers if they provide them with profit and profitability is strongly associated to productivity. In this regard, research institutes should provide GRF farmers and breeders with vital scientific support. For example, the development of new tools for breed traceability and recognition might increase farmers’ interest. A breeding nucleus, where phenotypic and genetic variability are studied, should be considered. The startup of a breeding scheme, however, should be carefully reflected, since in the Italian legislation, no financial support is provided for livestock conservation if the breed undergoes directional selection, even if the population is at risk of extinction.

Furthermore, genotyping of a large number of animals (males and females) is highly recommended. To reduce costs, genotyping could be targeted, at a first step, only to animals that will contribute to the next generation. In the near future, genomic information could be further utilized for the development of a genomic-based breeding program that would help to select animals early in life and boost the genetic progress. Besides, optimal contribution selection [49, 50], penalization of the number of offspring per male (in the absence of a breeding objective) and a rotational mating scheme, with a controlled buck exchange among farms [51], are some ideas to be considered.

In the three pillars for a successful conservation scheme described in [34] (“discover, secure and sustain”), GRF is still under investigation. Various studies have been carried out, while our work presented a first whole-genome scan on the breed’s genetic diversity.

Conclusions

Our genomic analysis suggests a distinct genetic pool of Garfagnina goat breed, with small parts of common ancestry shared with Bionda dell’Adamello, Valpassiria, Argentata dell’Etna and Ciociara Grigia. As a result, GRF can be successfully discriminated by the rest of the breeds analyzed using genomic information with a success rate of 100%. This could further help in a more detailed breed characterization at a genetic level, breed traceability and in controlling the amount of crossbreeding in the future. The above could further support a better monitoring of the breed status. A ROH on CHI12 associated with the CENPJ gene should be further investigated in the population. Given the distinct genetic pool, the small number of farms and goats and the restricted farming in a small geographical region, we suggest conservation (in vivo and in vitro) together with breeding measures to be taken for Garfagnina goat. We hope our work will add value to GRF farming and the local region where the breed is reared.

Acknowledgements

We thank the two anonymous reviewers for their constructive reviewing. We would also like to thank Dr. Paul Boettcher (FAO, Department of Animal Production and Health Division, Rome, Italy) and Dr. Anna Eleonora Karagianni (The Roslin Institute and the Royal (Dick) School of Veterinary Studies, University of Edinburgh, UK) for reading the manuscript and providing with fruitful comments that improved the readability and quality of this work.

References

GCurone, JFilipe, PCremonesi, ETrevisi, MAmadori, CPollera, et al What we have lost: Mastitis resistance in Holstein Friesians and in a local cattle breed. Res Vet Sci. 2018;116:8898. 10.1016/j.rvsc.2017.11.020

RLovreglio, OMeddour-Sahar, VLeone. Goat grazing as a wildfire prevention tool: a basic review. IForest—Biogeosciences For. 2014;7:260.

AStella, ELNicolazzi, CPVan Tassell, MFRothschild, LColli, BDRosen, et al AdaptMap: exploring goat diversity and adaptation. Genet Sel Evol. 2018;50:61 10.1186/s12711-018-0427-5

SNaderi, H-RRezaei, FPompanon, MGBBlum, RNegrini, H-RNaghas, et al The goat domestication process inferred from large-scale mitochondrial DNA analysis of wild and domestic individuals. Proc Natl Acad Sci. 2008;105:1765964. 10.1073/pnas.0804782105

MAZeder, BHesse. The initial domestication of goats (Capra hircus) in the Zagros mountains 10,000 years ago. Science. 2000;287:22547. 10.1126/science.287.5461.2254

Domestic Animal Diversity Information System (DAD-IS) | Food and Agriculture Organization of the United Nations [Internet]. 2019. http://www.fao.org/dad-is/en/

MMartini, FSalari, IAltomonte, DRignanese, SChessa, CGigliotti, et al The Garfagnina goat: A zootechnical overview of a local dairy population. J Dairy Sci. 93 10.3168/jds.2010-3207

FSalari, IAltomonte, NLRibeiro, MNRibeiro, RBozzi, MMartini. Effects of season on the quality of Garfagnina goat milk. Ital J Anim Sci. 2016;15:56875.

NLRibeiro, RGCosta, ECPimenta Filho, MNRibeiro, ACrovetti, EPSaraiva, et al Adaptive profile of Garfagnina goat breed assessed through physiological, haematological, biochemical and hormonal parameters. Small Rumin Res. 2016;144:23641.

10 

FCecchi, CRusso, FFratini, BTurchi, GPreziuso, CCantile. Preliminary association analysis of microsatellites and Mycobacterium avium subspecies paratuberculosis infection in the native Garfagnina goats. J Appl Anim Res. 2018;46:87982.

11 

FCecchi, CDadousis, RBozzi, FFratini, CRusso, PBandecchi, et al Genome scan for the possibility of identifying candidate resistance genes for goat lentiviral infections in the Italian Garfagnina goat breed. Trop Anim Health Prod. 2019;51:72933. 10.1007/s11250-018-1728-y

12 

GTosser-Klopp, PBardou, OBouchez, CCabau, RCrooijmans, YDong, et al Design and Characterization of a 52K SNP Chip for Goats. PLOS ONE. 2014;9:e86227 10.1371/journal.pone.0086227

13 

LGomez-Raya, CRodríguez, CBarragán, LSilió. Genomic inbreeding coefficients based on the distribution of the length of runs of homozygosity in a closed line of Iberian pigs. Genet Sel Evol. 2015;47:81 10.1186/s12711-015-0153-1

14 

SMastrangelo, MTolone, RDGerlando, LFontanesi, MTSardina, BPortolano. Genomic inbreeding estimation in small populations: evaluation of runs of homozygosity in three local dairy cattle breeds. animal. Cambridge University Press; 2016;10:74654.

15 

GSchiavo, SBovo, FBertolini, STinarelli, SDall’Olio, LNanni Costa, et al Comparative evaluation of genomic inbreeding parameters in seven commercial and autochthonous pig breeds. Anim Int J Anim Biosci. 2020;14:91020. 10.1017/S175173111900332X

16 

MAblondi, CDadousis, MVasini, SEriksson, SMikko, ASabbioni. Genetic Diversity and Signatures of Selection in a Native Italian Horse Breed Based on SNP Data. Animals. Multidisciplinary Digital Publishing Institute; 2020;10:1005.

17 

FCecchi, CRusso, DIamartino, AGaliero, BTurchi, FFratini, et al Identification of candidate genes for paratuberculosis resistance in the native Italian Garfagnina goat breed. Trop Anim Health Prod. 2017;49:113542. 10.1007/s11250-017-1306-8

18 

FBertolini, BServin, ATalenti, ERochat, ESKim, COget, et al Signatures of selection and environmental adaptation across the goat genome post-domestication. Genet Sel Evol. 2018;50:57 10.1186/s12711-018-0421-y

19 

SPurcell, BNeale, KTodd-Brown, LThomas, MARFerreira, DBender, et al PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. Am J Hum Genet. 2007;81:55975. 10.1086/519795

20 

Purcell S. PLINK v1.9 [Internet]. http://pngu.mgh.harvard.edu/purcell/plink/

21 

R Core Team. R: A language and environment for statistical computing. 2013.

22 

Biscarini F, Cozzi P, Gaspa G, Marras G. detectRUNS: an R package to detect runs of homozygosity and heterozygosity in diploid genomes [Internet]. [cited 2019 Oct 2]. https://cran.r-project.org/web/packages/detectRUNS/vignettes/detectRUNS.vignette.html

23 

GMarras, GGaspa, SSorbolini, CDimauro, PAjmone‐Marsan, AValentini, et al Analysis of runs of homozygosity and their relationship with inbreeding in five cattle breeds farmed in Italy. Anim Genet. 2015;46:11021. 10.1111/age.12259

24 

DHAlexander, JNovembre, KLange. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19:165564. 10.1101/gr.094052.109

25 

Alexander D, Shringarpure S, Novembre J, Lange K. ADMIXTURE Version 1.3.0 [Internet]. www.genetics.ucla.edu/software/admixture

26 

TJombart, SDevillard, FBalloux. Discriminant analysis of principal components: a new method for the analysis of genetically structured populations. BMC Genet. 2010;11:94 10.1186/1471-2156-11-94

27 

TJombart. adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics. 2008;24:14035. 10.1093/bioinformatics/btn129

28 

TJombart, IAhmed. adegenet 1.3–1: new tools for the analysis of genome-wide SNP data. Bioinformatics. 2011;27:30701. 10.1093/bioinformatics/btr521

29 

FZomerdijk, S-JHiemstra, Md’Arbaumont, MTixier-Boichard, PBoettcher. Quality Management Practices of Gene Banks for Livestock: A Global Review. Biopreservation Biobanking [Internet]. Mary Ann Liebert, Inc., publishers 140 Huguenot Street, 3rd Floor New Rochelle, NY 10801 USA; 2020 [cited 2020 May 29]; https://www.liebertpub.com/doi/abs/10.1089/bio.2019.0128

30 

In vivo conservation of Animal Genetic Resources [Internet]. [cited 2020 May 29]. http://www.fao.org/3/i3327e/i3327e00.htm

31 

Food and Agriculture Organization of the United Nations. FAOSTAT [Internet]. [cited 2020 May 31]. http://www.fao.org/faostat/en/

32 

Food and Agriculture Organization of the United Nations. GLOBAL PLAN OF ACTION FOR ANIMAL GENETIC RESOURCES and the INTERLAKEN DECLARATION [Internet]. [cited 2020 May 31]. http://www.fao.org/3/a1404e/a1404e00.htm

33 

JAWoolliams, MToro. What is genetic diversity? Util Conserv Farm Anim Genet Resour. Wageningen, The Netherlands: Wageningen Academic Publishers, The Netherlands; 2007 p. 5574.

34 

DPSponenberg, AMartin, CCouch, JBeranger. Conservation Strategies for Local Breed Biodiversity. Diversity. Multidisciplinary Digital Publishing Institute; 2019;11:177.

35 

FBertolini, TFCardoso, GMarras, ELNicolazzi, MFRothschild, MAmills, et al Genome-wide patterns of homozygosity provide clues about the population history and adaptation of goats. Genet Sel Evol. 2018;50:59 10.1186/s12711-018-0424-8

36 

FBertolini, BGandolfi, ESKim, BHaase, LALyons, MFRothschild. Evidence of selection signatures that shape the Persian cat breed. Mamm Genome Off J Int Mamm Genome Soc. 2016;27:14455.

37 

E-SKim, ARElbeltagy, AMAboul-Naga, BRischkowsky, BSayre, JMMwacharo, et al Multiple genomic signatures of selection in goats and sheep indigenous to a hot arid environment. Heredity. 2016;116:25564. 10.1038/hdy.2015.94

38 

APandya, KSArnos, XJXia, KOWelch, SHBlanton, TBFriedman, et al Frequency and distribution of GJB2 (connexin 26) and GJB6 (connexin 30) mutations in a large North American repository of deaf probands. Genet Med. 2003;5:295303. 10.1097/01.GIM.0000078026.01140.68

39 

JLamartine, GMEssenfelder, ZKibar, ILanneluc, ECallouet, DLaoudj, et al Mutations in GJB6 cause hidrotic ectodermal dysplasia. Nat Genet. 2000;26:1424. 10.1038/79851

40 

YFeng, XPeng, SLi, YGong. Isolation and characterization of sexual dimorphism genes expressed in chicken embryonic gonads. Acta Biochim Biophys Sin. 2009;41:28594. 10.1093/abbs/gmp012

41 

MNaveed, SKKazmi, MAmin, ZAsif, UIslam, KShahid, et al Comprehensive review on the molecular genetics of autosomal recessive primary microcephaly (MCPH). Genet Res. 2018;100.

42 

MFaheem, MINaseer, MRasool, AGChaudhary, TAKumosani, AMIlyas, et al Molecular genetics of human primary microcephaly: an overview. BMC Med Genomics. 2015;8 Suppl 1:S4 10.1186/1755-8794-8-S1-S4

43 

SDuerinckx, MAbramowicz. The genetics of congenitally small brains. Semin Cell Dev Biol. 2018;76:7685. 10.1016/j.semcdb.2017.09.015

44 

JBond, ERoberts, KSpringell, SLizarraga, SScott, JHiggins, et al A centrosomal mechanism involving CDK5RAP2 and CENPJ controls brain size. Nat Genet. 2005;37:3535. 10.1038/ng1539

45 

WDing, QWu, LSun, NCPan, XWang. Cenpj Regulates Cilia Disassembly and Neurogenesis in the Developing Mouse Cortex. J Neurosci. 2019;39:19942010. 10.1523/JNEUROSCI.1849-18.2018

46 

REMcIntyre, PLChavali, OIsmail, DMCarragher, GSanchez-Andrade, JVForment, et al Disruption of Mouse Cenpj, a Regulator of Centriole Biogenesis, Phenocopies Seckel Syndrome. PLOS Genet. 2012;8:e1003022 10.1371/journal.pgen.1003022

47 

SRPaiva, CMMcManus, HBlackburn. Conservation of animal genetic resources–A new tact. Livest Sci. 2016;193:328.

48 

DFGSilversides, PHPurdy, HDBlackburn. Comparative costs of programmes to conserve chicken genetic variation based on maintaining living populations or storing cryopreserved material. Br Poult Sci. Taylor & Francis; 2012;53:599607. 10.1080/00071668.2012.727383

49 

JAWoolliams, PBerg, BSDagnachew, THEMeuwissen. Genetic contributions and their optimization. J Anim Breed Genet. 2015;132:8999. 10.1111/jbg.12148

50 

BVillanueva, RPong-Wong, JAWoolliams, SAvendaño. Managing genetic resources in selected and conserved populations. BSAP Occas Publ. Cambridge University Press; 2004;30:11332.

51 

JJWindig, LKaal. An effective rotational mating scheme for inbreeding reduction in captive populations illustrated by the rare sheep breed Kempisch Heideschaap. animal. Cambridge University Press; 2008;2:173341. 10.1017/S1751731108003029

52 

VWimmer, TAlbrecht, H-JAuinger, C-CSchoen. synbreed: a framework for the analysis of genomic prediction data using R. Bioinformatics. 2012;28:20867. 10.1093/bioinformatics/bts335