PLoS ONE
Home Genome-wide identification and analysis of highly specific CRISPR/Cas9 editing sites in pepper (Capsicum annuum L.)
Genome-wide identification and analysis of highly specific CRISPR/Cas9 editing sites in pepper (<i>Capsicum annuum</i> L.)
Genome-wide identification and analysis of highly specific CRISPR/Cas9 editing sites in pepper (Capsicum annuum L.)

Competing Interests: The authors have declared that no competing interests exist.

Article Type: research-article Article History
Abstract

The CRISPR/Cas9 system is an efficient genome editing tool that possesses the outstanding advantages of simplicity and high efficiency. Genome-wide identification and specificity analysis of editing sites is an effective approach for mitigating the risk of off-target effects of CRISPR/Cas9 and has been applied in several plant species but has not yet been reported in pepper. In present study, we first identified genome-wide CRISPR/Cas9 editing sites based on the ‘Zunla-1’ reference genome and then evaluated the specificity of CRISPR/Cas9 editing sites through whole-genome alignment. Results showed that a total of 603,202,314 CRISPR/Cas9 editing sites, including 229,909,837 (~38.11%) NGG-PAM sites and 373,292,477 (~61.89%) NAG-PAM sites, were detectable in the pepper genome, and the systematic characterization of their composition and distribution was performed. Furthermore, 29,623,855 highly specific NGG-PAM sites were identified through whole-genome alignment analysis. There were 26,699,38 (~90.13%) highly specific NGG-PAM sites located in intergenic regions, which was 9.13 times of the number in genic regions, but the average density in genic regions was higher than that in intergenic regions. More importantly, 34,251 (~96.93%) out of 35,336 annotated genes exhibited at least one highly specific NGG-PAM site in their exons, and 90.50% of the annotated genes exhibited at least 4 highly specific NGG- PAM sites, indicating that the set of highly specific CRISPR/Cas9 editing sites identified in this study was widely applicable and conducive to the minimization of the off-target effects of CRISPR/Cas9 in pepper.

Li,Zhou,Liang,Song,Hu,Cui,Chen,Hu,Cheng,and He: Genome-wide identification and analysis of highly specific CRISPR/Cas9 editing sites in pepper (Capsicum annuum L.)

Introduction

In mutants, which are of great significance for both gene function analysis and crop genetic improvement, allelic variation mainly results from naturally or artificially induced mutation. Compared to natural variation, the most prominent advantage of artificially induced mutation is the high mutation frequency achieved. The main methods currently used for achieving artificially induced mutation include physical mutagenesis, chemical mutagenesis, random transposon insertion, and target gene editing technologies. Among these approaches, target gene editing, in which nucleotide variation is introduced at an appointed site and the target mutations are obtained accurately and efficiently, thereby speeding up the functional identification of target genes and genetic breeding improvement, is an ideal method for artificially inducing mutations [1].

A variety of target gene editing techniques, including the use of zinc-finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs) and the CRISPR/Cas system, have been developed to date [2]. The CRISPR/Cas system, which has entered the mainstream in recent years and been widely used in humans [3], animals [4], microorganisms [5] and plants [6], possesses the outstanding advantages of high simplicity and efficiency in contrast to the other two techniques. According to the number and functional characteristics of the Cas gene, CRISPR/Cas systems can be divided into 2 categories, including 6 different types (I to VI) [79]. The first category of CRISPR/Cas systems, including types I, III and IV, requires multiple Cas proteins to collaboratively interfere with the target gene, while the second category requires only one Cas protein. The type II CRISPR/Cas system, namely CRISPR/Cas9 system belongs to the second category and is now the most widely used gene editing system.

The CRISPR/Cas9 gene editing system is mainly composed of one Cas9 protein and one small guide RNA (sgRNA). The Cas9 protein from Streptococcus pyogenes (SpCas9) was first applied for use in the CRISPR/Cas9 system [10]; SpCas9 recognizes the protospacer adjacent motif (PAM) sequence 5′-NGG-3′ (where “N” can be any nucleotide base) in the target DNA, then cleaves the target DNA at 3 nt upstream of the PAM site, generating a blunt end, and gene editing is finally achieved by nucleotide insertion, deletion and substitution at the cleavage site mediated by the receptor cellular DNA repair machinery, including the nonhomologous end joining (NHEJ) and homologous recombination repair (HDR) mechanisms [11]. The sgRNA of the CRISPR/Cas9 system, artificially designed based on crRNA (CRISPR RNA) and the core sequence of trans-acting crRNA (tracrRNA), is a short single-stranded RNA that guides the Cas9/sgRNA complex to perform cleavage at 3 nt upstream of the PAM site through complementary base pairing between the 5’ end (~20 bp) of the sgRNA and the protospacer sequence of the target DNA, which determines the specificity of gene editing [12].

Previous studies have found that even if the sgRNA imperfectly matches the protospacer, the Cas9 protein can still perform cleavage at 3 nt upstream of the PAM site, making gene editing possible in nontarget regions; thus, off-target effects can occur [1315]. To reduce or eliminate the risk of off-target effect, the identification of candidate editing sites with high specificity is a prerequisite for the application of the CRISPR/Cas9 system. To date, a variety of tools based on whole-genome sequence similarity analysis have been developed for target site design and off-target risk assessment, such as CrisprGE [16], Cas-OFFinder [17], Cas-Designer [18], CRISPRdirect [19] and CRISPOR [20]. However, the majority of those tools have been mainly applied in humans and animals. Based on whole-genome reference sequences, the distribution and specificity of genome-wide CRISPR/Cas9 editing sites in Arabidopsis thaliana, Medicago truncatula, soybean (Glycine max), tomato (Solanum lycopersicum), Brachypodium distachyon, rice (Oryza sativa), Sorghum bicolor, maize (Zea mays) and grape (Vitis vinifera) have been systematically analysed and compared [12, 21], providing an important reference for choosing highly specific editing sites of related species.

Pepper (Capsicum spp.) belongs to the family Solanaceae and has a cosmopolitan distribution and considerable economic importance [22]. The reference genome sequences of pepper were first released in 2014 [23, 24], marking the transition of pepper research from structural genomics to functional genomics. The identification and functional analysis of important genes controlling agronomic traits have become a significant direction in molecular genetics research in pepper. With the development and continuous improvement of technologies for pepper regeneration in vitro and its genetic transformation [25, 26], the CRISPR/Cas9 gene editing system will become a powerful tool and will be widely used for the functional analysis of pepper genes. In this study, we first identified CRISPR/Cas9 editing sites at the genome-wide level in pepper and then evaluated the obtained specificity through whole-genome sequence alignment. The purpose of this study was to provide a reference for the selection of highly specific CRISPR/Cas9 editing sites and facilitate the application of CRISPR/Cas9-mediated gene editing in pepper.

Materials and methods

Genomic data and CRISPR/Cas9 editing site identification

The ‘Zunla-1’ (v2.0) pepper reference genome sequence and related genome annotations [23] were used for CRISPR/Cas9 editing site identification. There were two PAM sites recognized by the CRISPR/Cas9 system: 5'-NGG-3' and 5'-NAG-3', which were identified by using EMBOSS software [27] in both the positive and reverse strands of the Zunla-1 reference genome sequence. The 20-nt sequences before all 5'-NGG-3' and 5'-NAG-3' sites were extracted to form two protospacer sets, referred to as the GG_spacer set and AG-spacer set, respectively.

Identification of highly specific CRISPR/Cas9 editing sites

Based on the method reported previously, the specificity of CRISPR/Cas9 editing sites in pepper was evaluated. Class 0.0 and Class 1.0 spacers were expected to provide high specificity in CRISPR/Cas9 gene editing [12] and were thus classified as highly specific sites in this study. Since the sgRNA/Cas9 complex showed much less affinity and tolerance toward mismatches at the NAG-PAM site [5], in this study, we only assessed the specificity of the GG_spacers, for which the possibility of off-target effects was evaluated by using the AG_spacer set. The method is outlined as follows:

    The hard-masking function of USEARCH [28] was used to mask and remove GG_spacers containing low-complexity sequences;

    GG_spacers with the same sequences at the 6~20-nt region were removed;

    GASSST [29] and UBLAST [28] were used to generate a pairwise alignment for the remaining GG_spacers. According to the GG_spacer position and the minimum number of mismatches (minMM_GG, including InDel and SNP) between each GG_spacer and other GG_spacers, the GG_spacers were graded into three classes: Class 0 spacers shared no significant matching sequence with other GG_spacers; Class 1 spacers showed no fewer than four mismatches (minMM_GG≥4) or three mismatches adjacent to PAM sites; Class 2 included the other GG_spacers;

    For Class 0 and Class 1 GG_spacers, pairwise alignments were performed with AG_spacers, which were further graded into four classes as follows according to their position and the minimum number of mismatches (minMM_GG, including InDel and SNP) between each GG_spacer and other AG-spacers: Class 0.0 spacers exhibited no fewer than three mismatches with AG_spacers (minMM_AG≥3) or shared no significant matching sequence with AG_spacers; Class 0.1 spacers exhibited fewer than three mismatches with AG_spacers; Class 1.0 spacers exhibited no fewer than three mismatches with AG_spacers (minMM_AG≥3) or shared no significant matching sequence with AG_spacers; Class 1.1 spacers exhibited fewer than three mismatches with AG_spacers.

PCR verification and sequence analysis

Primer pairs flanking the selected target sites were designed by using the Primer3web (version 4.1.0; http://primer3.ut.ee/) tool. PCR reaction was performed in a 20 μL mixture including 2.0 μL DNA template (50 ng/μL), 2.0 μL PCR buffer (10×), 2.0 μL Mg2+ (25 mM), 1.5 μL forward and reverse primer (1 μM), 0.2 μL dNTPs (10 mM), and 1U Taq DNA polymerase. PCR procedure was as follow: 94°C for 3 min, 32 cycles of 94°C for 30 s, 55°C for 30 s, and 1 min at 72°C; and a final extension at 72°C for 10 min. PCR amplication of each sites were repeated three times and then the products were directly sequenced and assembled. Alignment of each sequence to the reference genome was conducted by using the local blastn:2.9.0+.

Results and discussion

Content and composition of CRISPR/Cas9 editing sites in pepper genome

A total of 603,202,314 CRISPR/Cas9 editing sites, containing 229,909,837 (~38.11%) NGG-PAM sites and 373,292,477 (~61.89%) NAG-PAM sites, were detected in the pepper genome. This was approximately 4.63 times greater than the number identified in another Solanaceae species, tomato (130,302,150), conforming to the law that the larger the size of a genome, the greater the number of CRISPR/Cas9 editing sites it contains [12]. The average density of NGG-PAM and NAG-PAM in pepper was 69.75/Kb and 112.56/Kb (Table 1), respectively, which were similar to those in tomato (63.30/Kb and 103.43/Kb, respectively), but the density of NGG-PAM in pepper was much less than that in monocot species such as rice (101.69/Kb) and maize (119.22/Kb) [12].

Table 1
The number and density of NGG-PAM and NAG-PAM sites on pepper chromosomes.
Chr.NGGNAGSubtotal
No.DensityNo.Density
P122,489,57974.7133,673,230111.8656,162,809
P211,839,69572.2118,417,689112.3330,257,384
P317,618,43367.3729,560,783113.0447,179,216
P415,393,26171.3624,658,517114.3240,051,778
P515,303,39270.4324,853,200114.3940,156,592
P615,305,36469.7225,109,431114.3840,414,795
P715,309,15068.9324,810,273111.7040,119,423
P811,278,02473.5717,545,055114.4528,823,079
P916,539,87869.2627,445,080114.9343,984,958
P1014,506,12870.5123,477,296114.1137,983,424
P1115,159,28968.8024,592,215111.6139,751,504
P1215,974,35369.4726,241,216114.1242,215,569
P043,193,29160.4372,908,492102.00116,101,783
Total229,909,83769.75373,292,477112.56603,202,314

With respect to the composition of the PAM sites, the TGG and CGG types accounted for the highest (~38.88%) and lowest proportions (~7.44%) of total NGG-PAM sites, respectively (Fig 1A), similar to the composition pattern found in the grape genome [21]. For NAG-PAM sites, the AAG type was the most abundant, with a proportion of ~36.07%, followed by TAG, GAG and CAG, accounting for 29.55%, 19.54% and 14.84% of the total NAG-PAM sites, respectively (Fig 1B).

Composition of pepper PAM sites.
Fig 1

Composition of pepper PAM sites.

A, NGG-PAM; B, NAG-PAM.

Distribution characteristics of CRISPR/Cas9 editing sites in pepper genome

The CRISPR/Cas9 editing sites (NGG-PAM and NAG-PAM) were uniformly distributed on all 12 chromosomes (P1~P12) of pepper (Fig 2). With the exception of chromosome P0, P1 and P8 exhibited the most and least CRISPR/Cas9 editing sites, respectively (Table 1). The number of NGG-PAM and NAG-PAM sites on the pepper chromosomes was significantly positively correlated (R2 = 0.997) with chromosome length (Fig 3). The density of NGG-PAM sites on different chromosomes (not including P0) ranged from 67.37/Kb (chromosome P3) to 74.71/Kb (chromosome P1). The densities of NAG-PAM sites on different chromosomes (excluding P0) were relatively similar to each other, with the minimum and maximum densities of 111.61/Kb (P11) and 114.93/Kb (P9), respectively (Table 1).

Distribution of different kinds of CRISPR/Cas9 editing sites in the pepper genome.
Fig 2

Distribution of different kinds of CRISPR/Cas9 editing sites in the pepper genome.

A, NGG-PAM+NAG-PAM site; B, NGG-PAM site; C, NAG-PAM site; D, Class 0.0; E, Class 0.1; F, Class 1.0; G, Class 1.1; H, Class 2.

Correlation between the number of CRISPR/Cas9 editing sites and chromosome length in pepper.
Fig 3

Correlation between the number of CRISPR/Cas9 editing sites and chromosome length in pepper.

The vast majority of NGG-PAM (~94.41%) and NAG-PAM (~94.42%) sites were located in the intergenic regions of the pepper genome, while 8,661,656 (~3.77%) and 3,425,476 (~1.49%) NGG-PAM sites were located in intron and exon regions, respectively, and the rest (~0.32%) were located in UTRs and splicing regions (Table 2). Regarding the distribution pattern in different genomic regions, the pattern of NAG-PAM sites was similar to that of NGG-PAM sites (Table 2). The density of CRISPR/Cas9 editing sites in genic regions (including UTRs, exons, introns and splicing sites,) was lower than that in intergenic regions for NGG+NAG-PAM (159.03/Kb versus 180.68/Kb, Fig 4A), NGG-PAM (60.55/Kb versus 68.87/Kb, Fig 4B) and NAG-PAM (98.49/Kb versus 111.81/Kb, Fig 4C), which differs from the situation in grape [21].

Comparison of the number and density of CRISPR/Cas9 editing sites between genic and intergenic regions.
Fig 4

Comparison of the number and density of CRISPR/Cas9 editing sites between genic and intergenic regions.

A, NGG-PAM+NAG-PAM site; B, NGG-PAM site; C, NAG-PAM site; D, Class 0.0+Class 1.0; E, Class 0.0; F, Class 1.0.

Table 2
The number of CRISPR/Cas9 editing sites in different genomic regions.
Genomic RegionNGG+NAGNGGNAG
No.PercentageNo.PercentageNo.Percentage
Intergenic569,505,88194.41%217,081,03894.42%352,424,84394.41%
5'UTR975,5660.16%373,5640.16%602,0020.16%
3'UTR937,6450.16%340,1060.15%597,5390.16%
Exon8,487,4231.41%3,425,4761.49%5,061,9471.36%
Intron23,217,3933.85%8,661,6563.77%14,555,7373.90%
Splicing78,4060.01%27,9970.01%50,4090.01%
Total603,202,314100.00%229,909,837100.00%373,292,477100.00%

Content of highly specific NGG-PAM sites in pepper genome

Through filtering and alignment analysis, 30,402,397 (~13.22%) NGG-PAM sites were successfully graded based on their specificity (Table 3). The total number of highly specific NGG-PAM sites in pepper, including those belonging to Class 0.0 and Class 1.0, was 29,623,855, which was 4.50 times higher than that in tomato, accounting for ~12.88% of the total NGG-PAM sites (Table 3), which was in line with the general rule that the number of specific gRNA spacers is positively correlated with genome size in eudicot species [12]. On average, there were 8.81/Kb highly specific sites in the pepper genome, which is comparable to that in the tomato genome (8.42/Kb, Table 3).

Table 3
The number of NGG-PAM sites with differences in specificity on pepper chromosome.
Chr.Class 0.0Class 1.0Highly specific*Class 1.1Class 2Subtotal
No.Density
P12,7072,782,8782,785,5859.257,07064,0002,856,655
P21,7951,631,0971,632,8929.963,62533,5621,670,079
P32,6402,648,9542,651,59410.145,96857,2102,714,772
P41,9622,193,5022,195,46410.185,54251,5772,252,583
P51,9572,136,2912,138,2489.845,55052,7062,196,504
P62,0912,211,3722,213,46310.085,50951,7972,270,769
P71,7581,862,1071,863,8658.394,90944,9741,913,748
P81,4431,643,7651,645,20810.733,61636,1341,684,958
P92,1212,403,8632,405,98410.086,30859,7302,472,022
P101,9492,041,4542,043,4039.935,31649,3472,098,066
P111,7881,905,1551,906,9438.654,87146,1881,958,002
P122,1242,271,1332,273,2579.895,62155,5962,334,474
P05,0723,862,8773,867,9495.4111,075100,7413,979,765
Total29,40729,594,44829,623,8558.8174,980703,56230,402,397

*, equal to the sum of Class 0.0 and Class 1.0; the number of Class 0.1 spacers on all chromosomes is 0.

To validate the specificity of target sites belonging to the class 0.0 and class 1.0, a random set of 19 sites were chosen to be amplified by PCR, and then the PCR products were directly sequenced and assembled. After aligning them back to the Zunla-1 reference genome, all of the products were matched to one unique location in the genome (Fig 5, S1 Table and S1 Fig), indicating that the target sites of class 0.0 and class 1.0 had low risk of off-target.

PCR amplification of 19 highly-specific target sites.
Fig 5

PCR amplification of 19 highly-specific target sites.

M, DL2000 plus, 1 to 10 represent A1 to A10 belonging to class0.0; 11 to 19 represent B1 to B9 belonging to class 1.0 (S1 Table).

Characterization of highly specific NGG-PAM sites’ distribution in pepper genome

The highly specific NGG-PAM sites were evenly distributed on all 12 chromosomes (P1~P12) of pepper (Fig 2). With the exception of P0, chromosomes P1 and P2 contained the maximum and minimum number of highly specific NGG-PAM sites, respectively (Table 3). The number of highly specific NGG-PAM sites in different genomic regions is shown in Table 4. Similar to the distribution of all NGG-PAM sites, there were a total of 26,699,387 (~90.13%) highly specific NGG-PAM sites located in intergenic regions, which was 9.13 times greater than the number in genic regions (Fig 4D). However, the average density of highly specific NGG-PAM sites in genic regions was higher than that in intergenic regions on the whole (13.80/Kb versus 8.47/Kb, Fig 4D) for Class 0.0 (0.015/Kb versus 0.008/Kb, Fig 4E) and Class 1.0 (13.79/Kb versus 8.46/Kb, Fig 4F). The same phenomenon occurs in the grape genome [21].

Table 4
The number of highly specific NGG-PAM sites in different genomic regions.
Genomic RegionClass 0.0Class 1.0Total
No.PercentageNo.PercentageNo.Percentage
Intergenic26,23489.21%26,673,15390.13%26,699,38790.13%
5'UTR2550.87%91,4030.31%91,6580.31%
3'UTR1070.36%99,0990.33%99,2060.33%
Exon4501.53%939,7503.18%940,2003.17%
Intron2,3477.98%1,783,6966.03%1,786,0436.03%
Splicing140.05%7,3470.02%7,3610.02%
Total29,407100.00%29,594,448100.00%29,623,855100.00%

We calculated the percentage of annotated genes that contained highly specific NGG-PAM sites identified in this study and found that 34,251 (~96.93%) out of 35,336 annotated genes exhibited at least one highly specific NGG-PAM site in their exons, and 90.50% of annotated genes exhibited at least 4 highly specific NGG- PAM sites (Fig 6 and S2 Table), indicating that the set of highly specific CRISPR/Cas9 editing sites identified in this study was widely applicable and will contribute to the minimization of off-target effects of CRISPR/Cas9 in pepper.

Histogram plots of gene numbers according to the number of exon-targeted highly specific NGG-PAM sites.
Fig 6

Histogram plots of gene numbers according to the number of exon-targeted highly specific NGG-PAM sites.

Acknowledgements

We thank Guojun Ouyang for his technical assistance in the analysis of highly-specific editing sites.

References

YLiu, GLi, YZhang, LChen. Current advances on CRISPR/Cas genome editing technologies in plants. Journal of South China Agricultural University. 2019;40(5):3849.

TGaj, CAGersbach, CFBarbas. ZFN, TALEN, and CRISPR/Cas-based methods for genome engineering. Trends in Biotechnology. 2013;31(7):397405. 10.1016/j.tibtech.2013.04.004

SWCho, SKim, JMKim, JSKim. Targeted genome engineering in human cells with the Cas9 RNA-guided endonuclease. Nature biotechnology. 2013;31(3):2302. Epub 2013/01/31. 10.1038/nbt.2507 .

WYHwang, YFu, DReyon, MLMaeder, SQTsai, JDSander, et al Efficient genome editing in zebrafish using a CRISPR-Cas system. Nature Biotechnology. 2013;31(3):2279. 10.1038/nbt.2501

WJiang, DBikard, DCox, FZhang, LAMarraffini. RNA-guided editing of bacterial genomes using CRISPR-Cas systems. Nature biotechnology. 2013;31(3):2339. 10.1038/nbt.2508

MWang, YMao, YLu, ZWang, XTao, JKZhu. Multiplex gene editing in rice with simplified CRISPR-Cpf1 and CRISPR-Cas9 systems. J Integr Plant Biol. 2018;60(8):62631. Epub 2018/05/16. 10.1111/jipb.12667 .

SShmakov, ASmargon, DScott, DCox, NPyzocha, WYan, et al Diversity and evolution of class 2 CRISPR-Cas systems. Nat Rev Microbiol. 2017;15(3):16982. Epub 2017/01/24. 10.1038/nrmicro.2016.184 .

KSMakarova, EVKoonin. Annotation and Classification of CRISPR-Cas Systems. Methods Mol Biol. 2015;1311:4775. Epub 2015/05/20. 10.1007/978-1-4939-2687-9_4 .

WXYan, PHunnewell, LEAlfonse, JMCarte, EKeston-Smith, SSothiselvam, et al Functionally diverse type V CRISPR-Cas systems. Science. 2019;363(6422):8891. 10.1126/science.aav7271

10 

MJinek, KChylinski, IFonfara, MHauer, JADoudna, ECharpentier. A Programmable Dual-RNA–Guided DNA Endonuclease in Adaptive Bacterial Immunity. Science. 2012;337(6096):81621. 10.1126/science.1225829

11 

XYao, XWang, XHu, ZLiu, JLiu, HZhou, et al Homology-mediated end joining-based targeted integration using CRISPR/Cas9. Cell Research. 2017;27(6):80114. 10.1038/cr.2017.76

12 

KXie, JZhang, YYang. Genome-wide prediction of highly specific guide RNA spacers for CRISPR–Cas9-mediated genome editing in model plants and major crops. Molecular plant. 2014;7(5):9236. 10.1093/mp/ssu009

13 

PDHsu, DAScott, JAWeinstein, FARan, SKonermann, VAgarwala, et al DNA targeting specificity of RNA-guided Cas9 nucleases. Nature biotechnology. 2013;31(9):82732. 10.1038/nbt.2647

14 

YFu, JAFoden, CKhayter, MLMaeder, DReyon, JKJoung, et al High-frequency off-target mutagenesis induced by CRISPR-Cas nucleases in human cells. Nature biotechnology. 2013;31(9):8226. Epub 2013/06/25. 10.1038/nbt.2623 .

15 

VPattanayak, SLin, JPGuilinger, EMa, JADoudna, DRLiu. High-throughput profiling of off-target DNA cleavage reveals RNA-programmed Cas9 nuclease specificity. Nature biotechnology. 2013;31(9):83943. Epub 2013/08/13. 10.1038/nbt.2673 .

16 

KKaur, HTandon, AKGupta, MKumar. CrisprGE: a central hub of CRISPR/Cas-based genome editing. Database (Oxford). 2015;2015:bav055. Epub 2015/06/30. 10.1093/database/bav055 .

17 

SBae, JPark, J-SKim. Cas-OFFinder: a fast and versatile algorithm that searches for potential off-target sites of Cas9 RNA-guided endonucleases. Bioinformatics (Oxford, England). 2014;30(10):14735. Epub 2014/01/24. 10.1093/bioinformatics/btu048 .

18 

JPark, SBae, J-SKim. Cas-Designer: a web-based tool for choice of CRISPR-Cas9 target sites. Bioinformatics. 2015;31(24):40146. 10.1093/bioinformatics/btv537

19 

YNaito, KHino, HBono, KUi-Tei. CRISPRdirect: software for designing CRISPR/Cas guide RNA with reduced off-target sites. Bioinformatics. 2015;31(7):11203. Epub 2014/11/22. 10.1093/bioinformatics/btu743 .

20 

JPConcordet, MHaeussler. CRISPOR: intuitive guide selection for CRISPR/Cas9 genome editing experiments and screens. Nucleic acids research. 2018;46(W1):W242W5. Epub 2018/05/16. 10.1093/nar/gky354 .

21 

YWang, XLiu, CRen, G-YZhong, LYang, SLi, et al Identification of genomic sites for CRISPR/Cas9-based genome editing in the Vitis vinifera genome. BMC plant biology. 2016;16(1):17. 10.1186/s12870-016-0787-3

22 

JCheng, YChen, YHu, ZZhou, FHu, JDong, et al Fine mapping of restorer-of-fertility gene based on high-density genetic mapping and collinearity analysis in pepper (Capsicum annuum L.). Theoretical and Applied Genetics. 2020;133(3):889902. Epub 2019/12/22. 10.1007/s00122-019-03513-y .

23 

CQin, CYu, YShen, XFang, LChen, JMin, et al Whole-genome sequencing of cultivated and wild peppers provides insights into Capsicum domestication and specialization. Proc Natl Acad Sci U S A. 2014;111(14):513540. 10.1073/pnas.1400975111 .

24 

SKim, MPark, SIYeom, YMKim, JMLee, HALee, et al Genome sequence of the hot pepper provides insights into the evolution of pungency in Capsicum species. Nature genetics. 2014;46(3):2708. 10.1038/ng.2877 .

25 

SLKothari, AJoshi, SKachhwaha, NOchoa-Alejo. Chilli peppers—A review on tissue culture and transgenesis. Biotechnology Advances. 2010;28(1):3548. 10.1016/j.biotechadv.2009.08.005

26 

JPozueta-Romero, GHoulne, LCanas, RSchantz, JChamarro. Enhanced regeneration of tomato and pepper seedling explants for Agrobacterium-mediated transformation. Plant Cell Tissue and Organ Culture. 2001;67(2):17380.

27 

PRice, ILongden, ABleasby. EMBOSS: the European Molecular Biology Open Software Suite. Trends in genetics: TIG. 2000;16(6):2767. Epub 2000/05/29. 10.1016/s0168-9525(00)02024-2 .

28 

RCEdgar. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26(19):24601. Epub 2010/08/17. 10.1093/bioinformatics/btq461 .

29 

GRizk, DLavenier. GASSST: global alignment short sequence search tool. Bioinformatics. 2010;26(20):253440. 10.1093/bioinformatics/btq485