Nucleic Acids Research
Home RMDisease: a database of genetic variants that affect RNA modifications, with implications for epitranscriptome pathogenesis
RMDisease: a database of genetic variants that affect RNA modifications, with implications for epitranscriptome pathogenesis
RMDisease: a database of genetic variants that affect RNA modifications, with implications for epitranscriptome pathogenesis

The authors wish it to be known that, in their opinion, the first three authors should be regarded as Joint First Authors.

Article Type: research-article Article History
Abstract

Deciphering the biological impacts of millions of single nucleotide variants remains a major challenge. Recent studies suggest that RNA modifications play versatile roles in essential biological mechanisms, and are closely related to the progression of various diseases including multiple cancers. To comprehensively unveil the association between disease-associated variants and their epitranscriptome disturbance, we built RMDisease, a database of genetic variants that can affect RNA modifications. By integrating the prediction results of 18 different RNA modification prediction tools and also 303,426 experimentally-validated RNA modification sites, RMDisease identified a total of 202,307 human SNPs that may affect (add or remove) sites of eight types of RNA modifications (m6A, m5C, m1A, m5U, Ψ, m6Am, m7G and Nm). These include 4,289 disease-associated variants that may imply disease pathogenesis functioning at the epitranscriptome layer. These SNPs were further annotated with essential information such as post-transcriptional regulations (sites for miRNA binding, interaction with RNA-binding proteins and alternative splicing) revealing putative regulatory circuits. A convenient graphical user interface was constructed to support the query, exploration and download of the relevant information. RMDisease should make a useful resource for studying the epitranscriptome impact of genetic variants via multiple RNA modifications with emphasis on their potential disease relevance. RMDisease is freely accessible at: www.xjtlu.edu.cn/biologicalsciences/rmd.

Chen,Song,Tang,Wei,Xu,Su,de Magalhães,Rigden,and Meng: RMDisease: a database of genetic variants that affect RNA modifications, with implications for epitranscriptome pathogenesis

INTRODUCTION

With the advances in the high-throughput sequencing technique, millions of single nucleotide polymorphisms (SNPs) have been identified from multiple species and in multiple human cancers, suggesting their critical roles in diverse biological functions and human health. However, deciphering if and how SNPs lead to functional changes is still a major challenge. Even synonymous SNPs, which do not change the amino acid sequence and so are sometimes considered ‘silent’ mutations, can still play critical roles during transcriptional and post-transcriptional regulation (1), such as changing splicing sites (2), influencing RNA-protein interactions (3) and alteration of RNA secondary structures.

Substantial efforts have been made to relate the genetic variants to their immediate biological consequences, including with regard to transcriptional regulation (4,5), post-transcriptional protein modification (6–12), RNA–protein interaction (13), calpain cleavage (14), ceRNA networks (15), polyadenylation (16) and RNA modifications (17,18). Most of these works were based on a widely adopted computational framework, i.e. a machine learning model is firstly trained to capture the characteristics of a specific type of epigenetic mark (or interaction) from gold standard experimental datasets. With the model, it is then possible to assess the probability of the mark being associated with a given (DNA, RNA or protein) sequence, and further, predicting whether a genetic mutation can affect the status of epigenetic mark by comparing the probabilities of the mark being associated with the original and the mutated sequences. Importantly, this analysis not only explains how a genetic variant regulates an epigenetic mark, but also helps explain the phenotypes associated with the SNP, as those identified from GWAS analysis. For example, if a SNP is known to increase the risk of a disease, and can also destroy a transcription factor binding site, it is often natural to speculate a transcription-related disease mechanism, even though the two may not be directly causal, and additional experimental validation is necessary.

The epitranscriptome has emerged as an important layer for gene expression regulation (19,20). Recent studies suggest that various RNA modifications occur widely in the transcriptome, play versatile roles in essential biological mechanisms, and are closely related to the progression of various diseases (21–24). For example, N6-methyladenosine (m6A) controls the speed of the circadian clock (25), affects RNA stability (26) and regulates the progression of multiple cancers (27–30). N4-acetylcytidine (ac4C) promotes translation efficiency (31), and 2′-O-methylation (Nm) of HIV transcripts help the virus to avoid innate immune sensing (32).

A number of high-throughput approaches have been developed for profiling the transcriptome-wide distribution of different types of RNA modifications, including m6A-seq (or MeRIP-seq) (33,34), PA-m6A-seq (35), miCLIP (36) and m6A-CLIP (37). These approaches have generated a large number of high-quality epitranscriptome datasets, on the basis of which the properties of the modification-carrying RNA sequences can be characterized. This enables the prediction of RNA modification sites from the primary sequences (38,39), and hence allows prediction of the impact of genetic variants on RNA modification by comparing the potential for modification of the original and the mutated sequences. iRNA-methyl (40) and SRAMP (41) are two of the earliest and most widely adopted approaches for predicting RNA methylation sites from the primary RNA sequences. We previously developed a high-accuracy predictor WHISTLE (42) for prediction of RNA methylation sites by taking advantage of conventional sequence features as well as 35 additional genomic features (43).

There exist several databases of RNA modifications with different focuses. The MODOMICS database concerns mainly RNA modification pathways. MeT-DB (44), CVm6A (45) and REPIC (46) collect and annotate the transcriptome m6A sites under different experimental conditions; while RMBase is currently the most comprehensive database containing well annotated sites of multiple types of RNA modifications identified in multiple species. m6AVar (17) and m7GDiseaseDB (18) contain disease-associated SNPs that can affect m6A and internal m7G RNA modification, respectively. Both were based on their respective customized RNA modification site prediction tools (m6AFinder and m7GFinder). These efforts together greatly facilitated research into RNA modifications. However, to the best of our knowledge, the impact of genetic mutation on most transcriptome modifications such as m5C, Ψ and Nm, has not been studied, and a centralized platform is not yet available for systematically deciphering the association of genetic variants with multiple RNA modification types as predicted by multiple independent tools.

To address this, we present here RMDisease, a database of genetic variants that affect RNA modifications with a focus on their potential disease association. By integrating 303,426 RNA modification sites, 40,915,548 somatic and germline SNPs and 18 prediction tools, RMDisease represents the most comprehensive available mapping from genome variants to their epitranscriptome disturbance. RMdisease contains a total of 202,307 human SNPs that can effect (add or remove) eight types of RNA modifications (m6A, m5C, m1A, m5U, Ψ, m6Am, m7G and Nm), including 4,289 disease-associated variants that may imply disease pathogenesis functioning at the epitranscriptome layer. Additionally, the RNA modification-affecting SNPs were further annotated with putative post-transcriptional machinery including RNA-binding protein (RBP) binding sites, miRNA targets and splicing sites. A graphical user interface was constructed to support the query, exploration and download of the database. The overall design of RMDisease is summarized in Figure 1.

The overall design of RMDisease. RMDisease integrates 303,426 high-confidence RNA modification sites experimentally detected by 18 base-resolution technologies and 18 independent in silico machine learning tools to evaluate systematically the potentials of somatic and germline variants to affect eight types of transcriptome modifications. Disease association and post-transcriptional regulations were further integrated to unveil potential epitranscriptome pathogenesis and putative regulatory machinery. A graphical user interface was constructed to support the query, exploration and download of the database.
Figure 1.

The overall design of RMDisease. RMDisease integrates 303,426 high-confidence RNA modification sites experimentally detected by 18 base-resolution technologies and 18 independent in silico machine learning tools to evaluate systematically the potentials of somatic and germline variants to affect eight types of transcriptome modifications. Disease association and post-transcriptional regulations were further integrated to unveil potential epitranscriptome pathogenesis and putative regulatory machinery. A graphical user interface was constructed to support the query, exploration and download of the database.

MATERIALS AND METHODS

Data resource

We considered in this study only the RNA modifications that widely occur in the transcriptome. Since there is not yet available a relatively complete high-confidence collection of such data, we manually collected from 32 studies the sites of eight types of RNA modifications, including m6A (178 049 sites), m5C (95 391 sites), m1A (16 346 sites), m5U (3696 sites), Ψ (3137 sites), m6Am (2447 sites), m7G (2525 sites) and Nm (1835 sites), respectively. These sites were reported from 68 high-throughput sequencing experiments generated by 18 base-resolution technologies, including m6A-REF-seq (47), MAZTER-seq (48), miCLIP (36), m6A-CLIP-seq (37), PA-m6A-seq (35), Ψ-seq (49), Pseudo-seq (50), CeU-Seq (51), RBS-Seq (52), m1A-MAP (53), m1A-seq (54), Aza-IP (55), RNA-BisSeq (56), FICC-Seq (57), Nm-seq (58), m7G-seq (59), m7G-miCLIP-Seq (60). Detailed information regarding these sequencing samples is provided in Supplementary Table S1.

We obtained 3 820 716 somatic variants and 37 094 832 germline variants from dbSNP (v151) and TCGA (v15.0), respectively (see Supplementary Table S3). Only the variants located within the mature transcripts were kept for further analysis.

Derivation of RNA modification-associated variants

The RNA modification-associated SNPs (RM-SNPs) are defined as SNPs that may lead to the gain or loss of an RNA modification site, as reported by the prediction tools via comparing the modification status of the original and the mutated sequences. RM-SNPs were further classified into 3 sub-groups based on their reliability, including: (i) high: a SNP directly alters the experimentally validated RNA modification site, leading to its loss; (ii) medium: a SNP alters a nucleotide within the 41 bp flanking window of an experimentally validated RNA modification site (but not directly the modifiable nucleotide itself), causing its loss as predicted by a machine learning model; (iii) Low: a SNP alters a nucleotide within the 41 bp flanking window of an RNA modification site (may or may not directly the modifiable nucleotide itself), causing significant increase or decrease in the probability of RNA modification. It may be worth noting that the identification of RM-SNPs of high or medium confidence level both require experimentally validated RNA modification sites in the very beginning.

We integrated 18 prediction tools developed for eight RNA modifications to perform comprehensive and independent evaluations, including five deep learning-based methods: DeepPromise (38), DeepM6Aseq (61), DeepMRMD (62), iPseU-CNN (63), Deep-2′-O-Me (64) and 13 machine learning-based methods: WHISTLE(65), SRAMP(66), iRNA-3type (67), iMRM (68), iRNA-2OM (69), iRNA-m5C (70), RAMPred (71), iRNA-PseColl (72), iRNA-m7G (73), m7GFinder (74), iRNA-PseU (75), PIANO (76), ISGm1A (77).

The association level (AL) between SNP and an RNA modification is calculated as follows:

where, and represent the probability of modification for the wild type and mutated sequences, respectively, as obtained from individual prediction tool (or experiment data if available). The association level (AL) ranges from 0 to 1, with 1 indicating the greatest impact on RNA modification. The statistical significance was assessed by comparing to the ALs of all mutations, with which the upper bound of the P-value can be calculated. Different prediction tools were employed for the same modification to obtain the association level for their corresponding target modification. As shown in previous studies, the RNA modification prediction tools that integrate both the sequence and genome-derived features outperform those based on sequence features only (38,65,74,76–77), and were thus used as the primary method. We retained the RM-SNPs with association level >0.4 and P-value <0.05 as predicted by approaches that took advantage of both sequence and genome-derived features. The results obtained from other methods (based on sequence only) were provided in RMDisease as well for reference purpose (see Supplementary Table S2).

Additional annotation

To annotate the basic genomic information of RNA modification-associated variants, the transcript structure from UCSC genome browser including CDS, 3′UTR, 5′UTR, start codon and stop codon, etc. were used, and the genomic conservation were annotated by the phastCons 60-way. In addition, the deleterious level of each RNA modification-associated variant was analyzed by SIFT (78), PolyPhen2 HVAR (79), PolyPhen2HDIV (79), LRT (80) and FATHMM (81) using the ANNOVAR package (82). Furthermore, to provide annotation of putative post-transcriptional regulatory machinery, information on RBP binding sites from POSTAR2 (83), miRNA-RNA interaction from miRanda (84) and startBase2 (85), and splicing sites from UCSC annotation with GT-AG role within 100 bp upstream and downstream of RNA modification-associated variants was integrated into the database. To unveil potentially epitranscriptome-related pathogenesis, the association between disease and RNA modification-associated variants was extracted from GWAS catalog (86), ClinVar (87) and Johnson and O’Donnel's data (88). Additionally, the RNA molecule–drug sensitivity associations from RNAactDrug (89) were obtained to provide a drug suggestion for each RNA modification-associated SNP.

Database and web interface implementation

MySQL tables were used for the storage and management of the metadata in RMDisease. Hyper Text Markup Language (HTML), Cascading Style Sheets (CSS) and Hypertext Preprocessor (PHP) were applied in the construction of web interfaces. The multiple statistical diagrams were created by EChars and the genome browser was implemented using Jbrowse (90) for the exploration of all the analysis results.

RESULTS

Database content

We firstly evaluated the potentials of genetic variants to add or remove an RNA modification site directly or indirectly. In the end, a total of 57 622, 23 463, 61 563, 23 875, 24 822, 640, 5047 and 5275 genetic variants were found to be associated with m6A, m1A, m5C, Ψ, m7G, Nm, m5U and m6Am, respectively, providing so far the most comprehensive map of genetic factors of epitranscriptome disturbance (Table 1).

Table 1.
RM-SNPs collected in RMDisease
Germline mutationSomatic mutationTotal
Modification typeConfidence levelLossGainAllLossGainAllLossGainAll
m6AHigh1405-14054276-42765681-5681
Medium13118-1311833666-3366646784-46784
Low386546921614304446519949585157
m1AHigh104-10461-61165-165
Medium856-8563134-31343990-3990
Low106626543720573098581558867961251219308
m5CHigh596-5961518-15182114-2114
Medium11350-1135028682-2868240032-40032
Low133838625200644877691421777861163119417
ΨHigh3-319-1922-22
Medium412-4121271-12711683-1683
Low1133272638594324139871831154571671322170
m7GHigh10-1082-8292-92
Medium253-253922-9221175-1175
Low1511745225696851161421299111961235923555
NmHigh4-421-2125-25
Medium85-85530-530615-615
Low000000000
m5UHigh12-120-012-12
Medium14-1438-3852-52
Low35070310534333497393078342004983
m6AmHigh12-122-214-14
Medium24-24132-132156-156
Low22015981818823205328730248035105

Note: RM-SNPs are further classified into two categories: (i) Direct: a SNP directly alters the modifiable nucleotide, leading to the loss of a known or predicted RNA modification site, or alters an non-modifiable nucleotide into one that can be modified. (ii) Indirect (within 41 bp): a SNP alters a nucleotide within the 41 bp flanking window of an RNA modification site (but not directly the modifiable nucleotide itself), causing significant increase or decrease in the probability of RNA modification. We considered only SNPs within the 41bp window for possible indirect effects. This is because most existing RNA modification prediction methods chose to be based on 41bp sequence or less (67–71,73,108). Increasing the length considered here may not help improve the completeness of the results but add substantially the computation load. Additionally, the Nm-SNPs of low confidence were not predicted due to the tremendous search space (Nm can happen to all nucleotide) and its relatively low abundance in the human transcriptome.

We then obtained disease annotations of SNPs from GWAS catalog, Johnson and O’Donnel's data, and ClinVar, and mapped them to RM-SNPs. These SNPs may link disease pathogenesis and clinical relevance to epitranscriptome regulations (Table 2). We summarized in Table 3 the diseases that are associated with the most RM-SNPs of a specific RNA modification type.

Table 2.
Disease-associated RM-SNPs collected in RMDisease
Disease-associated RM-SNPs
ClinVarGWAS
Modification typeSNP sourceTotal RM-SNPSNPDiseaseGeneSNPDiseaseGene
m6AdbSNP15115 21598940045314877117
TCGA42 407332187164000
m1AdbSNP1514680326208247292729
TCGA18 783217175139000
m5CdbSNP15117 1469943974501286794
TCGA44 417318140130000
ΨdbSNP1514274238208207353235
TCGA19 601516341000
m7GdbSNP1512519183141166252225
TCGA22 303416333000
NmdbSNP15189555000
TCGA5512182000
m5UdbSNP1511079404039888
TCGA3968324420000
m6AmdbSNP1511854837577444
TCGA3421311627000
Table 3.
Diseases associated with the most RM-SNPs
Disease nameClinVar study accessionMedGen identifierType#SNP
Hereditary cancer-predisposing syndromeRCV000129430.4C0027672m6A58
Hereditary cancer-predisposing syndromeRCV000129430.4C0027672m1A14
Hereditary pancreatitis (PCTT)RCV000468581.1C0238339m5C88
Hereditary cancer-predisposing syndromeRCV000129430.4C0027672Ψ12
Cardiovascular phenotypeRCV000249770.1CN230736m7G7
Adenocarcinoma of lungRCV000439229.1C0152013Nm1
Adenocarcinoma of lungRCV000439229.1C0152013m5U8
Leigh syndrome (LS)RCV000268982.1C0023264m6Am4

We then asked whether the RM-associated SNPs are more functional relevant to important biological events compared to non-associated SNPs, and used evolutionary conservation as an indicator. For this purpose, the phastCons 100-way conservation scores from UCSC was considered to evaluate the conservation of individual site, which was calculated for human genome derived from genome-wide multiple alignments with 99 other vertebrate species. Interestingly, we found that the RM-associated variants were more conserved than non-associated variants (Figure 2), suggesting that the RM-associated SNPs underwent stronger selection pressure than the other variants, and may be related to important biological events that can be regulated at the epitranscriptome layer.

Comparing the PhastCons scores of RNA modification-associated and non-associated SNPs. The sites where RNA modification-associated variants localized were more conserved than non-associated variants for all the eight transcriptome modifications considered in RMDisease.
Figure 2.

Comparing the PhastCons scores of RNA modification-associated and non-associated SNPs. The sites where RNA modification-associated variants localized were more conserved than non-associated variants for all the eight transcriptome modifications considered in RMDisease.

Website interface and usage

The user-friendly web interfaces provided in RMDisease enable the search, browse and download RNA modification associated-SNPs by modification type, gene, disease, chromosome region, RsID and post-transcriptional regulations. A genome browser was integrated for interactive exploration of genome regions of interest. All data provided in the RMDisease database can be freely downloaded. For the convenience of users, detailed instructions on how to use RMDisease were placed in the ‘help’ page. RMDisease is freely accessible at: www.xjtlu.edu.cn/biologicalsciences/rmd.

Case study: MATR3

MATR3 provides structural support for the nucleus and aids in several important nuclear functions. A mutation on MATR3 with Rs ID: rs185734839 at chr5:138 665 490 is known to be associated with ‘Distal myopathy’ from GWAS study according to the ClinVar database. As an anonymous SNP located on 3′ untranslated region, this mutation doesn’t affect the encoded protein sequence; however, it can directly destroy a known m6A RNA methylation site located at the same position, which was previously detected by two MAZTER-seq experiments in human ESC cell line (48). Post-transcriptional annotations suggest that, the m6A site eliminated by rs185734839 falls within the target regions of RNA binding protein CSTF2 and four microRNAs (miR-24, miR-1, miR-206 and miR-613), which provided potential functional circuits of the RNA methylation. It should be of interests to explore whether the methylation status of MATR3 can significantly affect its biological functionality, especially with respect to the disease, RBP and miRNAs mentioned previously. Additional case studies were provided in the Supplementary Materials and Supplementary Table S4.

CONCLUSIONS

An increasing number of biological mechanisms and disease mechanisms have been associated with the epitranscriptome, which consists of more than 100 different types of RNA modifications (91) at tens of thousands of locus in the human transcriptome. To systematically unveil the linkage between genetic factors and their respective epitranscriptome disturbance, we developed RMDisease, a database of genetic variants with potentials to alter eight types of widely spread transcriptome modifications, with emphasis on epitranscriptome disease pathogenesis. RMDisease revealed for the first time the impacts of genetic variants on six types of RNA modifications (m5C, m1A, m5U, Ψ, m6Am and Nm), and offered substantial improvements over existing works for m6A and m7G (17,18).

RMDisease and m7GDiseaseDB (92) used the same inference method. Both databases were based on m7GFinder (74), which integrates the sequence as well as the genomic features. The main different between them is that, a different scoring system was implemented in RMDisease (see Materials and Methods section), which penalizes direct mutation of a putative m6A site and makes the association level (AL) fall within the range of 0 to 1. Additionally, RMDisease integrated the results obtained from another m7G predictor iRNA-m7G (93), and use it as an independent reference. There exists major difference between m6AVar (17) and RMDisease for m6A-associated SNPs. Besides the aforementioned differences between m7GDiseaseDB and RMDisease, i.e. a different scoring framework and extra independent tools integrated, RMDisease also provides the statistical significance of the predicted associations, and was based on more accurate m6A predictor WHISTLE (42) and with more reliable epitranscriptome datasets integrated. A total of 12 325 m6A sites that can be affected by SNPs were found to be shared between RMDisease and m6AVar (see Supplementary Figure S1 for more details).

Previous studies have shown that there exist snoRNPs that can guide the formation of Nm and Psi with the base pairing mechanism (94–98). To the best of our knowledge, none of the existing prediction approaches for RNA modification explicitly considered this mechanism, which may undermine their prediction capability. Indeed, sequence-based predictors for pseudouridine sites without considering the base-paring mechanism between target RNAs and snoRNPs yielded very limited accuracy (lower than 80%) (61,99), and there exists speculation that the sequence features of pseudouridine sites may not exist at all (100). Incorporating base-pairing information into the prediction models are likely to further improve the prediction performance. Nevertheless, the primary approaches implemented in RMDisease are based on both sequence and genomic features. For pseudouridylation, the PIANO method (101), which was used in RMDisease as the primary prediction approach, has achieved substantially better performance than those based on sequence features only (92,101). One possible explanation is that, since many biological features are correlated, the base-pairing information may be indirectly and inexplicitly captured by the model after including additional genomic features, such as, secondary structure of RNA and genomic conservation. Additionally, our previous studies showed that, including additional genomic features can effectively improve the accuracy of a predictor, for example, for m6A on mRNAs (42), lncRNAs (102) and introns (103), as well as for m1A (77), Pseudouridine (101) and m7G (74) site prediction. It may be worth noting that, although existing methods did not explicitly model the base-pairing mechanisms between target RNAs and snoRNPs, they may still vaguely capture the relevant patterns. For example, a previous study showed that there exist snoRNAs that contain two conserved sequence motifs, namely box C (RUGAUGA) and box D (CUGA), and the 2′-O-methylation occurs on the target RNA precisely five nucleotides upstream of the box D. Sequence with the corresponding nucleotide contents should show higher probability for 2′-O-methylation. However, if there exists other more prevalent mechanisms, this relative weak pattern may not be detected, leading to false prediction related to the sites formed from this mechanism. Meanwhile, for snoRNP-guided RNA modification sites, changes due to mutation on these small RNAs cannot be captured by the analysis pipeline of RMDisease; similarly, changes due to mutation of key RNA modification enzyme genes, such as writers (e.g. METTL3 and METTL14) and erasers (e.g. FTO and ALKBH5), were not covered in this database, either. However, those mutations can potentially disturb the epitranscriptome at a much greater scale.

It is also worth noting that substantial discrepancy has been observed among different epitranscriptome profiling approaches, which can capture different bias (104–107). To minimize the impact of technology usage, special efforts have been made in this study to obtain the most comprehensive collection of the RNA modification sites including those generated from different technologies (Supplementary Table S1). Multiple machine learn models were trained with these datasets for each RNA modification to produce the most reliable results out of the data that is available.

In summary, RMDisease will serve as a useful resource for studies of genetic factors concerning the epitranscriptome regulatory circuits and their potential roles in pathogenesis.

ACKNOWLEDGEMENTS

Author's contribution: Z.W. conceived the idea; K.C. and B.S. collected and processed the data; Y.T. built the website; K.C. and B.S. drafted the manuscript. All authors read, critically revised and approved the final manuscript.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

National Natural Science Foundation of China [31671373]; XJTLU Key Program Special Fund [KSF-E-51]; AI University Research Centre through XJTLU Key Programme Special Fund [KSF-P-02]. Funding for open access charge: National Natural Science Foundation of China [31671373]; XJTLU Key Program Special Fund [KSF-E-51]; AI University Research Centre through XJTLU Key Programme Special Fund [KSF-P-02].

Conflict of interest statement. D.J.R. is Executive Editor of NAR.

REFERENCES

1. 

Sauna Z.E., Kimchi-Sarfaty C. Understanding the contribution of synonymous mutations to human disease. Nat. Rev. Genet.2011; 12:683691.

2. 

Wu X., Hurst L.D. Determinants of the usage of splice-associated cis-motifs predict the distribution of human pathogenic SNPs. Mol. Biol. Evol.2016; 33:518529.

3. 

Mao F., Xiao L., Li X., Liang J., Teng H., Cai W., Sun Z.S. RBP-Var: a database of functional variants involved in regulation mediated by RNA-binding proteins. Nucleic Acids Res.2015; 44:D154D163.

4. 

Andersen M.C., Engström P.G., Lithwick S., Arenillas D., Eriksson P., Lenhard B., Wasserman W.W., Odeberg J. In silico detection of sequence variations modifying transcriptional regulation. PLoS Comput. Biol.2008; 4:e5.

5. 

Riley T.R., Lazarovici A., Mann R.S., Bussemaker H.J. Building accurate sequence-to-affinity models from high-throughput in vitro protein-DNA binding data using FeatureREDUCE. Elife. 2015; 4:e06397.

6. 

Ryu G.M., Song P., Kim K.W., Oh K.S., Park K.J., Kim J.H. Genome-wide analysis to predict protein sequence variations that change phosphorylation sites or their corresponding kinases. Nucleic Acids Res.2009; 37:12971307.

7. 

Ren J., Jiang C., Gao X., Liu Z., Yuan Z., Jin C., Wen L., Zhang Z., Xue Y., Yao X. PhosSNP for systematic analysis of genetic polymorphisms that influence protein phosphorylation. Mol. Cell. Proteomics. 2010; 9:623634.

8. 

Kim Y., Kang C., Min B., Yi G.S. Detection and analysis of disease-associated single nucleotide polymorphism influencing post-translational modification. BMC Med Genomics. 2015; 8:S7.

9. 

Wagih O., Reimand J., Bader G.D. MIMP: predicting the impact of mutations on kinase-substrate phosphorylation. Nat. Methods. 2015; 12:531533.

10. 

Xu H.-D., Shi S.-P., Chen X., Qiu J.-D. Systematic analysis of the genetic variability that impacts SUMO conjugation and their involvement in human diseases. Sci. Rep.2015; 5:10900.

11. 

Krassowski M., Paczkowska M., Cullion K., Huang T., Dzneladze I., Ouellette B.F.F., Yamada J.T., Fradet-Turcotte A., Reimand J. ActiveDriverDB: human disease mutations and genome variation in post-translational modification sites of proteins. Nucleic Acids Res.2017; 46:D901D910.

12. 

Patrick R., Kobe B., Lê Cao K.-A., Bodén M. PhosphoPICK-SNP: quantifying the effect of amino acid variants on protein phosphorylation. Bioinformatics. 2017; 33:17731781.

13. 

Groenning A.G.B., Doktor T.K., Larsen S.J., Petersen U.S.S., Holm L.L., Bruun G.H., Hansen M.B., Hartung A.-M., Baumbach J., Andresen B.S. DeepCLIP: Predicting the effect of mutations on protein-RNA binding with deep learning. Nucleic Acids Res.2020; 48:70997118.

14. 

Liu Z.-X., Yu K., Dong J., Zhao L., Liu Z., Zhang Q., Li S., Du Y., Cheng H. Precise prediction of calpain cleavage sites and their aberrance caused by mutations in cancer. Front. Genet.2019; 10:715.

15. 

Wang P., Li X., Gao Y., Guo Q., Ning S., Zhang Y., Shang S., Wang J., Wang Y., Zhi H.et al. LnCeVar: a comprehensive database of genomic variations that disturb ceRNA network regulation. Nucleic Acids Res.2019; 48:D111D117.

16. 

Yang Y., Zhang Q., Miao Y.-R., Yang J., Yang W., Yu F., Wang D., Guo A.-Y., Gong J. SNP2APA: a database for evaluating effects of genetic variants on alternative polyadenylation in human cancers. Nucleic Acids Res.2019; 48:D226D232.

17. 

Zheng Y., Nie P., Peng D., He Z., Liu M., Xie Y., Miao Y., Zuo Z., Ren J. m6AVar: a database of functional variants involved in m6A modification. Nucleic Acids Res.2017; 46:D139D145.

18. 

Song B., Tang Y., Chen K., Wei Z., Rong R., Lu Z., Su J., de Magalhaes J.P., Rigden D.J., Meng J. m7GHub: deciphering the location, regulation and pathogenesis of internal mRNA N7-methylguanosine (m7G) sites in human. Bioinformatics. 2020; 36:35283536.

19. 

He C. Grand challenge commentary: RNA epigenetics. Nat. Chem. Biol.2010; 6:863865.

20. 

Saletore Y., Meyer K., Korlach J., Vilfan I.D., Jaffrey S., Mason C.E. The birth of the Epitranscriptome: deciphering the function of RNA modifications. Genome Biol.2012; 13:175.

21. 

McCown P.J., Ruszkowska A., Kunkler C.N., Breger K., Hulewicz J.P., Wang M.C., Springer N.A., Brown J.A. Naturally occurring modified ribonucleosides. WIREs RNA. 2020; n/a:e1595.

22. 

Jones J.D., Monroe J., Koutmou K.S. A molecular-level perspective on the frequency, distribution, and consequences of messenger RNA modifications. WIREs RNA. 2020; n/a:e1586.

23. 

Esteve-Puig R., Bueno-Costa A., Esteller M. Writers, readers and erasers of RNA modifications in cancer. Cancer Lett.2020; 474:127137.

24. 

Zhang Z., Luo K., Zou Z., Qiu M., Tian J., Sieh L., Shi H., Zou Y., Wang G., Morrison J.et al. Genetic analyses support the contribution of mRNA N6-methyladenosine (m6A) modification to human disease heritability. Nat. Genet.2020; 52:939949.

25. 

Fustin J.M., Doi M., Yamaguchi Y., Hida H., Nishimura S., Yoshida M., Isagawa T., Morioka M.S., Kakeya H., Manabe I.et al. RNA-methylation-dependent RNA processing controls the speed of the circadian clock. Cell. 2013; 155:793806.

26. 

Wang X., Lu Z., Gomez A., Hon G.C., Yue Y., Han D., Fu Y., Parisien M., Dai Q., Jia G.et al. N6-methyladenosine-dependent regulation of messenger RNA stability. Nature. 2014; 505:117120.

27. 

Yang S., Wei J., Cui Y.H., Park G., Shah P., Deng Y., Aplin A.E., Lu Z., Hwang S., He C.et al. m(6)A mRNA demethylase FTO regulates melanoma tumorigenicity and response to anti-PD-1 blockade. Nat. Commun.2019; 10:2782.

28. 

Niu Y., Lin Z., Wan A., Chen H., Liang H., Sun L., Wang Y., Li X., Xiong X.F., Wei B.et al. RNA N6-methyladenosine demethylase FTO promotes breast tumor progression through inhibiting BNIP3. Mol. Cancer. 2019; 18:46.

29. 

Lin X., Chai G., Wu Y., Li J., Chen F., Liu J., Luo G., Tauler J., Du J., Lin S.et al. RNA m(6)A methylation regulates the epithelial mesenchymal transition of cancer cells and translation of Snail. Nat. Commun.2019; 10:2065.

30. 

Lee H., Bao S., Qian Y., Geula S., Leslie J., Zhang C., Hanna J.H., Ding L. Stage-specific requirement for Mettl3-dependent m(6)A mRNA methylation during haematopoietic stem cell differentiation. Nat. Cell Biol.2019; 21:700709.

31. 

Arango D., Sturgill D., Alhusaini N., Dillman A.A., Sweet T.J., Hanson G., Hosogane M., Sinclair W.R., Nanan K.K., Mandler M.D.et al. Acetylation of Cytidine in mRNA promotes translation efficiency. Cell. 2018; 175:18721886.

32. 

Ringeard M., Marchand V., Decroly E., Motorin Y., Bennasser Y. FTSJ3 is an RNA 2′-O-methyltransferase recruited by HIV to avoid innate immune sensing. Nature. 2019; 565:500504.

33. 

Meyer K.D., Saletore Y., Zumbo P., Elemento O., Mason C.E., Jaffrey S.R. Comprehensive analysis of mRNA methylation reveals enrichment in 3′ UTRs and near stop codons. Cell. 2012; 149:16351646.

34. 

Dominissini D., Moshitch-Moshkovitz S., Schwartz S., Salmon-Divon M., Ungar L., Osenberg S., Cesarkas K., Jacob-Hirsch J., Amariglio N., Kupiec M.et al. Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq. Nature. 2012; 485:201206.

35. 

Chen K., Lu Z., Wang X., Fu Y., Luo G.Z., Liu N., Han D., Dominissini D., Dai Q., Pan T.et al. High-resolution N(6) -methyladenosine (m(6) A) map using photo-crosslinking-assisted m(6) A sequencing. Angew. Chem. Int. Ed. Engl.2015; 54:15871590.

36. 

Linder B., Grozhik A.V., Olarerin-George A.O., Meydan C., Mason C.E., Jaffrey S.R. Single-nucleotide-resolution mapping of m6A and m6Am throughout the transcriptome. Nat. Methods. 2015; 12:767772.

37. 

Gjoneska E., Pfenning A.R., Mathys H., Quon G., Kundaje A., Tsai L.-H., Kellis M. Conserved epigenomic signals in mice and humans reveal immune basis of Alzheimer/'s disease. Nature. 2015; 518:365369.

38. 

Chen Z., Zhao P., Li F., Wang Y., Smith A.I., Webb G.I., Akutsu T., Baggag A., Bensmail H., Song J. Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences. Brief. Bioinform.2019; bbz112.

39. 

Lv H., Zhang Z.-M., Li S.-H., Tan J.-X., Chen W., Lin H. Evaluation of different computational methods on 5-methylcytosine sites identification. Brief. Bioinform.2019; 21:982995.

40. 

Chen W., Feng P., Ding H., Lin H., Chou K.-C. iRNA-Methyl: identifying N 6-methyladenosine sites using pseudo nucleotide composition. Anal. Biochem.2015; 490:2633.

41. 

Zhou Y., Zeng P., Li Y.-H., Zhang Z., Cui Q. SRAMP: prediction of mammalian N6-methyladenosine (m6A) sites based on sequence-derived features. Nucleic Acids Res.2016; 44:e91.

42. 

Chen K., Wu X., Zhang Q., Wei Z., Rong R., Lu Z., Meng J., de Magalhães J.P., Su J., Rigden D.J. WHISTLE: a high-accuracy map of the human N6-methyladenosine (m6A) epitranscriptome predicted using a machine learning approach. Nucleic Acids Res.2019; 47:e41.

43. 

Zhu X., He J., Zhao S., Tao W., Xiong Y., Bi S. A comprehensive comparison and analysis of computational predictors for RNA N6-methyladenosine sites of Saccharomyces cerevisiae. Brief. Funct. Genomics. 2019; 19:367376.

44. 

Liu H., Wang H., Wei Z., Zhang S., Hua G., Zhang S.W., Zhang L., Gao S.J., Meng J., Chen X.et al. MeT-DB V2.0: elucidating context-specific functions of N6-methyl-adenosine methyltranscriptome. Nucleic Acids Res.2018; 46:D281D287.

45. 

Han Y., Feng J., Xia L., Dong X., Zhang X., Zhang S., Miao Y., Xu Q., Xiao S., Zuo Z.et al. CVm6A: a visualization and exploration database for m(6)As in cell lines. Cells. 2019; 8:168.

46. 

Liu S., He C., Chen M. REPIC: a database for exploring <em>N</em><sup>6</sup>-methyladenosine methylome. Genome Biol.2019; 21:100.

47. 

Huang T., Chen W., Liu J., Gu N., Zhang R. Genome-wide identification of mRNA 5-methylcytosine in mammals. Nat. Struct. Mol. Biol.2019; 26:380388.

48. 

Garcia-Campos M.A., Edelheit S., Toth U., Safra M., Shachar R., Viukov S., Winkler R., Nir R., Lasman L., Brandis A.et al. Deciphering the “m(6)A Code" via antibody-independent quantitative profiling. Cell. 2019; 178:731747.

49. 

Schwartz S., Bernstein D.A., Mumbach M.R., Jovanovic M., Herbst R.H., Leon-Ricardo B.X., Engreitz J.M., Guttman M., Satija R., Lander E.S.et al. Transcriptome-wide mapping reveals widespread dynamic-regulated pseudouridylation of ncRNA and mRNA. Cell. 2014; 159:148162.

50. 

Carlile T.M., Rojas-Duran M.F., Zinshteyn B., Shin H., Bartoli K.M., Gilbert W.V. Pseudouridine profiling reveals regulated mRNA pseudouridylation in yeast and human cells. Nature. 2014; 515:143146.

51. 

Li X., Zhu P., Ma S., Song J., Bai J., Sun F., Yi C. Chemical pulldown reveals dynamic pseudouridylation of the mammalian transcriptome. Nat. Chem. Biol.2015; 11:592597.

52. 

Khoddami V., Yerra A., Mosbruger T.L., Fleming A.M., Burrows C.J., Cairns B.R. Transcriptome-wide profiling of multiple RNA modifications simultaneously at single-base resolution. PNAS. 2019; 116:67846789.

53. 

Li X., Xiong X., Zhang M., Wang K., Chen Y., Zhou J., Mao Y., Lv J., Yi D., Chen X.W.et al. Base-resolution mapping reveals distinct m(1)A methylome in nuclear- and mitochondrial-encoded transcripts. Mol. Cell. 2017; 68:9931005.

54. 

Safra M., Sas-Chen A., Nir R., Winkler R., Nachshon A., Bar-Yaacov D., Erlacher M., Rossmanith W., Stern-Ginossar N., Schwartz S. The m1A landscape on cytosolic and mitochondrial mRNA at single-base resolution. Nature. 2017; 551:251255.

55. 

Khoddami V., Cairns B.R. Identification of direct targets and modified bases of RNA cytosine methyltransferases. Nat. Biotechnol.2013; 31:458464.

56. 

Yang X., Yang Y., Sun B.F., Chen Y.S., Xu J.W., Lai W.Y., Li A., Wang X., Bhattarai D.P., Xiao W.et al. 5-methylcytosine promotes mRNA export - NSUN2 as the methyltransferase and ALYREF as an m(5)C reader. Cell Res.2017; 27:606625.

57. 

Carter J.M., Emmett W., Mozos I.R., Kotter A., Helm M., Ule J., Hussain S. FICC-Seq: a method for enzyme-specified profiling of methyl-5-uridine in cellular RNA. Nucleic Acids Res.2019; 47:e113.

58. 

Dai Q., Moshitch-Moshkovitz S., Han D., Kol N., Amariglio N., Rechavi G., Dominissini D., He C. Nm-seq maps 2′-O-methylation sites in human mRNA with base precision. Nat. Methods. 2017; 14:695698.

59. 

Zhang L.S., Liu C., Ma H., Dai Q., Sun H.L., Luo G., Zhang Z., Zhang L., Hu L., Dong X.et al. Transcriptome-wide mapping of internal N(7)-Methylguanosine methylome in mammalian mRNA. Mol. Cell. 2019; 74:13041316.

60. 

Malbec L., Zhang T., Chen Y.S., Zhang Y., Sun B.F., Shi B.Y., Zhao Y.L., Yang Y., Yang Y.G. Dynamic methylome of internal mRNA N(7)-methylguanosine and its regulatory role in translation. Cell Res.2019; 29:927941.

61. 

He J., Fang T., Zhang Z., Huang B., Zhu X., Xiong Y. PseUI: Pseudouridine sites identification based on RNA sequence information. BMC Bioinformatics. 2018; 19:306.

62. 

Sun P.P., Chen Y.B., Liu B., Gao Y.X., Han Y., He F., Ji J.C. DeepMRMP: A new predictor for multiple types of RNA modification sites using deep learning. Math. Biosci. Eng.: MBE. 2019; 16:62316241.

63. 

Tahir M., Tayara H., Chong K.T. iPseU-CNN: Identifying RNA pseudouridine sites using convolutional neural networks. Mol. Ther. Nucleic Acids. 2019; 16:463470.

64. 

Mostavi M., Salekin S., Huang Y. 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). 2018; 23942397.

65. 

Wu X., Wei Z., Chen K., Zhang Q., Su J., Liu H., Zhang L., Meng J. m6Acomet: large-scale functional prediction of individual m(6)A RNA methylation sites from an RNA co-methylation network. BMC Bioinformatics. 2019; 20:223.

66. 

Zhou Y., Zeng P., Li Y.H., Zhang Z., Cui Q. SRAMP: prediction of mammalian N6-methyladenosine (m6A) sites based on sequence-derived features. Nucleic Acids Res.2016; 44:e91.

67. 

Chen W., Feng P., Yang H., Ding H., Lin H., Chou K.C. iRNA-3typeA: identifying three types of modification at RNA's adenosine sites. Mol. Ther. Nucleic Acids. 2018; 11:468474.

68. 

Liu K., Chen W. iMRM:a platform for simultaneously identifying multiple kinds of RNA modifications. Bioinformatics. 2020; 36:33363342.

69. 

Yang H., Lv H., Ding H., Chen W., Lin H. iRNA-2OM: A sequence-based predictor for identifying 2′-O-Methylation sites in homo sapiens. J. Comput. Biol.2018; 25:12661277.

70. 

Lv H., Zhang Z.M., Li S.H., Tan J.X., Chen W., Lin H. Evaluation of different computational methods on 5-methylcytosine sites identification. Brief. Bioinform.2019; 21:982995.

71. 

Chen W., Feng P., Tang H., Ding H., Lin H. RAMPred: identifying the N(1)-methyladenosine sites in eukaryotic transcriptomes. Sci. Rep.2016; 6:31080.

72. 

Feng P., Ding H., Yang H., Chen W., Lin H., Chou K.C. iRNA-PseColl: identifying the occurrence sites of different RNA modifications by incorporating collective effects of nucleotides into PseKNC. Mol. Ther. Nucleic Acids. 2017; 7:155163.

73. 

Chen W., Feng P., Song X., Lv H., Lin H. iRNA-m7G: identifying N(7)-methylguanosine sites by fusing multiple features. Mol. Ther. Nucleic Acids. 2019; 18:269274.

74. 

Song B., Tang Y., Chen K., Wei Z., Rong R., Lu Z., Su J., de Magalhães J.P., Rigden D.J., Meng J. m7GHub: deciphering the location, regulation and pathogenesis of internal mRNA N7-methylguanosine (m7G) sites in human. Bioinformatics. 2020; 36:35283536.

75. 

Chen W., Tang H., Ye J., Lin H., Chou K.C. iRNA-PseU: Identifying RNA pseudouridine sites. Mol. Ther. Nucleic Acids. 2016; 5:e332.

76. 

Song B., Tang Y., Wei Z., Liu G., Su J., Meng J., Chen K. PIANO: a web server for pseudouridine-site (Ψ) identification and functional annotation. Front. Genet.2020; 11:88.

77. 

Lian L., Lei X., Meng J., Wei Z. ISGm1A: integration of sequence features and genomic features to improve the prediction of human m1A RNA methylation sites. IEEE Access. 2020; 8:8197181977.

78. 

Kumar P., Henikoff S., Ng P.C. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat. Protoc.2009; 4:10731081.

79. 

Adzhubei I.A., Schmidt S., Peshkin L., Ramensky V.E., Gerasimova A., Bork P., Kondrashov A.S., Sunyaev S.R. A method and server for predicting damaging missense mutations. Nat. Methods. 2010; 7:248249.

80. 

Chun S., Fay J.C. Identification of deleterious mutations within three human genomes. Genome Res.2009; 19:15531561.

81. 

Shihab H.A., Gough J., Cooper D.N., Stenson P.D., Barker G.L., Edwards K.J., Day I.N., Gaunt T.R. Predicting the functional, molecular, and phenotypic consequences of amino acid substitutions using hidden Markov models. Hum. Mutat.2013; 34:5765.

82. 

Wang K., Li M., Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res.2010; 38:e164.

83. 

Zhu Y., Xu G., Yang Y.T., Xu Z., Chen X., Shi B., Xie D., Lu Z.J., Wang P. POSTAR2: deciphering the post-transcriptional regulatory logics. Nucleic Acids Res.2018; 47:D203D211.

84. 

Agarwal V., Bell G.W., Nam J.-W., Bartel D.P. Predicting effective microRNA target sites in mammalian mRNAs. eLife. 2015; 4:e05005.

85. 

Li J.-H., Liu S., Zhou H., Qu L.-H., Yang J.-H. starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein–RNA interaction networks from large-scale CLIP-Seq data. Nucleic Acids Res.2013; 42:D92D97.

86. 

Buniello A., MacArthur J.A L., Cerezo M., Harris L.W., Hayhurst J., Malangone C., McMahon A., Morales J., Mountjoy E., Sollis E.et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res.2018; 47:D1005D1012.

87. 

Landrum M.J., Lee J.M., Benson M., Brown G., Chao C., Chitipiralla S., Gu B., Hart J., Hoffman D., Hoover J.et al. ClinVar: public archive of interpretations of clinically relevant variants. Nucleic Acids Res.2015; 44:D862D868.

88. 

Johnson A.D., O’Donnell C.J. An open access database of genome-wide association results. BMC Med. Genet.2009; 10:6.

89. 

Dong Q., Li F., Xu Y., Xiao J., Xu Y., Shang D., Zhang C., Yang H., Tian Z., Mi K.et al. RNAactDrug: a comprehensive database of RNAs associated with drug sensitivity from multi-omics data. Brief. Bioinform.2019; bbz142.

90. 

Buels R., Yao E., Diesh C.M., Hayes R.D., Holmes I.H. JBrowse: a dynamic web platform for genome visualization and analysis. Genome Biol.2016; 17:66.

91. 

Boccaletto P., Machnicka M.A., Purta E., Piątkowski P., Bagiński B., Wirecki T.K., de Crécy-Lagard V., Ross R., Limbach P.A., Kotter A.et al. MODOMICS: a database of RNA modification pathways. 2017 update. Nucleic Acids Res.2017; 46:D303D307.

92. 

Song B., Chen K., Tang Y., Ma J., Meng J., Wei Z. PSI-MOUSE: predicting mouse pseudouridine sites from sequence and genome-derived features. Evolutionary Bioinformatics. 2020; 16:1176934320925752.

93. 

Chen W., Feng P., Song X., Lv H., Lin H. iRNA-m7G: identifying N7-methylguanosine sites by fusing multiple features. Mol/ Ther/ - Nucleic Acids. 2019; 18:269274.

94. 

Kiss-László Z., Henry Y., Bachellerie J.-P., Caizergues-Ferrer M., Kiss T. Site-specific ribose methylation of preribosomal RNA: a novel function for small nucleolar RNAs. Cell. 1996; 85:10771088.

95. 

Watkins N.J., Gottschalk A., Neubauer G., Kastner B., Fabrizio P., Mann M., Lührmann R. Cbf5p, a potential pseudouridine synthase, and Nhp2p, a putative RNA-binding protein, are present together with Gar1p in all H BOX/ACA-motif snoRNPs and constitute a common bipartite structure. RNA. 1998; 4:15491568.

96. 

Lowe T.M., Eddy S.R. A computational screen for methylation guide snoRNAs in yeast. Science. 1999; 283:11681171.

97. 

Badis G., Fromont-Racine M., Jacquier A. A snoRNA that guides the two most conserved pseudouridine modifications within rRNA confers a growth advantage in yeast. RNA. 2003; 9:771779.

98. 

Motorin Y., Helm M. RNA nucleotide methylation. Wiley Interdiscip/ Rev.: RNA. 2011; 2:611631.

99. 

Chen W., Feng P., Ding H., Lin H. PAI: predicting adenosine to inosine editing sites by using pseudo nucleotide compositions. Sci. Rep.2016; 6:3512335123.

100. 

Dou L., Li X., Ding H., Xu L., Xiang H. Is there any sequence feature in the RNA pseudouridine modification prediction problem. Mol. Ther. - Nucleic Acids. 2019; 19:

101. 

Song B., Tang Y., Wei Z., Liu G., Su J., Meng J., Chen K. PIANO: a web server for pseudouridine-site (Ψ) identification and functional annotation. Frontiers in Genetics. 2020; 11:88.

102. 

Liu L., Lei X., Fang Z., Tang Y., Meng J., Wei Z. LITHOPHONE: improving lncRNA methylation site prediction using an ensemble predictor. Frontiers in Genetics. 2020; 11:545.

103. 

Liu L., Lei X., Meng J., Wei Z. WITMSG: large-scale prediction of human intronic m6A RNA methylation sites from sequence and genomic features. Curr. Genomics. 2020; 21:6776.

104. 

Adachi H., De Zoysa M.D., Yu Y.-T. Post-transcriptional pseudouridylation in mRNA as well as in some major types of noncoding RNAs. Biochim. Biophys. Acta (BBA)-Gene Regul. Mech.2018; 1862:230239.

105. 

Zaringhalam M., Papavasiliou F.N. Pseudouridylation meets next-generation sequencing. Methods. 2016; 107:6372.

106. 

Hussain S., Aleksic J., Blanco S., Dietmann S., Frye M. Characterizing 5-methylcytosine in the mammalian epitranscriptome. Genome Biol.2013; 14:215.

107. 

Capitanchik C., Toolan-Kerr P., Luscombe N.M., Ule J. How do you identify m6 a methylation in transcriptomes at high resolution? a comparison of recent datasets. Front. Genet.2020; 11:398.

108. 

Chen W., Tang H., Lin H. MethyRNA: a web server for identification of N(6)-methyladenosine sites. J. Biomol. Struct. Dyn.2017; 35:683687.