Nucleic Acids Research
Home MNDR v3.0: mammal ncRNA–disease repository with increased coverage and annotation
MNDR v3.0: mammal ncRNA–disease repository with increased coverage and annotation
MNDR v3.0: mammal ncRNA–disease repository with increased coverage and annotation

The authors wish it to be known that, in their opinion, the first five authors should be regarded as Joint First Authors.

Article Type: research-article Article History
Abstract

Many studies have indicated that non-coding RNA (ncRNA) dysfunction is closely related to numerous diseases. Recently, accumulated ncRNA–disease associations have made related databases insufficient to meet the demands of biomedical research. The constant updating of ncRNA–disease resources has become essential. Here, we have updated the mammal ncRNA–disease repository (MNDR, http://www.rna-society.org/mndr/) to version 3.0, containing more than one million entries, four-fold increment in data compared to the previous version. Experimental and predicted circRNA–disease associations have been integrated, increasing the number of categories of ncRNAs to five, and the number of mammalian species to 11. Moreover, ncRNA–disease related drug annotations and associations, as well as ncRNA subcellular localizations and interactions, were added. In addition, three ncRNA–disease (miRNA/lncRNA/circRNA) prediction tools were provided, and the website was also optimized, making it more practical and user-friendly. In summary, MNDR v3.0 will be a valuable resource for the investigation of disease mechanisms and clinical treatment strategies.

Ning,Cui,Zheng,Wang,Luo,Yang,Du,Cheng,Dou,and Wang: MNDR v3.0: mammal ncRNA–disease repository with increased coverage and annotation

INTRODUCTION

The associations between ncRNA dysfunction and diseases have been the focus of attention in recent decades (1–5). With the continuous advancement of sequencing technology and prediction algorithms, experimentally validated and computationally predicted ncRNA–disease associations have explosively increased. Some ncRNAs, such as circRNAs whose functions were once unclear, have also been found to be closely related to diseases recently (6–10). The increasing growth of data requires that the existing ncRNA–disease data resources must be constantly updated to satisfy the requirements of disease research and clinical applications, such as, to our best knowledge, masses of piRNAs were found dysregulated in Parkinson's disease (11), but not collected in any related databases. In addition, the research on ncRNA and drugs has also been developed rapidly. For example, some long non-coding RNAs (lncRNAs) in cancer may present potential therapeutic targets, and both microRNA (miRNA) and lncRNA have been reported to play important roles in drug resistance (12,13). However the collection and integration of related drugs are still insufficient, which limits the research on the association and mechanism between drugs, ncRNA and diseases. Furthermore, some studies have indicated that the subcellular localization and interaction of ncRNA could also affect diseases (14,15). Accordingly, it is essential to update the relevant database in real time.

Because of the above factors, this version of the mammal ncRNA–disease repository (MNDR, http://www.rna-society.org/mndr/) was brought into being. We integrated different kinds of ncRNA–disease associations through manual literature curation and prediction algorithms, with other resources under one common framework. Compared to the pervious release, the update mainly improves the following aspects: (i) more than one million entries, four-fold increment in data, and an increase to 11 mammals; (ii) the addition of circRNA associations; (iii) the addition of drug-related information; (iv) the integration of ncRNA subcellular localization and interaction, (v) support for three ncRNA–disease prediction tools and (vi) more user-friendly interface and web services were designed. In summary, MNDR v3.0 provides comprehensive data on ncRNA–disease associations in mammals, helping to better understand the mechanism of ncRNAs and diseases.

DATA COLLECTION AND ORGNIZATION

MNDR v3.0 contains experimentally validated and computationally predicted ncRNA–disease associations from the literature and other resources, respectively. We have reviewed over 25 000 published studies and acquired >40 000 experimental ncRNA–disease associations. The diverse ncRNA–disease associations from 17 related experimentally validate databases (16–32) and 14 computationally predicted algorithms (31–44) were also integrated (Supplemental Table S1). In extension, drug-related information was obtained from four databases: ncDR (45), NoncoRNA (46), NRDTD (47) and RNAInter (48), ncRNA subcellular localizations were obtained from RNALocate (15) and interactions from RNAInter (48).

To unifying the data from different sources into authoritative reference databases, lncRNA symbols were mapping to the NCBI gene and Ensembl (49), while miRNA, circRNA and piRNA symbols to miRbase (50), circBase (51) and piRBase (52), respectively. Sno/scaRNAbase (53) and snoRNA-LBME-db (54) were chosen for snoRNA symbols. The disease terms were mapping to the Disease Ontology (55) and MeSH vocabularies. Related drug annotations were selected from PubChem Compound.

RESULTS

MNDR v3.0 statistics

In total, MNDR v3.0 contains 1 007 831 ncRNA–disease associations, across 11 mammals and documents 24 323 publications (Figure 1). Regarding prediction data, MNDR v3.0 includes 237 329 miRNA-associated, 252 144 lncRNA-associated and 296 910 circRNA-associated entries for Homo sapiens, as well as 2434 and 28 predicted lncRNA–disease associations for Mus musculus and Rattus norvegicus, respectively. Compared to the pervious release, number of the species was increased from 6 to 11 in MNDR v3.0 (Table 1). There are a total of 6301 non-redundant miRNAs, together with 39 880 lncRNAs, 20 506 circRNAs, 10 894 piRNAs and 521 snoRNAs, and the number of types of diseases was increased to 1614. In addition, related drug annotations and four types of ncRNA-drug associations: drug target, drug sensitive, drug resistant and drug interaction were also included.

Statistics on MNDR v3.0. (A) The distribution of experimental ncRNA–disease associations in five types of ncRNA (miRNA/lncRNA/circRNA/piRNA/snoRNA). (B) Number of experimental associations in 11 mammals.
Figure 1.

Statistics on MNDR v3.0. (A) The distribution of experimental ncRNA–disease associations in five types of ncRNA (miRNA/lncRNA/circRNA/piRNA/snoRNA). (B) Number of experimental associations in 11 mammals.

Table 1.
The features and development of MNDR
FeatureMNDR v1.0MNDR v2.0MNDR v3.0
Entry11492610421007831
RNA symbol3692385878102
Disease17514161614
Specie3611
Literature3771150424323
RNA categorymiRNA/lncRNA/piRNAmiRNA/lncRNA/piRNAmiRNA/lncRNA/circRNA
/snoRNA/snoRNA/piRNA/snoRNA
Detailed informationBasic annotationBasic annotationBasic annotation
Evidence supportDO/MeSH descriptionDO/MeSH description
ReferenceEvidence supportDrug information
ReferenceRNA interaction
RNA localization
Evidence support
Reference
Web application-BrowseBrowse
Advanced filter searchExact search/Fuzzy search
/Batch search
Three prediction tools:
SPM, SIMCLDA, DeepDCR

Database usage

To satisfy the different requests of biomedical researchers, a more user-friendly web interface and convenient search and browse functions have been designed in MNDR v3.0. It enables an optimized query with new fuzzy and batch functions. Users can use ‘Fuzzy Search’ to search ncRNA–disease associations by unstandardized or uncertain ncRNA name/disease name and then choose further from the candidate list. ‘Batch Search’ supports inputting a list of ncRNA official symbols/IDs, and disease names/IDs (DOID/MeSH ID), as well as uploading a file in text format to obtain multiple ncRNA–disease associations. By doing so, users can select ‘Exact Search’ to filter the search results, ‘Fuzzy Search’ to further focus on ncRNA or disease of interest, or ‘Batch Search’ to customize their query content by batch. The search results can be downloaded by clicking the button above the result table. MNDR v3.0 also offers a download option on the ‘Browse’ page.

Prediction tools

MNDR v3.0 provides three ncRNA–disease prediction tools on the website (Figure 2): SPM (structural perturbation method) was used for miRNA–disease prediction (56), while SIMCLDA based on inductive matrix completion was applied for the lncRNA–disease prediction (57), and the deep forests joint positive-unlabeled learning algorithm DeepDCR could be used to calculate the associations between circRNAs and diseases (42).

Snapshot of three ncRNA–disease prediction tools in MNDR: SPM, SIMCLDA and DeepDCR (left: input option, right: the presentation of results).
Figure 2.

Snapshot of three ncRNA–disease prediction tools in MNDR: SPM, SIMCLDA and DeepDCR (left: input option, right: the presentation of results).

CONCLUSIONS AND PERSPECTIVES

With the continuous development of high throughout technologies and predictive algorithms, the evidence of ncRNA–disease associations has increased greatly in the recent years. Meanwhile, research on the ternary relationships between related drugs, ncRNAs and diseases is receiving increasing attention, and ncRNA subcellular localizations and interactions are also confirmed to be related with the regulation of diseases. To address the above aspects, the MNDR database was updated with the latest data and some new and improved features. MNDR v3.0 contains over one million entries, including five types of ncRNA, covering 11 mammals. With the massive growth of associations, the diversification of annotations and the optimization of website interface and functions, MNDR v3.0 depicts a system-level ncRNA–disease landscape, helping researchers obtain accurate and comprehensive data more conveniently for further exploration. We may optimize the evaluation algorithm to respond to accumulating ncRNA–disease associations in the future, and will continually maintain and update MNDR database to satisfy the growing requirements for the investigation of disease mechanisms and related clinical applications.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

National Key Research and Development Project of China [2019YFA0801800]; National Natural Science Foundation of China [81770104]; Basic and Applied Basic Research Fund of Guangdong Province [2019A1515010784, 2019A1515110701]. Funding for open access charge: National Key Research and Development Project of China [2019YFA0801800]; National Natural Science Foundation of China [81770104]; Basic and Applied Basic Research Fund of Guangdong Province [2019A1515010784, 2019A1515110701].

Conflict of interest statement. None declared.

REFERENCES

1. 

Esteller M. Non-coding RNAs in human disease. Nat. Rev. Genet.2011; 12:861874.

2. 

Harries L.W. Long non-coding RNAs and human disease. Biochem. Soc. Trans.2012; 40:902906.

3. 

Rogoyski O.M., Pueyo J.I., Couso J.P., Newbury S.F. Functions of long non-coding RNAs in human disease and their conservation in Drosophila development. Biochem. Soc. Trans.2017; 45:895904.

4. 

Liu T.Y., Zhang Y.C., Lin Y.Q., Hu Y.F., Zhang Y., Wang D., Wang Y., Ning L. Exploration of invasive mechanisms via global ncRNA-associated virus-host crosstalk. Genomics. 2020; 112:16431650.

5. 

Jiang Q., Wang Y., Hao Y., Juan L., Teng M., Zhang X., Li M., Wang G., Liu Y. miR2Disease: a manually curated database for microRNA deregulation in human disease. Nucleic Acids Res.2009; 37:D98104.

6. 

Idda M.L., Munk R., Abdelmohsen K., Gorospe M. Noncoding RNAs in Alzheimer's disease. Wiley Interdiscipl. Rev. RNA. 2018; 9:doi:10.1002/wrna.1463.

7. 

Panir K., Schjenken J.E., Robertson S.A., Hull M.L. Non-coding RNAs in endometriosis: a narrative review. Hum. Reprod. Update. 2018; 24:497515.

8. 

Verdier J., Breunig I.R., Ohse M.C., Roubrocks S., Kleinfeld S., Roy S., Streetz K., Trautwein C., Roderburg C., Sellge G. Faecal Micro-RNAs in inflammatory bowel diseases. J. Crohn's & Colitis. 2020; 14:110117.

9. 

Zhang Y., Liu T., Chen L., Yang J., Yin J., Zhang Y., Yun Z., Xu H., Ning L., Guo F.et al. RIscoper: a tool for RNA-RNA interaction extraction from the literature. Bioinformatics. 2019; 35:31993202.

10. 

Chen X., Yang T., Wang W., Xi W., Zhang T., Li Q., Yang A., Wang T. Circular RNAs in immune responses and immune diseases. Theranostics. 2019; 9:588607.

11. 

Schulze M., Sommer A., Plotz S., Farrell M., Winner B., Grosch J., Winkler J., Riemenschneider M.J. Sporadic Parkinson's disease derived neuronal cells show disease-specific mRNA and small RNA signatures with abundant deregulation of piRNAs. Acta Neuropathol. Commun.2018; 6:58.

12. 

Crooke S.T., Witztum J.L., Bennett C.F., Baker B.F. RNA-Targeted therapeutics. Cell Metab.2018; 27:714739.

13. 

Donlic A., Hargrove A.E. Targeting RNA in mammalian systems with small molecules. Wiley Interdiscipl. Rev. RNA. 2018; 9:e1477.

14. 

Li Y., Wang C., Miao Z., Bi X., Wu D., Jin N., Wang L., Wu H., Qian K., Li C.et al. ViRBase: a resource for virus-host ncRNA-associated interactions. Nucleic Acids Res.2015; 43:D578D582.

15. 

Zhang T., Tan P., Wang L., Jin N., Li Y., Zhang L., Yang H., Hu Z., Zhang L., Hu C.et al. RNALocate: a resource for RNA subcellular localizations. Nucleic Acids Res.2017; 45:D135D138.

16. 

Ruepp A., Kowarsch A., Schmidl D., Buggenthin F., Brauner B., Dunger I., Fobo G., Frishman G., Montrone C., Theis F.J. PhenomiR: a knowledgebase for microRNA expression in diseases and biological processes. Genome Biol.2010; 11:R6.

17. 

Xie B., Ding Q., Han H., Wu D. miRCancer: a microRNA-cancer association database constructed by text mining on literature. Bioinformatics. 2013; 29:638644.

18. 

Wang D., Gu J., Wang T., Ding Z. OncomiRDB: a database for the experimentally verified oncogenic and tumor-suppressive microRNAs. Bioinformatics. 2014; 30:22372238.

19. 

Chung I.F., Chang S.J., Chen C.Y., Liu S.H., Li C.Y., Chan C.H., Shih C.C., Cheng W.C. YM500v3: a database for small RNA sequencing in human cancer research. Nucleic Acids Res.2017; 45:D925D931.

20. 

Wang J., Cao Y., Zhang H., Wang T., Tian Q., Lu X., Lu X., Kong X., Liu Z., Wang N.et al. NSDNA: a manually curated database of experimentally supported ncRNAs associated with nervous system diseases. Nucleic Acids Res.2017; 45:D902D907.

21. 

Yang Z., Wu L., Wang A., Tang W., Zhao Y., Zhao H., Teschendorff A.E. dbDEMC 2.0: updated database of differentially expressed miRNAs in human cancers. Nucleic Acids Res.2017; 45:D812D818.

22. 

Cui T., Zhang L., Huang Y., Yi Y., Tan P., Zhao Y., Hu Y., Xu L., Li E., Wang D. MNDR v2.0: an updated resource of ncRNA–disease associations in mammals. Nucleic Acids Res.2018; 46:D371D374.

23. 

Fan C., Lei X., Fang Z., Jiang Q., Wu F.X. CircR2Disease: a manually curated database for experimentally supported circular RNAs associated with various diseases. Database (Oxford). 2018; 2018:bay044.

24. 

Yao D., Zhang L., Zheng M., Sun X., Lu Y., Liu P. Circ2Disease: a manually curated database of experimentally validated circRNAs in human disease. Sci. Rep.2018; 8:11018.

25. 

Zhao Z., Wang K., Wu F., Wang W., Zhang K., Hu H., Liu Y., Jiang T. circRNA disease: a manually curated database of experimentally supported circRNA–disease associations. Cell Death. Dis.2018; 9:475.

26. 

Bao Z., Yang Z., Huang Z., Zhou Y., Cui Q., Dong D. LncRNADisease 2.0: an updated database of long non-coding RNA-associated diseases. Nucleic Acids Res.2019; 47:D1034D1037.

27. 

Gao Y., Wang P., Wang Y., Ma X., Zhi H., Zhou D., Li X., Fang Y., Shen W., Xu Y.et al. Lnc2Cancer v2.0: updated database of experimentally supported long non-coding RNAs in human cancers. Nucleic Acids Res.2019; 47:D1028D1033.

28. 

Huang Z., Shi J., Gao Y., Cui C., Zhang S., Li J., Zhou Y., Cui Q. HMDD v3.0: a database for experimentally supported human microRNA–disease associations. Nucleic Acids Res.2019; 47:D1013D1017.

29. 

Zhang W., Yao G., Wang J., Yang M., Wang J., Zhang H., Li W. ncRPheno: a comprehensive database platform for identification and validation of disease related noncoding RNAs. RNA biology. 2020; 17:943955.

30. 

Muhammad A., Waheed R., Khan N.A., Jiang H., Song X. piRDisease v1.0: a manually curated database for piRNA associated diseases. Database (Oxford). 2019; 2019:baz052.

31. 

Wang W.J., Wang Y.M., Hu Y., Lin Q., Chen R., Liu H., Cao W.Z., Zhu H.F., Tong C., Li L.et al. HDncRNA: a comprehensive database of non-coding RNAs associated with heart diseases. Database (Oxford). 2018; 2018:bay067.

32. 

Zhao H., Shi J., Zhang Y., Xie A., Yu L., Zhang C., Lei J., Xu H., Leng Z., Li T.et al. LncTarD: a manually-curated database of experimentally-supported functional lncRNA-target regulations in human diseases. Nucleic Acids Res.2020; 48:D118D126.

33. 

Cheng L., Hu Y., Sun J., Zhou M., Jiang Q. DincRNA: a comprehensive web-based bioinformatics toolkit for exploring disease associations and ncRNA function. Bioinformatics. 2018; 34:19531956.

34. 

Chen X., Yin J., Qu J., Huang L. MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for miRNA–disease association prediction. PLoS Comput. Biol.2018; 14:e1006418.

35. 

You Z.H., Huang Z.A., Zhu Z., Yan G.Y., Li Z.W., Wen Z., Chen X. PBMDA: A novel and effective path-based computational model for miRNA–disease association prediction. PLoS Comput. Biol.2017; 13:e1005455.

36. 

Mork S., Pletscher-Frankild S., Palleja Caro A., Gorodkin J., Jensen L.J. Protein-driven inference of miRNA–disease associations. Bioinformatics. 2014; 30:392397.

37. 

Yu S.P., Liang C., Xiao Q., Li G.H., Ding P.J., Luo J.W. MCLPMDA: A novel method for miRNA–disease association prediction based on matrix completion and label propagation. J. Cell. Mol. Med.2019; 23:14271438.

38. 

Lan W., Li M., Zhao K., Liu J., Wu F.X., Pan Y., Wang J. LDAP: a web server for lncRNA–disease association prediction. Bioinformatics. 2017; 33:458460.

39. 

Wang J., Ma R., Ma W., Chen J., Yang J., Xi Y., Cui Q. LncDisease: a sequence based bioinformatics tool for predicting lncRNA–disease associations. Nucleic Acids Res.2016; 44:e90.

40. 

Li J., Han X., Wan Y., Zhang S., Zhao Y., Fan R., Cui Q., Zhou Y. TAM 2.0: tool for MicroRNA set analysis. Nucleic Acids Res.2018; 46:W180W185.

41. 

Sun J., Shi H., Wang Z., Zhang C., Liu L., Wang L., He W., Hao D., Liu S., Zhou M. Inferring novel lncRNA–disease associations based on a random walk model of a lncRNA functional similarity network. Mol. Biosyst.2014; 10:20742081.

42. 

Zeng X., Zhong Y., Lin W., Zou Q. Predicting disease-associated circular RNAs using deep forests combined with positive-unlabeled learning methods. Brief. Bioinform.2019; 21:14251436.

43. 

Wang Y., Nie C., Zang T., Wang Y. Predicting circRNA-Disease Associations Based on circRNA Expression Similarity and Functional Similarity. Front. Genet.2019; 10:832.

44. 

Chen X., Yan C.C., Luo C., Ji W., Zhang Y., Dai Q. Constructing lncRNA functional similarity network based on lncRNA–disease associations and disease semantic similarity. Sci. Rep.2015; 5:11338.

45. 

Dai E., Yang F., Wang J., Zhou X., Song Q., An W., Wang L., Jiang W. ncDR: a comprehensive resource of non-coding RNAs involved in drug resistance. Bioinformatics. 2017; 33:40104011.

46. 

Li L., Wu P., Wang Z., Meng X., Zha C., Li Z., Qi T., Zhang Y., Han B., Li S.et al. NoncoRNA: a database of experimentally supported non-coding RNAs and drug targets in cancer. J. Hematol. Oncol.2020; 13:15.

47. 

Chen X., Sun Y.Z., Zhang D.H., Li J.Q., Yan G.Y., An J.Y., You Z.H. NRDTD: a database for clinically or experimentally supported non-coding RNAs and drug targets associations. Database (Oxford). 2017; 2017:bax057.

48. 

Lin Y., Liu T., Cui T., Wang Z., Zhang Y., Tan P., Huang Y., Yu J., Wang D. RNAInter in 2020: RNA interactome repository with increased coverage and annotation. Nucleic Acids Res.2020; 48:D189D197.

49. 

Yates A.D., Achuthan P., Akanni W., Allen J., Allen J., Alvarez-Jarreta J., Amode M.R., Armean I.M., Azov A.G., Bennett R.et al. Ensembl 2020. Nucleic Acids Res.2020; 48:D682D688.

50. 

Kozomara A., Birgaoanu M., Griffiths-Jones S. miRBase: from microRNA sequences to function. Nucleic Acids Res.2019; 47:D155D162.

51. 

Glazar P., Papavasileiou P., Rajewsky N. circBase: a database for circular RNAs. RNA. 2014; 20:16661670.

52. 

Wang J., Zhang P., Lu Y., Li Y., Zheng Y., Kan Y., Chen R., He S. piRBase: a comprehensive database of piRNA sequences. Nucleic Acids Res.2019; 47:D175D180.

53. 

Xie J., Zhang M., Zhou T., Hua X., Tang L., Wu W. Sno/scaRNAbase: a curated database for small nucleolar RNAs and cajal body-specific RNAs. Nucleic Acids Res.2007; 35:D183D187.

54. 

Lestrade L., Weber M.J. snoRNA-LBME-db, a comprehensive database of human H/ACA and C/D box snoRNAs. Nucleic Acids Res.2006; 34:D158D162.

55. 

Schriml L.M., Mitraka E., Munro J., Tauber B., Schor M., Nickle L., Felix V., Jeng L., Bearer C., Lichenstein R.et al. Human Disease Ontology 2018 update: classification, content and workflow expansion. Nucleic Acids Res.2019; 47:D955D962.

56. 

Zeng X., Liu L., Lu L., Zou Q. Prediction of potential disease-associated microRNAs using structural perturbation method. Bioinformatics. 2018; 34:24252432.

57. 

Lu C., Yang M., Luo F., Wu F.X., Li M., Pan Y., Li Y., Wang J. Prediction of lncRNA–disease associations based on inductive matrix completion. Bioinformatics. 2018; 34:33573364.