Nucleic Acids Research
Home GIMICA: host genetic and immune factors shaping human microbiota
GIMICA: host genetic and immune factors shaping human microbiota
GIMICA: host genetic and immune factors shaping human microbiota

The authors wish it to be known that, in their opinion, the first three authors should be regarded as Joint First Authors.

Article Type: research-article Article History
Abstract

Besides the environmental factors having tremendous impacts on the composition of microbial community, the host factors have recently gained extensive attentions on their roles in shaping human microbiota. There are two major types of host factors: host genetic factors (HGFs) and host immune factors (HIFs). These factors of each type are essential for defining the chemical and physical landscapes inhabited by microbiota, and the collective consideration of both types have great implication to serve comprehensive health management. However, no database was available to provide the comprehensive factors of both types. Herein, a database entitled ‘Host Genetic and Immune Factors Shaping Human Microbiota (GIMICA)’ was constructed. Based on the 4257 microbes confirmed to inhabit nine sites of human body, 2851 HGFs (1368 single nucleotide polymorphisms (SNPs), 186 copy number variations (CNVs), and 1297 non-coding ribonucleic acids (RNAs)) modulating the expression of 370 microbes were collected, and 549 HIFs (126 lymphocytes and phagocytes, 387 immune proteins, and 36 immune pathways) regulating the abundance of 455 microbes were also provided. All in all, GIMICA enabled the collective consideration not only between different types of host factor but also between the host and environmental ones, which is freely accessible without login requirement at: https://idrblab.org/gimica/

GIMICA is unique in: (i) providing both host genetic and immune factors shaping human microbiota and (ii) enabling collective consideration not only among various host factors but also between the host and environmental ones.
Graphical Abstract
GIMICA is unique in: (i) providing both host genetic and immune factors shaping human microbiota and (ii) enabling collective consideration not only among various host factors but also between the host and environmental ones.

Tang,Wu,Mou,Wang,Wang,Li,Guo,Yin,Xie,Wang,Wang,Ding,Xue,and Zhu: GIMICA: host genetic and immune factors shaping human microbiota

INTRODUCTION

The microbial communities (MCs) in different sites of human body are known to be instrumental in host physiology and health (1–4). The composition of these MCs has been generally considered as susceptible to a variety of environmental factors (5–7) (including: diet, lifestyle, disease, etc.), which thus mediates changes in host metabolic status (5) and regulates the development of chronic disease (6–9). Besides these environmental ones, the host factor has recently emerged as another crucial determinant that shapes human microbiota (10–12). Particularly, different from environmental ones, host factors are able to define the chemical and physical landscape (including nutrient availability and activity threshold for immune system) inhabited by host microbiota (13–18), which therefore determine the heritable nature of microbiome (12) and immune system activity (13,19).

There are two popular host factors that are frequently considered in current MC studies (19–23): (i) host genetic factors (HGFs, including: single nucleotide polymorphism (SNP), gene copy number variation (CNV) and non-coding ribonucleic acid (RNA) regulation) are crucial in exploring the heritability of microbiota compositions (24), revealing the mechanisms underlying the adaptation to environmental fluctuations (11), and discovering promising drug target (25) and targeted therapeutics (26,27); (ii) host immune factors (HIFs, including: checkpoint molecule, cytokine, antibody and lymphocyte) are critical for maintaining the symbiotic balance between host and microbiota (28), promoting the activities of immune system (13,20), and boosting the efficacy of immunotherapy (29). Since the composition of human MCs is greatly affected by their inhabited chemical/physical landscapes (13), the accumulation of human factors that define such landscapes can give significant insights into current MC studies.

So far, a variety of human MCs-related databases have been constructed and currently active, the majority of which focus on providing sequencing/metagenomic data (dbCAN-seq (30), eHOMD (31), gcMeta (32), GMrepo (33), HPMCD (34), MAHMI (35), MBGD (36), MetaRef (37), MDB (38)), while some others are specialized in containing the data of either metabolites/metabolic reactions (HMDB (39), mVOC (40), and VMH (41)) or taxonomic profiling (MMDB (42)). Only three databases are available now to describe the environmental factors affecting the compositions of MCs: Disbiome (43) linking the microbiome to disease indication, gutMDisorder (44) collecting the immense data of diet, drug and lifestyle that lead to microbial dysbiosis and aBiofilm (45) providing some environmental substances that inhibit bacteria. However, no database has yet been designed to provide the host factors shaping human microbiota.

Herein, a new database of host genetic and immune factors shaping human microbiota (GIMICA) was therefore introduced. First, a comprehensive literature review on human MC was conducted. Thousands of microbe species were discovered as inhabiting human blood, gut, lung, mucosa, nose, ocular surface, oral cavity, skin, and vagina. These microbe species were very diverse and belonged to >50 taxonomic phyla (such as: Firmicutes, Actinobacteria and Ascomycota). Second, based on these microbes, the HGFs and HIFs for each species were collected via literature review, and the host data of both genetic and immune factors were then provided in the GIMICA database. Third, the crosslinks of each microbe to other MCs-related knowledge bases, especially the ones describing the environmental factors affecting the composition of MCs, were systematically discovered. These crosslinks were essential because they allowed, for the first time, the collective considerations not only among different host factors but also between the host and environmental ones. All in all, the GIMICA comprehensively provided the host genetic and immune factors that shape human microbiota (as illustrated in Figure 1), which allowed the collective considerations among the different types of factor (between HGF and HIF; between host and environmental factors). Since the composition of MCs should be considered as a complex shaped by the combination of both host and environmental factors (46–48), the GIMICA is expected to have implication for the future practice of MCs-related studies on human physiology and health (1–3,49–53).

GIMICA comprehensively described host genetic and immune factors shaping human microbiota, which allowed the collective consideration among different types of factors (between host genetic factors and host immune factors). By providing the crosslinks to available databases (Disbiome (43), gutMDisorder (44) and aBiofilm (45)) with environmental factors information, GIMICA further enabled the collective consideration between the host and environmental factors.
Figure 1.

GIMICA comprehensively described host genetic and immune factors shaping human microbiota, which allowed the collective consideration among different types of factors (between host genetic factors and host immune factors). By providing the crosslinks to available databases (Disbiome (43), gutMDisorder (44) and aBiofilm (45)) with environmental factors information, GIMICA further enabled the collective consideration between the host and environmental factors.

FACTUAL CONTENT AND DATA RETRIEVAL

Collecting and confirming microbes inhabiting human

The data of microbe species inhabiting human body were first collected by literature reviews in PubMed (54) using the keyword combinations of: ‘microbial community + human’, ‘microbe + human’, ‘microbiota + human’, etc. Second, several MCs-related databases (such as: eHOMD (31), GMrepo (33), VMH (41) and gutMDisorder (44)) were reviewed, and hundreds of microbes in these databases were collected for further assessing their suitability to be included in GIMICA. In total, 6438 microbes were identified based on the researches in these steps above. Third, the names of all these identified microbes were standardized by searching NCBI Taxonomy database (55), which resulted in 5386 microbe species with well-defined Taxonomy ID. A repository that contained the diverse synonyms of each species were further constructed using NCBI Taxonomy (55) data. Fourth, for those microbes collected from MCs-related databases, literature search in PubMed (54) was performed based on the keyword combinations of ‘'Microbe Name’ + human’, ‘‘Microbe Synonym’ + human’, etc. During this literature review and the review in the first step, the body sites in human, where each microbe species inhabited, were carefully identified and systematically recorded. Consequently, 4257 microbes were confirmed to inhabit nine sites of human body. Particularly, 281, 2436, 513, 225, 197, 164, 468, 1012 and 276 microbe species (bacteria, fungi, metamonada, amoebozoa and archaea (56–58)) were found to colonize in human blood, gut, lung, mucosa, nose, ocular surface, oral cavity, skin and vagina, respectively (59–63). These microbes are diverse and belong to 58 taxonomic phyla (such as Firmicutes, Actinobacteria, Ascomycota, Euryarchaeota, Bacteroidete, Proteobacteria and Tenericutes). Finally, the affiliated information for each human microbe were systematically collected using literature review, which contained taxonomic lineage (kingdom, phylum, class, order, family, genus, and species), oxygen dependency/tolerance (aerobe, anaerobe, aerotolerant and aerophilic), mechanisms of microbial metabolism (such as catalytic, fermentative, oxidative, proteolytic, respiratory and saccharolytic), sites of human body inhabited (such as gut, oral cavity and skin), Gram-staining classifications (negative, positive and variable), impact on/relation to host (probiotic, commensal, opportunistic and pathogen) and other general descriptions (disease-relevance, metabolic product, mechanism of symbiosis, the number of genes/transcripts and genome size).

Host genetic factors shaping human microbiota

The host genetic factors that shape human microbiota are characterized by a variety of regulatory types, which contained single nucleotide polymorphisms (upstream or downstream gene variants, untranslated region variants, intron variants, etc.), copy number variations (duplication, deletion, etc.), and non-coding RNA (lncRNA, rRNA, miRNA, miscRNA, sRNA, pseudogene, etc.). Such factors were essential in exploring the heritability of microbiota compositions (24), revealing the mechanism underlying the adaptation to environmental fluctuations (11), and discovering the drug target and targeted therapeutics (26). Because the HGFs data were largely dispersed in literatures, the PubMed (54) was systematically searched to discover the regulation of human microbiota by HGFs. Particularly, keyword combinations of ‘genetic variant + ‘Microbe Name’’, ‘host genetic factor + ‘Microbe Name’’, ‘genetic variation + ‘Microbe Name’’, ‘gene copy number variations + ‘Microbe Name’’, ‘CNV + ‘Microbe Name’’, ‘nucleotide polymorphism + ‘Microbe Name’’, and ‘SNP + ‘Microbe Name’’ were adopted for literature reviews, and the resulting publications were manually assessed for retrieving any HGFs-related information.

As a result, the collected data included the various regulatory types of host genetic factors (single nucleotide polymorphisms, copy number variations and non-coding RNAs), the detail sub-types of each HGF type (such as upstream variant, gene duplication and lncRNA), the genetic sequence, genome location and strand type for each HGF, and the impacts of each HGF on studied microbe species (up/down-regulation of microbe abundance, promotion/inhibition of the gene expression in microbes). As a result, current version of GIMICA had 1,368 single nucleotide polymorphisms (590 intron, 74 missense, 54 synonymous variants, 43 upstream gene, 26 non coding transcript variant, 17 UTR, 7 downstream gene, etc.) regulating the relative abundances of 267 microbes that primarily inhabit host gut, mucosa, and vagina; 186 copy number variations (37 duplications and 149 deletions) modulating the expressions of 156 microbes that live mainly in human oral cavity, gut, lung, skin, blood, and mucosa; 1297 non-coding RNA regulations (514 miRNAs, 683 lncRNAs, 14 sRNAs, 9 pseudogenes, etc.) altering the gene expression of 251 microbe species that reside mostly in host lung, oral cavity, gut and skin. All these HGFs data can be assessed and retrieved using various search strategies in both the Home page and the subpage (which is entitled as ‘Host Genetic Factors’) of the GIMICA database. The example host genetic factors in GIMICA were illustrated in the Figure 2.

Example host genetic factors in GIMICA. 1368 single nucleotide polymorphisms regulating the relative abundance of 267 microbes that primarily inhabit human gut, mucosa and vagina; 186 gene copy number variations modulating the expression of 156 microbes that live mainly in human oral cavity, gut, lung, skin, blood and mucosa; 1297 non-coding RNA regulations altering the expression of 251 microbes that reside mostly in host lung, oral cavity, gut and skin.
Figure 2.

Example host genetic factors in GIMICA. 1368 single nucleotide polymorphisms regulating the relative abundance of 267 microbes that primarily inhabit human gut, mucosa and vagina; 186 gene copy number variations modulating the expression of 156 microbes that live mainly in human oral cavity, gut, lung, skin, blood and mucosa; 1297 non-coding RNA regulations altering the expression of 251 microbes that reside mostly in host lung, oral cavity, gut and skin.

Host immune factors shaping human microbiota

The host immune factors that shape human microbiota contained a number of immune molecules and cells regulating microbiota compositions (such as checkpoint molecule, cytokine, complement component, eosinophils, immunoglobulin, innate lymphoid cell, macrophage, monocyte, natural killer cell, retinoic acid receptor, toll-like receptor, etc.). These factors are critical for maintaining symbiotic balance between host and microbiota (28), promoting the activities of immune system (13), and boosting the efficacy of immunotherapy (29). Similar to the collection of the HGF data, the HIF data were identified by literature search based on different keyword combinations, which included ‘host immune factors + ‘Microbe Name’’, ‘immune cell + ‘Microbe Name’’, ‘immune system + ‘Microbe Name’’, ‘immunity + ‘Microbe Name’’, ‘checkpoint molecules + ‘Microbe Name’’, ‘complement component + ‘Microbe Name’’, etc. The resulting publications were also manually assessed for retrieving any HIF-related information.

Consequently, the collected data included various regulatory types of host immune factors (HIF), such as checkpoint molecules, cytokines, complement component, eosinophils, immunoglobulin, etc. Moreover, other critical data were also collected and provided, which included the names of immune-related molecules/cells, the sequence, protein family, signaling pathway, and molecular function for each HIF, and the impacts of each HIF on a studied microbe (up/down-regulation of microbe abundance, promotion/inhibition of the gene expression in microbe). All in all, GIMICA contained 126 lymphocytes (including helper/cytotoxic/regulatory T-cells, plasma/regulatory B-cells, natural killer cells, etc.) and phagocytes (including macrophages, neutrophils, dendritic cells, monocytes, eosinophils, mast cells, etc.) affecting the expression of 258 microbes that are mainly located in human blood, gut, skin, oral cavity and ocular surface; 387 immune proteins (cytokines, immune receptors, checkpoint proteins, complement components, etc.) altering the compositions of 235 microbes that primarily reside in human gut, oral cavity, blood, ocular surface, and vagina; 36 immune-related signaling pathways (toll-like receptor, interleukin, etc.) modulating the gene expressions of 65 microbe species that live in human blood, gut and skin. All these HIF data can be assessed and retrieved using various search strategies in both the Home page and the subpage entitled ‘Host Immune Factors’ of GIMICA. The example host immune factors in GIMICA were illustrated in the Figure 3. Figure 4 shows the body sites distribution of microbes inhabiting human (Figure 4A) and the statistics of host genetic factors and immune factors shaping human microbiota in GIMICA (Figure 4B).

Example host immune factors in GIMICA. 126 lymphocytes/phagocytes affecting the expressions of 258 microbes that are mainly located in human blood, gut, oral cavity and ocular surface; 387 immune proteins altering the compositions of 235 microbes that primarily reside in human gut, oral cavity, blood and vagina; 36 immune-relevant signaling pathways modulating the expressions of 65 microbes that live in human blood, gut and skin.
Figure 3.

Example host immune factors in GIMICA. 126 lymphocytes/phagocytes affecting the expressions of 258 microbes that are mainly located in human blood, gut, oral cavity and ocular surface; 387 immune proteins altering the compositions of 235 microbes that primarily reside in human gut, oral cavity, blood and vagina; 36 immune-relevant signaling pathways modulating the expressions of 65 microbes that live in human blood, gut and skin.

The statistic of microbes and host factors in GIMICA. (A) The body sites distribution of microbes inhabiting human; (B) The statistics of host genetic factors (HGFs) and host immune factors (HIFs) shaping human microbiota in GIMICA.
Figure 4.

The statistic of microbes and host factors in GIMICA. (A) The body sites distribution of microbes inhabiting human; (B) The statistics of host genetic factors (HGFs) and host immune factors (HIFs) shaping human microbiota in GIMICA.

Collective consideration among different factors

Human microbiota composition was frequently discovered to be collectively shaped by multiple host factors (46,47). For example, genetic variation in host protein NOD2 (HGF) and the immune response stimulated by the NF-kappa B pathway (HIF) could collectively increase the abundance of microbes in the Enterobacteriaceae family, and this microbiota dysbiosis had been discovered as a leading cause of neonatal meningitis (13). In other words, when associating the alteration of microbiome composition with a studied disease, both host genetic and immune factors should be assessed. To make this collective consideration possible, both types of host factor (HGF and HIF) were systematically reviewed and collected in GIMICA for each microbe. As a result, 36.4% of all microbes in GIMICA database were identified with both HGF and HIF reported.

Moreover, human microbiota composition was also found susceptible to both environmental and host factors (46,64). For example, a genetic polymorphism in host interlukin-5 gene (host factor) and the environmental factor (such as high-fat diets) could collectively elevate the concentration of Prevotella melaninogenica in vaginal microbiota, which in turn led to a chronic inflammation (65). Since the environmental factors had been long studied, the collective considerations of both host and environmental factors were of great interests and significant importance. To enable this collective consideration, each microbe in GIMICA was cross-referenced and linked to available databases (Disbiome (43), gutMDisorder (44) and aBiofilm (45)) that provided environmental factor. As a result, 88.4% of all microbes in GIMICA were identified with both host and environmental factors.

Data standardization, access and retrieval

To make the access and analysis of GIMICA data convenient for all users, the collected raw data were carefully cleaned up and then systematically standardized. These standardizations included: (i) all diseases in GIMICA were standardized by the latest version of International Classification of Diseases (ICD-11, officially released by World Health Organization (66)), which was expected to serve comprehensive health management (67); (ii) all microbe species were standardized using NCBI Taxonomy database (55) and (iii) the extend data of each microbe could be accessed using crosslinks to ICD-11 (66), UniProt (68), KEGG (69), NCBI Taxonomy (55), gutMDisorder (44), Disbiome (43), etc. GIMICA has been smoothly running for months and tested from various sites around the world. All data can be viewed, assessed and downloaded. GIMICA is freely assessable without login requirement by all users at: https://idrblab.org/gimica/

CONCLUSION

In sum, GIMICA is unique in: (i) providing both host genetic and immune factors shaping human microbiota and (ii) enabling collective consideration not only among various host factors but also between the host and environmental ones. With the extensive efforts made on describing the host factors that are capable of shaping human microbiota (10–12) and discussing the collective determination by both host and environmental factors (46,47,64), those immense, connected and structuralized data provided in GIMICA are expected to have implication for the future practice of MCs-related study on human physiology and health (1–3,70–73).

FUNDING

National Key R&D Program of China [2018YFC0910500]; National Natural Science Foundation of China [U1909208, 81872798]; Project supported by the China Knowledge Centre for Engineering Sciences and Technology [No. CKCEST-2019-1-12]; Leading Talent of the ‘Ten Thousand Plan’ – National High-Level Talents Special Support Plan, Key R&D Program of Zhejiang Province [2020C03010]; Technology Innovation and Application Demonstration Project of Chongqing [cstc2018jscx-msybX0287]; Fundamental Research Funds for the Central Universities [2018QNA7023, 10611CDJXZ238826, 2018CDQYSG0007, CDJZR14468801, 2019CDYGYB005]; Information Technology Center, Zhejiang University. Funding for open access charge: National Natural Science Foundation of China [U1909208, 81872798].

Conflict of interest statement. None declared.

REFERENCES

1. 

Ang Q.Y., Alexander M., Newman J.C., Tian Y., Cai J., Upadhyay V., Turnbaugh J.A., Verdin E., Hall K.D., Leibel R.L.et al. Ketogenic diets alter the gut microbiome resulting in decreased intestinal Th17 cells. Cell. 2020; 181:12631275.

2. 

Liu J., Lian Q., Chen Y., Qi J. Amino acid based de bruijn graph algorithm for identifying complete coding genes from metagenomic and metatranscriptomic short reads. Nucleic Acids Res.2019; 47:e30.

3. 

Brown J.M., Hazen S.L. Microbial modulation of cardiovascular disease. Nat. Rev. Microbiol.2018; 16:171181.

4. 

Hong J., Luo Y., Mou M., Fu J., Zhang Y., Xue W., Xie T., Tao L., Lou Y., Zhu F. Convolutional neural network-based annotation of bacterial type IV secretion system effectors with enhanced accuracy and reduced false discovery. Brief. Bioinform.2019; 21:1825–1836.

5. 

Sonnenburg J.L., Backhed F. Diet-microbiota interactions as moderators of human metabolism. Nature. 2016; 535:5664.

6. 

Schloss P.D., Iverson K.D., Petrosino J.F., Schloss S.J. The dynamics of a family's gut microbiota reveal variations on a theme. Microbiome. 2014; 2:25.

7. 

Spor A., Koren O., Ley R. Unravelling the effects of the environment and host genotype on the gut microbiome. Nat. Rev. Microbiol.2011; 9:279290.

8. 

Tang J., Fu J., Wang Y., Li B., Li Y., Yang Q., Cui X., Hong J., Li X., Chen Y.et al. ANPELA: analysis and performance assessment of the label-free quantification workflow for metaproteomic studies. Brief. Bioinform.2020; 21:621636.

9. 

Tang J., Mou M., Wang Y., Luo Y., Zhu F. MetaFS: performance assessment of biomarker discovery in metaproteomics. Brief. Bioinform.2020; doi:10.1093/bib/bbaa105.

10. 

Deshpande N.P., Riordan S.M., Castano-Rodriguez N., Wilkins M.R., Kaakoush N.O. Signatures within the esophageal microbiome are associated with host genetics, age, and disease. Microbiome. 2018; 6:227.

11. 

Poole A.C., Goodrich J.K., Youngblut N.D., Luque G.G., Ruaud A., Sutter J.L., Waters J.L., Shi Q., El-Hadidi M., Johnson L.M.et al. Human salivary amylase gene copy number impacts oral and gut microbiomes. Cell Host Microbe. 2019; 25:553564.

12. 

Kurilshikov A., Wijmenga C., Fu J., Zhernakova A. Host genetics and gut microbiome: challenges and perspectives. Trends Immunol.2017; 38:633647.

13. 

Hall A.B., Tolonen A.C., Xavier R.J. Human genetic variation and the gut microbiome in disease. Nat. Rev. Genet.2017; 18:690699.

14. 

Wang L., Ping P., Kuang L., Ye S., Lqbal F.M.B., Pei T. A novel approach based on bipartite network to predict human microbe-disease associations. Curr. Bioinformatics. 2018; 13:141148.

15. 

Munir A., Malik S.I., Malik K.A. Proteome mining for the identification of putative drug targets for human pathogen Clostridium tetani. Curr. Bioinformatics. 2019; 14:532540.

16. 

An Y., Wang Y., Zhan J., Tang X., Shen K., Shen F., Wang C., Luan W., Wang X., Wang X.et al. Fosfomycin protects mice from Staphylococcus aureus pneumonia caused by alpha-hemolysin in extracellular vesicles by inhibiting MAPK-regulated NLRP3 inflammasomes. Front. Cell. Infect. Microbiol.2019; 9:253.

17. 

Alkema W., Boekhorst J., Wels M., van Hijum S.A. Microbial bioinformatics for food safety and production. Brief. Bioinform.2016; 17:283292.

18. 

Tang J., Wang Y., Fu J., Zhou Y., Luo Y., Zhang Y., Li B., Yang Q., Xue W., Lou Y.et al. A critical assessment of the feature selection methods used for biomarker discovery in current metaproteomics studies. Brief. Bioinform.2020; 21:13781390.

19. 

Ahern P.P., Maloy K.J. Understanding immune-microbiota interactions in the intestine. Immunology. 2020; 159:414.

20. 

Leshem A., Liwinski T., Elinav E. Immune-microbiota Interplay and colonization resistance in infection. Mol. Cell. 2020; 78:597613.

21. 

Guzman-Castaneda S.J., Ortega-Vega E.L., de la Cuesta-Zuluaga J., Velasquez-Mejia E.P., Rojas W., Bedoya G., Escobar J.S. Gut microbiota composition explains more variance in the host cardiometabolic risk than genetic ancestry. Gut Microbes. 2020; 11:191204.

22. 

Palm N.W., de Zoete M.R., Flavell R.A. Immune-microbiota interactions in health and disease. Clin. Immunol.2015; 159:122127.

23. 

Ortega-Vega E.L., Guzman-Castaneda S.J., Campo O., Velasquez-Mejia E.P., de la Cuesta-Zuluaga J., Bedoya G., Escobar J.S. Variants in genes of innate immunity, appetite control and energy metabolism are associated with host cardiometabolic health and gut microbiota composition. Gut Microbes. 2020; 11:556568.

24. 

Wen C., Yan W., Sun C., Ji C., Zhou Q., Zhang D., Zheng J., Yang N. The gut microbiota is largely independent of host genetics in regulating fat deposition in chickens. ISME J.2019; 13:14221436.

25. 

Li Y.H., Yu C.Y., Li X.X., Zhang P., Tang J., Yang Q., Fu T., Zhang X., Cui X., Tu G.et al. Therapeutic target database update 2018: enriched resource for facilitating bench-to-clinic research of targeted therapeutics. Nucleic Acids Res.2018; 46:D1121D1127.

26. 

Yuan C., Burns M.B., Subramanian S., Blekhman R. Interaction between host MicroRNAs and the gut microbiota in colorectal cancer. mSystems. 2018; 3:e00205-17.

27. 

Wang Y., Zhang S., Li F., Zhou Y., Zhang Y., Wang Z., Zhang R., Zhu J., Ren Y., Tan Y.et al. Therapeutic target database 2020: enriched resource for facilitating research and early development of targeted therapeutics. Nucleic Acids Res.2020; 48:D1031D1041.

28. 

Zheng D., Liwinski T., Elinav E. Interaction between microbiota and immunity in health and disease. Cell Res.2020; 30:492506.

29. 

Honda K., Littman D.R. The microbiota in adaptive immune homeostasis and disease. Nature. 2016; 535:7584.

30. 

Huang L., Zhang H., Wu P., Entwistle S., Li X., Yohe T., Yi H., Yang Z., Yin Y. dbCAN-seq: a database of carbohydrate-active enzyme (CAZyme) sequence and annotation. Nucleic Acids Res.2018; 46:D516D521.

31. 

Escapa I.F., Chen T., Huang Y., Gajare P., Dewhirst F.E., Lemon K.P. New insights into human nostril microbiome from the expanded human oral microbiome database (eHOMD): a resource for the microbiome of the human aerodigestive tract. mSystems. 2018; 3:e00187-18.

32. 

Shi W., Qi H., Sun Q., Fan G., Liu S., Wang J., Zhu B., Liu H., Zhao F., Wang X.et al. gcMeta: a global catalogue of metagenomics platform to support the archiving, standardization and analysis of microbiome data. Nucleic Acids Res.2019; 47:D637D648.

33. 

Wu S., Sun C., Li Y., Wang T., Jia L., Lai S., Yang Y., Luo P., Dai D., Yang Y.Q.et al. GMrepo: a database of curated and consistently annotated human gut metagenomes. Nucleic Acids Res.2020; 48:D545D553.

34. 

Forster S.C., Browne H.P., Kumar N., Hunt M., Denise H., Mitchell A., Finn R.D., Lawley T.D. HPMCD: the database of human microbial communities from metagenomic datasets and microbial reference genomes. Nucleic Acids Res.2016; 44:D604D609.

35. 

Blanco-Miguez A., Gutierrez-Jacome A., Fdez-Riverola F., Lourenco A., Sanchez B. MAHMI database: a comprehensive MetaHit-based resource for the study of the mechanism of action of the human microbiota. Database. 2017; 2017:baw157.

36. 

Uchiyama I., Mihara M., Nishide H., Chiba H., Kato M. MBGD update 2018: microbial genome database based on hierarchical orthology relations covering closely related and distantly related comparisons. Nucleic Acids Res.2019; 47:D382D389.

37. 

Huang K., Brady A., Mahurkar A., White O., Gevers D., Huttenhower C., Segata N. MetaRef: a pan-genomic database for comparative and community microbial genomics. Nucleic Acids Res.2014; 42:D617D624.

38. 

Li J., Jia H., Cai X., Zhong H., Feng Q., Sunagawa S., Arumugam M., Kultima J.R., Prifti E., Nielsen T.et al. An integrated catalog of reference genes in the human gut microbiome. Nat. Biotechnol.2014; 32:834841.

39. 

Wishart D.S., Feunang Y.D., Marcu A., Guo A.C., Liang K., Vazquez-Fresno R., Sajed T., Johnson D., Li C., Karu N.et al. HMDB 4.0: the human metabolome database for 2018. Nucleic Acids Res.2018; 46:D608D617.

40. 

Lemfack M.C., Gohlke B.O., Toguem S.M.T., Preissner S., Piechulla B., Preissner R. mVOC 2.0: a database of microbial volatiles. Nucleic Acids Res.2018; 46:D1261D1265.

41. 

Noronha A., Modamio J., Jarosz Y., Guerard E., Sompairac N., Preciat G., Danielsdottir A.D., Krecke M., Merten D., Haraldsdottir H.S.et al. The virtual metabolic human database: integrating human and gut microbiome metabolism with nutrition and disease. Nucleic Acids Res.2019; 47:D614D624.

42. 

Yang J., Park J., Park S., Baek I., Chun J. Introducing murine microbiome database (MMDB): a curated database with taxonomic profiling of the healthy mouse gastrointestinal microbiome. Microorganisms. 2019; 7:480.

43. 

Janssens Y., Nielandt J., Bronselaer A., Debunne N., Verbeke F., Wynendaele E., Van Immerseel F., Vandewynckel Y.P., De Tre G., De Spiegeleer B. Disbiome database: linking the microbiome to disease. BMC Microbiol.2018; 18:50.

44. 

Cheng L., Qi C., Zhuang H., Fu T., Zhang X. gutMDisorder: a comprehensive database for dysbiosis of the gut microbiota in disorders and interventions. Nucleic Acids Res.2020; 48:D554D560.

45. 

Rajput A., Thakur A., Sharma S., Kumar M. aBiofilm: a resource of anti-biofilm agents and their potential implications in targeting antibiotic drug resistance. Nucleic Acids Res.2018; 46:D894D900.

46. 

Fan P., Bian B., Teng L., Nelson C.D., Driver J., Elzo M.A., Jeong K.C. Host genetic effects upon the early gut microbiota in a bovine model with graduated spectrum of genetic variation. ISME J.2020; 14:302317.

47. 

Benson A.K., Kelly S.A., Legge R., Ma F., Low S.J., Kim J., Zhang M., Oh P.L., Nehrenberg D., Hua K.et al. Individuality in gut microbiota composition is a complex polygenic trait shaped by multiple environmental and host genetic factors. PNAS. 2010; 107:1893318938.

48. 

Yang Q., Wang Y., Zhang Y., Li F., Xia W., Zhou Y., Qiu Y., Li H., Zhu F. NOREVA: enhanced normalization and evaluation of time-course and multi-class metabolomic data. Nucleic Acids Res.2020; 48:W436W448.

49. 

Zhao T., Hu Y., Zang T., Cheng L. MRTFB regulates the expression of NOMO1 in colon. PNAS. 2020; 117:75687569.

50. 

Wang Y., Wu Q., Zhang W., Zhang N. Tuberculosis and HIV coinfection-the challenge in the prevention, detection and treatment of tuberculosis. Curr. Bioinformatics. 2019; 14:9199.

51. 

Narang P., Dangi M., Sharma D., Khichi A., Chhillar A.K. An integrated chikungunya virus database to facilitate therapeutic analysis: ChkVDb. Curr. Bioinformatics. 2019; 14:323332.

52. 

Mobegi F.M., Zomer A., de Jonge M.I., van Hijum S.A. Advances and perspectives in computational prediction of microbial gene essentiality. Brief. Funct. Genomics. 2017; 16:7079.

53. 

Suhr M.J., Hallen-Adams H.E. The human gut mycobiome: pitfalls and potentials–a mycologist's perspective. Mycologia. 2015; 107:10571073.

54. 

Sayers E.W., Beck J., Brister J.R., Bolton E.E., Canese K., Comeau D.C., Funk K., Ketter A., Kim S., Kimchi A.et al. Database resources of the national center for biotechnology information. Nucleic Acids Res.2020; 48:D9D16.

55. 

Federhen S. Type material in the NCBI taxonomy database. Nucleic Acids Res.2015; 43:D1086D1098.

56. 

Almeida A., Mitchell A.L., Boland M., Forster S.C., Gloor G.B., Tarkowska A., Lawley T.D., Finn R.D. A new genomic blueprint of the human gut microbiota. Nature. 2019; 568:499504.

57. 

Hallen-Adams H.E., Suhr M.J. Fungi in the healthy human gastrointestinal tract. Virulence. 2017; 8:352358.

58. 

Kwa M., Plottel C.S., Blaser M.J., Adams S. The intestinal microbiome and estrogen receptor-positive female breast cancer. J. Natl. Cancer Inst.2016; 108:djw029.

59. 

Nichols R.G., Peters J.M., Patterson A.D. Interplay between the host, the human microbiome, and drug metabolism. Hum. Genomics. 2019; 13:27.

60. 

Ozkan J., Willcox M., Wemheuer B., Wilcsek G., Coroneo M., Thomas T. Biogeography of the human ocular microbiota. Ocular Surf.2019; 17:111118.

61. 

Simon C. Introduction: do microbes in the female reproductive function matter. Fertil. Steril.2018; 110:325326.

62. 

Ozkan J., Willcox M.D. The ocular microbiome: molecular characterisation of a unique and low microbial environment. Curr. Eye Res.2019; 44:685694.

63. 

Ozkan J., Willcox M.D., de la Jara P.L., Mandathara P.S., Rathi V.M., Thomas V., Holden B.A. The effect of daily lens replacement during overnight wear on ocular adverse events. Optom. Vis. Sci.2012; 89:16741681.

64. 

Zuo T., Ng S.C. The gut microbiota in the pathogenesis and therapeutics of inflammatory bowel disease. Front. Microbiol.2018; 9:2247.

65. 

Larsen J.M. The immune response to Prevotella bacteria in chronic inflammatory disease. Immunology. 2017; 151:363374.

66. 

Lancet T. ICD-11. Lancet. 2019; 393:2275.

67. 

Lancet Infectious Diseases, T ICD-11: in praise of good data. Lancet. Infect. Dis.2018; 18:813.

68. 

UniProt C. UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res.2019; 47:D506D515.

69. 

Kanehisa M., Sato Y., Furumichi M., Morishima K., Tanabe M. New approach for understanding genome variations in KEGG. Nucleic Acids Res.2019; 47:D590D595.

70. 

Rao R.T., Jayakumar K. Genomic islands of Mannheimia haemolytica - in silico analysis. Curr. Bioinformatics. 2017; 12:369377.

71. 

Zhuang H., Cheng L., Wang Y., Zhang Y.K., Zhao M.F., Liang G.D., Zhang M.C., Li Y.G., Zhao J.B., Gao Y.N.et al. Dysbiosis of the gut microbiome in lung cancer. Front. Cell. Infect. Microbiol.2019; 9:112.

72. 

Bayjanov J.R., Starrenburg M.J., van der Sijde M.R., Siezen R.J., van Hijum S.A. Genotype-phenotype matching analysis of 38 Lactococcus lactis strains using random forest methods. BMC Microbiol.2013; 13:68.

73. 

Banjara N., Nickerson K.W., Suhr M.J., Hallen-Adams H.E. Killer toxin from several food-derived Debaryomyces hansenii strains effective against pathogenic Candida yeasts. Int. J. Food Microbiol.2016; 222:2329.