PLoS ONE
Home Immune gene expression networks in sepsis: A network biology approach
Immune gene expression networks in sepsis: A network biology approach
Immune gene expression networks in sepsis: A network biology approach

Competing Interests: The authors have declared that no competing interests exist.

Article Type: research-article Article History
Abstract

To study the dysregulated host immune response to infection in sepsis, gene expression profiles from the Gene Expression Omnibus (GEO) datasets GSE54514, GSE57065, GSE64456, GSE95233, GSE66099 and GSE72829 were selected. From the Kyoto Encyclopedia of Genes and Genomes (KEGG) immune system pathways, 998 unique genes were selected, and genes were classified as follows based on gene annotation from KEGG, Gene Ontology, and Reactome: adaptive immunity, antigen presentation, cytokines and chemokines, complement, hematopoiesis, innate immunity, leukocyte migration, NK cell activity, platelet activity, and signaling. After correlation matrix formation, correlation coefficient of 0.8 was selected for network generation and network analysis. Total transcriptome was analyzed for differentially expressed genes (DEG), followed by gene set enrichment analysis. The network topological structure revealed that adaptive immunity tended to form a prominent and isolated cluster in sepsis. Common genes within the cluster from the 6 datasets included CD247, CD8A, ITK, LAT, and LCK. The clustering coefficient and modularity parameters were increased in 5/6 and 4/6 datasets in the sepsis group that seemed to be associated with functional aspect of the network. GSE95233 revealed that the nonsurvivor group showed a prominent and isolated adaptive immunity cluster, whereas the survivor group had isolated complement-coagulation and platelet-related clusters. T cell receptor signaling (TCR) pathway and antigen processing and presentation pathway were down-regulated in 5/6 and 4/6 datasets, respectively. Complement and coagulation, Fc gamma, epsilon related signaling pathways were up-regulated in 5/6 datasets. Altogether, network and gene set enrichment analysis showed that adaptive-immunity-related genes along with TCR pathway were down-regulated and isolated from immune the network that seemed to be associated with unfavorable prognosis. Prominence of platelet and complement-coagulation-related genes in the immune network was associated with survival in sepsis. Complement-coagulation pathway was up-regulated in the sepsis group that was associated with favorable prognosis. Network and gene set enrichment analysis supported elucidation of sepsis pathogenesis.

Kim,Jekarl,Yoo,Lee,Kim,Kim,and Das: Immune gene expression networks in sepsis: A network biology approach

Introduction

Sepsis is defined as a documented infection with immune dysfunction or dysregulated host response with organ dysfunction [13]. Sepsis cases are composed of a heterogeneous subset of patients with various biological and clinical characteristics, including age, medication, underlying disease, causative microbes, and microbes with different antibiotic resistances [4, 5]. These features might act as confounding factors or cause in patients various immune responses or immune dysfunction.

Immune dysfunction or dysregulated host response in sepsis is associated with defective antigen presentation, defective adaptive immunity, defective NK cell activity, decreased immunoglobulin levels, neutrophil abnormalities, hypercytokinemia, complement consumption, and defective bacterial removal [68]. These features occur in the time course of disease by themselves or as a combination of more than two features. Measurement of immune-related molecules such as procalcitonin, C-reactive protein, interleukins, chemokines, and other immune-related molecules showed that they were associated with these features [9, 10]. Gene expression profiles from sepsis patients provided comprehensive data that could minimize selection bias and exhibit a wide range of biological responses [1117]. Up- or down-regulated genes and pathways related to sepsis support the understanding of sepsis [1113].

The immune response in healthy and diseased states is rarely attributed to single molecules, but rather to complex inter-related molecules. As sepsis-related immune response molecules are complex, a network approach might enhance our understanding of the equilibrium of the immune response in sepsis [18, 19]. A network approach analysis of a small group of cytokines revealed that the network constructed from sepsis patients’ data had a smaller network diameter and shorter shortest path and characteristic path than did the control network [20]. These topologic features were related to decreased network function or modularity of the network. In addition, compared to that of day 1, the day 4 cytokine network was decreased in the sepsis group, which was in line with previous literature [21]. These studies constructed networks based on measured immune-associated molecules that could be regarded as actual observable characteristics of sepsis. As molecules could be analyzed independently, analysis of multiple molecules as a system was required. Analysis of these molecules via network analysis including topologic parameters and visualization resulted in an informative outcome. However, previous studies constructed networks based on cytokine measurements in small groups, which resulted in a limited number of recruited molecules. Measuring molecules in multiplex methods is relatively unfeasible compared to measuring gene expression.

In this study, gene expression signatures from a public database, Gene Expression Omnibus (GEO), were analyzed. Total transcriptome was analyzed for differentially expressed genes (DEGs) between healthy control and sepsis groups, followed by gene set enrichment analysis for DEGs, which revealed up- or down-regulated pathways. We selected 998 immune-related genes included in immune response pathways using gene expression profiles to construct a network for the healthy control and sepsis groups.

Methods

Cases in the cohort

This was a retrospective study, and our protocol was approved by the Institutional Review Board (IRB) of Incheon St. Mary’s Hospital and Seoul St. Mary’s Hospital in accordance with the Declaration of Helsinki. As public datasets were used, informed consent was waived by the IRB. The public datasets were downloaded from the Gene Express Omnibus (GEO) website (http://www.ncbi.nlm.nih.gov/geo/). The mentioned databases were searched using the following common terminology: sepsis, human, GPL570 (Affymetrix Human Genome U133 Plus 2.0 Array or hgu133plus2.db), GPL6947 (Illumina HumanHT-12 V3.0 expression beadchip), and GPL10558 (Illumina HumanHT-12 V4.0 expression beadchip) [22, 23]. These platforms were selected because they allow the study of genes related to the immune response. GPL570, GPL6947, and GPL10558 consisted of 58075, 48803, and 47231 probes, respectively. Entering GPL570, GPL6947, and GPL10558 with other terminology, 74 datasets or series were found. Among them, the GSE54514 [11], GSE57065 [12], GSE64456 [13], GSE66099 [14], GSE72829 [15], and GSE95233 [16] datasets were selected because their study designs were suitable to the purpose of this study [19], which included a healthy control group and a sepsis group, with more than 10 samples in each group. Datasets with drug treatment, studied tissue other than whole blood, and known sepsis etiology were excluded (S1 Fig in S1 File). All the datasets used transcriptomes derived from whole blood. Data regarding the time point of gene expression were derived from the first sample after admission. GSE66099 and GSE72829 were composed of several unique datasets, and healthy control and septic shock data from GSE66099 were selected. Healthy control cases and initial sepsis cases were used for each dataset. In the case of GSE72829, a healthy control and a definite bacterial infection group were selected. Randomized selection of sepsis cases without replacement was performed to match the number of healthy control cases and form a balanced correlation matrix. GSE95233 was additionally analyzed for survivor and nonsurvivor sepsis cases (GSE95233-1). Baseline characteristics of the datasets are listed in Table 1.

Table 1
Baseline characteristics of recruited datasets for adult and paediatric patient.
 GSE54514GSE57065GSE95233GSE64456GSE66099GSE72829
Patient typeAdultAdultAdultPediatricPediatricPediatric
Age, mean (yr) a,b≥18, 60.3≥18, 62≥18, 65≤2 mo, 29 d≤10, 3.7≤17, 1.9 mo
Sex (male %)b4067.965525862
Nonsurvivor (%)25.717.9 (28 d)NANA16.319.2
Cohort descriptionICU with sepsis orICU withICU withED with sepsisPICU with sepsis orDiscovery group
septic shockseptic shockseptic shock(PECARN study)septic shock(IRIS study)
Severity evaluationAPACHEIISOFASOFAYOSbPRISMNA
Control (n)182522194952
Sepsis or septic shock (n)3528518919894
Randomized selection without replacement (septic shock)182522---
Randomized selection without replacement (sepsis)---184952
Measured frequency, frequency (time points)5 (1, 2, 3, 4, 5 d)3 (0, 24, 28 hr)3 (1–2, 3–4, 7–10 d)111
Recruited time pointsc0–24 hr0–24 hr1–2 dNA0–24 hr5 d
PlatformIllumina, GPL6947Affymetrix, GPL570Affymetrix, GPL570Illumina, GPL10558Affymetrix, GPL570Illumina, GPL10558
Reference[11][12][16][13][14][15]

ED, emergency department; ICU, intensive care unit; NA, data not available; hr, hour; d, day; APACHEII, acute physiology and chronic health evaluation II; SOFA, sequential (sepsis related) organ failure assessment.

aGSE57065 and GSE72829 were summarized using median age.

b Percent of male and age are summarized data derived from the previous studies, which could be different from the analysis performed in this study. The sepsis patient was randomly selected corresponding to the number of the control cases.

cYOS, Yale observation score (6–30).

Sepsis diagnosis

Employed gene expression datasets used sepsis diagnosis based on sepsis criteria [1]. The patients were classified as having systemic inflammatory response syndrome if they met two or more of the following criteria: WBC count ≥ 12.0 × 109/L or ≤ 4.0 × 109/L, or presenting more than 10% immature cells; body temperature ≥ 38° C or ≤ 36° C; respiratory rate ≥ 20 breaths per minute, or heart rate ≥ 90 beats per minute. Sepsis was diagnosed with SIRS and documented bacterial infection, and septic shock was diagnosed with sepsis and hypotension not responding to fluid resuscitation [1].

Bioinformatics

For differentially expressed genes (DEG) analysis between control and patient groups based on total transcriptome, gene set enrichment for pathway analysis and network construction, several appropriate R software packages were used [24], including Bioconductor [25], affy [26], GEOquery [27], lumi [28], and limma [29]. The data were downloaded from the GEO database, and the data matrix of gene expression values was formatted using the GEOquery package [27]. In the case of Affymetrix gene expression data sets, the raw microarray data (CEL files) were normalized using the Robust Multichip Averaging (RMA) method, and log2 transformation was performed [2527]. In the case of Illumina gene expression data sets, the arrays were normalized using the quantile method [27, 29]. The data were annotated with corresponding gene names using R packages hgu133plus2.db, illuminaHumanv3.db, and illumina-Humanv4.db [30, 31].

Gene set enrichment for pathway analysis

Analysis of DEG was performed for total transcriptome. After a t-test using basic functions in R software, genes were selected based on false discovery rate of less than 0.01 (FDR < 0.01) [32]. Based on DEG analysis, gene set enrichment for pathway analysis using GAGE package, based on R language, was performed [33]. GAGE package applies input data to the Kyoto Encylopedia of Genes and Genomes (KEGG) pathways. Significant pathways were plotted for heatmap that was selected based on global P-value of less than 0.05.

Immune response related genes

To analyze the function of the immune response in the control and sepsis groups, KEGG immune system pathways were used [34]. Twenty-one pathways of the immune system were included, as follows: hematopoietic cell lineage (has04640); complement and coagulation cascade (has04610); platelet activation (hsa04611); Toll-like receptor signaling (hsa04620); Toll and Imd signaling (hsa04624); NOD-like receptor signaling pathway (hsa04621); RIG-I-like receptor signaling (hsa04622); cytosolic DNA-sensing (hsa04623); C-type lectin receptor signaling (hsa04625); natural killer cell-mediated cytotoxicity (hsa04650); antigen processing and presentation (hsa04612); T cell receptor signaling (hsa04660); Th1 and Th2 cell differentiation (04658); Th17 cell differentiation (hsa04659); B cell receptor signaling (hsa04662); Fc epsilon RI signaling (hsa04664); Fc gamma R-mediated phagocytosis (hsa04666); leukocyte transendothelial migration (hsa04670); intestinal immune network for IgA production (hsa04672), and chemokine signaling (hsa04062). The 21 pathways were assigned to adaptive immunity, innate immunity, NK cell activity, leukocyte migration, antigen presentation, complement-coagulation, platelet activity, cytokines-chemokines, signaling, and hematopoiesis. From them, 1908 genes in these pathways and 998 unique genes were extracted (S1 and S2 Tables in S1 File).

As genes were included in multiple pathways, we utilized Gene Ontology (GO) and Reactome pathways for additional information [35, 36]. From the GO pathway, we included activation of the innate immune response, complement activation, and immune response-activating signal transduction. From the immune response pathway, the adaptive immune response and the innate immune response were included. Leukocyte homeostasis, leukocyte activation, and leukocyte migration were also collected. From the Reactome database, we included the hemostasis pathway; platelet homeostasis; platelet adhesion to exposed collagen; platelet activation, signaling, and aggregation; formation of fibrin clots; dissolution of fibrin clots, and cell surface interactions at the vascular wall. In the immune system pathway, adaptive immune system, innate immune system, and cytokine signaling in the immune system were included (S3 Table in S1 File).

Genes included in multiple pathways were assigned to one pathway with their most frequent function among the KEGG, GO, and Reactome pathways. If the frequency of the gene that was included in the pathway was equal, undetermined (U) value was assigned. If the frequency was equal and included chemokines-cytokines, signaling, or hematopoiesis pathways, pathways other than these ones were assigned (S3 Table in S1 File).

Network topology analysis

The network was constructed using Spearman’s correlations equal to or greater than 0.8 and a P-value of less than 0.05 for each set of molecules. Each gene was regarded as a node, and the correlation pairs between the genes were regarded as links [37]. Links were created between the nodes with significant correlation coefficients. Network analysis results were plotted, and the topologic parameters were calculated using Cytoscape and an additional application, NetworkAnalyzer [3739]. The clustering coefficient is defined as the number of links between associated nodes. The degree of a node is defined as the number of links connected to a node (gene), and a hub is defined as a node with a relatively high degree. Network density is the ratio of links in the network to the maximum possible number of links. The length of a path is the number of unique links between the nodes that are present in the path. Distance is the shortest path length between two nodes. The network diameter is the maximum length of the shortest path. The characteristic path length is the average shortest path length and provides the expected distance between two nodes. The average number of neighbors is the number of identical nodes connected to a node. The normalized version of the average number of neighbors is the network density. Network heterogeneity is the variance in the number of links of a node divided by the number of mean links. The average number of neighbors indicates the average connectivity of the nodes. The topological coefficient is a relative measure of the extent to which a node shares neighbors with other nodes. Network centralization is close to one when the network resembles a star-like figure, while uniformly connected networks have a centralization value close to zero. The stress centrality of a node is the number of shortest paths passing through that node. The betweenness centrality of a node is the number of shortest paths between two other nodes that pass through that node, divided by the total number of shortest paths between the two nodes; betweenness centrality reflects the amount of control that this node exerts over other nodes. Closeness centrality reflects how close nodes are, which implies that nodes with high closeness centrality values have the shortest distance to other nodes on average. Eccentricity reflects the maximum distance of a node from other nodes [38, 39]; a high eccentricity value means that all other nodes are in proximity, a low eccentricity value means that there are at least one node and its neighbors that are far from that node. Degree correlation defines the relationship between the degree of a node and that of connected nodes [40]. Modularity is defined as the tendency of nodes to cluster and to interact, coordinating biological processes [41, 42]. The parameters were analyzed using comparison of proportions (MedCalc Software version 18.2.1, Mariakerke, Belgium).

Results

Gene set enrichment for pathway analysis

Total transcriptome analysis of healthy control and sepsis group revealed that 176 to 8229 genes showed expression differences in datasets. Gene set enrichment for pathway analysis was performed based on KEGG pathways (S4 Table in S1 File). Among up-regulated or down-regulated pathways with statistical significance, we searched for pathways associated with immune-related pathway (S2–S6 Figs in S1 File). We found that complement and coagulation cascade pathway and Fc epsilon RI signaling and Fc gamma R-mediated phagocytosis pathways were up-regulated in 2/3 of datasets from adult patients and in 3/3 of those from pediatric patients. T cell receptor signaling pathway was down-regulated in 2/3 and 3/3 of datasets from adult and pediatric patients, respectively. Antigen presentation and processing were down-regulated in 2/3 of datasets from adult and pediatric patients (Table 2). Altogether, these findings were interpreted along with the network analysis results. In network analysis, adaptive immunity genes were prominent and isolated from network in sepsis patient group (Fig 1). T cell receptor signaling (TCR) pathway was down-regulated in 5/6 of the datasets. These data imply that adaptive immunity related with T cells might be impaired or dysregulated in sepsis. In addition, adaptive immunity was associated with unfavorable prognosis in sepsis group. Complement and coagulation cascade was prominent in sepsis survivors of GSE95233 datasets (Fig 2). Complement and coagulation cascade was increased in sepsis in 5/6 of datasets and these data imply that these pathways may be involved in convalescence process.

Network structure of GSE57065 of the (A) control group and (B) sepsis group. In the sepsis group, the adaptive immunity network was prominent and isolated (green).
Fig 1

Network structure of GSE57065 of the (A) control group and (B) sepsis group. In the sepsis group, the adaptive immunity network was prominent and isolated (green).

Network structure of GSE95233 of (A) sepsis survivors and (B) sepsis nonsurvivors. In the survivor group, the platelet (olive green) and coagulation cascade network (yellow) were isolated and prominent. In the nonsurvivor group, the adaptive immunity (green) and innate immunity networks (blue) were prominent and isolated.
Fig 2

Network structure of GSE95233 of (A) sepsis survivors and (B) sepsis nonsurvivors. In the survivor group, the platelet (olive green) and coagulation cascade network (yellow) were isolated and prominent. In the nonsurvivor group, the adaptive immunity (green) and innate immunity networks (blue) were prominent and isolated.

Table 2
Differentially expressed gene analysis between control and sepsis followed by gene set pathway analysisa.
 GSE54514GSE57065GSE95233GSE64456GSE66099GSE72829
Patient typeAdultAdultAdultPediatricPediatricPediatric
Control cases (n)182522194952
Sequential selection of sepsis cases (n)182522194952
Differentially expressed Genes (n)17678856784312382297463
Up regulated pathway (n)06044137851
Down regulated pathway (n)03333163315
Up regulated immune pathwayC, Fc, RIGC, FcC, Fc, NK, K, TLRAPC, C, CXC & etc, Fc, IgA,BCR, C, Fc, CXC & etc, LK, NK, TLR
Down regulated immune pathwayAPC, TCR, H, KAPC, BCR, CXC & etc, IgA, TCRAPC, IgA, TCRH, TCRAPC, IgA, TCR

aDifferentially expressed gene was selected based on FDR (P<0.01).

APC, Antigen processing and presentation; BCR, B cell receptor signaling; C, Complement and coagulation; CXC & etc, chemokine signaling; Fc, Fc gamma R mediated phagocytosis or Fc epsilon RI signaling; H, hematopoietic cell lineage; IgA, intestinal immune network for IgA production; LM, leukocyte endothelial migration; NK, NK cell mediated cytotoxicity; RIG, RIG-I like receptor signaling; TLR, Toll like receptor signaling; TCR, T cell receptor signaling.

Network structure and immune function

The network topologic structure revealed that, in sepsis, adaptive immunity genes (green colored) formed a prominent cluster, isolated from the rest of the network (Fig 1 and S7–S18 Figs in S1 File). In that isolated cluster, CD247 (CD3 Zeta chain), CD8A, ITK (tyrosine protein kinase ITK / TSK), LAT (linker for activation of T cells), LCK (leukocyte C-terminal Src kinase) were found in all 6 datasets. CD2, FYN (Src family tyrosine kinase), GATA3 (GATA-binding protein 3), IL7R, RASGRP1 (RAS guanyl-releasing protein 1) were all found in adult datasets, but 2/3were found in the pediatric group. CBLB (E2 ubiquitin protein ligase CBL-B), CD3D, CD3G, ZAP70 (Zeta-chain-associated protein kinase 70) were found in 2/3 of adult datasets, but all were found in pediatric datasets. Other genes are listed in S5 Table in S1 File. In addition, NK cell activity (light blue) tended to be included in the network in the sepsis group, whereas it was isolated in the healthy control group. The nonsurvivor network from GSE95233 revealed that adaptive immunity was isolated and prominent from the rest of the network, whereas platelet activity (olive green) and the compliment and coagulation cascade pathway (yellow) were isolated and prominent in the survivor group (Fig 2). Platelet, complement and coagulation-related genes were prominent in survivors; C6, C7, F11, COL1A1, COL3A1, ADCY6, GP5, FGG, MYLK4, SERPINE1, SERPIND1, CR2, etc. were included. CD8B, IL1A, IL3, IL12B, IL21, IL23R, and other genes were also included. The isolated gene cluster from nonsurvivors included CD2, CD3D, CD3G, CD8A, CD247, CBLB, GATA3, ITK, LCK, CTLA4, IL10, ZAP70, and other genes.

Network parameter analysis

Comparison of topologic parameters revealed that the clustering coefficient in sepsis was increased in 2/3 of adult datasets and in 3/3 of pediatric datasets (Table 3). Network heterogeneity was increased in 3/3 of adult datasets, but was decreased in 3/3 of pediatric datasets. Modularity value was increased in 1/3 of adult datasets, whereas modularity value was increased in 3/3 of pediatric datasets in the sepsis group. Shortest path was decreased in 2/3 of adult and pediatric datasets, whereas average degree was increased in 2/3 of adult and pediatric datasets (Table 3). For GSE95233-1 analysis, difference between network parameters was calculated as follows: (case—control / control) x 100. The difference of clustering coefficient between survivor (0.390) and nonsurvivor (0.337) was -13.5%. The difference of modularity between survivor (0.581) and nonsurvivor (0.694) was 19.4%.

Table 3
Network topological parameters from gene expression data from GEO dataset for adult and paediatric patient.
GSE54514GSE57065GSE95233GSE95233-1GSE64456GSE66099GSE72829
Patient typeAdultAdultAdultAdultPediatricPediatricPediatric
Disease typeControlseptic shockcontrolseptic shockcontrolseptic shocksurvivornon survivorControlsepsisControlsepsisControlsepsis
Clustering coefficient0.3190.3230.3280.2800.3280.3670.3900.3370.3670.3900.3370.3880.3810.435
Network density0.0130.0160.0120.0100.0120.0390.0260.0140.0390.0260.0140.0120.0300.034
Network heterogeneity1.0391.1421.1511.1721.1511.5041.3581.1931.5041.3581.1931.0911.2491.129
Connected components2730426142343347343347321625
Network diameter191714151412191112191114139
Network centralization0.0590.0790.1030.0480.1030.1960.1130.0750.1960.1130.0750.0780.1480.121
Shortest path160514805985963817222596381426401211521173014264012115211730369023809222532
Characteristic path length7.4185.2675.2765.0885.2763.8627.2113.4823.8627.2113.4824.7053.922.824
Average degree, <k>5.937.425.303.645.3017.8711.714.0417.8711.714.044.508.4310.27
Number of nodes, N464453426367426457448297457448297380284299
Degree correlation, μ0.4890.5080.4220.8100.4220.4030.7150.5460.4030.7150.5460.2100.4750.346
Modularity, Mc0.7430.6620.6440.8060.6440.1980.5810.6940.1980.5810.6940.7450.4340.603

GSE95233-1 indicates survivors (n = 16) and non-survivors (n = 16) among sepsis patients.

We searched for nodes with such a high degree that they could be considered as hub nodes. In the GSE54514 control and sepsis groups, CD3D and ITPR3 (inositol 1,4,5 trisphosphate receptor type 3) had degree 33 and 43, respectively. In the GSE57065 control and sepsis groups, ATG5 (autophagy-related 5), and GBP5 (guanylate-binding protein 5) had degree 19 and 25, respectively. In the GSE95233 control and sepsis groups, IFNGR1 (interferon gamma receptor 1), and ESAM (endothelial cell selective adhesion molecule) had degree 49 and 21, respectively.

In the GSE64456 control and sepsis groups, LYN (tyrosine protein kinase Lyn), and IFI16 (interferon gamma-inducible protein 16) had degree 56 and 70, respectively. In the GSE66099 control and sepsis groups, NCF2 (neutrophil cytosolic factor 2), NLRP12 (NLR family pyrin domain-containing 12), and NCR2 (natural cytotoxicity-triggering receptor 2) had degree 26, 26, and 34, respectively. In the GSE72829 control and sepsis groups, C5AR1 (complement C5a receptor 1) and CSF3R (colony-stimulating factor 3 receptor) had degree 50 and 46, respectively. These network parameters are listed in S6–S17 Tables in S1 File.

Discussion

The features of immune dysfunction or immune response in sepsis include defective antigen presentation and adaptive immunity (T and B cell abnormalities), abnormalities in NK cell activity, diminished immunoglobulin level, alteration in neutrophils, abnormal cytokine levels, unbalanced pro- and anti-inflammatory cytokines, complement consumption, and defective bacterial killing [4, 5]. This complexity hampers understanding of immunologic features in sepsis. Immune suppression, immune paralysis and hypo-inflammatory response are reported in sepsis that is intermingled with hyperinflammatory state [68].

This study showed that in the adult sepsis group, there were more down-regulated pathways, whereas in the pediatric group, there were more up-regulated pathways. These data suggest that immune suppression might be more dominant in the adult sepsis group, whereas hyper-inflammatory states might be dominant in the pediatric sepsis group. The mechanism of immune response was indeed different between adult and pediatric patients in sepsis. However, down-regulation of TCR and APC pathway and up-regulation of complement-coagulation cascade pathway was found in most of adult and pediatric sepsis patients. These findings are congruent with previous reports that, in sepsis, CD4+ T cell showed cell exhaustion, increased apoptosis and decreased adhesion molecules, and CD28 and TCR diversity. In sepsis, CD8+ T cell also showed cell exhaustion, increased apoptosis, and decreased cytotoxic function, cytokine secretion, and TCR diversity [8]. Antigen presentation by follicular dendritic cell or dendritic cells is decreased in sepsis [8]. In the present study, isolation and prominent adaptive immune genes from the rest of the immune network were noted, and these data imply that part of the adaptive immune function was not in equilibrium or in a regulated state. As this isolated adaptive immune network was found in nonsurvivors, initial network state along with down-regulated pathways related with adaptive immunity resulted in unfavorable prognosis.

Complement and coagulation pathway was up-regulated in sepsis patients. However, network analysis showed that platelet and complement-and-coagulation-related genes were prominent and isolated in survivors, and that this initial state resulted in a favorable prognosis. On the contrary, it is reported that persistently activated complement system, including C3a and C5a, is found in sepsis, which is associated with unfavorable prognosis [68]. Excessive activation of complement system potentiated activated innate immune system in sepsis, and this also resulted in an unfavorable prognosis in the sepsis group [6]. These findings imply that initial inactivation of complement-coagulation pathway or persistent, uncontrolled activation of these pathways might be associated with unfavorable prognosis.

Network analysis along with gene set enrichment analysis from gene expression profiles may have elucidated sepsis pathogenesis and host immune response in sepsis [1117]. In our study, the interrelations among immune genes were analyzed using a network analysis approach that might reflect the comprehensive aspects of the dynamics of gene expression. Analysis of the topologic structure of the network in sepsis revealed that adaptive-immunity-related genes formed a cluster in sepsis patients (S7–S18 Figs in S1 File), which tended to be isolated.

Formation of a cluster or module with adaptive immune genes might be a normal process to respond to sepsis or might be a pathologic process associated with immune dysregulation. To elucidate the nature of the cluster, we analyzed survivors and nonsurvivors from the sepsis group in GSE95233. The survivor group showed a prominent and isolated cluster with complement-coagulation and platelet activity genes, whereas the nonsurvivor group showed a cluster with prominent and isolated adaptive immunity genes (Fig 2). Although this analysis included only one dataset, as a cluster of adaptive immunity was observed among the nonsurvivor group, these isolated and prominent genes seemed to be related to adverse effects. Platelet and complement cascade activation might be related to clearance of microbes, which supports the adaptive immune response [43, 44]. Activation of adaptive immunity is required to resolve sepsis, but the isolated and prominent adaptive immune response in this study seemed to be dysregulated or impaired. Additional data from gene set enrichment for pathway analysis showed that TCR pathway was down regulated in all the datasets. As the isolated cluster in network analysis included genes related to TCR pathway, adaptive immunity seemed to be suppressed or dysregulated, which resulted in an unfavorable prognosis [7, 45].

Clustering coefficient is one of the network topologic parameters that reflects the number of links between neighboring nodes, and its increased values denote an increased probability of similar biological function [45, 46]. In this study, the clustering coefficient was increased in 2/3 of adult datasets and 3/3 of pediatric datasets in the sepsis group, which implies increased function of the immune network. Previous studies revealed that an increased clustering coefficient correlated with bloodstream C-reactive protein concentration or inflammation in hepatocellular carcinoma patients before and after transarterial chemoembolization [47]. Network analysis revealed that inflammatory process seemed to be activated in sepsis group; further studies are required to understand the relation between the clustering coefficient and the inflammatory response in the immune network.

Modularity of a network is related to subgroups or a clusters within that network [48, 49]. High modularity implies dense connections between nodes within the same cluster but sparse connections between nodes in different clusters [50, 51]. Modularity is expected to promote evolvability and multifunctionality, and functions as a driving biological process [52, 53]. In this study, the sepsis group showed increased modularity values in 3/3 of the studied pediatric datasets. On the contrary, only 1/3 of the adult datasets in the sepsis group showed increased modularity. These results are consistent with the finding that, in the sepsis group, the pediatric datasets showed more significant pathways compared to those of the adult datasets (Table 2).

Extreme age groups (patients younger than 1 year or older than 50 years) are related with increases incidence and mortality of sepsis [54, 55]. Although organ development, immunity, immune cell subsets, cytokines response to antigen, and hormonal effects might be different between the 2 extreme age groups, we found a common pattern in this study. In particular, CD8A-related genes were isolated and TCR pathway was down-regulated, and antigen-presenting cells were isolated from rest of the immune network. CD8A molecule is related with CD8+ T cells, which is known to be quantitatively and qualitatively different in extreme age groups. CD8+ T cells are decreased in older age group, and CD8+ T cell in neonates are biased toward an innate immune response, unlike in adults, which functions as a cellular immune response [55, 56]. These findings could be part of prior information or knowledge that could help, directly or indirectly, elucidate pathobiology in sepsis, which might be associated with management of the sepsis. Further studies are required in regard to the functional aspect of the isolated cluster of adaptive immunity from network analysis and function of CD8+ T cell in sepsis. On the other hand, network analysis showed that network heterogeneity was higher in adult sepsis, whereas it was lower in pediatric sepsis. Gene set enrichment analysis showed that the adult sepsis group had more down-regulated pathways compared to the pediatric group, which had more up-regulated pathways. High network heterogeneity values imply that the nodes are connected with other nodes of different types, while lower values denote that the nodes are connected with similar other nodes. Altogether, these data suggest that immune senescence or immune hyperactivation are a driving force of immune response in adult and pediatric sepsis, respectively.

Among prominent clusters from survivors, IL1A, IL12, IL21, and IL23 genes were also included. Although statistically insignificant, previous studies showed that the mean cytokines levels (pg/mL) for survivors (n = 63) and nonsurvivors (n = 17) were as follows [49]: IL1A, 24.8, 7.4; IL12, 15.7, 0; and IL21, 138.9, 10.1, IL23, 75.4, 16.7, respectively. Mean cytokine level of IL1A, IL12, IL21, IL23 was increased in survivors of sepsis. These data suggest that platelet, complement and coagulation related molecules and IL1A, IL12, IL21, IL23 might be closely related with convalescence process in sepsis.

Altogether, network analysis showed isolated components or isolated clusters of genes related to adaptive immune response. The genes in the cluster were related to adaptive immune response or T cells and the gene set enrichment analysis showed that the T cell signaling pathway was decreased in 2/3 of adult and 3/3 of pediatric sepsis datasets. From these findings, T cell signaling or T cell related functions seemed to be impaired or decreased in sepsis cases. Among survivors of sepsis, complement and coagulation cascade and platelet-related genes were prominent in network analysis. Gene set enrichment analysis showed that complement and coagulation cascade pathway was up-regulated in 2/3 of adult and 3/3 of pediatric sepsis cases. From these findings, complement and coagulation pathway seemed to be associated with convalescence process during sepsis process. Clustering coefficient that was related with inflammatory process was increased in 2/3 of adult and 3/3 in pediatric datasets.

A limitation of this study is that the use of various platforms might have increased variations among datasets. The cases recruited for normal control and sepsis was relatively small because of the small sample size of the normal control group. Survivors and nonsurvivors networks were analyzed in only one dataset; thus further studies are required for more robust results. In addition, only genes in the immune pathways were analyzed; though, other genes related to metabolism and the endocrine system might have an effect on the sepsis network. Multiple gene functional annotations were reduced to a gene function that included the most frequent function from the KEGG, GO and Reactome pathways, which might have caused some bias. As this study was a retrospective study using public datasets, sex and age was not matched for control and sepsis cases. The results might have been affected by these parameters.

Conclusion

Immune dysfunction might be caused by prominence and isolation of the adaptive immune response from the rest of the immune network. The isolated cluster included T cell receptor signaling gene and that pathway was down-regulated as resulting from gene set enrichment for pathway analysis. The increased clustering coefficient and modularity implied that an inflammatory response was activated in the sepsis group. Survivors of sepsis showed a prominent cluster of genes that was related to platelet and complement and coagulation cascade pathways. Up-regulated complement and coagulation cascade pathway in sepsis seemed to be related with convalescence process or with favorable prognosis. Network and gene set enrichment analysis supported elucidation of sepsis pathogenesis.

Acknowledgements

The authors thank the researchers who provided data in the GEO public database.

References

RCBone, RABalk, FBCerra, RPDellinger, AMFein, WAKnaus, et al. Definitions for sepsis and organ failure and guidelines for the use of innovative therapies in sepsis. Chest. 1992; 101:16441655. 10.1378/chest.101.6.1644

CWSeymour, VXLiu, TJIwashyna, FMBrunkhorst, TDRea, AScherag, et al. Assessment of clinical criteria for sepsis for the third international consensus definitions for sepsis and septic shock (Sepsis-3). JAMA. 2016; 315:762774. https://doi.org/10.100.jama. 2016.0288. 10.1001/jama.2016.0288

MSinger, CDeutschman, CWSeymour, MShankar-hari, DAnnane, MBauer, et al. The third international consensus definitions for sepsis and septic shock (Sepsis-3). JAMA. 2016; 315:801810. 10.1001/jama.2016.0287

JBermejo-Martin, DAndaluz-Ojeda, RAlmansa, FGandia, JGomez-Herreras, EGomez-Sanchez, et al. Defining immunological dysfunction in sepsis: A requisite tool for precision medicine. J Infect. 2016; 72:525536. 10.1016/j.jinf.2016.01.010

ALeligdowicz, MAMatthay. Heterogeneity in sepsis: new biological evidence with clinical applications. Crit Care. 2019; 23:8088. 10.1186/s13054-019-2372-2

JBoomer, KTo, KChang, OTakasu, DOsborne, AWalton, et al. Immunosuppression in patients who die of sepsis and multiple organ failure. JAMA. 2011; 306:25942605. 10.1001/jama.2011.1829

MDelano, PWard. The immune system’s role in sepsis progression, resolution, and long term outcome. Immunol Rev. 2016; 274:330353. 10.1111/imr.12499

RHotchkiss, GMonneret, DPayen. Sepsis-induced immunosuppression: from cellular dysfunction to immunotherapy. Nat Rev Immunol. 2013; 13:862874. 10.1038/nri3552

DWJekarl, SLee, JLee, YJPark, YKim, JHPark, et al. Procalcitonin as a diagnostic marker and IL-6 as a prognostic marker for sepsis. Diagn Microbiol Infect Dis. 2013; 75:342347. 10.1016/j.diagmicrobio.2012.12.011

10 

DWJekarl, JYKim, SLee, MKim, YKim, KHan, et al. Diagnosis and evaluation of severity of sepsis via the use of biomarkers and profiles of 13 cytokines: a multiplex analysis. Clin Chem Lab Med. 2015; 53:575581. 10.1515/cclm-2014-0607

11 

GParnell, BTang, MNalos, NArmstrong, SHuang, DBooth, et al. Identifying key regulatory genes in the whole blood of septic patient to monitor underlying immune dysfunctions. Shock. 2013; 40:166174. 10.1097/SHK.0b013e31829ee604

12 

MACazalis, ALepape, FVenet, FFrager, BMougin, HVallin, et al. Early and dynamic changes in gene expression in septic shock patients: a genome-wide approach. Intensive Care Med Exp. 2014; 2:2037. 10.1186/s40635-014-0020-3

13 

PMahajan, NKuppermann, AMejias, NSuarez, DChaussabel, CCasper, et al. Association of RNA biosignatures with bacterial infections in febrile infants aged 60 days or younger. JAMA. 2016; 316:846857. 10.1001/jama.2016.9207

14 

TSweeney, AShidham, HWong, PKhatri. A comprehensive time-course-based multicohort analysis of sepsis and sterile inflammation reveals a robust diagnostic gene set. Sci Transl Med. 2015; 7:287ra71. 10.1126/scitranslmed.aaa5993

15 

JHerberg, MKaforou, VWright, HShailes, HEleftherohorinou, CHoggart, et al. Diagnostic test accuracy of a 2-transcript host RNA signature for discriminating bacterial vs viral infection in febrile children. JAMA. 2016; 316:835845. 10.1001/jama.2016.11236

16 

FVenet, JSchilling, MCazalis, JDemaret, FPoujol, TGirardot, et al. Modulation of LILRB2 and mRNA expression in septic shock patients and after ex vivo lipopolysaccharide stimulation. Hum Immunol. 2017; 78:441450. 10.1016/j.humimm.2017.03.010

17 

TSweeney, TPerumal, RHenao, MNichols, JHowrylak, AChoi, et al. A community approach to mortality prediction in sepsis via gene expression analysis. Nat Comm. 2018; 9:664673. 10.1038/s41467-018-03078-2

18 

ALBarabasi, ZOltvai. Network biology: understanding the cell’s functional organization. Nat Rev Genet. 2004; 5:101113. 10.1038/nrg1272

19 

LBarabasi A, NGulbahce, JLoscalzo. Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011; 12:5668. 10.1038/nrg2918

20 

DWJekarl, KSKim, SLee, MKim, YKim. Cytokine and molecular networks in sepsis cases: a network biology approach. Eur Cytokine Netw. 2018; 29:103111. 10.1684/ecn.2018.0414

21 

HMatsumoto, HOgura, KShimizu, MIkeda, THirose, HMatsuura, et al. The clinical importance of a cytokine network in the acute phase of sepsis. Sci Rep. 2018; 8:13995. 10.1038/s41598-018-32275-8

22 

MRitchie, MDunning, MSmith, WShi, ALynch. BeadArray expression analysis using bioconductor. PLoS Comp Biol. 2011; 7:e1002276. 10.1371/journal.pcbi.1002276.

23 

RJaksik, MIwanaszko, RRzeszowska-Wolny, MKimmel. Microarray experiments and factors which affect their reliability. Biol Direct. 2015; 10:46. 10.1186/s13062-015-0077-2

24 

R Developoment Core Team. R: A language and environment for statistical computing. Vienna, Austiria: R Foundation for Statistical Computing 2017. http://cran.r-project.org/.

25 

SFalcon, MMorgan, RGentleman. An introduction to Bioconductor’s expression set class. R package. 2007. http://www.bioconductor.org/packages/release/bioc/vignettes/Biobase/inst /doc/ExpressionSetIntroduction.pdf.

26 

LGautier, LCope, BMBolstad, RAIrizarry. Affy—analysis of Affymetrix genechip data at the probe level. Bioinformatics. 2004; 20:307315. 10.1093/bioinformatics/btg405

27 

SDavis, PMeltzer. GEOquery: a bridge between the Gene Expression Omnibus (GEO) and BioConductor. Bioinformatics. 2007; 23: 18461847. 10.1093/bioinformatics/btm254

28 

PDu, WAKibbe, SMLin. Lumi: a pipeline for processing Illumina microarray. Bioinformatics. 2008; 24:15471548. 10.1093/bioinformatics/btn224

29 

MRitchie, BPhipson, DWu, YHu, CWLaw, WShi, et al. Limma powers differential expression analysies for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015; 43:e47e60. 10.1093/nar/gkv007

30 

M.Carlson hgu133plus2.db: Affymetrix human genome U133 plus 2.0 array annoatation data (chip hgu133plus2). R package. 2016. http://www.bioconductor.org/packages/release/data/annotation/html/hgu133plus2.db.html.

31 

M.Carlson lumiHumanAll.db: illumina human illumina expression annotation data (chip lumiHumanAll). R package. 2013. http://www.bioconductor.org/packages/release/data/annotation/html/lumiHumanAll.db.html.

32 

SDraghici. Statitics and data analysis for microarrays using R and Bioconductor: CRC Press; 2012.

33 

WLuo, MFriedman, KShedden, KHankenson, PWoolf. GAGE: generally applicable gene set enrichment for pathway analysis. BMC Bioinformatics. 2009; 10:161, 10.1186/1471-2105-10-161

34 

MKanehisa, MFurumichi, MTanabe, YSato, KMorishima. KEGG: new perspective on genomes, pathways, diseases and drugs. Nucleic Acids Res. 2017; 45:D353D361. 10.1093/nar/gkw1092

35 

The Gene Ontology Consortium. Gene ontology consortium: going forward. Nucleic Acids Res. 2014; 43:D1049D1056. 10.1093/nar/gku1179

36 

AFabregat, KSidiropoulos, PGarapati, MGillespie, KHausmann, RHaw, et al. The Reactome pathway knowledgebase. Nucleic Acids Res. 2016; 44:D481D487. 10.1093/nar/gkv1351

37 

NTDoncheva, YAssenov, FSDomingues, MAlbrecht. Topological analysis and interactive visualization of biological networks and protein structures. Nat Protoc. 2012; 7:670685. 10.1038/nprot.2012.004

38 

RSaito, MESmoot, KOno, JRuscheinski, PWang, SLotia, et al. A travel guide to cytoscape plugins. Nat Method. 2012; 9:10691076. 10.1038/nmeth.2212

39 

Netanalyzer. Germany: Max Plank Institute for Informatics. 2017. https://med.bioinf.mpi-inf.mpg.de/netanalyzer/.

40 

A.Barabasi Network science: Cambridge University Press; 2016. 10.1017/nws.2016.2

41 

RPAlexander, PMKim, TEmonet, MGerstein. Understanding modularity in molecular networks requires dynamic. Sci Sig. 2009; 2:4448. 10.1126/scisignal.281pe44.

42 

GWagner, MPavlicev, JCheverud. The road to modularity. Nat Rev Genet. 2007; 8:921931. 10.1038/nrg2267

43 

ADewitte, SLepreux, JVilleneuve, CRigothier, CCombe, AOuattara, et al. Blood platelets and sepsis pathophysiology: A new therapeutic prospect in critical ill patients? Ann Intensive Care. 2017; 7:115133. 10.1186/s13613-017-0337-7

44 

TJStocker, HIshikawa-Ankerhold, SMassberg. CSchulz. Small but mighty: platelets as central effector of host defense. Thromb Haemost. 2017; 117:651661. 10.1160/TH16-12-0921

45 

LRogge, EBianchi, MBiffi, EBono, SChang, HAlexander, et al. Transcript imaging of the development of human T helper cells using oligonucleotide arrays. Nature Genet. 2000; 25:96100. 10.1038/75671

46 

GAPavlopoulos, MSecrier, CNMoschopoulos, TGSoldatos, SKossida, JAerts, et al. Using graph theory to analyze biological networks. BioData Mining. 2011; 4:1037. 10.1186/1756-0381-4-10

47 

DWJekarl, SLee, JHKwon, SWNam, MKim, YKim, et al. Complex interaction networks of cytokines after transarterial chemotherapy in patients with hepatocellular carcinoma. PLoS One. 2019; 14: e0224318. 10.1371/journal.pone.0224318

48 

ZZhu, MGerstein, MSnyder. Getting connected: analysis and principles of biological networks. Genes Dev. 2017; 21:10101024. 10.1101/gad.1528707.

49 

DWJekarl, JYKim, JWHa, SLee, JYoo, MKim, et al. Diagnosis and prognosis of sepsis based on use of cytokines, chemokines and growth factors. Dis Markers. 2019; 111. 10.1155/2019/1089107

50 

JDong, SHorvath. Understanding network concepts in modules. BMC Syst Biol. 2007; 1:120. 10.1186/1752-0509-1-1

51 

KMitra, ACarvunis, SRamesh, TIdeker. Integrative approaches for finding modular structure in biological networks. Nat Rev Genet. 2013; 10:719732. 10.1038/nrg3552

52 

MEJNewman. Networks an introduction: Oxford; 2010.

53 

ZZhang, JZhan. A big world inside small-world networks. PLoS One. 2009; 4:e5686. 10.1371/journal.pone.0005686

54 

KRudd, SJohnson, KAgesa, KShackelford, DTsoi, DKievlan, et al. Global, regional, and national sepsis incidence and mortality, 1990–2017: analysis for the global burden of disease study. Lancet. 2020; 395:200211. 10.1016/S0140-6736(19)32989-7

55 

AAlpert, YPickman, MLeipold, YRosenberg-Hasson, XJi, RGaujoux, et al. A clinically meaningful metric of immune age derived from high-dimentional longitudianl monitoring. Nat Med. 2019; 25:487495. 10.1038/s41591-019-0381-y

56 

AGalindo-Albarran, OLopez-Portales, DGutierrez-Reyna, ORodriguez-Jorge, JSanchez-Villanueva, ORamirez-Pliego, et al. CD8+ T cells from human neonates are biased toward an innate immune response. Cell Rep. 2016; 15:21512160. 10.1016/j.celrep.2016.10.056