PLoS ONE
Home Hypomethylation mediates genetic association with the major histocompatibility complex genes in Sjögren’s syndrome
Hypomethylation mediates genetic association with the major histocompatibility complex genes in Sjögren’s syndrome
Hypomethylation mediates genetic association with the major histocompatibility complex genes in Sjögren’s syndrome

Competing Interests: The authors have declared that no competing interests exist.

Article Type: Research Article Article History
Abstract

Differential methylation of immune genes has been a consistent theme observed in Sjögren’s syndrome (SS) in CD4+ T cells, CD19+ B cells, whole blood, and labial salivary glands (LSGs). Multiple studies have found associations supporting genetic control of DNA methylation in SS, which in the absence of reverse causation, has positive implications for the potential of epigenetic therapy. However, a formal study of the causal relationship between genetic variation, DNA methylation, and disease status is lacking. We performed a causal mediation analysis of DNA methylation as a mediator of nearby genetic association with SS using LSGs and genotype data collected from 131 female members of the Sjögren’s International Collaborative Clinical Alliance registry, comprising of 64 SS cases and 67 non-cases. Bumphunter was used to first identify differentially-methylated regions (DMRs), then the causal inference test (CIT) was applied to identify DMRs mediating the association of nearby methylation quantitative trait loci (MeQTL) with SS. Bumphunter discovered 215 DMRs, with the majority located in the major histocompatibility complex (MHC) on chromosome 6p21.3. Consistent with previous findings, regions hypomethylated in SS cases were enriched for gene sets associated with immune processes. Using the CIT, we observed a total of 19 DMR-MeQTL pairs that exhibited strong evidence for a causal mediation relationship. Close to half of these DMRs reside in the MHC and their corresponding meQTLs are in the region spanning the HLA-DQA1, HLA-DQB1, and HLA-DQA2 loci. The risk of SS conferred by these corresponding MeQTLs in the MHC was further substantiated by previous genome-wide association study results, with modest evidence for independent effects. By validating the presence of causal mediation, our findings suggest both genetic and epigenetic factors contribute to disease susceptibility, and inform the development of targeted epigenetic modification as a therapeutic approach for SS.

Chi,Taylor,Quach,Quach,Criswell,Barcellos,and Di Ruscio: Hypomethylation mediates genetic association with the major histocompatibility complex genes in Sjögren’s syndrome

Introduction

Sjögren’s syndrome (SS) is an autoimmune disease characterized by the lymphocytic infiltration of salivary and lacrimal glands, resulting in dryness of the mouth and eyes, fatigue, and joint pain. The prevalence of SS is estimated to be 3% in individuals aged 50 years or older and 0.6% overall, with a 9:1 female-to-male predominance [1]. When SS occurs in isolation, it is referred to as primary SS; secondary SS co-occurs with other systemic autoimmune diseases [2]. Environmental factors including infectious agents, stress, air pollution, and silicone are implicated in disease pathogenesis [36]. Genetic association studies have established genetic loci both within and outside the major histocompatibility complex (MHC) [79].

Differential methylation has been reported by multiple studies of CD4+ T cells, CD19+ B cells, whole blood, and labial salivary glands (LSGs) in SS [1021]. Specifically, hypomethylation of immune-related genes has been observed, along with implications for altered gene expression. Some of these studies found evidence supporting genetic control of DNA methylation. Miceli-Richard et al. reported an overlap of differentially methylated probes with established genetic risk loci, suggesting both genetic and epigenetic abnormalities in the same genes [18]. Imgenberg-Kreuz et al. identified methylation quantitative trait loci (meQTL), or loci where genetic variation is associated with DNA methylation, in whole blood [19]. However, this association analysis was performed based on the whole blood of healthy controls only, instead of based on both pSS cases and controls. These association results alone are not sufficient to support the causal mediation of DNA methylation for the genetic association with SS (e.g. ruling out reverse causation). Distinguishing differential methylation that is a cause of, rather than a consequence of, disease is essential for further consideration of epigenetic modification as a therapeutic approach to SS [22].

We investigated evidence for genetic control of DNA methylation for SS risk using LSGs from 64 primary SS cases and 67 symptomatic non-cases from the Sjögren’s International Collaborative Clinical Alliance (SICCA) registry. Our overall approach first used bumphunter to identify differentially-methylated regions (DMRs), or regions where contiguous CpG sites are differentially methylated in the same direction. Then, for each DMR, we identified its corresponding meQTLs as SNPs within ±250 kb that are associated with its DNA methylation levels. These meQTLs are considered cis-meQTLs since meQTL effects spanning multiple megabase pairs at the MHC have been observed [23]. Finally, we performed the causal inference test (CIT) developed by Millstein et al. to find DMR-meQTL pairs where the DMR shows strong evidence of mediating the risk of surrounding meQTLs on SS [24]. By extension, this also suggested CpG sites whose methylation levels could be independent of neighboring genetic variation and CpG sites whose methylation levels may be influenced by disease status. These findings significantly expand what is known about potential targets of epigenetic-modifying agents within the human genome. Although cancer has been the most common application for epigenetic therapies [2528], it is believed that knowledge of effective target biomarkers as well as the development of high-specificity epigenetic-modifying agents could lead to similar successes for non-cancerous conditions such as SS [22, 29, 30].

Materials and methods

Study subjects and clinical evaluation

A total of 131 female, non-Hispanic white individuals were selected from SICCA for this study. Multidimensional scaling (MDS) of genotype data confirms their non-Hispanic white ancestry and suggests that the majority of individuals are predominantly of French or Orcadian ancestry (S1 Fig). All individuals from the SICCA registry exhibited at least one symptom related to SS, specifically symptoms of dry eyes or dry mouth, prior suspicion/diagnosis of SS, positive serum anti-SSA, anti-SSB, rheumatoid factor or antinuclear antibody results, increase in dental caries, bilateral parotid gland enlargement, or a possible diagnosis of secondary SS [31]. Table 1 summarizes the SS phenotypes, potential confounders, and co-morbidities of these study subjects. Case status was determined according to the 2016 American College of Rheumatology/European League Against Rheumatism (ACR/EULAR) criteria for SS [32]. Non-cases from the SICCA registry with at least one, but not all, SS symptoms or signs were also included. More specifically, non-cases did not meet ACR/EULAR for SS but were enrolled in SICCA due to the presence of 1 or more symptoms or signs suggesting possible SS. Based on these criteria, we studied 64 SS cases and 67 non-cases.

Table 1
Summary statistics of SS phenotypes, potential confounders, and co-morbidities.
cases (n = 64)non-cases (n = 67)p-value
Focus score3.39 (1.83)0.89 (0.67)6.80E-6
Left ocular staining score7.46 (2.91)3.19 (2.75)8.54E-12
Right ocular staining score7.19 (3.17)3.25 (2.74)4.21E-10
SSA seropositive (indicator)0.6303.61E-14
SSB seropositive (indicator)0.5506.26E-12
Unstimulated whole salivary flow rate0.34 (0.39)0.70 (0.54)8.20E-6
Schirmer ≤ 5 mm/5min on at least one eye0.230.072.16E-2
Self-reported age of SS onset at screening49.12 (10.80)46.10 (8.86)2.04E-1
Censored age at study visit54.69 (11.94)53.46 (10.82)4.53E-1
Current smoker0.010.063.90E-1
Anticholinergic drug use0.400.512.76E-1
SLE suspected00NA
SLE physician confirmed0.050.041.00
RA suspected00.011.00
RA physician confirmed0.060.036.34E-1

Means and corresponding standard deviations (in parenthesis) are reported for continuous variables, and proportions are reported for binary variables. The p-value reports significance of difference between cases and non-cases for a given variable, determined either with Wilcoxon’s rank sum test for continuous variables or chi-square test of independence for binary variables. Missing values are excluded from summary statistics. NA = not available; SLE = systemic lupus erythematosus; RA = rheumatoid arthritis.

Methylotyping and data processing

DNA was extracted from the LSG tissue collected from each study subject as previously described [20]. DNA methylation was measured for each subject using the Illumina 450K Infinium Methylation BeadChip (450K) platform for 28 subjects and the Infinium MethylationEPIC (EPIC) platform for 103 subjects. The 450K and EPIC chips allow for high-throughput interrogation of more than 450,000 and 850,000 highly informative CpGs sites respectively, spanning ~22,000 genes across the genome.

Methylation data processing was performed using Minfi, a Bioconductor package for the analysis of Infinium DNA methylation microarrays [33]. Background subtraction with dye-bias normalization was performed on methylated and unmethylated signals with the noob procedure, followed by quantile normalization with preprocessQuantile [34, 35].

For joint analysis of all 131 samples, the intersection of CpGs from 450K and EPIC chips was selected for analysis, resulting in a starting number of 452,832 CpGs. Probes where more than 5% of samples had a detection p-value > 0.01 were removed, to retain probes where signal is distinguishable from negative control probes. To remove probes with ambiguous methylation measurements due to incomplete binding between the DNA strand of interest and probe strand DNA, probes with SNPs with minor allele frequency greater than 0% at either the probe site, CpG interrogation site, or single nucleotide extension were removed. Finally, probes identified with probe-binding specificity and polymorphic targets problems, or cross-reactive probes, were removed [36, 37]. The final processed dataset consisted of 336,040 CpG sites. Since no subject had more than 5% of probes with detection p-value > 0.01, all 131 subjects were retained. Both M-values and β-values were used in subsequent analyses (see S1 Text).

Removing unwanted DNA methylation variation

We identified array type (450K or EPIC), genetic ancestry, self-reported age of SS syndrome onset, collection phase, smoker status, anticholinergic drug use, and co-morbidities as potential confounders (Table 1). Of these, array type and genetic ancestry were found to be strongly associated with DNA methylation and case status respectively (p ≤ 0.05) (S1 and S2 Figs), and analytical models were adjusted accordingly. However, case status was not associated with array type, because the distribution of cases and non-cases were similar between 450K and EPIC with 46.4% cases and 50.0% non-cases respectively (S2 Fig). Wilcoxon’s rank sum test of difference in ancestry MDS component values between cases and non-cases revealed a significant association at p-value ≤ 0.05 for components 2–4 and at p-value ≤ 0.10 for component 1. Unwanted methylation variation due to array type and genetic ancestry (batch effects) were removed from β-values and M-values using ComBat from the SVA package, which applies an empirical Bayes, model-based location/scale batch adjustment [38, 39]. See S1 Text for details of Combat usage.

Genotyping and quality control

The subject genotypes were taken from the genotypes of the larger SICCA cohort, which was genotyped on the Illumina HumanOmni2.5-4v1 or Illumina HumanOmni25M-8v1-1 arrays from DNA extracted from whole blood. All quality control steps performed have been previously described [7]. The final genotype dataset consisted of 1,392,448 SNPs.

Dimensionality reduction

Principal component analysis (PCA) was performed on the centered and scaled β-value matrix XRn×p, where n and p are the number of subjects and CpG sites, respectively. PCA was performed on methylation data prior and after batch correction with ComBat.

Multidimensional scaling (MDS) was performed to detect population structure using lower dimensions that explain observed genetic distance. With genotype data as reference allele counts, pairwise genotype dissimilarity is summarized by the distance matrix D=J-IBSRn×n, where IBSRn×n is the identity-by-state similarity matrix and JRn×n is the all-ones matrix. MDS of genotypes from the 131 subjects and reference European subpopulations from the Human Genome Diversity Project (HGDP) [40] was performed using PLINK 1.9 to assess association between genetic ancestry and case-control status [41].

Identification of differentially methylated regions

Differentially-methylated regions (DMRs) were identified using bumphunter, which searches for bumps, or contiguous CpG sites consistently hypermethylated or hypomethylated in one group of subjects compared to the other [42]. The linear regression specified for bumphunter was

which controlled for array type and genetic ancestry. Here, “M” is the M-value without batch correction with Combat, outcome is SS case status, array type indicates array (450K or EPIC), and C1 − C5 indicate the first five MDS components of genotype data. The number of bootstrap resampling B was set to 1,000 for generating null distribution of candidate DMRs for establishing significance. Significant SS DMRs were stringently selected as those with, fwerArea ≤ 0.05 defined as proportion of bootstraps with maximum bump area greater than observed DMR area, and consists of at least two CpG sites. See S1 Text for details on choice of bumphunter hyperparameters and annotation of DMRs.

Gene set enrichment analysis

Since methylation at transcription start sites and gene bodies has been shown to regulate gene expression [43], we restricted gene set enrichment analysis (GSEA) to genes differentially methylated at the promoter or gene body. DMR genes were tested for enrichment of gene ontology (GO) gene sets from the Molecular Signatures Database [44] combined with SS-related gene sets from past studies using the hypergeometric test (see S1 Text for gene set details). False discovery rate was controlled with the Benjamini-Hochberg procedure [45]. Since genes in the same pathway tend to be up or down-regulated together, GSEA was performed separately for hypermethylated and hypomethylated DMR genes in cases compared to non-cases [46].

Identification of DNA methylation quantitative trait loci

Methylation quantitative trait loci (meQTLs) are loci whose genotypes are associated with DNA methylation. We test for short-range cis-meQTLs, defined as SNPs in the ±250 kb genomic region from the DMR start and end positions. This window size was chosen based on previous meQTL studies of similar sample sizes to roughly ensure adequate power [19, 4750]. Although long-range meQTL effects spanning several megabase pairs (mb) has been observed at the MHC [23], McRae et al. observed most significant meQTLs are within 100 kb of target CpGs in their study involving a window size of ±2 mb [50]. Thus, we do not expect many such meQTLs to be missed if they exist. SNPs in approximate linkage equilibrium were selected using PLINK as those satisfying pairwise correlation R2 ≤ 0.5 in a 250,000 bp window, with a window stride of 25,000 bp [41]. The association between a candidate meQTL and DMR was established by regressing the M-value, averaged across CpG sites of the DMR, against genotype encoded as 0, 1, or 2 copies of the reference allele, from all 131 subjects. The DNA methylation values used for identifying meQTLs were batch-corrected for array type and genetic ancestry. Significance of association was evaluated using t-test from linear regression. False discovery rate was controlled with the Benjamini–Hochberg procedure [45].

Mediation analysis with causal inference test

We used the causal inference test (CIT) to determine whether DNA methylation mediates genetic risk by evaluating statistical evidence for a causal mediation model [24, 51]. Specifically, the CIT evaluates a set of statistical tests of the necessary and sufficient conditions for the causal mediation relationship involving genotype “G”, DNA methylation “M”, and case status “S”. In the causal graph of this causal mediation model, the directed edge travels from “G” to “S” through “M”. The conditions are:

    S ~ G

    S ~ M | S

    M ~ S | G

    S ⊥ G | M,

where “~” denotes associated with and “⊥” denotes independent of. In the event of reverse causation, where the disease condition induces differential methylation, a spurious association will instead be observed between genotype and SS, failing condition four. The maximum p-value from these four statistical tests is the CIT p-value. See Millstein et al. for additional details on the CIT [24]. The CIT was performed for the identified meQTL-DMR pairs using genotype, DNA methylation, and SS case status from all 131 subjects. The genotype and DNA methylation data are encoded the same way as for the identification of meQTLs. The CIT genotype is encoded as 0, 1, or 2 copies of the reference allele, DNA methylation value is the batch-adjusted M-value, and SS is binary case status. False discovery rate was controlled at or under 5% using the permutation-based q-value developed and implemented by Millstein et al. [51, 52]. See S1 Text for usage details of the CIT.

Ethics statement

This study was approved by the Institutional Review Board of the Human Research Protection Program at the University of California, San Francisco (approval number: 10–02551).

Results

Characterization of SS cases and non-cases

We start by characterizing the clinical and global DNA methylation profiles of SS cases and non-cases. Although all non-cases exhibit at least one SS-related phenotype, cases have significantly higher focus scores, ocular staining scores, SSA and SSB seropositivity, Schirmer test positivity rate, and lower unstimulated whole salivary flow rates (Table 1). This is expected, since severity in these phenotypes is the basis upon which the 2016 ACR/EULAR criteria classifies SS [32]. From Table 1, there are no significant differences in the potential confounders of age-related variables, smoking habits, and anticholinergic drug use. Around 5% of cases and non-cases have physician confirmed co-morbidities of systemic lupus erythematosus or rheumatoid arthritis, without significant differences in occurrence between the groups. Thus, the presence of co-morbidities is unlikely to significantly influence our differential methylation analysis results. PCA of adjusted DNA methylation data shows clear global differences between cases and non-cases (S3 Fig). This difference is immediately seen in the first principal component, which explains the most variance of the projected methylation data. This highlights the relevance of DNA methylation differences in the context of SS and LSG.

Hypomethylation of genes involved in immune response

Analysis with Bumphunter identified 215 significant DMRs from 2,747 candidate “bumps” (S1 Table). Of the 215 DMRs, 169 were hypermethylated regions and 46 were hypomethylated regions, in cases relative to non-cases. Approximately 84% of DMRs were located in either promoters or gene bodies (Fig 1A), locations where differential methylation tends to influence transcription [43]. The top three DMR-contributing chromosomes were chromosomes 1, 6, and 17, and a majority of DMRs on chromosome 6 overlapped or surrounded the MHC (Fig 1B). Detailed annotation of significant DMRs are in S1 Table. We found no overlap between these DMRs and gene regions with established or suggestive association with SS, even at the MHC [7, 8, 53]. We define an overlap to occur when the genetic coordinate range (start to end) of a gene overlaps with that of the DMR. S2 Table lists the set of genes with which we examined overlap with DMRs. Although Miceli-Richard et al. observed an overlap between genetic risk loci for SS with differentially-methylated DNA regions, our studies differ in the target tissue involved and definition of differential-methylated regions (i.e. region vs single CpG site) [18].

DMR characteristics.
Fig 1

DMR characteristics.

(A) Proportion of SS DMR locations relative to closest gene, and CpG type proportions at each DMR location; most DMRs are located either in the gene body (inside) or promoter, and most DMR CpG sites are either in the CpG island or the open sea. (B) Density plot of SS DMR locations on chromosome 6, where a DMR’s location is represented by GRCh37 genetic coordinates of its first CpG site to last CpG site. The shaded red region denotes the MHC region (28,477,797 bp—33,448,354 bp on chromosome 6). mb = megabase pairs.

Genes near hypomethylated regions in cases were enriched for gene sets associated with immune function (Table 2), with the top gene sets almost exclusively related to immune response. This was expected given many DMRs were concentrated at the MHC. IRF5, which resides on chromosome 7 and is the strongest genetic risk factor for SS outside the MHC [8], was not the nearest gene for any DMRs. Of the 131 individuals in our study, 26 were in a previous LSG study by Cole et al., which identified 57 genes whose promoters were hypomethylated in SS relative to controls [20]. From GSEA, these 57 genes (SS DMP genes) form the top enriched gene set with an adjusted p-value of 1.71E-4 (Table 2). Finally, the DMR gene PSMB9 was one of the 45 genes that previously demonstrated differential expression between SS cases and non-cases [54].

Table 2
Top gene sets enriched for hypomethylated genes in SS.
gene setnoverlap genesp-valueadj. p-value
SS DMP genes8TAP1, LTA, PSMB8, AIM2, NCKAP1L, LINC00426, LCP2, ARHGAP253.80E-181.71E-14
Antigen processing and presentation of endogenous peptide antigen4HLA-E, HLA-B, TAP1, ABCB11.60E-123.59E-9
Antigen processing and presentation of peptide antigen via MHC class I6PSMB9, HLA-E, PSMB8, HLA-B, TAP1, ABCB14.74E-125.53E-9
Antigen processing and presentation of endogenous antigen4HLA-E, HLA-B, TAP1, ABCB14.92E-125.53E-9
Negative regulation of innate immune response4HLA-E, HLA-B, TAP1, NLRC53.93E-103.24E-7
Negative regulation of natural killer cell mediated immunity3HLA-E, HLA-B, TAP14.32E-103.24E-7
Antigen processing and presentation via MHC class IB3HLA-E, TAP1, ABCB11.19E-107.64E-7
Positive regulation of antigen processing and presentation3ABCB1, CCR7, TAP11.58E-97.92E-7
Positive regulation of humoral immune response3LTA, TNF, CCR71.58E-97.92E-7
Negative regulation of cell killing3HLA-B, HLA-E, TAP12.66E-91.20E-6

Candidate gene sets include GO gene sets from the Molecular Signatures Database [44], a set of genes previously reported to harbor differentially methylated CpG sites between SS cases and non-cases (SS DMP genes) [20], and a set of genes previously reported to be differentially expressed between SS cases and healthy controls (SS DE genes) [54]. n = number of overlapping genes; adj. p-value = Benjamini-Hochberg adjusted p-value.

In contrast to hypomethylated regions, genes near hypermethylated regions were enriched for gene sets with several functions; therefore, the overall picture for hypermethylation in cases was less clear. Table 3 shows that the top gene sets were associated with nervous system development and cellular transport and signaling.

Table 3
Top gene sets enriched for hypermethylated genes in SS.
gene setnoverlap genesp-valueadj. p-value
Positive regulation of transporter activity6WNK4, ATP1B2, RELN, HAP1, CACNB2, TRPC61.36E-86.12E-5
Diencephalon development5ETS1, GSX1, GLI2, HAP1, SLC6A44.17E-79.38E-4
Hypothalamus development3ETS1, GSX1, HAP11.73E-62.59E-3
Vasoconstriction3EDN3, HTR1A, SLC6A43.29E-63.42E-3
Modulation of excitatory postsynaptic potential3ZMYND8, CELF4, RELN4.38E-63.42E-3
Somatic stem cell population maintenance4WNT98, LRP5, PBX1, BCL94.59E-63.42E-3
Nerve development4HOXB3, COL25A1, TFAP2A, SLITRK65.32E-63.42E-3
Peptide Transport4EDN3, SLC15A2, FAM3B, TAPBP7.06E-63.97E-3
Anatomical structure regression2LRP5, GLI21.03E-54.86E-3
ERBB2 signaling pathway3ERBB2, GRB7, SHC11.28E-54.86E-3

Candidate gene sets include GO gene sets from the Molecular Signatures Database [44], a set of genes previously reported to harbor differentially methylated CpG sites between SS cases and non-cases (SS DMP genes) [20], and a set of genes previously reported to be differentially expressed between SS cases and healthy controls (SS DE genes) [54]. n = number of overlapping genes; adj. p-value = Benjamini-Hochberg adjusted p-value.

DNA methylation mediates the effect of meQTLs on SS at the MHC

We tested for association between average DMR methylation M-values and SNPs in approximate linkage equilibrium in a ±250kb neighborhood of each DMR, which yielded 20,754 unique DMR-SNP candidate pairs to test. A total of 26 meQTL-DMR associations were identified under the Benjamini-Hochberg adjusted p-value cutoff of 0.05, with one each from chromosomes 3, 11, 12, 16, and two from chromosome 4; the rest were located within the MHC region on chromosome 6 (S3 Table). Fig 2A shows how methylation levels vary by genotype for example meQTL rs9275224. Note that a meQTL can be associated with multiple DMRs, and a DMR can be associated with multiple meQTLs. Down-sampling SNPs at the MHC to achieve comparable SNP densities to that of non-MHC regions still resulted in a higher meQTL discovery rate at the MHC relative to non-MHC regions (see Supplementary Results in S1 Text for more details). Thus, the higher discovery rate at the MHC cannot be explained by higher SNP densities. The distribution of meQTL-DMR distances is concentrated around 160 kb, with an average of 153 kb, which is well within the limit of 250 kb (Fig 2B). Thus, the window size of ±250kb appears sufficient for identifying most cis-meQTLs. While the density plot of the meQTL-DMR distances appears somewhat bimodal, the smaller peak at around 60 kb is most likely an artifact due to small sample size and the smoothing process of a density plot. From S3 Table, there are only 3 meQTL-DMR distances ranging from 75 kb to 80 kb.

MeQTLs associated with SS DMR methylation M-values.
Fig 2

MeQTLs associated with SS DMR methylation M-values.

(A) SNP rs9275224 is a meQTL associated with average M-value of the DMR at genetic positions 32,810,706–32,810,742 (GRCh37) on chromosome 6. See S3 Table for the remaining meQTL-DMR pairs. (B) Density plot of associated and unassociated SNP-DMR pairs by absolute distance. The significance criteria for association is having a Benjamini-Hochberg adjusted p-value (p) ≤ 0.05. While distance is approximately uniformly distributed for unassociated SNP-DMR pairs, the distances of associated SNP-DMR pairs is concentrated around 153 kb. (C) MHC region spanning the HLA-DQA1, HLA-DQB1, and HLA-DQA2 loci with high density of the meQTL-DMR pairs. Each DMR is specified by its chromosome, starting position, and ending position, in GRCh37 genetic coordinates.

Of these 26 meQTL-DMR pairs, the CIT identified 19 with significant evidence supporting the causal mediation model (q-value ≤ 0.05); one pair each was from chromosomes 3, 12, and 16, and the rest were from chromosome 6 (Table 4). At the MHC, the region spanning the HLA-DQA1, HLA-DQB1, and HLA-DQA2 loci contained a high density of DMR-meQTL pairs, with five DMRs and four meQTLs (Fig 2C). In total, the meQTL-DMR pairs from Table 4 represent 12 unique DMRs and 9 unique SNPs. The remainder of the 26 associated meQTL-DMR pairs did not support the causal mediation model, with the three unique DMRs potentially consequences of reverse causation (S3 Table). The remaining 200 of the 215 DMRs discovered were not associated with any nearby SNPs (S3 Table); thus, no evidence of nearby genetic control was detected, and it is still unknown which ones represent potential cases of reverse causation.

Table 4
Top causal inference test results for meQTLs of SS DMRs.
SNP rs IDSNP positionA1A2SS DMRdistancep.citq.cit
rs927522432659878GAchr6:32810706–328107421508281.00E-32.11E-3
rs927522432659878GAchr6:32819921–328201021600431.00E-32.11E-3
rs927522432659878GAchr6:32822911–328231161630331.00E-32.11E-3
rs927522432659878GAchr6:32813084–328133371532061.00E-32.11E-3
rs927522432659878GAchr6:32813448–328135311535701.00E-32.11E-3
rs226103331603591GAchr6:31544694–31544931586601.17E-32.11E-3
rs226103331603591GAchr6:31527920–31528239753521.89E-32.11E-3
rs273498529818662GAchr6:30042980–300429852243181.99E-32.11E-3
rs927537432668526AGchr6:32810706–328107421421803.99E-33.47E-3
rs226103331603591GAchr6:31539973–31539998635935.25E-34.17E-3
rs1333520987860446ACchr16:87636539–876365942238525.78E-34.30E-3
rs302130232623150GAchr6:32810706–328107421875567.84E-34.89E-3
rs302130232623150GAchr6:32819921–328201021967711.47E-29.29E-3
rs285833232681161CAchr6:32819921–328201021387601.63E-21.05E-2
rs1740765924238010AGchr12:24104007–241041151338951.74E-21.35E-2
rs302130232623150GAchr6:32813084–328133371899342.49E-21.64E-2
rs302130232623150GAchr6:32822911–328231161997612.69E-21.74E-2
rs285833232681161CAchr6:32810706–328107421295453.36E-22.14E-2
rs76027985112439220GAchr3:112359488–112359557796633.65E-22.44E-2

All genetic positions are based on GRCh37 coordinates, and DMRs are denoted by the chromosome, start position, and end position. Distance refers to base pair distance between DMR and meQTL. A1 = allele 1; A2 = allele 2; SS DMR = differentially-methylated regions for Sjögren’s syndrome; p.cit = causal inference test p-value; q.cit = permutation-based q-values from the causal inference test.

Utilizing data from a previous genome-wide association study (GWAS) of SS involving 2,131 European individuals [7], we tested the association with SS for all meQTLs supporting the causal mediation model (Table 4), using the updated 2016 ACR/EULAR classification criteria to define cases and controls [32]. European ancestry, sex, and smoking status were adjusted as described in Taylor et al. [7]. Table 5 shows these association results. Of these, five meQTLs at the MHC from 31,603 kb to 32,681 kb reached genome-wide significance (Fig 3 and Table 5). MeQTLs supporting the causal mediation model from chromosomes 3, 12, and 16 are not significantly associated with SS, with p-values not even satisfying the significance level of a single hypothesis test (p-value ≤ 0.05).

Manhattan plot of SS GWAS results at the MHC.
Fig 3

Manhattan plot of SS GWAS results at the MHC.

European SS GWAS results at the MHC from Taylor et al. [7], with mediating meQTL p-values from this study colored in yellow. SS case status was determined based on the 2016 ACR/EULAR classification criteria [32]. The red horizontal line indicates genome-wide significance level of p-value < 5 × 10−8.

Table 5
Association of meQTLs with SS in European GWAS.
SNP rs IDCHRgenetic positionp-valueOR (95% CI)
rs7602798531124392209.02E-11.041 (0.549–1.973)
rs27349856298186621.17E-31.274 (1.101–1.474)
rs22610336316035912.62E-120.617 (0.539–0.707)
rs30213026326231502.21E-292.475 (2.114–2.898)
rs92752246326598785.01E-211.937 (1.688–2.224)
rs92753746326685261.62E-90.598 (0.506–0.707)
rs28583326326811619.02E-252.071 (1.803–2.380)
rs1740765912242380102.63E-10.889 (0.723–1.092)
rs1333520916878604465.63E-11.039 (0.912–1.185)

Association results of meQTLs that support causal mediation model in previous European GWAS for SS [7]. SS case status was determined based on the 2016 ACR/EULAR classification criteria [32]. The genome-wide significance threshold is p-value < 5 × 10−8. CI = confidence interval; CHR = chromosome.

We next examined the extent to which linkage disequilibrium (LD) can explain the association of meQTLs with SS at the MHC. We obtained squared coefficient of correlation statistics (R2) as a measure of LD based on genotypes of European populations from the 1000 Genomes Project (S4 Table) [55]. Fig 4A shows the LD heatmap among the six meQTLs at the MHC, and Fig 4B shows the LD heatmap between the six meQTLs and MHC SNPs that previously demonstrated association with SS in Europeans [7, 8]. These meQTLs are in mild LD with each other (Fig 4A), with a maximum R2 of 0.357 (S4 Table). This is expected, since we pre-selected SNPs in approximate linkage equilibrium before searching for meQTLs. Using multivariate logistic regression modeling and adjusting for European ancestry, sex, and smoking status as described in Taylor et al. [7], we found modest evidence that the meQTLs rs3021302, rs9275224, and rs2858332 exhibit independent effects (p-value ≤ 0.05; Table 6).

LD heatmap for MHC meQTLs supporting the causal mediation model.
Fig 4

LD heatmap for MHC meQTLs supporting the causal mediation model.

Heatmap of the LD measure of R2 statistics, based on European populations from the 1000 Genomes Project [55]. (A) LD among meQTLs and (B) LD between meQTLs and established SS-associated SNPs at the MHC, for Europeans [7, 8].

Table 6
Multivariate logistic regression of SS status against MHC meQTLs.
SNP rs IDCHRgenetic positionp-valueOR (95% CI)
rs27349856298186620.5050.952 (0.823–1.101)
rs22610336316035910.0720.875 (0.756–1.010)
rs30213026326231500.0001.688 (1.413–2.015)
rs92752246326598780.0141.257 (1.048–1.509)
rs92753746326685260.5191.066 (0.878–1.294)
rs28583326326811610.0021.304 (1.102–1.544)

Multivariate logistic regression of SS case status again all MHC MeQTLs supporting the causal mediation model based on genotypes from previous European GWAS [7]. The logistic regression adjusted for European ancestry, sex, and smoking status, following Taylor et al. [7], and SS case status was determined based on the 2016 ACR/EULAR classification criteria [32]. CHR = chromosome; OR = odds ratio; CI = confidence interval.

However, Fig 4B shows that these meQTLs exhibiting independent effects are in stronger LD with some SS SNPs. Considering R2 > 0.50 as reflecting at least modest LD, the meQTL rs2858332 is in relatively strong LD with rs9275572 (R2 = 0.741), which is in the gene regions HLA-DQB1 and HLA-DQA2 [7]. Although meQTL rs9275224 is not in as strong LD with rs9275572 as meQTL rs2858332 (R2 = 0.446), rs9275572 is still the SNP that rs9275224 is in strongest LD with. Lastly, meQTL rs3021302 is in modest LD with rs115575857 and rs3129716 (both R2 = 0.572), which are in the gene region HLA-DQB1 [8]. Based on the LD statistics, these meQTLs likely do not tag the SS SNPs we compared with, but may reflect association of different HLA alleles with SS given modest evidence of independent effects.

Discussion

We investigated the relationship between genetic variation, DNA methylation, and SS in the largest study of LSG, to date. We compared SS cases against symptomatic non-cases, and results show that significant differential methylation in LSG exists and is primarily driven by case status. Results from DMR analysis of LSG are consistent with the general theme of hypomethylation previously reported in a much smaller sample [20], providing strong support for these findings. We applied the CIT to genotype and DNA methylation data from the same individuals, and conclude that genetic control of differential methylation is a risk factor for SS, especially at the MHC.

General hypomethylation of genomic regions involved in the immune response in LSG remains one of the most significant findings (Table 2), with many DMRs located in the MHC region. Many of these hypomethylated genes have biological roles closely related to SS pathophysiology. For example, dendritic cells in the glands produce high levels of interferons [1], and PSMB8 and PSMB9, whose expressions are induced by gamma interferon, were both hypomethylated in SS cases compared to non-cases. Genes PSMB8 and PSMB9 encode catalytic subunits of the immunoproteasome that is involved in peptide presentation on the surface of antigen-presenting cells [56]. Hypomethylation of PSMB9 may have a causal role in increasing expression levels in SS [54]. Previous studies have suggested that differential DNA methylation in SS could be controlled by B cells infiltrating the LSG, which in turn may affect the expression of inflammatory genes [13, 14].

Although the overall picture for hypermethylated regions in cases is less clear than that for hypomethylated regions, gene set enrichment analysis (GSEA) suggested some degree of neurological involvement in SS (Table 3). Peripheral neuropathy is the most common neurological complication in SS, but involvement of the central nervous system has also been observed, including cognitive disorder meningitis and optic neuritis [57]. The pathological mechanism by which SS leads to damage of the nervous system is not well-established, but it is thought to involve inflammatory infiltration of the dorsal root ganglia [1, 57]. Although dryness of mouth resulting from reduced saliva flow is negatively correlated with glandular innervation in general [58], another study found no differences in innervation pattern between SS cases and healthy controls, and found both groups to have functional acinar receptor systems [59].

Evidence of allele-specific methylation over extended genomic regions has been previously reported and can vary by tissue, developmental stage, and ancestry [47]. Here, we identified DMRs in SS whose methylation levels appear to be under genetic control using the CIT. Twelve of the 215 DMRs demonstrated evidence of causal dependence on neighboring genotypes, with the majority residing in the MHC. Furthermore, 9 of the 16 DMRs in the MHC region showed evidence of mediation, supporting a general theme of genetic control of DNA methylation at the MHC. Majority of these MHC meQTLs involved in this causal mediation relationship are significantly associated with SS based on a previous GWAS for Europeans [7]. Our analysis shows modest evidence that some of these meQTLs have independent effects on SS risk, and that these meQTLs are in modest LD with some, but not all, established risk alleles in the HLA gene regions [7, 8]. However, larger studies are likely needed to determine whether the association of HLA alleles with SS is also mediated by DNA methylation, due to the polymorphic nature of HLA alleles. Using a combined genetic and epigenetics approach, our results support a role for functional relevance of previously established SS-associated SNPs at the MHC.

Findings that DNA methylation can mediate genetic risk conferred by the MHC, has been identified in a number of other autoimmune diseases. Differential methylation encompassing exon 2 of HLA-DRB1*15:01 has been shown in monocytes to the mediate effect of the HLA-DRB1*15:01 allele on its expression and risk of multiple sclerosis [60]. In psoriasis, the majority of reported meQTLs also reside in the MHC, although target CpG loci were located more than 500 kb away from their corresponding meQTLs. Using the CIT, 11 SNP-CpG pairs were found to exhibit a methylation-mediated relationship with psoriasis in skin tissue [61]. In rheumatoid arthritis, DNA methylation levels were found to mediate genetic risk within the MHC in whole blood [62]. Our results add to the growing evidence that the MHC likely confers genetic risk of disease in a more complex way than previously understood.

DNA methylation is currently thought to be influenced by genetic factors, age, environment and lifestyle, and tissue-type [6366]. By identifying CpG sites that mediate nearby genetic risk for SS, CpG sites whose methylation levels may be altered by disease status, and CpG sites showing no evidence of nearby genetic control, we provide information that could be relevant for the potential therapeutic application of site-specific epigenetic editing for SS [67]. For example, it may be important to avoid targeting CpG sites whose methylation levels are altered by disease status. Currently, epigenetic therapy has been most effective for hematological malignancies but not in solid tumors [26, 28]. Epigenetic therapeutic approaches for other disease conditions remain in development, facing challenges such as lack of knowledge of effective target biomarkers, insufficient drug specificity, and dose-limiting toxicities [22, 2830]. Nevertheless, autoimmune diseases have been cited as a promising area for the application of epigenetic therapies [22].

Since the LSG consists of a mixture of epithelial and inflammatory cells, a limitation of our study of LSG tissue is that it is unclear to what extent the observed methylation differences are explained by differences in cellular composition [30]. Without a reference dataset of methylation measurements on separated cell types from LSG, it is difficult to adjust for cell type heterogeneity using reference-based methods, which has been shown to perform better than reference-free methods [68]. Reference-free correction methods have been shown to vary widely in performance and lead to false positives in epigenome-wide association studies. A similar study of LSG has observed differentially-methylated cell differentiation markers as evidence for an increased proportion of immune cells [20], although we did not replicate these findings in our DMR analysis. Evidence of cell-specific differential methylation has been observed for salivary gland epithelial cells in SS [21]. Further investigation is needed to establish the relative contributions of cell-specific differential methylation and cellular heterogeneity to differential methylation in LSG tissue.

In conclusion, we report evidence of genetic control of differential DNA methylation in SS by performing a formal CIT on genotype and DNA methylation datasets obtained from 131 individuals with LSG tissue and genotype data. We extended and replicated previous hypomethylation findings observed in many immune-related genes in SS cases, particularly those at the MHC. Our results also support the potential involvement of neurological processes in SS. By performing CIT on DMRs and their nearby meQTLs, we found that many DMRs associated with nearby risk alleles at the MHC were also mediators of SS risk. Interestingly, we did not observe as strong an evidence of mediation for SS DMRs at non-MHC locations. Through a formal study of the causal mediation relationship between genetic variation, DNA methylation, and SS case status, our findings provide essential information for the development of site-specific methylation-modifying therapies for SS.

References

XMariette, LACriswell. Primary Sjögren’s Syndrome. CGSolomon, editor. N Engl J Med. 2018 Mar 8;378(10):9319. doi: 10.1056/NEJMcp1702514

JJNair, TPSingh. Sjogren’s syndrome: Review of the aetiology, Pathophysiology & Potential therapeutic interventions. J Clin Exp Dent. 2017 Apr;9(4):e5849. doi: 10.4317/jced.53605

AIgoe, RHScofield. Autoimmunity and infection in Sjögren’s syndrome. Curr Opin Rheumatol. 2013 Jul;25(4):4807. doi: 10.1097/BOR.0b013e32836200d2

DKaraiskos, CPMavragani, SMakaroni, EZinzaras, MVoulgarelis, ARabavilas, et al. Stress, coping strategies and social support in patients with primary Sjögren’s syndrome prior to disease onset: a retrospective case-control study. Ann Rheum Dis. 2009 Jan;68(1):406. doi: 10.1136/ard.2007.084152

SFerraro, NOrona, LVillalón, PHNSaldiva, DRTasat, ABerra. Air particulate matter exacerbates lung response on Sjögren’s Syndrome animals. Exp Toxicol Pathol. 2015 Feb;67(2):12531. doi: 10.1016/j.etp.2014.10.007

BFreundlich, CAltman, NSnadorfi, MGreenberg, JTomaszewski. A profile of symptomatic patients with silicone breast implants: a Sjögrens-like syndrome. Semin Arthritis Rheum. 1994 Aug;24(1 Suppl 1):4453. doi: 10.1016/0049-0172(94)90109-0

KETaylor, QWong, DMLevine, CMcHugh, CLaurie, KDoheny, et al. Genome-Wide Association Analysis Reveals Genetic Heterogeneity of Sjögren’s Syndrome According to Ancestry. Arthritis Rheumatol (Hoboken, NJ). 2017;69(6):1294305. doi: 10.1002/art.40040

CJLessard, HLi, IAdrianto, JAIce, ARasmussen, KMGrundahl, et al. Variants at multiple loci implicated in both innate and adaptive immune responses are associated with Sjögren’s syndrome. Nat Genet. 2013 Nov;45(11):128494. doi: 10.1038/ng.2792

FZhang, YLi, KZhang, HChen, FSun, JXu, et al. A genome-wide association study in Han Chinese identifies a susceptibility locus for primary Sjögren’s syndrome at 7q11.23. Vol. 45, Nature Genetics. 2013. p. 13617. doi: 10.1038/ng.2779

10 

HYin, MZhao, XWu, FGao, YLuo, LMa, et al. Hypomethylation and overexpression of CD70 (TNFSF7) in CD4+ T cells of patients with primary Sjögren’s syndrome. J Dermatol Sci. 2010 Sep 1;59(3):198203. doi: 10.1016/j.jdermsci.2010.06.011

11 

XYu, GLiang, HYin, ONgalamika, FLi, MZhao, et al. DNA hypermethylation leads to lower FOXP3 expression in CD4+ T cells of patients with primary Sjögren’s syndrome. Vol. 148, Clinical Immunology. Clin Immunol; 2013. p. 2547. doi: 10.1016/j.clim.2013.05.005

12 

NGestermann, MKoutero, RBelkhir, JTost, XMariette, CMiceli-Richard. Methylation profile of the promoter region of IRF5 in primary Sjögren’s syndrome. Eur Cytokine Netw. 2012 Oct;23(4):16672. doi: 10.1684/ecn.2012.0316

13 

YThabet, CLe Dantec, IGhedira, VDevauchelle, DCornec, JOPers, et al. Epigenetic dysregulation in salivary glands from patients with primary Sjögren’s syndrome may be ascribed to infiltrating B cells. J Autoimmun. 2013 Mar 1;41:17581. doi: 10.1016/j.jaut.2013.02.002

14 

ODKonsta, CLe Dantec, ACharras, DCornec, EKKapsogeorgou, AGTzioufas, et al. Defective DNA methylation in salivary gland epithelial acini from patients with Sjögren’s syndrome is associated with SSB gene expression, anti-SSB/LA detection, and lymphocyte infiltration. J Autoimmun. 2016 Apr 1;68:308. doi: 10.1016/j.jaut.2015.12.002

15 

CPMavragani, ANezos, ISagalovskiy, SSeshan, KAKirou, MKCrow. Defective regulation of L1 endogenous retroelements in primary Sjogren’s syndrome and systemic lupus erythematosus: Role of methylating enzymes. J Autoimmun. 2018 Mar 1;88:7582. doi: 10.1016/j.jaut.2017.10.004

16 

SGonzález, SAguilera, CAlliende, UUrzúa, AFGQuest, LHerrera, et al. Alterations in type I hemidesmosome components suggestive of epigenetic control in the salivary glands of patients with Sjögren’s syndrome. Arthritis Rheum. 2011 Apr 1;63(4):110615. doi: 10.1002/art.30212

17 

NAltorok, PCoit, THughes, KAKoelsch, DUStone, ARasmussen, et al. Genome-wide DNA methylation patterns in naive cd4+ t cells from patients with primary sjögren’s syndrome. Arthritis Rheumatol. 2014;66(3):7319. doi: 10.1002/art.38264

18 

CMiceli-Richard, SFWang-Renault, SBoudaoud, FBusato, CLallemand, KBethune, et al. Overlap between differentially methylated DNA regions in blood B lymphocytes and genetic at-risk loci in primary Sjögren’s syndrome. Ann Rheum Dis. 2016 May 1;75(5):93340. doi: 10.1136/annrheumdis-2014-206998

19 

JImgenberg-Kreuz, JKSandling, JCAlmlöf, JNordlund, LSignér, KBNorheim, et al. Genome-wide DNA methylation analysis in multiple tissues in primary Sjögren’s syndrome reveals regulatory effects at interferon-induced genes. Ann Rheum Dis. 2016 Nov 1;75(11):202936. doi: 10.1136/annrheumdis-2015-208659

20 

MBCole, HQuach, DQuach, ABaker, KETaylor, LFBarcellos, et al. Epigenetic Signatures of Salivary Gland Inflammation in Sjögren’s Syndrome. Arthritis Rheumatol (Hoboken, NJ). 2016;68(12):293644. doi: 10.1002/art.39792

21 

ACharras, ODKonsta, CLe Dantec, CBagacean, EKKapsogeorgou, AGTzioufas, et al. Cell-specific epigenome-wide DNA methylation profile in long-term cultured minor salivary gland epithelial cells from patients with Sjögren’s syndrome. Ann Rheum Dis. 2017 Mar 1;76(3):6258. doi: 10.1136/annrheumdis-2016-210167

22 

RYung, TMau. Potential of epigenetic therapies in non-cancerous conditions. Vol. 5, Frontiers in Genetics. Frontiers Media S.A.; 2014. doi: 10.3389/fgene.2014.00438

23 

AFMcRae, JEPowell, AKHenders, LBowdler, GHemani, SShah, et al. Contribution of genetic variation to transgenerational inheritance of DNA methylation. Genome Biol. 2014 May 29;15(5):R73. doi: 10.1186/gb-2014-15-5-r73

24 

JMillstein, BZhang, JZhu, EESchadt. Disentangling molecular relationships with a causal inference test. BMC Genet. 2009 Dec 27;10(1):23. doi: 10.1186/1471-2156-10-23

25 

PAJones, JPJIssa, SBaylin. Targeting the cancer epigenome for therapy. Vol. 17, Nature Reviews Genetics. Nature Publishing Group; 2016. p. 63041. doi: 10.1038/nrg.2016.93

26 

PAJones, HOhtani, AChakravarthy, DDDe Carvalho. Epigenetic therapy in immune-oncology. Vol. 19, Nature Reviews Cancer. Nature Publishing Group; 2019. p. 15161. doi: 10.1038/s41568-019-0109-9

27 

NAhuja, ARSharma, SBBaylin. Epigenetic therapeutics: A new weapon in the war against cancer. Annu Rev Med. 2016 Jan 14;67:7389. doi: 10.1146/annurev-med-111314-035900

28 

YCheng, CHe, MWang, XMa, FMo, SYang, et al. Targeting epigenetic regulators for cancer therapy: Mechanisms and advances in clinical trials. Signal Transduct Target Ther. 2019 Dec 17;4(1):139. doi: 10.1038/s41392-019-0095-0

29 

AMajchrzak-Celińska, WBaer-Dubowska. Pharmacoepigenetics: Basic Principles for Personalized Medicine. In: Pharmacoepigenetics. Elsevier; 2019. p. 10112.

30 

JImgenberg-Kreuz, JKSandling, GNordmark. Epigenetic alterations in primary Sjögren’s syndrome—an overview. Clin Immunol. 2018 Nov 1;196:1220. doi: 10.1016/j.clim.2018.04.004

31 

ASMalladi, KESack, SCShiboski, CHShiboski, ANBaer, RBanushree, et al. Primary Sjögren’s syndrome as a systemic disease: a study of participants enrolled in an international Sjögren’s syndrome registry. Arthritis Care Res (Hoboken). 2012 Jun;64(6):9118.

32 

CHShiboski, SCShiboski, RSeror, LACriswell, MLabetoulle, TMLietman, et al. 2016 American College of Rheumatology/European League Against Rheumatism Classification Criteria for Primary Sjögren’s Syndrome: A Consensus and Data-Driven Methodology Involving Three International Patient Cohorts. Arthritis Rheumatol (Hoboken, NJ). 2017;69(1):3545.

33 

MJAryee, AEJaffe, HCorrada-Bravo, CLadd-Acosta, APFeinberg, KDHansen, et al. Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics. 2014 May 15;30(10):13639. doi: 10.1093/bioinformatics/btu049

34 

TJTriche, DJWeisenberger, DVan Den Berg, PWLaird, KDSiegmund, KDSiegmund. Low-level processing of Illumina Infinium DNA Methylation BeadArrays. Nucleic Acids Res. 2013 Apr;41(7):e90. doi: 10.1093/nar/gkt090

35 

NTouleimat, JTost. Complete pipeline for Infinium ® Human Methylation 450K BeadChip data processing using subset quantile normalization for accurate DNA methylation estimation. Epigenomics. 2012 Jun;4(3):32541. doi: 10.2217/epi.12.21

36 

DLMcCartney, RMWalker, SWMorris, AMMcIntosh, DJPorteous, KLEvans. Identification of polymorphic and off-target probe binding sites on the Illumina Infinium MethylationEPIC BeadChip. Genomics data. 2016 Sep;9:224. doi: 10.1016/j.gdata.2016.05.012

37 

YChen, MLemire, SChoufani, DTButcher, DGrafodatskaya, BWZanke, et al. Discovery of cross-reactive probes and polymorphic CpGs in the Illumina Infinium HumanMethylation450 microarray. Epigenetics. 2013 Feb 27;8(2):2039. doi: 10.4161/epi.23470

38 

WEJohnson, CLi, ARabinovic. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007 Jan 1;8(1):11827. doi: 10.1093/biostatistics/kxj037

39 

JTLeek, WEJohnson, HSParker, AEJaffe, JDStorey. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012 Mar 15;28(6):8823. doi: 10.1093/bioinformatics/bts034

40 

HMCann. A Human Genome Diversity Cell Line Panel. Science (80-). 2002 Apr 12;296(5566):261b262. doi: 10.1126/science.296.5566.261b

41 

CCChang, CCChow, LCTellier, SVattikuti, SMPurcell, JJLee. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015 Dec 25;4(1):7. doi: 10.1186/s13742-015-0047-8

42 

AEJaffe, PMurakami, HLee, JTLeek, MDFallin, APFeinberg, et al. Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies. Int J Epidemiol. 2012 Feb;41(1):2009. doi: 10.1093/ije/dyr238

43 

PAJones. Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat Rev Genet. 2012 Jul 29;13(7):48492. doi: 10.1038/nrg3230

44 

ALiberzon, CBirger, HThorvaldsdóttir, MGhandi, JPMesirov, PTamayo. The Molecular Signatures Database Hallmark Gene Set Collection. Cell Syst. 2015 Dec 23;1(6):41725. doi: 10.1016/j.cels.2015.12.004

45 

YBenjamini, YHochberg. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Vol. 57, Journal of the Royal Statistical Society. Series B (Methodological). WileyRoyal Statistical Society; 1995. p. 289300.

46 

GHong, WZhang, HLi, XShen, ZGuo. Separate enrichment analysis of pathways for up- and downregulated genes. J R Soc Interface. 2014 Mar 6;11(92):20130950. doi: 10.1098/rsif.2013.0950

47 

AKSmith, VKilaru, MKocak, LMAlmli, KBMercer, KJRessler, et al. Methylation quantitative trait loci (meQTLs) are consistently detected across ancestry, developmental stage, and tissue type. BMC Genomics. 2014 Feb 21;15(1):145.

48 

JRWagner, SBusche, BGe, TKwan, TPastinen, MBlanchette. The relationship between DNA methylation, genetic and expression inter-individual variation in untransformed human fibroblasts. Genome Biol. 2014 Feb 20;15(2):R37. doi: 10.1186/gb-2014-15-2-r37

49 

JTBell, AAPai, JKPickrell, DJGaffney, RPique-Regi, JFDegner, et al. DNA methylation patterns associate with genetic and gene expression variation in HapMap cell lines. Genome Biol. 2011 Jan 20;12(1):R10. doi: 10.1186/gb-2011-12-1-r10

50 

AFMcRae, REMarioni, SShah, JYang, JEPowell, SEHarris, et al. Identification of 55,000 Replicated DNA Methylation QTL. Sci Rep. 2018 Dec 1;8(1). doi: 10.1038/s41598-018-35871-w

51 

JMillstein, GKChen, VBreton C. cit: hypothesis testing software for mediation analysis in genomic applications. Bioinformatics. 2016 Aug 1;32(15):23645. doi: 10.1093/bioinformatics/btw135

52 

JMillstein, DVolfson. Computationally efficient permutation-based confidence interval estimation for tail-area FDR. Front Genet. 2013;4:179. doi: 10.3389/fgene.2013.00179

53 

PCruz-Tapias, ARojas-Villarraga, SMaier-Moore, J-MAnaya. HLA and Sjögren’s syndrome susceptibility. A meta-analysis of worldwide studies. Autoimmun Rev. 2012 Feb;11(4):2817. doi: 10.1016/j.autrev.2011.10.002

54 

TORHjelmervik, KPetersen, IJonassen, RJonsson, AIBolstad. Gene expression profiling of minor salivary glands clearly distinguishes primary Sjögren’s syndrome patients from healthy control subjects. Arthritis Rheum. 2005 May;52(5):153444. doi: 10.1002/art.21006

55 

MJMachiela, SJChanock. LDlink: A web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics. 2015 Dec 18;31(21):35557. doi: 10.1093/bioinformatics/btv402

56 

DAFerrington, DSGregerson. Immunoproteasomes. In: Progress in molecular biology and translational science. 2012. p. 75112.

57 

JPerzyńska-Mazan, MMaślińska, RGasik. Neurological manifestations of primary Sjögren’s syndrome. Reumatologia. 2018;56(2):99105. doi: 10.5114/reum.2018.75521

58 

CESørensen, JOLarsen, JReibel, MLauritzen, ELMortensen, MOsler, et al. Associations between xerostomia, histopathological alterations, and autonomic innervation of labial salivary glands in men in late midlife. Exp Gerontol. 2014 Sep 1;57:2117. doi: 10.1016/j.exger.2014.06.004

59 

AMPedersen, SDissing, JFahrenkrug, JHannibal, JReibel, BNauntofte. Innervation pattern and Ca2+ signalling in labial salivary glands of healthy individuals and patients with primary Sjogren’s syndrome (pSS). J Oral Pathol Med. 2000 Mar 1;29(3):97109. doi: 10.1034/j.1600-0714.2000.290301.x

60 

LKular, YLiu, SRuhrmann, GZheleznyakova, FMarabita, DGomez-Cabrero, et al. DNA methylation as a mediator of HLA-DRB1*15:01 and a protective variant in multiple sclerosis. Nat Commun. 2018 Dec 19;9(1):2397. doi: 10.1038/s41467-018-04732-5

61 

FZhou, CShen, JXu, JGao, XZheng, RKo, et al. Epigenome-wide association data implicates DNA methylation-mediated genetic risk in psoriasis. Clin Epigenetics. 2016 Dec 5;8(1):131. doi: 10.1186/s13148-016-0297-z

62 

YLiu, MJAryee, LPadyukov, MDFallin, EHesselberg, ARunarsson, et al. Epigenome-wide association data implicate DNA methylation as an intermediary of genetic risk in rheumatoid arthritis. Nat Biotechnol. 2013 Feb;31(2):1427. doi: 10.1038/nbt.2487

63 

CLing, TRönn. Epigenetic adaptation to regular exercise in humans. Drug Discov Today. 2014 Jul;19(7):10158. doi: 10.1016/j.drudis.2014.03.006

64 

CHVinkers, ALKalafateli, BPRutten, MJKas, ZKaminsky, JDTurner, et al. Traumatic stress and human DNA methylation: a critical review. Epigenomics. 2015 Jun;7(4):593608. doi: 10.2217/epi.15.11

65 

SHorvath. DNA methylation age of human tissues and cell types. Genome Biol. 2013;14(10):R115. doi: 10.1186/gb-2013-14-10-r115

66 

S-KChu, H-CYang. Interethnic DNA methylation difference and its implications in pharmacoepigenetics. Epigenomics. 2017 Nov;9(11):143754. doi: 10.2217/epi-2017-0046

67 

SHStricker, AKöferle, SBeck. From profiles to function in epigenomics. Vol. 18, Nature Reviews Genetics. Nature Publishing Group; 2016. p. 5166. doi: 10.1038/nrg.2016.138

68 

KMcGregor, SBernatsky, IColmegna, MHudson, TPastinen, ALabbe, et al. An evaluation of methods correcting for cell-type heterogeneity in DNA methylation studies. Genome Biol. 2016 Dec 3;17(1):84. doi: 10.1186/s13059-016-0935-y