PLoS ONE
Home DACH1 mutation frequency in endometrial cancer is associated with high tumor mutation burden
<i>DACH1</i> mutation frequency in endometrial cancer is associated with high tumor mutation burden
DACH1 mutation frequency in endometrial cancer is associated with high tumor mutation burden

Competing Interests: The employment of Dr. Oliver Hampton by M2GEN does not alter our adherence to PLOS ONE policies on sharing data and materials.

‡ These authors also contributed equally to this work.

Article Type: research-article Article History
Abstract

Objective

DACH1 is a transcriptional repressor and tumor suppressor gene frequently mutated in melanoma, bladder, and prostate cancer. Loss of DACH1 expression is associated with poor prognostic features and reduced overall survival in uterine cancer. In this study, we utilized the Oncology Research Information Exchange Network (ORIEN) Avatar database to determine the frequency of DACH1 mutations in patients with endometrial cancer in our Kentucky population.

Methods

We obtained clinical and genomic data for 65 patients with endometrial cancer from the Markey Cancer Center (MCC). We examined the clinical attributes of the cancers by DACH1 status by comparing whole-exome sequencing (WES), RNA Sequencing (RNASeq), microsatellite instability (MSI), and tumor mutational burden (TMB).

Results

Kentucky women with endometrial cancer had an increased frequency of DACH1 mutations (12/65 patients, 18.5%) compared to The Cancer Genome Atlas (TCGA) endometrial cancer population (25/586 patients, 3.8%) with p-value = 1.04E-05. DACH1 mutations were associated with increased tumor mutation count in both TCGA (median 65 vs. 8972, p-value = 7.35E-09) and our Kentucky population (490 vs. 2160, p-value = 6.0E-04). DACH1 mutated patients have a higher tumor mutation burden compared to DACH1 wild-type (24 vs. 6.02, p-value = 4.29E-05). DACH1 mutations showed significant gene co-occurrence patterns with POLE, MLH1, and PMS2. DACH1 mutations were not associated with an increase in microsatellite instability at MCC (MSI-H) (p-value = 0.1342).

Conclusions

DACH1 mutations are prevalent in Kentucky patients with endometrial cancer. These mutations are associated with high tumor mutational burden and co-occur with genome destabilizing gene mutations. These findings suggest DACH1 may be a candidate biomarker for future trials with immunotherapy, particularly in endometrial cancers.

Riggs,Lin,Wang,Piecoro,Miller,Hampton,Rao,Ueland,Kolesar,and Deb: DACH1 mutation frequency in endometrial cancer is associated with high tumor mutation burden

Introduction

Uterine cancer is increasing in incidence and mortality in the United States. In 2020, an estimated 65,620 women will be diagnosed with endometrial cancer, making it the fourth most common female cancer, with an estimated 12,590 deaths [1]. Kentucky is an above-average risk region, with 29 new cases per 100,000 women compared to 27.6 per 100,000 women nationally [2]. The mean five-year survival rate for endometrial cancer is 81.2%, with more than 67% of patients diagnosed at an early stage. The survival rate decreases to 69% for locally metastatic and 16% for widely metastatic disease [3]. The treatment paradigm for endometrial cancer has been unchanged for some time. Current first-line therapy includes a combination of surgery, carboplatin and paclitaxel chemotherapy, and radiation depending on the stage and risk.

To molecularly categorize endometrial cancers, Kandoth and colleagues performed an integrated genomic, transcriptomic, and proteomic analysis of 373 endometrial carcinomas. They were able to classify these cancers into POLE ultramutated, microsatellite instability hypermutated (MSI-H), copy number low, and copy number high [4]. Uterine serous cancers and approximately 25% of high-grade endometrioid tumors were in the copy number high group and had frequent TP53 mutations and poor prognosis. The majority of endometrioid cancers were in the copy number low group and were TP53 wild-type with PTEN and PIK3CA commonly mutated. This group included low-grade endometrioid (60%), high-grade endometrioid endometrial cancers (8.7%), serous carcinomas (2.3%), and mixed-histology carcinomas (25%). The MSI-H group accounted for 28.6% of low-grade and 54.3% of high-grade endometrioid cancers studied. Frequently co-mutated genes included PTEN and PIK3CA. The POLE ultramutated groups accounted for 6.4% of low-grade and 17.4% of high-grade cancers and had improved progression-free survival. In addition, POLE-mutant microsatellite stable (MSS) tumors have been associated with high tumor mutation burden (TMB) in endometrial cancer [4].

The Drosophila dachshund (dac) gene (DACH1) was initially identified as critical to Drosophila eye development and is an essential member of the retinal determination gene network, responsible for normal organogenesis [5]. DACH1 is a known tumor suppressor gene in breast, colon, and renal cancer and frequently mutated in melanoma, bladder, and prostate cancer. Most well-studied in breast carcinoma, DACH1 is expressed in normal mammary epithelium with significantly reduced expression found in mastopathy, ductal carcinoma, and lobular carcinoma in situ [6]. DACH1 expression was reduced or lost in invasive breast cancer patients with a poor prognosis [7], with its expression inversely related to tumor diameter, stage, and nodal metastasis, and directly associated with increased survival time [6]. While less studied in uterine cancer, nearly all normal endometrial samples show nuclear expression of DACH1, with DACH1 expression lost in more than half of endometrial cancers. Loss of DACH1 expression is associated with poor prognostic factors, including higher FIGO surgical stage, positive peritoneal cytology, and lymph node positivity in endometrial cancer [8].

DACH1 is a transcriptional co-repressor that functions as part of a DNA binding complex and regulates gene transcription. DACH1 is also an endogenous regulator of cyclin D1, with loss of DACH1 resulting in increased cyclin D1, which is required for the G1/S transition. Nuclear expression of cyclin D1 is rarely observed in healthy endometrial tissue, while the majority of uterine cancers express cyclin D1, and cyclin D1 expression predicts poor survival [8, 9].

Our primary objective was to determine the frequency of DACH1 mutations in our population and their association with other tumor suppressor genes, tumor mutation burden, microsatellite instability, and clinical risk factors.

Methods

Study design

ORIEN is a cancer precision medicine initiative initially developed by the Moffitt Cancer Center [10, 11]. It has evolved into a consortium research network of nineteen U.S. cancer centers, including the MCC, the only NCI designated cancer center in Kentucky, who joined the alliance in December 2017. All ORIEN alliance members utilize a standard protocol: Total Cancer Care (TCC)®. TCC is a prospective cohort study with whole-exome tumor sequencing, RNA sequencing, germline sequencing, and lifetime follow up. Nationally, over 250,000 participants have enrolled. As part of this study, participants agree to have their clinical data followed over time, to undergo germline and tumor somatic sequencing, and to be contacted in the future if an appropriate clinical trial becomes available [12]. At MCC, a buccal swab is used at enrollment for germline testing.

The Kentucky Cancer Registry (KCR) is a population-based central cancer registry for the Commonwealth of Kentucky. All cases of cancer diagnosed and/or treated in Kentucky are required to be reported to the KCR by state statute (KRS 214.556). Data elements reported to the registry consist of demographic and clinical information including genetic data. The final data set consolidated the linked demographic data with the genomic data from the enrolled patients pulled for the study. The Cancer Research Informatics Shared Resource Facility (CRI SRF) served as the honest broker and assisted with the distribution of clinical and genetic data stored in the KCR. A contractual agreement was previously established through M2GEN ORIEN/Total Cancer Care and the Kentucky Cancer Registry to allow data sharing. At the time of data receipt, all data was fully anonymized prior to analysis and the authors did not receive any special privileges in accessing the data.

Study population

Patients presenting to Markey Cancer Center between December 1, 2018, and May 31, 2019, were invited to enroll in the parent trial, Total Cancer Care prospective cohort study. The study was offered to all eligible patients, and subjects were recruited during routine clinic visits. Treating physicians informed the patient about the study, and designated study coordinators assisted with enrollment and the formal consent process. Eligible patients were 18 years of age or older and had a diagnosis of cancer. A total of 65 patients with endometrial cancer enrolled at MCC were included in the analysis. To be included, each patient had to be ≥18 years of age, enrolled in TCC, and have both somatic and germline tumor whole exome sequencing results available. Patients were assigned a TCC ID number and otherwise de-identified by the Kentucky Cancer Registry prior to analysis. The TCC ID number allowed the linkage of clinical data with genomic data through the CRI SRF honest broker. The study was conducted in accordance with the U.S. Common Rule, approved by the University of Kentucky Institutional Review Board (IRB #50767), and the investigators had obtained informed written consent from all subjects enrolling in TCC. Demographic variables, such as age at diagnosis, body mass index, race, and geographic location, were extracted from the linked KCR data and included in the analysis. Age at diagnosis was dichotomized to less than or equal to 64 and 65 and older. Clinical variables included cancer type, AJCC stage at diagnosis, and clinical comorbidities. Based on the frequency of cancer cases, types of cancer at diagnosis were grouped into several broader categories. Cancer stages were dichotomized to early (stage I-II) and late-stage (stage III-IV). The county of current residence and patient zip code were used to define Appalachia status as Appalachian or non-Appalachian. RNA sequencing was available for 52 patients. Tumor mutation burden and microsatellite instability data were available for 55 patients. The TCGA PanCancer Atlas dataset was used for comparison through cBioPortal.org, utilizing endometrial cancer and carcinosarcoma subgroups for analysis totaling 586 patients, which can be accessed here: https://www.cbioportal.org/results/cancerTypesSummary?cancer_study_list=ucec_tcga_pan_can_atlas_2018%2Cucs_tcga_pan_can_atlas_2018&Z_SCORE_THRESHOLD=2.0&RPPA_SCORE_THRESHOLD=2.0&data_priority=0&profileFilter=0&case_set_id=all&gene_list=DACH1&geneset_list=%20&tab_index=tab_visualize&Action=Submit.

Sequencing methods (RUO)

ORIEN Avatar specimens undergo DNA and RNA extraction. For frozen and optimal cutting temperature (OCT) tissue DNA extraction, Qiagen QIASymphony DNA purification is performed, generating 213 bp average insert size. For frozen and OCT tissue RNA extraction, Qiagen RNAeasy plus mini kit is performed, generating 216 base pair (bp) average insert size. For formalin-fixed paraffin-embedded (FFPE) tissue, Covaris Ultrasonication FFPE DNA/RNA kit is utilized to extract both DNA and RNA, generating 165 bp average insert size. Preparation of M2GEN Whole Exome Sequencing (WES) libraries involves hybrid capture using an enhanced Integrated DNA Technology (IDT) WES kit (38.7 Mb) with additional custom-designed probes for double coverage of 440 cancer genes. Library hybridization is performed at either single or 8-plex and sequenced on an Illumina NovaSeq 6000 instrument generating 100 bp paired reads. WES is performed on tumor/normal matched samples with the normal covered at 100X and the tumor covered at 300X (additional 440 cancer genes covered at 600X) depth. We performed both tumor/normal concordance and gender identity quality control checks. The minimum threshold for hybrid selection is >80% of bases with >20X fold coverage; M2GEN WES libraries typically meet or exceed 90% of bases with >50X fold coverage for tumor and 90% of bases with >30X fold coverage for normal samples.M2GEN RNA sequencing (RNAseq) is performed using the Illumina TruSeq RNA Exome with single library hybridization, cDNA synthesis, library preparation, sequencing (100 bp paired reads at Hudson Alpha, 150 bp paired reads at Fulgent) to a coverage of 100M total reads / 50M paired reads.

Bioinformatics

The bioinformatics pipeline was developed by M2Gen (Fig 1). The raw reads of WES and RNAseq data were saved in a fastq format. The adapter sequences were first trimmed by Bbduk software using paired-end read option. Reads were then mapped to the human genome using BWA-MEM with paired-end read option, which follows an alignment algorithm that aligns sequence reads or long query sequences against a large reference genome. GRCh38/hg38 human genome reference sequencing and GenCode build version 32 were used as the reference genome. We performed normalization, expression modeling, and difference testing using edgeR [13, 14]. The normalization of RNAseq counts was conducted with calcNormFactors function in edgeR, which normalizes the library sizes by finding a set of scaling factors to minimize the log-fold changes between the samples for most genes. The default method, a trimmed mean of M-values (TMM), was utilized to compute the scale factors between each pair of samples to provide the effective library size in the downstream RNAseq analysis. Next, the common dispersion was estimated from the housekeeping genes and libraries as a single group with function estimateDisp, which was controlled in differential analysis. Further, the expression model and differential expression analysis were completed using the functions glmFit, and statistics were controlled for multiple comparison using false discovery rate, which is defined as the expected proportion of false positives among all significant tests.

Flow diagram illustrating the bioinformatics pipeline used by M2Gen.
Fig 1

Flow diagram illustrating the bioinformatics pipeline used by M2Gen.

Republished from M2Gen (https://www.m2gen.com/) under a CC BY license, with permission from Oliver A. Hampton, original copyright 2019.

Statistics and analysis

We performed a descriptive analysis of clinical variables and disease-related prognostic factors, including age, BMI, tumor grade, tumor stage, recurrence, tobacco usage, Appalachian status, and histology tumor subtypes. We compared categorical and continuous variables using the Chi-square test or Fisher’s exact test and Student T-test, respectively. We performed a comparative analysis between ORIEN and TCGA datasets utilizing the Fisher’s exact test. We calculated co-occurrence using the Fisher’s exact test. Tumor mutation burden (TMB) and microsatellite instability (MSI) were calculated with the Wilcoxon rank sum test. Of the 65 patients, 55 had microsatellite instability and TMB data available. TMB was calculated using the count of non-synonymous somatic mutations (single nucleotide variants and small insertions/deletions, including missense, stop gain, stop loss and start loss mutations) per mega-case in the coding region of the specific capture kit [15]. Percent of MSI was calculated using MSISensor2 (https://github.com/niu-lab/msisensor2, [16]) and dichotomized to MSI-H versus MSS with a threshold of MSI-H ≥ 20% [17]. We used the Cox model for survival analyses [18], and corrected for dichotomized stage (high stage- III/IV, low stage- I/II), grade, and curves compared via the log-rank test. We performed all statistical analyses with R 3.6.3. The network analysis was performed using Qiagen’s Ingenuity Pathway Analysis (IPA) system for core analysis of the RNA sequencing data and overlaid with the Global Molecular Network Overlay in the IPA knowledge base. Using IPA, canonical pathways, disease and functions, and gene networks were categorized based on differential gene expression.

Results

Out of 65 patients, 12 had DACH1 gene mutations as shown in Table 1. The mean age and BMI were 62 years and 36.4 kg/m2, respectively. The majority of patients were stage I (47.7%), but a significant portion were stage III and IV (40%). Grade 3 disease (50.8%) was common, and 30.7% had grade 1 disease. Disease recurrence was present in 24.6% of the patients. Approximately 60% of the patients were from the Appalachian region, which mirrors the percentage of patients treated from the Appalachian region by MCC as a whole. The cell types were distributed as follows: 57% endometrioid, 23.1% high grade serous, 9.2% carcinosarcoma, and 7.7% mixed cell adenocarcinoma. One patient each had clear cell carcinoma and malignant mesonephroma. Twelve of the 65 patients had at least one deleterious mutation in the DACH1 gene by whole exome sequencing. Eleven patients had point mutations, and six patients had more than a single mutation in the gene (range 1–7 point mutations). Two patients had a one-base-pair insertion, and one of these patients had a total of seven mutations with six point mutations and one base-pair-insertion (Fig 2 and Table 2).

Lollipop plot of DACH1 gene mutations in the 65 patients in the Markey Cancer Center.
Fig 2

Lollipop plot of DACH1 gene mutations in the 65 patients in the Markey Cancer Center.

Missense mutations are green. Truncating mutations are black and include nonsense, nonstop, frameshift deletions, frameshift insertions, and splice site mutations. All other types of mutations are included as pink (excluding fusion and inframe deletion or insertions).

Table 1
Demographics of the Markey Cancer Center population.
Patient Demographics
Age62.14 ± 10.79
BMI36.38 ± 10.22
Race
 Caucasian59 (90.8%)
 African American5 (7.7%)
 Asian1 (1.5%)
Grade
 120 (30.7%)
 212 (18.5%)
 333 (50.8%)
Clinical stage
 I31 (47.7%)
 II5 (7.7%)
 III14 (21.5%)
 IV12 (18.5%)
 Unknown3 (4.6%)
Clinical stage, early vs. late
 Early stage (I-II)36 (55.4%)
 Late stage (III-IV)26 (40.0%)
 Unknown3 (4.6%)
Tobacco use (smoking)
 No43 (66.2%)
 Yes21 (32.3%)
 Unknown1 (1.5%)
Documented recurrence
 No42 (64.6%)
 Yes16 (24.6%)
 Unknown7 (10.8%)
Appalachian status
 Non-Appalachian25 (38.5%)
 Appalachian40 (61.5%)
Histologic subtype
 Endometrioid37 (57.0%)
 Mixed cell adenocarcinoma5 (7.7%)
 Carcinosarcoma6 (9.2%)
 Serous15 (23.1%)
 Other: Clear cell, malignant mesonephroma2 (3.1%)
DACH1
 Mutated12 (18.5%)
 Wild-type53 (81.5%)
Table 2
Description of DACH1 gene mutations in the 12 DACH1 mutated patients at Markey Cancer Center.
PatientProtein ChangeMutation TypeVariant TypeStart PosEnd PosRefVar
13'UTRSNP7143989671439896AC
23'UTRSNP7143971071439710CA
2A624TMissense_MutationSNP7147916971479169CT
2T429PMissense_MutationSNP7157285471572854TG
2A6TMissense_MutationSNP7186675471866754CT
3*479*IntronSNP7155980471559804GA
4*672*IntronSNP7147564171475641TA
4E619*Nonsense_MutationSNP7147918471479184CA
4*479*IntronSNP7155720971557209AT
5*575*IntronSNP7147938671479386AT
63'UTRSNP7143855971438559GT
6N368KMissense_MutationSNP7163057871630578GC
6*322*IntronSNP7167493771674937TG
7P502LMissense_MutationSNP7155708971557089GA
7P500 =SilentSNP7155709471557094AG
83'UTRSNP7143849371438493GT
83'UTRSNP7143924171439241CT
8I20TMissense_MutationSNP7186671171866711AG
93'UTRSNP7143812771438127CT
93'UTRSNP7143847971438479AC
93'UTRSNP7143889771438897CT
9*695*IntronSNP7146467471464674GT
9P479LMissense_MutationSNP7155715871557158GA
9*433*IntronSNP7157276671572766GT
9*376*IntronINS7157337571573375TTA
103'UTRSNP7143931571439315GA
11*575*IntronSNP7147937171479371TA
12X376_spliceSplice_RegionINS7157301671573016TTA

There were no significant associations between DACH1 mutation and clinical covariates, including grade, stage, or histology. Age is approaching significance with a p-value of 0.053, with DACH1 mutations trending towards occurring more frequently in older patients, as shown in Table 3. Though not reaching statistical significance, 7/12 (58%) of the patients with DACH1 mutations also had high-grade disease, compared to 26/53 (49%) of those who were wild-type. There was no statistical difference seen in Appalachian versus non-Appalachian patients nor the histologic subtype. DACH1 gene mutations were not statistically associated with a microsatellite unstable genome in either the MCC cohort (p-value = 0.1342) or in the TCGA analysis through cBioPortal using the MSIsensor Score (p-value = 0.142) as shown in Table 4. Other commonly occurring driver mutations associated with microsatellite instability and genome instability were frequent in the DACH1 patients.

Table 3
Covariate analysis of DACH1 mutated patients compared to wild-type.
CovariatesDACH1
WT N = 53M N = 12P-valuea
Age61.15 ± 11.2566.50 ± 7.350.05267
BMI36.91±10.6534.03±8.030.3047
Race0.3746
 Caucasian49 (75.4%)10 (15.4%)
 African American3 (4.6%)2 (3.1%)
 Asian1 (1.5%)0 (0.0%)
Grade0.9121
 117 (32%)3 (25%)
 210 (18.8%)2 (16.7%)
 326 (49%)7 (58.3%)
Clinical stage0.4144
 I23 (43.4%)8 (66.7%)
 II5 (9.4%)0 (0.0%)
 III13 (24.5%)1 (8.3%)
 IV10 (18.9%)2 (16.7%)
 Unknown2 (3.8%)1 (8.3%)
Clinical stage, early vs. late0.3316
 Early stage (I-II)28 (52.8%)8 (66.7%)
 Late stage (III-IV)23 (43.4%)3 (25%)
 Unknown2 (3.8%)1 (8.3%)
Tobacco use (smoking)1
 No35 (66%)8 (66%)
 Yes17 (32%)4 (6.2%)
 Unknown1 (1.9%)0 (0%)
Documented recurrence1
 No33 (50.8%)9 (13.8%)
 Yes13 (20.0%)3 (4.6%)
 Unknown7 (13.2%)0 (0%)
Appalachian status1
 Non-Appalachian20 (30.8%)5 (7.7%)
 Appalachian33 (50.8%)7 (10.8%)
Histologic subtypeb0.9473
 Endometrioid31 (47.7%)6 (9.23%)
 Mixed cell adenocarcinoma4 (6.2%)1 (1.5%)
 Carcinosarcoma5 (7.7%)1 (1.5%)
 Serous12 (18.5%)3 (4.6%)

a P-values were calculated using the Fisher’s exact test for categorical variables and using the student t-test for continuous variables.

b Single cases each of malignant mesonephroma (DACH1 mutated) and clear cell carcinoma (DACH1 wild-type) excluded from covariate analysis.

Table 4
Genomic covariate analysis of DACH1 mutated patients compared to wild-type at MCC (n = 65 patients).
CovariatesDACH1
WT N = 53M N = 12Co-Occurrence P-valuea
PTEN (n = 40)32/408/400.94
PIK3CA (n = 35)26/359/350.191
TP53 (n = 31)23/318/310.255
POLE Mutation (n = 28)17/2811/285.78E-04
MLH1 Mutation (n = 22)12/2210/222.39E-04
MSH2 Mutation (n = 25)17/258/250.046
MSH6 Mutation (n = 28)19/289/280.264
PMS2 Mutation (n = 12)8/124/123.67E-07
Microsatellite Instability (n = 55)0.1342b
 MSI-High (n = 7)4/73/7
 Microsatellite Stable (n = 48)40/488/48
Tumor Mutation Burden6.0224.04.29E-05c

a P-values were calculated using the Fisher’s exact test for categorical variables and using the student t-test for continuous variables. The co-occurrence analysis using Fisher’s exact test was performed to determine whether DACH1 mutations are mutually exclusive or tend to co-occur with other gene mutations.

b MSI data was only available for 11/12 DACH1 mutated patients and 44/53 of the DACH1 wild type patients.

c Tumor mutation burden p-value was calculated using the Wilcoxon rank sum test.

We compared frequencies of commonly found driver mutations including PTEN, PIK3CA, TP53, POLE, and the Lynch Syndrome-associated genes (MLH1, MSH2, MSH6, PMS2) using the enrichment test to determine whether the frequency of gene mutations in the MCC cohort was similar to that of the TCGA PanCancer Atlas (PCA) endometrial carcinoma and carcinosarcoma datasets. We identified 3.8% (25/586 patients) DACH1 gene mutations in uterine cancer somatic samples in the TCGA PCA of endometrial and carcinosarcoma patients, which was significantly lower than the 18.5% (12/65, p = 1.05E-05) seen in the MCC patient cohort. In addition, MLH1 (22/65, 33.85%, p = 2.63E-13), MSH2 (25/65, 38.46%, p = 2.87E-07), MSH6 (28/65, 43.1%, p = 1.01E-11), PMS2 (12/65, 18.5%, p = 3.79E-03), and POLE (28/65, 43.08%, 5.39E-08) mutations were more common in the MCC cohort than the TCGA PCA, while mutation frequency in PTEN, PIK3CA, and TP53 were not statistically difference in the two datasets. We performed a co-occurrence analysis using the Fisher’s exact test to determine whether DACH1 mutations are mutually exclusive or tend to co-occur with other gene mutations, with a significant co-occurrence pattern noted between DACH1 and two of the four Lynch Syndrome associated genes, MLH1 (p = 2.39E-04) and PMS2 (p = 3.67E-07) as well as DACH1 and POLE (p = 5.78E-08), shown in Table 5. Neither MSH2 (p = 0.0628) nor MSH6 (p = 0.264) co-occurred with DACH1 in the MCC cohort, although this may be related to small sample size. The co-occurrence of DACH1 with MLH1, PMS2, and POLE were replicated in the TCGA PCA dataset adjusting for FDR using the Benjamini-Hockberg procedure through cBioPortal, with a significant co-occurrence pattern also found with DACH1 and MSH2, and DACH1 and MSH6 (q-value <0.001).

Table 5
Comparison of mutation frequency in endometrial cancer and carcinosarcoma between MCC and TCGA PanCancer Atlas (PCA).
TCGA FrequencyMCC FrequencyP-valuea
MCC vs TCGA
PTEN62%62%1
363/58640/65
PIK3CA52%53.9%0.885
305/58635/65
TP5342%47.7%0.452
246/58631/65
DACH13.8%18.5%1.05E-05
25/58612/65
MLH16%33.9%2.63E-13
35/58622/65
MSH28.9%38.5%2.87E-07
52/58625/65
MSH611.1%43.1%1.01E-11
65/58628/65
PMS27.2%18.5%3.79E-03
42/58612/65
POLE14.7%43.1%5.39E-08
86/58628/65
MSI-H14.5%12.7%0.719b
85/5867/55
MSS85.5%87.3%0.719c
501/58648/55

a A comparison analysis between the MCC and TCGA PCA datasets was conducted utilizing the Fisher’s exact test to determine significance.

b Fisher’s exact test was used to compare MCC and TCGA PCA datasets for microsatellite status with no difference found.

Given that DACH1 plays a complex role in transcriptional repression, we performed a gene expression analysis using the RNA sequencing data in Qiagen’s Ingenuity Pathway Analysis (IPA) to evaluate differences in expression and pathways to better understand the mechanism of action of DACH1. Of the 65 patients, 52 had RNA sequencing data available. A total of 2,599 genes were significantly differentially expressed (FDR values < 0.05) between the DACH1 mutated patients and wild-type (Fig 3), with a large proportion of these being upregulated in the setting of mutated DACH1. The top ten upregulated and downregulated differentially expressed genes comparing DACH1 mutated patients to wild-type are displayed in Table 6a and 6b. In the top ten upregulated genes, many are involved in transcription regulation and cell signaling. In contrast, the top downregulated genes were part of the immune system response and the transcription of ER/PR receptors. We performed an in-depth pathway analysis utilizing the RNA sequencing data to determine cell-specific pathways impacted by DACH1 mutations, shown in Table 7 and Fig 4A and 4B. Of note, the most significant pathways involved were the breast cancer development pathway by a log value of 3.45, and catecholamine and transcriptional regulation pathways each by a log value of 3.19.

Differential gene expression between DACH1 mutated patients and wild-type.
Fig 3

Differential gene expression between DACH1 mutated patients and wild-type.

A marked increase in significantly upregulated genes is noted.

Differential expression analysis of DACH1 mutated versus wild-type patients.
Fig 4

Differential expression analysis of DACH1 mutated versus wild-type patients.

(A) Pathway analysis of genes differentially expressed between DACH1 mutated patients and wild-type. (B) Network analysis of the pathways differentially expressed between DACH1 mutated patients and wild-type.

Table 6
(A) Upregulated pathways in DACH1 mutated patients compared to wild-type. (B) Downregulated pathways in DACH1 mutated patients compared with wild-type.
Top significant differentially expressed genes comparing DACH1 mutated patients to wild-type and their associated pathways.
Upregulated
GeneslogFCPValueQvalue (FDR)Function
1. F8A214.384821.50E-060.000206Vesicle trafficking
2. CRH10.605653.31E-135.69E-10Cell-signaling
3. HOXD127.8614971.45E-096.52E-07Transcription regulation
4. GFRA47.2811256.03E-081.43E-05Growth factor signaling
5. ELK2AP7.1468136.04E-092.14E-06Transcription regulation
6. CTAG26.9929952.35E-060.00029Testis antigen
7. DPP46.6325472.66E-158.76E-12T-cell activation
8. MAGEB16.6163773.95E-050.00278Testis antigen
9. EVX26.1739531.83E-050.001525Transcription regulation
10. PNMA55.8119263.29E-181.95E-14Immune response
Downregulated
GeneslogFCP-ValueQ-value (FDR)Function
1. CST4-9.666493.78E-050.002704Protease inhibitor
2. PAGE1-8.333770.0016110.040795Tumor antigen
3. DEFA1-7.278660.0003710.014545Immune System
4. GP2-7.190040.00130.035124Immune System
5. INSL4-6.877980.0012150.033501Cell-signaling
6. KRTAP4-4-6.828820.0299180.25257Development
7. MYBPC1-6.126660.0017530.043191Cell Structure
8. SRARP-6.123160.000630.021442Transcription of E2/PR receptors
9. DMBT1-6.001113.99E-060.000454Immune System
10. LFT-5.977980.0003340.013478Immune System
Table 7
Pathway analysis of genes differentially expressed between DACH1 mutated patients and wild-type.
Ingenuity Canonical Pathways-log(p-value)
1. Breast Cancer Regulation by STMN13.45
2. Catecholamine Biosynthesis3.19
3. Transcriptional Regulatory Network in Embryonic Stem Cells3.19
4. Serotonin and Melatonin Biosynthesis2.53
5. Methionine Salvage II (Mammalian)2.06
6. FXR/RXR Activation2.01
7. LPS/IL-1 Mediated Inhibition of RXR Function1.9
8. Stearate Biosynthesis I (Animals)1.86
9. Complement System1.83
10. Thyroid Hormone Metabolism II (via Conjugation and/or Degradation)1.77

To better assess clinical applicability, we converted the pathway analysis to a heat map with analysis by disease and organ system (Fig 5A). The size of the box denotes the -log(p-value). The color of the boxes correlates with the z-score with the intensity of blue representing z ≤ 0 and orange z ≥ 0. Pathways related to cellular injury and cancer predominated, suggesting DACH1 mutations lead to disease processes resulting in cellular injury and cancer (Fig 5A). Specific to cancer, several statistically significant p-values with z-score ≥ 0 indicated overexpression, including pelvic cancer (-log[p-value] = 5.439, z-score = 0.391), genital cancer (-log[p-value] = 3.967, z-score = 0.391), and quantity of malignant tumor (-log[p-value] = 2.502, z-score = 1.254) (Fig 5B). DACH1 was most associated with the cancer forming pathway followed closely by organismal injury and abnormalities, diseases of the endocrine system, and the gastrointestinal system (Fig 5C). Multiple pathways involved in cell cycle control, signaling, and development were also significantly differentially expressed between DACH1 mutated and wild-type as assessed by RNA sequencing.

Gene network analysis between DACH1 mutated and wild-type patients.
Fig 5

Gene network analysis between DACH1 mutated and wild-type patients.

(A) A heatmap of the network analysis of genes differentially expressed between DACH1 mutated patients and wild-type by organ and disease system pathways is shown. The size of the box denotes the -log(p-value). The color of the boxes correlates with the z-score with the intensity of blue representing z ≤ 0 and orange z ≥ 0. Those with the highest z-scores and the greatest p-values include head and neck cancer, head and neck tumor, cancer of secretory structure, and neoplasia of cells. (B) Heatmap of network analysis separated by cancer disease process is shown. This shows an increased z-score in secretory cancers (-log[p-value] = 31.281, z-score = 0.547), head and neck cancers (-log[p-value] = 33.233, z-score = 1.463), abdominal adenocarcinoma (-log[p-value] = 16.306, z-score = 0.328), pelvic cancer (-log[p-value] = 5.439, z-score = 0.391), hyperplasia of the intestinal tract (-log[p-value] = 2.818, z-score = 0.239), prostate cancer (-log[p-value] = 4.883, z-score = 0.291), genital cancer (-log[p-value] = 3.967, z-score = 0.391), and quantity of malignant tumor (-log[p-value] = 2.502, z-score = 1.254). (C) Disease system pathways involved with DACH1 mutations are shown through network analysis of genes differentially expressed between DACH1 mutated patients and wild-type.

We performed network mapping using IPA with Global Network Overlay to determine the interplay of DACH1 on genes found to be significantly altered between DACH1 mutated patients and wild-type (Fig 6A). We note upregulated expression in red, with color intensity corresponding to increased significance. Downregulated expression is notated in green with color intensity again corresponding to increased significance. Network mapping results were filtered by statistically significant p-values with expression fold changes ≥ 0 (Fig 6B). This revealed an interplay between three genes including ASCL1 (3.070 expression fold change, p-value = 2.08E-03), SOX2 (expression fold change 3.470, p-value = 1.84E-02), and LHX1 (4.090 expression fold change, p-value = 1.58E-02) when comparing DACH1 mutated patients to wild-type. Each of these three genes is involved in transcription regulation and cell cycle control. The top five network functions included differentiation of chromaffin cells (p-value = 3.97E-05), activation of DNA endogenous promoter (p-value = 6.05E-05), transcription of DNA (p-value = 2.64E-04), fusion of bone (p-value = 2.74E-04), and formation of the forebrain n (p-value = 5.0E-04).

Network analysis of genes differentially expressed between DACH1 mutated patients and wild-type.
Fig 6

Network analysis of genes differentially expressed between DACH1 mutated patients and wild-type.

(A) Network mapping by Qiagen IPA with Global Network Overlay is shown to compare DACH1 mutated patients versus wild-type. (B) Network mapping with statistically significant different gene expression was shown using global network overlay with significance seen in SOX2, ASCL1, and LHX1. Upregulated expression is shown in red, with color intensity corresponding to increased significance. Downregulated expression is notated in green with color intensity again corresponding to increased significance.

Given DACH1’s control of transcription regulation and the cell cycle, we then evaluated its effect on tumor mutation burden and microsatellite instability. We first compared tumor mutation counts in the TCGA PCA endometrial cancer and carcinosarcoma cohorts between DACH1 mutated patients and wild-type and found clinically significant differences with a median of 8972 in DACH1 mutants vs. 65 in DACH1 wild-type (p-value = 7.35e-09). We repeated this analysis in our MCC population with a median of 2160 in DACH1 mutated patients vs. 490 in DACH1 wild-type (p-value = 6E-04). We then compared TMB between DACH1 wild-type and mutated patients in the MCC cohort. Of the 65 patients, 55 had microsatellite instability and TMB data available. As expected, given the marked difference in tumor mutation counts in both TCGA PCA and MCC, DACH1 wild-type patients had a median TMB of 6.02 and DACH1 mutated patients had a significantly higher median TMB of 24.0 (p-value = 4.29E-05) compared by the Wilcoxon rank sum test (Fig 7). Given the co-occurrence of mutations found in DACH1 with MLH1, POLE, and PMS2, we then compared microsatellite instability between DACH1 mutated patients and DACH1 wild-type in the MCC cohort using the chi-square test, and no significance was found between the two groups (p-value = 0.2659) as shown in Table 8, with 3/12 DACH1 mutated patients being MSI-H, and 8/12 being MSS, and one patient with MSI unavailable.

Tumor mutation burden in DACH1 mutated patients compared to DACH1 wild-type as a continuous variable.
Fig 7

Tumor mutation burden in DACH1 mutated patients compared to DACH1 wild-type as a continuous variable.

The median is statistically different between the two (p-value = 4.288 E-05). TMB high is defined as ≥ 20 mutations per megabase.

Table 8
Relationship of gene mutations with microsatellite instability at MCC.
GenesMSI StatusP-valuea
HighLow
DACH10.1342
Mutant38
Wild-Type440
POLE0.001208
Mutant716
Wild-Type032
MLH10.04116
Mutant514
Wild-Type224
MSH21
Mutant319
Wild-Type429
MSH60.1025
Mutant517
Wild-Type231
PMS20.01596
Mutant46
Wild-Type342

Microsatellite instability in DACH1 mutated patients compared to wild-type. MSI was calculated by MSISensor2, which assumes MSI-H ≥ 20%. Correlation of DACH1 mutations with microsatellite instability status was not significant (p-value = 0.1342).

a P-values calculated using the Fisher’s exact test.

We assessed overall survival analysis between MCC DACH1 mutated patients and wild-type given prior studies suggesting an increase in stage, lymph node status, and metastasis with reduced DACH1 expression using the Cox model, correcting for stage and grade. No significant difference was noted between DACH1 wild-type and mutated patients (p-value = 0.803), with 80% of patients still alive at five years in both groups (Fig 8A). The result was similar in the TCGA PCA dataset with no difference in overall survival (p-value = 0.196) (Fig 8B). In TCGA PCA, at five years, 90.48% of the patients with the DACH1 mutation were still alive, while 70.47% of patients who were DACH1 wild-type group were still alive.

Overall survival (months) between DACH1 mutated patients versus wild-type.
Fig 8

Overall survival (months) between DACH1 mutated patients versus wild-type.

(A). Overall survival (months) between DACH1 mutated patients and wild-type at MCC, corrected for stage and grade, were found to be similar withno significant difference (p-value = 0.803) though limited outcome data in DACH1 mutated patients. (B). Overall survival (months) between DACH1 mutated patients and wild-type was also evaluated in the TCGA PCA (p-value = 0.196) patients with no significant difference.

Discussion

DACH1 plays a critical role in cell cycle control and acts as a tumor suppressor gene in breast cancer [6]. Our network analysis further supports this role in endometrial cancer by revealing three essential upregulated genes and their pathways with significant differences in expression between DACH1 mutated patients and wild-type, ASCL1, SOX2, LHX1. ASCL1 and SOX2 are important transcription factors involved in cell cycle regulation via interaction with Cyclin D [1921], and LHX1 is a DNA-binding transcription factor. We anticipate that DACH1 mutations result in loss of transcriptional repression of these regulators resulting in uncontrolled cell cycle progression in endometrial cancer, similar to DACH1’s control of the cell cycle via cyclin D1 in breast cancer [8, 22].

POLE is a tumor suppressor gene involved in nucleotide excision repair, which is mutated in 7–15% of endometrial cancers [19] and is associated with a good prognosis and a high TMB. In our population, we identified a high frequency of both DACH1 and POLE mutations when compared to the TCGA PCA, and that POLE and DACH1 are significantly co-mutated. Notably, mutation frequencies of other common driver genes, PTEN, PIK3CA, and TP53, were consistent between the two datasets. In population studies, approximately 25% of endometrial tumors exhibit MSI-H status by IHC. Of these, the majority (~85%) are explained by hypermethylation of the MLH1 promoter, approximately 5% by germline mutations in Lynch-associated genes (MLH1, MSH2, MSH6, PMS2) and the remaining 10% by somatic mutations, unusual germline mutations not covered by clinical panels, or POLE mutations [23]. In our Kentucky population, mutation frequency in MLH1, MSH2, MSH6, and PMS2 was approximately 34%, 38%, 43%, and 19%, respectively, all significantly higher than reported by TCGA and with significant co-occurrence between DACH1 and MLH1 and PMS2, despite similar rates seen in TP53, PTEN, and PIK3CA. Since co-occurrence of mutations typically occurs among functionally related genes that work together to promote tumorigenesis [24], we hypothesize that DACH1 and DNA repair genes like POLE partner to halt the cell cycle and repair DNA and that concurrent mutation of these tumor suppressors drives oncogenesis in a subset of patients with endometrial cancer. We also suggest that this sub-type of endometrial cancer, while present in the TCGA PCA, is significantly overrepresented in Kentucky patients with endometrial cancer, likely related to the high prevalence of Lynch syndrome associated with colon cancer in the region [25].

TMB ≥ 10 received accelerated FDA approval as an indication for treatment with pembrolizumab in malignant solid tumors as studies indicate an improvement in progression-free and overall survival rates with increasing TMB, independent of PD-L1 status [26]. At MCC, DACH1 mutated patients had a median TMB of 24.0, significantly higher than DACH1 wild-type patients. In a subgroup analysis of the KEYNOTE-158 trial, patients with TMB ≥ 13 had an objective response rate of 37% with an ongoing response ≥ 12 months in 58% and ≥ 24 months in 50% [27, 28]. With TMB ≥ 10, the objective response rate was 29% with the same duration of response. This subgroup included patients both with intact and deficient MMR mechanisms, suggesting TMB alone as an additional indication for treatment with pembrolizumab [28]. Given the median TMB of 24.0 in the DACH1 population, DACH1 could serve as a future biomarker for increased TMB and possible treatment indication with checkpoint immunotherapy such as pembrolizumab.

A strength of this investigation is that a significant proportion of our study population has high grade or recurrent disease, which is often a limitation in prior genomic and proteomic analyses of endometrial cancer patients. In addition, the availability of paired RNA Seq and whole exome sequencing allowed for an extensive assessment of differentially altered pathways in those with DACH1 mutations. Finally, to our knowledge, we are the first to identify the significant co-occurrence of DACH1 mutations with both POLE and Lynch associated genes and an over-representation of these mutations in our Kentucky population. There are also several study limitations to consider. The study sample size is small, with 65 patients total and 12 with DACH1 mutations, and may be too small to detect possible clinical variables associated with DACH1 gene mutations. We also compared our cohort to the TCGA PCA, potentially introducing inconsistencies in sequencing and bioinformatics processing. However, given our conservative variant calling and including only known deleterious mutations, we are biasing towards under-calling variants. Clinical characteristics between the TCGA and our population also varied, with more patients with recurrent endometrial cancer in TCGA, making up only 24.6% of the MCC cohort. Nevertheless, the mutation frequency of PTEN, PIK3CA, and TP53 was similar between the TCGA and MCC cohorts. In addition, approximately half of included patients had high-grade disease, making them less representative of the uterine cancer population as a whole, but similar to those evaluated by TCGA. Finally, the majority of literature related to DACH1 in endometrial cancer is at the protein expression level, and the relationship to DACH1 mutation and protein expression is currently unknown.

Conclusion

Kentucky has both a high incidence and mortality from endometrial cancer. Compared to the rest of the U.S., Kentucky’s population is unique in its genomic and socioeconomic make-up. In part, DACH1 mutations and enrichments in other co-occurring pathogenic genes may explain these differences. DACH1 could provide a novel therapeutic target for immunotherapy in this ultrasensitive group of endometrial cancers with increased tumor mutation burden.

References

National Cancer Institute: Surveillance, Epidemiology, and End Results Program (SEER). SEER Cancer Stat Facts: Uterine Cancer. [cited 2020 July 7]. https://seer.cancer.gov/statfacts/html/corp.html.

U.S. Cancer Statistics Working Group. U.S. Cancer Statistics Data Visualizations Tool, based on 2019 submission data (1999–2017): U.S. Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute. [cited 2020 June]. www.cdc.gov/cancer/dataviz.

DALevine; The Cancer Genome Atlas Research Network Group. Integrated genomic characterization of endometrial carcinoma. Nature. 2013;497(7447):6773. 10.1038/nature12113

GMardon, NMSolomon, GMRubin. Dachshund encodes a nuclear protein required for normal eye and leg development in Drosophila. Development. 1994;120(12):347386.

KWu, ALi, MRao, MLiu, VDailey, YYang, et al DACH1 is a cell fate determination factor that inhibits cyclin D1 and breast tumor growth. Mol Cell Biol. 2006;26(19):711629. 10.1128/MCB.00268-06

FZhao, MWang, SLi, XBai, HBi, YLiu, et al DACH1 inhibits SNAI1-mediated epithelial-mesenchymal transition and represses breast carcinoma metastasis. Oncogenesis. 2015;4:e143 10.1038/oncsis.2015.3

FNan, QLu, JZhou, LCheng, VMPopov, SWei, et al Altered expression of DACH1 and cyclin D1 in endometrial cancer. Cancer Biol Ther. 2009;8(16):15349. 10.4161/cbt.8.16.8963

MNKhabaz, ASAbdelrahman, NSButt, BAl-Maghrabi, JAl-Maghrabi. Cyclin D1 is significantly associated with stage of tumor and predicts poor survival in endometrial carcinoma patients. Ann Diagn Pathol. 2017;30:4751. 10.1016/j.anndiagpath.2017.04.006

10 

MACaligiuri, WSDalton, LRodriguez, TSellers, CLWillman. Orien: reshaping cancer research and treatment. Oncol Issues. 2016; 31(3):6266.

11 

DAFenstermacher, RMWenham, DERollison, WSDalton. Implementing personalized medicine in a cancer center. Cancer J. 2011;17(6):52836. 10.1097/PPO.0b013e318238216e

12 

WSDalton, DSullivan, JEcsedy, MACaligiuri. Patient Enrichment for Precision-Based Cancer Clinical Trials: Using Prospective Cohort Surveillance as an Approach to Improve Clinical Trials. Clin Pharmacol Ther. 2018;104(1):236. 10.1002/cpt.1051

13 

MDRobinson, DJMcCarthy, GKSmyth. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):13940. 10.1093/bioinformatics/btp616

14 

DJMcCarthy, YChen, GKSmyth. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Res. 2012;40(10):428897. 10.1093/nar/gks042

15 

BMelendez, CVan Campenhout, SRorive, MRemmelink, ISalmon, ND’Haene. Methods of measurement for tumor mutational burden in tumor tissue. Transl Lung Cancer Res. 2018;7(6):6617. 10.21037/tlcr.2018.08.02

16 

BNiu, KYe, QZhang, CLu, MXie, MDMcLellan, et al MSIsensor: microsatellite instability detection using paired tumor-normal sequence data. Bioinformatics. 2014;30(7):10156. 10.1093/bioinformatics/btt755

17 

EAKautto, RBonneville, JMiya, LYu, MAKrook, JWReeser, et al Performance evaluation for rapid detection of pan-cancer microsatellite instability with MANTIS. Oncotarget. 2017;8(5):745263. 10.18632/oncotarget.13918

18 

ELKaplan, PMeier. Nonparametric Estimation from Incomplete Observations. Journal of the American Statistical Association. 1958;53(282):45781.

19 

RBHufnagel, ANRiesenberg, MQuinn, JAtBrzezinski, TGlaser, NLBrown. Heterochronic misexpression of Ascl1 in the Atoh7 retinal cell lineage blocks cell cycle exit. Mol Cell Neurosci. 2013;54:10820. 10.1016/j.mcn.2013.02.004

20 

EPacary, RAzzarelli, FGuillemot. Rnd3 coordinates early steps of cortical neurogenesis through actin-dependent and -independent mechanisms. Nat Commun. 2013;4:1635 10.1038/ncomms2614

21 

MSwistowska, PGil-Kulik, AKrzyzanowski, TBielecki, MCzop, AKwasniewska, et al Potential Effect of SOX2 on the Cell Cycle of Wharton’s Jelly Stem Cells (WJSCs). Oxid Med Cell Longev. 2019;2019:5084689 10.1155/2019/5084689

22 

VMPopov, JZhou, LAShirley, JQuong, WSYeow, JAWright, et al The cell fate determination factor DACH1 is expressed in estrogen receptor-alpha-positive breast cancer and represses estrogen receptor-alpha signaling. Cancer Res. 2009;69(14):575260. 10.1158/0008-5472.CAN-08-3992

23 

JLDillon, JLGonzalez, LDeMars, KJBloch, LJTafe. Universal screening for Lynch syndrome in endometrial cancers: frequency of germline mutations and identification of patients with Lynch-like syndrome. Hum Pathol. 2017;70:1218. 10.1016/j.humpath.2017.10.022

24 

QCui. A network of cancer genes with co-occurring and anti-co-occurring mutations. PLoS One. 2010;5(10). 10.1371/journal.pone.0013180

25 

AShankar, MDignan, LSelby, UShankar. Higher Incidence of Early-Onset Colorectal Cancer in Southeastern Appalachian Kentucky Due to Genetic and Epigenetic Characteristics: 277. American Journal of Gastroenterology. 2016;111:S130.

26 

NARizvi, MDHellmann, ASnyder, PKvistborg, VMakarov, JJHavel, et al Cancer immunology. Mutational landscape determines sensitivity to PD-1 blockade in non-small cell lung cancer. Science. 2015;348(6230):1248.

27 

PAOtt, YJBang, DBerton-Rigaud, EElez, MJPishvaian, HSRugo, et al Safety and antitumor activity of pembrolizumab in advanced programmed death ligand 1-positive endometrial cancer: Results from the KEYNOTE-028 study. J Clin Oncol. 2017 8 1;35(22):25352541. 10.1200/JCO.2017.72.5952 Epub 2017 May 10. .

28 

PAOtt, YJBang, DBerton-Rigaud, EElez, MJPishvaian, HSRugo, et al Safety and Antitumor Activity of Pembrolizumab in Advanced Programmed Death Ligand 1–Positive Endometrial Cancer: Results From the KEYNOTE-028 Study, Obstetrical & Gynecological Survey: 1 2018 73 (1): p 2627. 10.1097/01.ogx.0000527579.58363.20