PLoS ONE
Home Prevalence and cardiometabolic correlates of ketohexokinase gene variants among UK Biobank participants
Prevalence and cardiometabolic correlates of ketohexokinase gene variants among UK Biobank participants
Prevalence and cardiometabolic correlates of ketohexokinase gene variants among UK Biobank participants

Competing Interests: The authors have read the journal’s policy and have the following conflicts: JJ, DN, PB, SC, YC and JM are employees of Eli Lilly and own stock in Eli Lilly. Eli Lilly holds a patent (“Disubstituted Pyrazole Compounds as Ketohexokinase Inhibitors”; WO 2020/257171 A1) related to inhibition of the ketohexokinase enzyme which is pertinent to the content of this article and is developing molecules that act by this mechanism. This does not alter our adherence to the PLOS ONE policies on sharing data and materials.

Article Type: research-article Article History
Abstract

Essential fructosuria (EF) is a benign, asymptomatic, autosomal recessive condition caused by loss-of-function variants in the ketohexokinase gene and characterized by intermittent appearance of fructose in the urine. Despite a basic understanding of the genetic and molecular basis of EF, relatively little is known about the long-term clinical consequences of ketohexokinase gene variants. We examined the frequency of ketohexokinase variants in the UK Biobank sample and compared the cardiometabolic profiles of groups of individuals with and without these variants alone or in combination. Study cohorts consisted of groups of participants defined based on the presence of one or more of the five ketohexokinase gene variants tested for in the Affymetrix assays used by the UK Biobank. The rs2304681:G>A (p.Val49Ile) variant was present on more than one-third (36.8%) of chromosomes; other variant alleles were rare (<1%). No participants with the compound heterozygous genotype present in subjects exhibiting the EF phenotype in the literature (Gly40Arg/Ala43Thr) were identified. The rs2304681:G>A (p.Val49Ile), rs41288797 (p.Val188Met), and rs114353144 (p.Val264Ile) variants were more common in white versus non-white participants. Otherwise, few statistically or clinically significant differences were observed after adjustment for multiple comparisons. These findings reinforce the current understanding of EF as a rare, benign, autosomal recessive condition.

Johnston,Nelson,Bhatnagar,Curtis,Chen,MacKrell,and Kuivaniemi: Prevalence and cardiometabolic correlates of ketohexokinase gene variants among UK Biobank participants

Introduction

Essential fructosuria (EF) (OMIM #229800) is an autosomal recessive condition caused by loss-of-function variants in the ketohexokinase (KHK) gene (HGNC:6315) and characterized by intermittent appearance of fructose in the urine [1]. Its prevalence has been estimated to be 1:130,000 [2]. However, given that EF is asymptomatic, presumed benign, and does not require treatment, it may be more prevalent than reported. In seminal work on EF in the 1960’s, Laron identified 50 published cases of EF and noted that 18 of those were of Jewish decent [3], and Froesch commented that the anomaly was “encountered almost exclusively in Jews.” [4] Not surprisingly, given the benign nature of the EF, limited additional population characterization of EF has been performed since that time.

Most of what is known clinically about EF stems from the study of a single family in which variant analysis was performed and affected individuals were found to be compound heterozygotes for two variants resulting in amino acid substitutions of Gly40Arg and Ala43Thr in highly conserved regions of the KHK protein [1, 5, 6]. In contrast, Val49Ile, another common variant in normal Europeans, does not produce fructosuria when present in homozygous state [1]. Two KHK protein isotypes result from alternative splicing of the KHK gene, one with a “central,” predominantly hepatic distribution (i.e., KHK-C) and a second that is more widely distributed (i.e., KHK-A) [7]. Gly40Arg has been shown to result in no KHK activity in either isotype, while Ala43Thr leads to reduced thermostability of both isotypes [7]. The importance of additional variants (e.g., Val188Met and Val264Ile) is unknown.

Despite a basic understanding of the genetic and molecular basis of EF, relatively little is known about the clinical consequences, if any, of carrying KHK gene variants. Given the high concentration of fructose in the typical Western diet, inhibiting the function of KHK and the normal processing of fructose is an intriguing potential target for drug development, and understanding the cardiometabolic characteristics of humans with KHK gene variants may provide insight into the viability of this potential therapeutic approach. Our goal was to examine the frequency of KHK variants in the UK Biobank sample, including those implicated in EF, and to compare the cardiometabolic profiles of groups of individuals with and without these variants alone or in combination.

Materials and methods

Data sources

We used data from the UK Biobank, which recruited more than 500,000 participants out of 9.2 million eligible adults between the ages of 40 and 70 years in the UK who were invited to participate (5.5% response rate) [8]. The study protocol is available online [9] and more details are published elsewhere [10]. During the baseline assessment, participants provided informed consent and completed a questionnaire and interview covering a broad range of sociodemographic characteristics, health-related risk factors and behaviors, and past medical history. Assessment centre staff conducted a variety of physical measurements and collected blood and urine samples that were subsequently used to generate biomarker and genotype information. Information on subsequent death, disease occurrence and other health-related information are collected during follow-up from a variety of sources and systems, including death and cancer registries and general practice and hospital activity records.

This research was conducted using the UK Biobank Resource under Application Number 43833 and is covered by the ethics approval for UK Biobank studies from the NHS National Research Ethics Service (16/NW/0274). Each person who agreed to join the UK Biobank project received a clear assurance that all of their personal information would be held in strict confidence with careful controls, and that identifiable information about them would not be available to anyone outside the UK Biobank.

Study populations

Study cohorts consisted of groups of participants defined based on the presence of one or more of the five KHK gene variants tested for in the Affymetrix assays used by the UK Biobank: rs104893643:G>A (p.Gly40Arg), rs104893644:G>A (p.Ala43Thr), rs2304681:G>A (p.Val49Ile), rs41288797:G>A (p.Val188Met) and rs114353144:G>A (p.Val264Ile). We chose to focus on only directly genotyped, non-synonymous, exonic variants as we felt this would ensure the highest level of confidence in terms of both data quality and pathogenicity. Participants (n = 14,241 [2.8%]) with missing data for all five of these loci we excluded. The corresponding Affymetrix identifiers, reference sequences and allele frequencies are shown in Table 1. Genomic coordinates are in reference to NCBI build GRCh38.p12.

Table 1
Frequency of ketohexokinase variants in UK Biobank.
dbSNP rs IDAffy IDGene Variant*Protein VariantAllele Frequency
rs104893643Affx-80216441NC_000002.12:g.27092357G>Ap.Gly40Arg248 (0.03%)
rs104893644Affx-80216442NC_000002.12:g.27092366G>Ap.Ala43Thr50 (0.01%)
rs2304681Affx-19853162NC_000002.12:g.27092384G>Ap.Val49Ile358,641 (36.78%)
rs41288797Affx-19853206NC_000002.12:g.27097647G>Ap.Val188Met895 (0.09%)
rs114353144Affx-36290456NC_000002.12:g.27099556G>Ap.Val264Ile1,914 (0.20%)

Affy ID = unique Affymetrix SNP identifier; Ala = alanine; Arg = arginine; dbSNP rs ID = SNP identifier from dbSNP; Gly = glycine; Ile = isoleucine; Met = methionine; Thr = threonine; UK = United Kingdom; Val = valine.

*Genomic coordinates are in reference to NCBI build GRCh38.p12.

Denominators range from 482,957 to 487,956 participants and 965,914 to 975,912 alleles.

Given the low prevalence of the variants other than p.Val49Ile, we compared subgroups of participants with each rare variant to those without the corresponding variant as a reference group. Given the high prevalence of p.Val49Ile, we compared groups with no, one and two such variants. Finally, as an exploratory analysis, we compared participants with a p.Val49Ile variant in combination with other variants to those with no variants. This was done to determine, given the autosomal recessive character of EF, whether participants with compound heterozygosity for Val49Ile and other KHK variants might manifest higher rates of cardiometabolic complications. In doing so, we examined separately those variants with known (i.e., Gly40Arg or Ala43Thr) and unknown (i.e., Val188Met or Val264Ile) associations with EF in the literature.

Variables and measures

The UK Biobank contains genotypes for 488,377 participants who were successfully genotyped, 49,950 using the Applied Biosystems UK Biobank Lung Exome Variant Evaluation (BiLEVE) Axiom Array by Affymetrix (now part of Thermo Fisher Scientific) and 438,427 using the closely related Applied Biosystems UK Biobank Axiom Array [11]. Pre-imputation quality control, phasing and imputation of the UK Biobank genetic data have been described elsewhere [11].

We requested and analyzed all variables potentially associated with the cardiometabolic status of participants. These included age, gender, race and ethnicity, and select baseline participant-reported cardiovascular, gastrointestinal, renal, and endocrine medical conditions. We also examined use of certain medications (i.e., cholesterol lowering agents, antihypertensives, insulin), age at myocardial infarction, stroke, and death, and primary and contributory causes of death. We analyzed physical exam and laboratory features related to cardiometabolic health, including body mass index, lipid biomarkers, HbA1c, serum metabolic panel, and liver function tests. For analysis of lipid biomarkers and HbA1c, we excluded participants taking cholesterol-lowering medications and those with diabetes, respectively, as determined by participant self-report on the baseline assessment questionnaire. Finally, abdominal subcutaneous adipose tissue volume, total trunk fat volume and visceral adipose tissue volume, as determined using magnetic resonance imaging, were examined for the subset of participants in the UK Biobank multi-modal imaging substudy [12]. The full list of variables used is shown in Table 3.

Statistical analyses

Pairwise comparisons of characteristics were made between groups with the following variant genotypes and those with corresponding reference sequences: rs104893643:G>A (p.Gly40Arg); rs104893644:G>A (p.Ala43Thr); rs41288797:G>A (p.Val188Met); rs114353144:G>A (p.Val264Ile); [rs2304681:G>A];[rs2304681: = ] (i.e., heterozygous Val49Ile); [rs2304681:G>A];[rs2304681:G>A] (i.e., homozygous Val49Ile); all compound heterozygotes with at least one Gly40Arg or Ala43Thr variant; and all compound heterozygotes with any two variants other than Gly40Arg and Ala43Thr. Demographic comparisons were made using Chi-square tests for categorical variables, (i.e., sex and ethnicity) and Student’s t-tests for continuous variables (i.e., age). All other comparisons were made using separate age- and sex-adjusted logistic/linear regression models for each binary/continuous trait of interest. We were unable to adjust for ethnicity directly due to small sample sizes. Instead, we conducted a sensitivity analysis by repeating all analyses among white participants only and noted instances where results from the sensitivity analysis were discrepant from those from the full cohort in table footnotes.

All p-values for the regression coefficients corresponding to the genetic variant groups were adjusted using the FDR method of Benjamini and Hochberg [13], with the exception of exploratory p-values in the S1 Table, which are not adjusted for multiplicity. The p-values for the sensitivity analysis were adjusted for multiplicity separately. Continuous variables are summarized as mean (standard deviation). All statistical analyses were conducted using SAS 9.4 software (SAS Institute Inc., Cary, NC, USA).

Results

The frequency of KHK missense variant alleles in the UK Biobank sample is shown in Table 1. The Val49Ile variant was observed with a 36.8% frequency, and more than half of participants (60.0%) had at least one such variant. Other variant alleles were rare (<1%) in the studied sample. The joint frequency of KHK missense variants is shown in Table 2. Approximately 13% of participants carried Val49Ile variants on both KHK alleles, resulting in exclusively variant KHK protein (i.e., p.Val49Ile). Other variants in combination occurred in <1% (Val49Ile plus other) and <0.01% (any two other) of participants. No participants with the compound heterozygous Gly40Arg/Ala43Thr genotype (i.e., the combination present in subjects exhibiting the EF phenotype in the literature) were identified. All variants were in Hardy-Weinberg equilibrium, as evidenced by non-significant deviations of observed from expected genotype distributions for each of the variants (data not shown).

Table 2
Frequency of ketohexokinase haplotype combinations in UK Biobank*.
Gly40ArgAla43ThrVal49IleVal188MetVal264Ile
Gly40Arg00207 (0.04%)01 (<0.01%)
Ala43Thr044 (0.01%)01 (<0.01%)
Val49Ile66,155 (13.57%)878 (0.18%)734 (0.15%)
Val188Met1 (<0.01%)2 (<0.01%)
Val264Ile0

Ala = alanine; Arg = arginine; Gly = glycine; Ile = isoleucine; Met = methionine; Thr = threonine; UK = United Kingdom; Val = valine.

*Denominators range from 481,611 to 487,943 participants.

The comparison of demographic characteristics and cardiometabolic risk factors and outcomes for UK Biobank participants with and without rare KHK variants is shown in Table 3. We observed statistical significance between Val188Met (false discovery rate [FDR] p<0.0001) and Val264Ile (FDR p<0.0001) and white ethnicity as compared to participants without those variants. No other statistically significant differences were observed between the cohorts with any of the individual rare variants and those without, nor were any differences observed in the sensitivity analyses including only participants of white ethnicity.

Table 3
Comparison of cardiometabolic profile of UK Biobank participants with and without rare ketohexokinase variants*.
CharacteristicNo Rare Variants (N = 477,686)Gly40Arg Variant (n = 248)Ala43Thr Variant (n = 50)Val188Met Variant (n = 894)Val264Ile Variant (n = 1,914)
Age at recruitment, mean (SD)56.5 (8.1)56.7 (7.7)57.6 (8.3)56.1 (8.2)56.3 (8.1)
Female sex, %54.256.566.056.055.2
Ethnic background, %
 White94.298.898.098.597.7
 Black or black British1.6000.20.6
 Asian, Asian British, or Chinese2.3000.20.5
 Mixed0.60.800.20.6
 Other0.90.42.00.50.4
 Unknown0.10000
Medical conditions (Cardiovascular), %39.438.344.038.938.3
 Hypertension27.926.232.025.626.5
 Angina4.73.62.04.94.7
 Myocardial infarction3.82.42.03.63.8
 Heart failure/pulmonary edema1.60.401.11.5
 Heart arrhythmia5.45.26.04.55.9
 Heart valve problem/heart murmur5.34.44.04.85.6
 Cardiomyopathy3.02.002.12.5
 Pericardial problem3.01.62.02.22.6
 Cerebrovascular disease6.33.28.05.76.4
 Peripheral vascular disease4.92.84.04.45.0
 Venous thromboembolic disease4.12.42.02.84.0
 High cholesterol15.012.516.013.614.8
Medical conditions (Gastrointestinal), %19.821.018.018.118.9
 Esophageal disorder10.09.34.09.110.0
 Stomach disorder4.04.02.03.53.9
 Bowel problem (any)10.98.116.010.311.1
  Irritable bowel syndrome3.92.84.03.44.0
  Malabsorption/celiac disease3.61.64.02.84.1
  Constipation1.91.201.51.4
 Liver/biliary/pancreas problem7.47.76.07.77.3
Medical conditions (Renal), %5.53.24.05.96.1
 Renal/kidney failure (any)1.80.82.01.62.2
  Requiring dialysis1.70.82.01.62.1
 Other renal/kidney problem (any)4.82.44.05.25.2
  Diabetic nephropathy1.10.801.71.4
Medical conditions (Endocrine/Diabetes), %14.914.914.015.814.0
 Diabetes (any)8.25.74.08.27.0
  Type 2 diabetes3.10.82.02.83.1
Other endocrine condition6.79.310.07.67.0
Medical conditions (Other), %
 Diabetic neuropathy/ulcers3.21.24.02.63.9
Age at event, mean (SD)
 Myocardial infarction58.2 (13.7)60.9 (7.5)28.8 (59.2)58.0 (8.9)59.0 (8.7)
 Stroke57.4 (15.9)56.2 (3.2)63.4 (11.0)48.9 (32.1)58.3 (8.9)
Medication use, %
 Cholesterol lowering medication11.28.96.012.311.2
 Blood pressure medication11.910.112.011.211.7
 Insulin0.70.800.80.8
Mortality, %4.02.44.03.54.1
Age at death, mean (SD)67.3 (7.1)69.4 (5.8)69.1 (2.2)68.3 (7.1)66.3 (7.0)
Primary cause of death, %
 Diseases of the circulatory system0.80.400.61.0
 Diseases of the digestive system0.2000.20.1
Contributory cause of death, %
 Diseases of the circulatory system1.10.801.21.1
 Diseases of the digestive system0.2000.30.2
Morphometrics, mean (SD) §
 Body mass index, kg/m227.4 (4.8)27.0 (4.6)26.8 (5.9)27.5 (4.7)27.1 (4.9)
 Abdominal subcutaneous adipose tissue volume, L6.5 (3.2)--5.0 (1.1)5.7 (2.5)
 Total trunk fat volume, L10.3 (4.6)--7.9 (2.6)8.9 (3.2)
 Visceral adipose tissue volume, L 3.2 (2.3)--2.3 (2.1)2.8 (1.9)
Biomarkers (Cardiovascular), mean (SD)
 Cholesterol (serum), mmol/L5.8 (1.1)5.8 (1.2)5.6 (0.9)5.8 (1.1)5.8 (1.1)
 LDL (serum), mmol/L3.6 (0.8)3.6 (0.9)3.5 (0.7)3.6 (0.8)3.6 (0.8)
 HDL (serum), mmol/L1.5 (0.4)1.5 (0.4)1.6 (0.3)1.5 (0.4)1.5 (0.4)
 Triglyceride (serum), mmol/L1.7 (1.0)1.7 (1.1)1.5 (0.7)1.7 (1.0)1.7 (1.0)
 Apolipoprotein A (serum), g/L1.6 (0.3)1.5 (0.3)1.6 (0.2)1.6 (0.3)1.6 (0.3)
 Apolipoprotein B (serum), g/L1.1 (0.2)1.0 (0.2)1.0 (0.2)1.0 (0.2)1.0 (0.2)
 C-reactive protein (serum), mg/L2.6 (4.4)2.9 (5.9)1.9 (2.0)2.6 (4.4)2.4 (4.3)
 Lipoprotein (a) (serum), nmol/L44.3 (48.9)49.6 (52.1)41.3 (47.4)42.8 (47.2)44.0 (49.4)
Biomarkers (Diabetes), mean (SD) #
 HbA1c (RBC), mmol/mol36.0 (6.5)35.5 (5.0)35.7 (4.2)35.9 (7.0)35.6 (5.9)
 Glucose (serum), mmol/L5.1 (1.2)5.1 (1.0)5.1 (1.2)5.2 (1.3)5.1 (1.1)
Biomarkers (Renal), mean (SD)
 Cystatin C (serum), mg/L0.9 (0.2)0.9 (0.2)0.9 (0.1)0.9 (0.1)0.9 (0.2)
 Creatinine (serum), μmol/L72.3 (18.7)72.4 (15.7)68.5 (10.9)71.1 (14.4)72.5 (18.6)
 Total protein (serum), g/L72.5 (4.1)72.2 (4.0)71.7 (3.6)72.5 (4.0)72.4 (4.1)
 Urea (serum), mmol/L5.4 (1.4)5.5 (1.4)5.2 (1.2)5.4 (1.3)5.4 (1.5)
 Phosphate (serum), mmol/L1.2 (0.2)1.2 (0.2)1.2 (0.1)1.2 (0.2)1.2 (0.2)
 Urate (serum), μmol/L309.2 (80.4)303.3 (71.4)294.7 (67.9)309.6 (84.2)306.1 (81.4)
Biomarkers (Liver), mean (SD)
 Albumin (serum), g/L45.2 (2.6)45.1 (2.6)44.5 (2.8)45.2 (2.5)45.2 (2.7)
 Direct bilirubin (serum), μmol/L1.8 (0.9)2.0 (0.9)1.9 (0.8)1.8 (0.8)1.8 (0.8)
 Gamma glutamyltransferase (serum), U/L37.4 (42.2)36.6 (42.6)35.8 (32.2)39.2 (52.3)35.5 (34.3)
 Alanine aminotransferase (serum), U/L23.6 (14.2)22.7 (11.6)25.2 (12.7)23.7 (12.8)23.5 (16.3)
 Aspartate aminotransferase (serum), U/L26.2 (10.7)25.8 (8.3)27.4 (6.7)26.3 (8.7)26.3 (12.9)

Ala = alanine; Arg = arginine; Gly = glycine; HbA1c = hemoglobin A1c; HDL = high-density lipoprotein; Ile = isoleucine; LDL = low-density lipoprotein; Met = methionine; SD = standard deviation; Thr = threonine; Val = valine; RBC = red blood cell.

*For descriptive purposes, the “no rare variants” group includes only participants with reference sequences at all four loci. However, for statistical comparison between groups with and without individual variants, only those without the variant of interest were excluded from the reference group.

P < 0.05 for comparison between participants with and without the corresponding KHK variant using false discovery rate adjustment.

‡Other endocrine conditions include those in ICD-10 category E34, including “endocrine disorder, unspecified” (E34.9) and “carcinoid syndrome” (E34.0).

§Adipose tissue volumes were determined using abdominal magnetic resonance imaging and were only available for a small subset of participants (n = 5,994 [1.2%] overall, ranging from 0 [0%] to 20 [1.0%] within the four rare variant groups).

‖Visceral adipose tissue volume is the volume of adipose tissue within the abdominal cavity, excluding adipose tissue outside the abdominal skeletal muscles and adipose tissue and lipids within and posterior of the spine and posterior of the back muscles.

¶Analysis of lipid biomarkers excludes participants taking cholesterol-lowering medications.

#Analysis of diabetes biomarkers (i.e., HbA1c and glucose) excludes participants diagnosed with type 2 diabetes.

The comparison of demographic characteristics and cardiometabolic risk factors and outcomes for UK Biobank participants with and without Val49Ile KHK variants is shown in Table 4. Overall, few statistically significant differences were observed after adjustment for multiple comparisons. Participants with single Val49Ile variants were more frequently of white ethnicity (94.7% vs. 93.4%; FDR p<0.001) than those with no Val49Ile variants. They also had lower total trunk fat volume (10.1 ± 4.6L vs. 10.5 ± 4.6L; FDR p = 0.01) and visceral adipose tissue volume (3.1 ± 2.2L vs. 3.4 ± 2.3L; FDR p<0.001). These differences were also statistically significant among only participants of white ethnic background. Those with double Val49Ile variants were also more frequently of white ethnicity (95.3% vs. 93.4%; FDR p<0.001), less frequently diagnosed with hypertension (27.4% vs. 28.1%; FDR p = 0.001), more frequently diagnosed with endocrine conditions other than diabetes (7.0% vs. 6.6%; FDR p = 0.02), and had lower hemoglobin A1c (HbA1c) values (35.9 ± 6.6 mmol/mol vs. 36.0 ± 6.5 mmol/mol; FDR p = 0.003) than those with no Val49Ile variants; none of these differences were statistically significant among only participants of white ethnic background.

Table 4
Comparison of cardiometabolic profile of UK Biobank participants with and without Val49Ile ketohexokinase variants*.
CharacteristicNo Val49Ile Variant (n = 195,103)Single Val49Ile Variant (n = 226,319)Double Val49Ile Variant (n = 66,155)
Age at recruitment, mean (SD)56.5 (8.1)56.5 (8.1)56.6 (8.1)
Female sex, %54.254.254.3
Ethnic background, %
 White93.494.795.3
 Black or black British1.81.51.2
 Asian, Asian British or Chinese2.72.01.6
 Mixed0.60.60.5
 Other1.00.90.8
 Unknown0.10.10.1
Medical conditions (Cardiovascular), %39.539.339.2
 Hypertension28.127.827.4
 Angina4.74.74.6
 Myocardial infarction3.93.83.8
 Heart failure/pulmonary edema1.61.61.6
 Heart arrhythmia5.35.45.4
 Heart valve problem/heart murmur5.35.35.4
 Cardiomyopathy3.03.03.1
 Pericardial problem3.03.03.1
 Cerebrovascular disease6.26.26.4
 Peripheral vascular disease4.84.95.0
 Venous thromboembolic disease4.04.14.1
 High cholesterol15.014.915.0
Medical conditions (Gastrointestinal), %19.719.820.2
 Esophageal disorder10.010.010.2
 Stomach disorder4.04.04.0
 Bowel problem (any)10.810.911.1
  Irritable bowel syndrome3.93.94.0
  Malabsorption/celiac disease3.53.63.6
  Constipation1.81.81.9
 Liver/biliary/pancreas problem7.37.47.5
Medical conditions (Renal), %5.55.55.6
 Renal/kidney failure (any)1.81.81.8
  Requiring dialysis1.71.71.7
 Other renal/kidney problem (any)4.84.84.9
  Diabetic nephropathy1.11.11.2
Medical conditions (Endocrine/Diabetes), %14.914.915.1
 Diabetes (any)8.48.28.1
  Type 2 diabetes3.03.03.1
 Other endocrine condition §6.66.77.0
Medical conditions (Other), %
 Diabetic neuropathy/ulcers3.23.23.3
Age at event, mean (SD)
 Myocardial infarction57.9 (14.6)58.4 (13.0)58.1 (13.2)
 Stroke57.4 (15.8)57.5 (15.9)56.8 (16.6)
Medication use, %
 Cholesterol lowering medication11.311.211.1
 Blood pressure medication11.911.811.7
 Insulin0.70.70.6
Mortality, %4.04.04.0
Age at death, mean (SD)67.3 (7.1)67.3 (7.0)67.2 (7.2)
Primary cause of death, %
 Diseases of the circulatory system0.91.10.8
 Diseases of the digestive system0.20.20.2
Contributory cause of death, %
 Diseases of the circulatory system1.11.11.1
 Diseases of the digestive system0.30.20.3
Morphometrics, mean (SD)
 Body mass index, kg/m227.4 (4.8)27.4 (4.8)27.4 (4.8)
 Abdominal subcutaneous adipose tissue volume, L6.6 (3.3)6.5 (3.2)6.5 (3.1)
 Total trunk fat volume, L10.5 (4.6)10.1 (4.6)10.1 (4.3)
 Visceral adipose tissue volume, L 3.4 (2.3)3.1 (2.2)3.1 (2.2)
Biomarkers (Cardiovascular), mean (SD) #
 Cholesterol (serum), mmol/L5.8 (1.1)5.8 (1.1)5.9 (1.1)
 LDL (serum), mmol/L3.6 (0.8)3.6 (0.8)3.7 (0.8)
 HDL (serum), mmol/L1.5 (0.4)1.5 (0.4)1.5 (0.4)
 Triglyceride (serum), mmol/L1.7 (1.0)1.7 (1.0)1.7 (1.0)
 Apolipoprotein A (serum), g/L1.6 (0.3)1.6 (0.3)1.6 (0.3)
 Apolipoprotein B (serum), g/L1.0 (0.2)1.1 (0.2)1.1 (0.2)
 C-reactive protein (serum), mg/L2.6 (4.4)2.6 (4.3)2.6 (4.3)
 Lipoprotein (a) (serum), nmol/L44.5 (48.8)44.2 (48.9)44.3 (49.9)
Biomarkers (Diabetes), mean (SD) **
 HbA1c (RBC), mmol/mol36.0 (6.5)36.0 (6.4)35.9 (6.6)
 Glucose (serum), mmol/L5.1 (1.2)5.1 (1.2)5.1 (1.1)
Biomarkers (Renal), mean (SD)
 Cystatin C (serum), mg/L0.9 (0.2)0.9 (0.2)0.9 (0.2)
 Creatinine (serum), μmol/L72.2 (17.9)72.4 (19.0)72.4 (19.5)
 Total protein (serum), g/L72.5 (4.1)72.5 (4.1)72.5 (4.1)
 Urea (serum), mmol/L5.4 (1.4)5.4 (1.4)5.4 (1.4)
 Phosphate (serum), mmol/L1.2 (0.2)1.2 (0.2)1.2 (0.2)
 Urate (serum), μmol/L309.6 (80.5)309.0 (80.3)309.2 (80.5)
Biomarkers (Liver), mean (SD)
 Albumin (serum), g/L45.2 (2.6)45.2 (2.6)45.2 (2.6)
 Direct bilirubin (serum), μmol/L1.8 (0.9)1.8 (0.8)1.8 (0.8)
 Gamma glutamyltransferase (serum), U/L37.4 (42.5)37.3 (41.7)37.5 (42.3)
 Alanine aminotransferase (serum), U/L23.6 (14.2)23.5 (14.2)23.5 (14.3)
 Aspartate aminotransferase (serum), U/L26.2 (10.7)26.2 (10.8)26.2 (10.2)

Abbreviations are explained in the first footnote to Table 3.

*Val49Ile genotypes were unavailable for 675 (0.14%) individuals.

P < 0.05 for unadjusted (age, sex, ethnicity) or age- and sex-adjusted (all others) comparison to participants with no Val49Ile variants (GG) using false discovery rate adjustment.

P < 0.05 in sensitivity analysis including only participants of white ethnic background.

§Other endocrine conditions include those in ICD-10 category E34, including “endocrine disorder, unspecified” (E34.9) and “carcinoid syndrome” (E34.0).

‖Adipose tissue volumes were determined using abdominal magnetic resonance imaging and were only available for a small subset of participants (n = 5,994 [1.2%] overall, ranging from 822 [1.3%] to 2,667 [1.3%] within the three variant subgroups).

¶Visceral adipose tissue volume is the volume of adipose tissue within the abdominal cavity, excluding adipose tissue outside the abdominal skeletal muscles and adipose tissue and lipids within and posterior of the spine and posterior of the back muscles.

#Analysis of lipid biomarkers excludes participants taking cholesterol-lowering medications.

**Analysis of diabetes biomarkers (i.e., HbA1c and glucose) excludes participants diagnosed with type 2 diabetes.

Given the autosomal recessive inheritance nature of EF, we were also interested in participants with multiple KHK variants. Differences between participants with two KHK variants and those with none are shown in S1 Table. Note that, while a number of statistically significant differences were observed, only one, an increased direct bilirubin in the group with two KHK variants including Gly40Arg or Ala43Thr (1.9 [0.9] μmol/L vs. 1.8 [0.9] μmol/L), was significant in both age- and sex-adjusted analyses and in the sensitivity analysis among only white participants. These findings should be considered exploratory only as these p-values were not adjusted for multiple comparisons and the variation was within the normal range.

Discussion

Our failure to identify even a single individual with the Gly40Arg/Ala43Thr genotype among UK Biobank participants suggests either that the prevalence of EF is much lower than previously estimated or that there may be important differences in prevalence by race and ethnicity. Based on the carrier frequency of these two variants in the UK Biobank, and the fact that they are in Hardy-Weinberg equilibrium, we can estimate the prevalence of EF to be approximately 0.37:1,000,000, considerably lower than published estimates. To contextualize the variant allele frequencies observed in the UK Biobank sample, we queried these five KHK variants in the Genome Aggregation Database (gnomAD https://gnomad.broadinstitute.org/), a public resource seeking to aggregate and harmonize exome and genome sequencing data from a variety of large-scale sequencing projects [14]. This resource (gnomAD v2) contains data from 125,748 exomes and 15,708 whole genomes, and S2 Table shows the variant allele frequency distributions for different global populations. In general, allele frequencies in the UK Biobank sample were similar to those in the European non-Finnish population from gnomAD. However, it should be noted that, while we found that UK Biobank participants with Val49Ile, Val188Met, and Val264Ile variants were more frequently of white ethnicity than those with no variants, the overwhelming majority of UK Biobank participants (94.1%) are in fact white, and these variants are equally and in some cases more prevalent in other ethnic groups, in particular Latinos (S2 Table). Further insights into the geographic and ethnic distribution of KHK variants could be obtained by replicating this study in other large biobank cohorts from China or Scandinavia [1517].

Previous literature has suggested that compound heterozygosity is required for the EF phenotype based on serum and urinary fructose analysis. Since the UK Biobank biomarker panels did not include serum or urinary fructose, we could not confirm this; however, our findings do suggest that neither individual KHK variants nor compound heterozygosity for the variant combinations we identified appear to confer a decreased risk of fructose-mediated adverse cardiometabolic events. In particular, the fact that the few statistically significant differences observed between individuals with a single Val49Ile variant and those with none were not also observed in those with two Val49Ile variants, and that the differences observed in those with two variants were not replicated among white participants only suggests that the Val49Ile KHK protein is likely a normally functioning variant. Alternatively, it is possible that the Val49Ile variant does indeed have cardiometabolic consequences but that they are of small magnitude and that we are thus unable to be demonstrate statistical significance despite the large numbers of participants with this common variant.

To date, it remains unclear from the published literature whether the variants studied, other than Gly40Arg and Ala43Thr, are functional. Unfortunately, using the UK Biobank data we were unable to provide any further evidence regarding cardiometabolic outcomes associated with the Gly40Arg/Ala43Thr genotype, or to characterize sufficient participants with multiple non-Val49Ile variants, to derive meaningful conclusions about the clinical significance of the Val188Met and Val264Ile variants. While the decreased prevalence of diabetes and slightly lower BMIs in participants with two KHK variants including Gly40Arg or Ala43Thr is intriguing, it should be noted these findings were not replicated in the sensitivity analysis including only white participants. In this case, the small numbers of participants with rare variants in combination limit our power to establish meaningful differences.

Preclinical data in mice have shown that full genetic knockout of KHK (KHK-KO) provides cardiometabolic protection/benefits under various metabolic stressors [18]. While we hoped to be able to determine whether the EF genotype (akin to preclinical KHK-KO) conferred any clinical benefit in humans, we were barred from doing so by an absence of identifiable EF subjects in the UK Biobank. On the other hand, this dataset did allow us to identify participants with single variants (e.g., Gly40Arg or Ala43Thr). In these individuals, partial loss in protein function appeared neither to confer any clinical benefit, nor to result in any negative, detrimental phenotypes.

Interestingly, participants homozygous for the relatively common Val49Ile were marginally less frequently diagnosed with hypertension. Compelling evidence from basic science, population studies, and clinical trials implicate fructose as playing a major role in the development of hypertension, among an array of additional metabolic complications [1929]. In line with these findings, removing dietary fructose has resulted in reversal of adverse metabolic effects, including hypertension. The mechanism connecting fructose to hypertension is unclear and may be an indirect product of worsened metabolic phenotype. However, there is emerging evidence that fructose metabolism in specific cell types (e.g., kidney/proximal tubule and vascular endothelium) directly promote pathways involved in increased blood pressure, including renal NHE3 upregulation [3034]. While the decreased incidence of hypertension in participants with two Val49Ile variants in this dataset is encouraging, the lack of additional clinical benefits (e.g., decreased glucose, insulin, and triglycerides, as seen in fructose restriction and/or clinical KHK inhibition), as well as the fact that this association was not statistically significant among only white participants, complicates interpretation and application of the data. In addition, the Val49Ile variant has previously been suggested to have no functional importance [1, 7]. Nevertheless, these data do confirm that variants within KHK do not appear to result in detrimental phenotypes and suggest additional biological exploration into this finding (e.g., preclinical Val49Ile characterization) is warranted.

Recent clinical data also suggest that submaximal pharmacological KHK inhibition may provide clinically meaningful benefits. In a phase 2a study (NCT03256526), administration of Pfizer’s KHK inhibitor compound (PF06835919) at doses of 75 mg or 300 mg for 6 weeks was evaluated in subjects with non-alcoholic fatty liver disease (NAFLD) [35]. A significant reduction in whole liver fat, as measured by magnetic resonance imaging proton density fat fraction in subjects with NAFLD, was observed at the 300 mg per day dose. At both the 75 mg and 300 mg doses, significant improvements in a marker of insulin sensitivity (i.e., Homeostatic Model Assessment of Insulin Resistance) were observed. Noteworthy was the dose-response in efficacy observed from 75 mg to 300 mg on high-sensitivity C-reactive protein and adiponectin, markers of inflammation and insulin sensitivity, respectively, suggesting submaximal pharmacological KHK inhibition may result in clinical benefit. However, without knowing the target engagement profile of PF06835919 and lacking any EF subject data to characterize, it remains to be determined if pharmacological KHK inhibition is comparable to genetic (partial or full) loss of KHK function.

While no participants with EF were identified in this dataset, other investigators [36] have derived relevant insights from a model simulating fructose fate in the liver compartment of normal and EF individuals. Their model predicted that in EF (vs. normal/baseline), increased urinary fructose is a result of decreased reabsorption by the kidneys; i.e., the rate of fructose reabsorption in normal subjects would be higher than individuals with EF even when exposed to similar fructose levels in the kidney. The model suggests that, in the liver of EF subjects, non-KHK metabolism increases while peripheral fructose metabolism slows. The authors conclude that EF adaptations may be a compensatory mechanism to avoid malabsorption of fructose and maximize non-KHK metabolism. Whether these predicted adaptations would mask any potential clinical benefit of genetic KHK loss-of-function (partial or full) relative to those observed with clinical pharmacological KHK inhibition will be revealed upon more robust characterization of EF subjects. Nonetheless, the clinical benefit observed with pharmacological inhibition suggests that KHK may be a promising therapeutic target for cardiometabolic diseases including NAFLD and type 2 diabetes mellitus.

Several limitations deserve mention. The UK Biobank cohort, while large and well characterized, includes only UK citizens between the ages of 40 and 70, is not a representative sample of all UK citizens and is known to be biased towards healthier individuals [37]. It is unknown how well our findings regarding the prevalence of KHK variants in this sample would translate to other geographically diverse populations. Another limitation pertains to available data elements. Despite the richness of the UK Biobank dataset, several key biomarkers that could have provided meaningful insights into KHK functioning, including urinary fructose excretion, serum fructose after a fructose bolus, and liver fat, were unavailable [3, 38, 39].

Despite these limitations, we believe our findings, based on a sample of nearly half a million participants, add to what is known about the prevalence and the clinical consequences of KHK variants singly and in combination. These findings reinforce the current understanding of EF as a rare, benign, autosomal recessive condition. While we observed no apparent clinical benefit in participants with compound heterozygous KHK variants, we cannot on that basis alone exclude the possibility that certain variants might produce the EF phenotype and result in meaningful cardiometabolic benefits.

Acknowledgements

The authors would like to acknowledge Craig Hammond for his helpful comments on an earlier draft of this manuscript, and Antonia Baldo for editorial assistance.

References

DTBonthron, NBrady, IADonaldson, BSteinmann. Molecular basis of essential fructosuria: molecular cloning and mutational analysis of human ketohexokinase (fructokinase). Hum Mol Genet. 1994;3: 16271631. 10.1093/hmg/3.9.1627

BSteinmann, RSanter, Gvan den Berghe. Disorders of fructose metabolism In: JFernandes, JMSaudubray, Gvan den Berghe, JHWalter, editors. Inborn Metabolic Diseases. 4th ed. Heidelberg: Springer; 2006 pp 135142.

ZLaron. Essential benign fructosuria. Arch Dis Child. 1961;36: 273277. 10.1136/adc.36.187.273

ERFroesch. Disorders of fructose metabolism. J Clin Pathol Suppl (Assoc Clin Pathol). 1969;2: 712.

PBoesiger, RBuchli, DMeier, BSteinmann, RGitzelmann. Changes of liver metabolite concentrations in adults with disorders of fructose metabolism after intravenous fructose by 31P magnetic resonance spectroscopy. Pediat Res. 1994;36: 436440. 10.1203/00006450-199410000-00004

RGitzelmann, BSteinmann, Gvan den Berghe. Disorders of fructose metabolism In: CRScriver, ALBeaudet, WSSly, DValle, editors. The Metabolic Basis of Inherited Disease. 6th ed. New York: McGraw-Hill; 1994 pp. 399424.

AAsipu, BEHayward, JO’Reilly, DTBonthron. Properties of normal and mutant recombinant human ketohexokinases and implications for the pathogenesis of essential fructosuria. Diabetes. 2003;52: 24262432. 10.2337/diabetes.52.9.2426

AFry, TJLittlejohns, CSudlow, NDoherty, LAdamska, TSprosen, et al Comparison of sociodemographic and health-related characteristics of UK Biobank participants with those of the general population. Am J Epidemiol. 2017;186: 10261034. 10.1093/aje/kwx246

The UK Biobank. UK Biobank: protocol for a large-scale prospective epidemiological resource. March 21, 2007. Available at: http://www.ukbiobank.ac.uk/wp-content/uploads/2011/11/UK-Biobank-Protocol.pdf. Available August 17, 2020.

10 

CSudlow, JGallacher, NAllen, VBeral, PBurton, JDanesh, et al UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12: e1001779 10.1371/journal.pmed.1001779

11 

CBycroft, CFreeman, DPetkova, GBand, LTElliott, KSharp, et al The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562: 203209. 10.1038/s41586-018-0579-z

12 

TJLittlejohns, JHolliday, LMGibson, SGarratt, NOesingmann, FAlfaro-Almagro, et al The UK Biobank imaging enhancement of 100,000 participants: rationale, data collection, management and future directions. Nat Commun. 2020;11: 2624 10.1038/s41467-020-15948-9

13 

YBenjamini, YHochberg. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Series B Stat Methodol. 1995;57: 289300.

14 

KJKarczewski, LCFrancioli, GTiao, BBCummings, JAlfoldi, QWang, et al The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581: 434443. 10.1038/s41586-020-2308-7

15 

ZChen, JChen, RCollins, YGuo, RPeto, FWu, et al China Kadoorie Biobank (CKB) collaborative group. China Kadoorie Biobank of 0.5 million people: survey methods, baseline characteristics and long-term follow-up. Int J Epidemiol. 2011;40: 16521666. 10.1093/ije/dyr120

16 

SKrokstad, ALanghammer, KHveem, TLHolmen, KMidthjell, TRStene, et al Cohort profile: the HUNT study, Norway. Int J Epidemiol. 2013;42: 968977. 10.1093/ije/dys095

17 

LLeitsalu, THaller, TEsko, MLTammesoo, HAlavere, HSnieder, et al Cohort profile: Estonian Biobank of the Estonian Genome Center, University of Tartu. Int J Epidemiol. 2015;44: 113747. 10.1093/ije/dyt268

18 

COMiller, XYang, KLu, JCao, KHerath, TWRosahl, et al Ketohexokinase knockout mice, a model for essential fructosuria, exhibit altered fructose metabolism and are protected from diet-induced metabolic defects. Am J Physiol Endocrinol Metab. 2018;315: E386E393. 10.1152/ajpendo.00027.2018

19 

VHJayalath, JLSievenpiper, RJde Souza, VHa, AMirrahimi, IDSantaren, et al Total fructose intake and risk of hypertension: a systematic review and meta-analysis of prospective cohorts. J Am Coll Nutr. 2014;33: 32839. 10.1080/07315724.2014.916237

20 

DIJalal, GSmits, RJJohnson, MChonchol. Increased fructose associates with elevated blood pressure. J Am Soc Nephrol. 2010;21: 15439. 10.1681/ASN.2009111111

21 

ISHwang, HHo, BBHoffman, GMReaven. Fructose-induced insulin resistance and hypertension in rats. Hypertension. 1987;10: 5126. 10.1161/01.hyp.10.5.512

22 

HGPreuss, RDFournier. Effects of sucrose ingestion on blood pressure. Life Sci. 1982;30: 87986. 10.1016/0024-3205(82)90615-4

23 

SVasdev, VMPrabhakaran, MWhelan, CAFord, LLongerich, SParai. Fructose-induced hypertension, hypertriglyceridemia and elevated cytosolic calcium in rats: prevention by deuterium oxide. Artery. 1994;21: 12447.

24 

QYang, ZZhang, EWGregg, WDFlanders, RMerritt, FBHu. Added sugar intake and cardiovascular diseases mortality among US adults. JAMA Intern Med. 2014;174: 51624. 10.1001/jamainternmed.2013.13563

25 

VSMalik, BMPopkin, GABray, JPDespres, WCWillett, FBHu. Sugar-sweetened beverages and risk of metabolic syndrome and type 2 diabetes: a meta-analysis. Diabetes Care. 2010;33: 247783. 10.2337/dc10-1079

26 

TRebello, REHodges, JLSmith. Short-term effects of various sugars on antinatriuresis and blood pressure changes in normotensive young men. Am J Clin Nutr. 1983;38: 8494. 10.1093/ajcn/38.1.84

27 

JJRumessen, EGudmand-Hoyer. Absorption capacity of fructose in healthy adults. Comparison with sucrose and its constituent monosaccharides. Gut. 1986;27: 11618. 10.1136/gut.27.10.1161

28 

MTLe, RFFrye, CJRivard, JCheng, KKMcFann, MSSegal, et al Effects of high-fructose corn syrup and sucrose on the pharmacokinetics of fructose and acute metabolic and hemodynamic responses in healthy subjects. Metabolism. 2012;61: 64151. 10.1016/j.metabol.2011.09.013

29 

CMBrown, AGDulloo, GYepuri, JPMontani. Fructose ingestion acutely elevates blood pressure in healthy young humans. Am J Physiol Resul Integr Comp Physiol. 2008;294: R7307. 10.1152/ajpregu.00680.2007

30 

ABrymora, MFlisinski, RJJohnson, GGoszka, AStefańska, JManitius. Low-fructose diet lowers blood pressure and inflammation in patients with chronic kidney disease. Nephrol Dial Transplant. 2012;27: 60812. 10.1093/ndt/gfr223

31 

DAShoham, RDurazo-Arvizu, HKramer, ALuke, SVupputuri, AKshirsagar, et al Sugary soda consumption and albuminuria: results from the National Health and Nutrition Examination Survey, 1999–2004. PLoS One. 2008;3: e3431 10.1371/journal.pone.0003431

32 

PDCabral, NJHong, MAHye Khan, PAOrtiz, WHBeierwaltes, JDImig, et al Fructose stimulates Na/H exchange activity and sensitizes the proximal tubule to angiotensin II. Hypertension. 2014;63: e6873. 10.1161/HYPERTENSIONAHA.113.02564

33 

THayasaki, TIshimoto, TDoke, AHirayama, TSoga, KFuruhashi, et al Fructose increases the activity of sodium hydrogen exchanger in renal proximal tubules that is dependent on ketohexokinase. J Nutr Biochem. 2019;71: 5462. 10.1016/j.jnutbio.2019.05.017

34 

NSimons, FGDebray, NCSchaper, EJMFeskens, CEMHollak, JAPBons, et al Kidney and vascular function in adult patients with hereditary fructose intolerance. Mol Genet Metab Rep. 2020;11: 100600 10.1016/j.ymgmr.2020.100600

35 

Calle R, Bergman A, Somayaji V, Chidsey K, Kazierad D. Ketohexokinase inhibitor PF-06835919 administered for 6 weeks reduces whole liver fat as measured by magnetic resonance imaging-proton density fat fraction in subjects with nonalcoholic fatty liver disease. Paper presented at: Annual Meeting of the European Associate for the Study of the Liver International Liver Conference; April 12, 2019; Vienna, Austria.

36 

RJAllen, CJMusante. A mathematical analysis of adaptations to the metabolic fate of fructose in essential fructosuria subjects. Am J Physiol Endocrinol Metab. 2018;315: E394E403. 10.1152/ajpendo.00317.2017

37 

JMSwanson. The UK Biobank and selection bias. Lancet. 2012;380: 110 10.1016/S0140-6736(12)61179-9

38 

SSilver, MReiner. Essential fructosuria: report of three cases with metabolic studies. Arch Intern Med. 1934;54: 412426.

39 

AKKhachadurian. Nonalimentary fructosuria. Pediatrics. 1963;32: 455457.