Diabetologia
Home The suitability of patient-reported outcome measures used to assess the impact of hypoglycaemia on quality of life in people with diabetes: a systematic review using COSMIN methods
The suitability of patient-reported outcome measures used to assess the impact of hypoglycaemia on quality of life in people with diabetes: a systematic review using COSMIN methods
The suitability of patient-reported outcome measures used to assess the impact of hypoglycaemia on quality of life in people with diabetes: a systematic review using COSMIN methods

Article Type: Review Article Article History
Abstract

Aims/hypothesis

It is generally accepted that hypoglycaemia can negatively impact the quality of life (QoL) of people living with diabetes. However, the suitability of patient-reported outcome measures (PROMs) used to assess this impact is unclear. The aim of this systematic review was to identify PROMs used to assess the impact of hypoglycaemia on QoL and examine their quality and psychometric properties.

Methods

Systematic searches (MEDLINE, EMBASE, PsycINFO, CINAHL and The Cochrane Library databases) were undertaken to identify published articles reporting on the development or validation of hypoglycaemia-specific PROMs used to assess the impact of hypoglycaemia on QoL (or domains of QoL) in adults with diabetes. A protocol was developed and registered with PROSPERO (registration no. CRD42019125153). Studies were assessed for inclusion at title/abstract stage by one reviewer. Full-text articles were scrutinised where considered relevant or potentially relevant or where doubt existed. Twenty per cent of articles were assessed by a second reviewer. PROMS were evaluated, according to COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) guidelines, and data were extracted independently by two reviewers against COSMIN criteria. Assessment of each PROM’s content validity included reviewer ratings (N = 16) of relevance, comprehensiveness and comprehensibility: by researchers (n = 6); clinicians (n = 6); and adults with diabetes (n = 4).

Results

Of the 214 PROMs used to assess the impact of hypoglycaemia on QoL (or domains of QoL), eight hypoglycaemia-specific PROMS were identified and subjected to full evaluation: the Fear of Hypoglycemia 15-item scale; the Hypoglycemia Fear Survey; the Hypoglycemia Fear Survey version II; the Hypoglycemia Fear Survey-II short-form; the Hypoglycemic Attitudes and Behavior Scale; the Hypoglycemic Confidence Scale; the QoLHYPO questionnaire and the Treatment-Related Impact Measure-Non-severe Hypoglycemic Events (TRIM-HYPO) questionnaire. Content validity was rated as ‘inconsistent’, with most as ‘(very) low’ quality, while structural validity was deemed ‘unsatisfactory’ or 'indeterminate'. Other measurement properties (e.g. reliability) varied, and evidence gaps were apparent across all PROMs. None of the identified studies addressed cross-cultural validity or measurement error. Criterion validity and responsiveness were not assessed due to the lack of a ‘gold standard’ measure of the impact of hypoglycaemia on QoL against which to compare the PROMS.

Conclusions/interpretation

None of the hypoglycaemia-specific PROMs identified had sufficient evidence to demonstrate satisfactory validity, reliability and responsiveness. All were limited in terms of content and structural validity, which restricts their utility for assessing the impact of hypoglycaemia on QoL in the clinic or research setting. Further research is needed to address the content validity of existing PROMs, or the development of new PROM(s), for the purpose of assessing the impact of hypoglycaemia on QoL.

Prospero registration

CRD42019125153

Graphical abstract

Supplementary Information

The online version contains peer-reviewed but unedited supplementary material available at 10.1007/s00125-021-05382-x.

Keywords
Carlton,Leaviss,Pouwer,Hendrieckx,Broadley,Clowes,McCrimmon,Heller,and Speight: The suitability of patient-reported outcome measures used to assess the impact of hypoglycaemia on quality of life in people with diabetes: a systematic review using COSMIN methods

Introduction

Both the experience and the risk of hypoglycaemia can have a serious negative impact on the quality of life (QoL) of adults with diabetes [18]. Living a life of quality is perhaps the ultimate goal, so protecting QoL is a daily burden for people experiencing or at risk of hypoglycaemia, and one that can be contradictory to the goals of medical therapy [8]. This may particularly be the case in those who aim for very tight glucose targets. The extent of this impact on QoL can be assessed using patient-reported outcome measures (PROMs). PROMs are questionnaires that can be used in both research and/or clinical care. PROMs complement objective data (e.g. actual blood glucose levels) by capturing the individual’s experiences in a quantifiable and standardised manner, across a range of concepts, e.g. health-related QoL, satisfaction with treatment or emotional well-being [9, 10]. When applied to the study of hypoglycaemia in diabetes, PROMs can facilitate an assessment of the psychological and economic burden of hypoglycaemia, which can be used to determine the value of therapeutic approaches to reducing hypoglycaemia frequency and severity.

Given the large number of PROMs available, it can be challenging to determine which PROM(s) to select for a given clinical or research purpose. Factors such as response burden (e.g. mode of administration, number of items [questions]), type of PROM (generic or condition-specific) and the purpose of the data collection will influence choice. However, a more fundamental issue is whether the PROM has been evaluated as ‘fit for purpose’. This evaluation should include assessment of three overall domains (validity, reliability and responsiveness), for which consensus-based standards (COnsensus-based Standards for the selection of health Measurement Instruments [COSMIN]) can be applied [11]. The COSMIN methodology and standards derive from widespread international expert consensus [11, 12] and have been applied to other PROM measures [1317], but not yet to the assessment of the impact of hypoglycaemia on QoL.

QoL is highly subjective and has been defined in many ways and most people, intuitively, have an understanding of what it means to them [18]. Perhaps the simplest definition is that QoL is a personal evaluation of how good or bad one’s life is [19]. For the purpose of this review, and consistent with the general consensus [9], we operationalised QoL as: (1) a multidimensional construct including components such as physical well-being (e.g. pain/discomfort, mobility, fatigue), psychological well-being (e.g. mood, fear, confidence) and social well-being (e.g. stigma, participation) [20]; (2) a subjective construct based on feelings, values, experiences and priorities (therefore, we do not include objective measures, or purely functional performance or assessment instruments); and (3) a dynamic construct, which changes over time according to the person’s priorities, experiences and situation.

The objectives of this review were to: (1) identify PROMs used to assess the impact of hypoglycaemia on QoL in adults with diabetes; and (2) formally evaluate their content validity, structural validity and other measurement properties. Our intention was to provide researchers and clinicians with a robust evidence base to assist them when selecting PROMs for this purpose. The review was undertaken as part of the Hypoglycaemia REdefining SOLutions for better liVEs (Hypo-RESOLVE) project, an international collaboration of clinicians, scientists, industry partners and people with diabetes [21].

Methods

We used the updated COSMIN guidance [12, 2224].

Data sources and searches

A protocol was developed and registered with PROSPERO [25]. A systematic literature search was conducted during 26–28 November 2018 to identify published evidence around the four concepts of: (diabetes) and (hypoglycaemia) and (psychosocial outcomes) and (measurement properties of measurement instruments). Databases searched include MEDLINE, EMBASE, PsycINFO, CINAHL and The Cochrane Library. Terms for psychosocial outcomes were chosen to include both generic, ‘umbrella’ terms for ‘quality and life’ and ‘well-being’ (sourced from published search filters) and specific psychosocial outcomes of diabetes known to the Hypo-RESOLVE team (e.g. fear of hypoglycaemia). In order to identify studies for the present systematic review, a validated search filter devised for retrieving studies on measurement properties of instruments in PubMed was used [26]. An example search strategy is shown in the electronic supplementary material (ESM) Methods.

Study selection

Inclusion criteria consisted of any study design that included the primary development and/or validation of a hypoglycaemia-specific PROM used to assess the impact of hypoglycaemia on QoL in adults diagnosed with diabetes with any type, e.g. type 1, type 2 and gestational, and who have experienced hypoglycaemia. Studies of hypoglycaemia/hypoglycaemic episodes not associated with diabetes were excluded. Commentaries, reviews, opinion pieces and any other non-empirical work were also excluded. Studies were assessed for inclusion at title and abstract stage by one reviewer (JL). Full-text articles were scrutinised where considered as relevant or potentially relevant or where doubt existed. Twenty per cent of studies were assessed by a second reviewer (JC) to check for consistency. Disagreements were resolved through discussion.

Data extraction

Data extraction included study characteristics (e.g. language; participant characteristics; recall period; analysis model), a brief summary of results and measurement properties of the PROMs. Primary outcomes included measurement properties of identified PROMs, consistent with the COSMIN checklist: PROM development; content validity; structural validity; internal consistency; cross-cultural validity/measurement invariance; reliability; measurement error; criterion validity; hypothesis testing for construct validity; and responsiveness. Definitions of the measurement properties are detailed in Table 1. In accordance with COSMIN guidelines, all data relating to PROM measurement properties were extracted independently by two reviewers (JL and JC) against the respective COSMIN criteria. Discrepancies were resolved through discussion.

Table 1
Definitions of measurement properties
Measurement propertyDefinition
Content validityThe extent to which the items in a PROM are representative of the construct they are intended to measure
Structural validityThe extent to which the items in a PROM reflect the dimensionality of the construct (i.e. the items form a single [unidimensional] scale or multiple subscales [a multidimensional scale])
Reliability: internal consistencyThe extent to which there is consistency of results across items in the PROM (i.e. within a specified scale or subscale)
Reliability: test–retestThe extent to which the PROM yields scores that are reproducible (stable) over time when there has been no change in the concept being assessed
Measurement errorThe systematic and random error of a person’s score on the PROM that is not attributed to changes in the construct to be measured
Criterion validityThe extent to which the scores of a PROM reflect the scores of a test or measure considered to be the ‘gold standard’
Hypothesis testing for construct validityThe extent to which the scores of a PROM are consistent with hypotheses. For example, with regard to internal relationships, relationships to scores of other instruments or differences between relevant groups. It is based on the assumption that the PROM is a valid measure of the construct
ResponsivenessThe ability of a PROM to detect change, as expected, over time in the construct to be measured when there is a true change in a person’s condition or treatment
Cross-cultural validityThe extent to which the measurement properties of the translated or culturally adapted PROM reflect the performance of the original version of the PROM

Content validity assessment

Content validity is the extent to which a PROM is deemed to reflect the construct of interest and, arguably, the most fundamental aspect of scale selection [27]. The methodological quality of the PROM development studies and other studies supplementing content validity were assessed using COSMIN standards [28]. The assessment involves three steps (see Fig. 1): (1) evaluation of the quality of the PROM development; (2) evaluation of the quality of any additional content validity studies on the PROM (if available); and (3) evaluation of the content validity of the PROM based on the quality and results of the available studies and the PROM itself. Steps 1 and 2 result in a rating of each COSMIN standard ranked on a four-point scale: ‘very good’, ‘adequate’, ‘doubtful’ and ‘inadequate’. Total ratings are then determined using the lowest rating for any item for that study (i.e. worst score counts) [22].

COSMIN assessment of content validity
Fig. 1

COSMIN assessment of content validity

Step 3 consists of three sub-stages. Step 3a incorporates reviewer ratings of the identified PROMs whereby reviewers consider relevance, comprehensiveness and comprehensibility. We sought ratings from three key stakeholder groups: (1) researchers (including those with expertise in systematic reviewing, QoL research and psychological aspects of diabetes) (n = 6); (2) clinicians (n = 6); and (3) adults with diabetes (n = 4), including two representatives of the Hypo-RESOLVE Patient Advisory Committee (PAC). All reviewers provided independent ratings of the PROMs based on several criteria: (1) the construct of interest (i.e. does the PROM include items that are relevant in measuring the impact of hypoglycaemia on QoL?); (2) the population of interest; (3) the context of use of interest (i.e. is the PROM suitable for use in research and/or clinical practice?); (4) the appropriateness of response options; (5) the appropriateness of the recall period; (6) the comprehensiveness (i.e. does the PROM assess the impact of hypoglycaemia on QoL as a whole, or only on select domains of QoL?); (7) the suitability/clarity of the PROM instructions; (8) whether PROM items and response options are understandable; (9) the appropriateness of PROM item wording; and (10) the extent to which response options are appropriate to the question being asked. A majority rating was determined for each group (researcher, clinician and PAC). The group ratings were then consolidated to produce an overall reviewer rating for each PROM. Table 2 details how relevance, comprehensiveness and comprehensibility were assessed.

Table 2
COSMIN criteria and rating system for evaluating the content validity of the PROMs (adapted from Terwee et al [28]), with an example shown in italics
Name of PROM (or subscale)PROM development study

(+/−/±/?)
Content validity study 1

(+/−/±/?)
Content validity study 2a

(+/−/±/?)
Rating of reviewers

(+/−/±/?)
Overall ratings per PROM

(+/−/±/?)
Quality of evidence

(High, moderate, low, very low)
The ABC-QoLJones et al, 2015Smith et al, 2016
Relevance
 1. Are the included items relevant for the construct of interest?b+++
 2. Are the included items relevant for the target population of interest?c+++
 3. Are the included items relevant for the context of use and interest?d
 4. Are the response options appropriate?++
 5. Is the recall period appropriate?+++
RELEVANCE RATING++++
Comprehensiveness
 6. Are all key concepts covered?
COMPREHENSIVENESS RATING
Comprehensibility
 7. Are the PROM instructions understood by the population of interest as intended?++
 8. Are the PROM items and response options understood by the population of interest as intended?++
 9. Are the PROM items appropriately worded?+
 10. Do the response options match the question?+
COMPREHENSIBILITY RATING±±±±
CONTENT VALIDITY RATING±High

Step 3b involves summarising the results of all available studies to provide an overall rating of relevance, comprehensiveness and comprehensibility and an overall content validity rating. This results in an outcome of ‘sufficient’, ‘insufficient’, ‘inconsistent’ or ‘indeterminate’. Finally, in Step 3c, the overall ratings determined in Step 3b are accompanied by a grading of the quality of the evidence using a modified Grading of Recommendations Assessment, Development and Evaluation (GRADE) approach [29]. Using the modified GRADE approach, the quality of evidence is graded as ‘high’, ‘moderate’, ‘low’ or ‘very low’. The GRADE approach uses five factors to consider the quality of the evidence: risk of bias, inconsistency, indirectness, imprecision and publication bias [29]. Detailed information of the rating process is reported elsewhere [28]. The resultant evaluation of content validity includes an overall rating of: + (‘satisfactory’); − (‘unsatisfactory’); ± (‘inconsistent’); or ? (‘indeterminate’), with a measure of the quality of the evidence to support the content validity rating (‘high’, ‘moderate’, ‘low’, ‘very low’). A worked example of content validity rating and scoring is shown in Table 2. Detailed information on the COSMIN methodology applied is reported elsewhere [28].

Assessment of other psychometric properties

Table 1 defines each of the psychometric properties assessed. As above, a COSMIN rating was determined by assessment across the criteria for measurement properties using the same rating scale (‘sufficient’, ‘insufficient’, ‘inconsistent’ or ‘indeterminate’). The assessment of the quality of the evidence was applied using the GRADE approach. This results in a rating of: + (‘satisfactory’); − (‘unsatisfactory’); ± (‘inconsistent’); or ? (‘indeterminate’), with a measure of the quality of the evidence to support the structural validity rating (‘high’, ‘moderate’, ‘low’, ‘very low’). Full information on the COSMIN methodology applied in this review is reported elsewhere [23].

Quality assurance of the review

The quality of this review was assessed against a COSMIN checklist that was designed to evaluate the quality of systematic reviews of PROMs [30] (ESM Table 1).

Results

The search returned a total of 3661 unique records, from which 214 PROMs were identified as used in studies to assess the impact of hypoglycaemia on QoL or subdomains of QoL (Fig. 2, Table 3). Of these, 17 PROMs were initially identified as hypoglycaemia-specific and for consideration in this review, and nine were subsequently excluded following further scrutiny of the instruments. PROMs were excluded if they were: hypoglycaemia symptom measures that assessed attitudes, awareness and/or attitudes to awareness of symptoms (n = 3); related to specific treatments (n = 2); only a subscale of an overall PROM (n = 2); or not available for full inspection (n = 2). Consequently, the current review includes eight hypoglycaemia-specific PROMs that have been used to assess the impact of hypoglycaemia on QoL or at least one aspect of QoL: the Fear of Hypoglycemia 15-item scale (FH-15); the Hypoglycemia Fear Survey (HFS); the Hypoglycemia Fear Survey version II (HFS-II); the HFS-II short-form; the Hypoglycemic Attitudes and Behavior Scale (HABS); the Hypoglycemic Confidence Scale (HCS); the QoLHYPO questionnaire and the Treatment-Related Impact Measure-Non-severe Hypoglycemic Events (TRIM-HYPO) (Table 4).

PRISMA 2009 Flow Diagram: hypoglycaemia-specific PROMs used to assess the impact of hypoglycaemia on QoL
Fig. 2

PRISMA 2009 Flow Diagram: hypoglycaemia-specific PROMs used to assess the impact of hypoglycaemia on QoL

Table 3
Type and number of PROMs identified in title and abstract sift
Type of PROM measureNumber of PROMs
Designed for completion by children/adolescents22
Designed for completion by adults192
 Generic82
 Diabetes-specific51
 Treatment-specific37
 Glucose-monitoring-specific5
 Hypoglycaemia-specific17
Total214
Table 4
PROMs identified that have been used to assess the impact of hypoglycaemia on QoL (or its subdomains) in people with diabetes
PROMRecall periodN domains (items)Domains assessed by PROM (n items)Response optionsTotal score rangeOriginValidated English version available for review
FH-15Not stated3 (15)Fear (7), avoidance (3), interference (5)Never, almost never, sometimes, almost always, every day. 1–5 scale15–75SpainNo—Spanish only version
HFSNot stated2 (27)Behaviour (10), worry (17)Never, rarely, sometimes, often, very often. 1–5 scale27–135USAYes
HFS-II6 months2 (33)Behaviour (15), worry (18)Never, rarely, sometimes, often, almost always. 0–4 scale0–132USAYes
HFS-II short-form6 months2 (11)Behaviour (5), worry (6)Never, rarely, sometimes, often, almost always. 0–4 scale0–44USAYes
HABSPresent3 (14)Avoidance (4), confidence (5), anxiety (5)Strongly disagree, disagree, neutral, agree, strongly agree. 1–5 scale14–70USAYes
HCSNot stated1 (9)Confidence (9)Not confident at all, a little confident, moderately confident, very confident. 1–4 scale9–36USAYes
QoLHYPONot statedNot stated (13)Not reportedNever, rarely, sometimes, often, always. 0–4 scale0–52SpainNo—Spanish only version
TRIM-HYPOPast 30 days5 (33)Daily function (7), Emotional wellbeing (7), Work productivity (9), Sleep disruption (5), Diabetes management (5)Varies per item. 1–5 scale0–100France, Germany, UK, USAYes

Overall COSMIN assessment of PROMs

The overall results of the COSMIN assessment are shown in Table 5. There are considerable evidence gaps for the measurement properties of most of the PROMs. The HFS-II, QoLHYPO and TRIM-HYPO were the only instruments that could be rated across all the measurement properties.

Table 5
Summary of psychometric properties of hypoglycaemia-specific PROMs used to assess the impact of hypoglycaemia on QoL
PROMContent validityStructural validityReliability: internal consistencyReliability: test–retestHypothesis testing for construct validity
Rating of resultsQuality of evidenceRating of resultsQuality of evidenceRating of resultsQuality of evidenceRating of resultsQuality of evidenceRating of resultsQuality of evidence
FH-15±LowModerate+ModerateNRNR?Moderate
HFS±ModerateModerate?ModerateVery lowNRNR
HFS-II±LowModerate+ModerateModerate?Moderate
HFS-NorwegianNRNRHigh+High+HighNRNR
HFS-SingaporeNRNRHigh+High?High?High
HFS-SpanishNRNRModerate?Moderate?Very low?Moderate
HFS-SwedishNRNRModerate+ModerateNRNR?Moderate
HFS-II short-formNRNRHigh+HighNRNR?High
HABS±Very lowModerate+ModerateNRNR?Moderate
HCS±LowModerate+ModerateNRNR?Moderate
QoLHYPO±ModerateModerate+Moderate+Moderate?Moderate
TRIM-HYPO±Low?Moderate±ModerateVery low±Moderate

Content validity

ESM Table 2 summarises the key characteristics and COSMIN quality assessment of the PROM development studies. For five of the seven PROMs, there was evidence that adults with diabetes were involved in item generation (HFS, HABS, HCS, QoLHYPO and TRIM-HYPO). COSMIN quality ratings ranged from ‘inadequate’ (HFS, HABS and QoLHYPO), to ‘doubtful’ (HFS-II, HCS and TRIM-HYPO), to ‘very good’ (FH-15). The developers of the HFS-II short-form do not report on content validity, due to the scale being developed based on existing items in the HFS-II [31].

ESM Table 3 details characteristics of the PROM development studies. The overall quality of the PROM development studies was classified as ‘very good’ (FH-15), ‘inadequate’ (HFS, HABS and QoLHYPO) or ‘doubtful’ (HFS-II, HCS and TRIM-HYPO). Only five of the PROMs provided evidence of concept elicitation (all of which were of ‘doubtful’ or ‘inadequate’ quality) (HFS, HABS, HCS, QoLHYPO and TRIM-HYPO). The COSMIN rating for the PROM design ranged from ‘inadequate’ (HFS, HABS and QoLHYPO), to ‘doubtful’ (HFS-II, HCS and TRIM-HYPO), to ‘very good’ (FH-15). Three of the PROMs (HFS, QoLHYPO and TRIM-HYPO) reported on content validity. During the development of the HFS, health professionals were asked about the relevance and comprehensiveness of the PROM (‘doubtful’ COSMIN quality rating) [32]. For the QoLHYPO, adults with diabetes were asked about the comprehensibility, but not relevance, of the PROM (‘doubtful’ COSMIN quality rating) [33]. During the development of the TRIM-HYPO, adults with diabetes were asked about the comprehensibility and relevance of the PROM, but were not asked about comprehensiveness of the PROM ('doubtful' COSMIN quality rating) [34]. Aside from the development studies, no further studies were identified that independently assessed the content validity of the PROMs.

ESM Table 4 details the consensus ratings for the three groups of reviewers (researchers, clinicians, people living with diabetes), and an overall reviewer consensus rating for each PROM. FH-15 had an overall reviewer rating of ‘sufficient’; HFS-II, HABS, HCS and TRIM-HYPO were rated as ‘inconsistent’. For two of the PROMs (HFS and QoLHYPO), relevance, comprehensiveness and comprehensibility ratings resulted in a combination whereby COSMIN guidance is not explicit, and, thus, an overall rating could not be applied [28].

Structural validity

Twelve studies assessed the structural validity of the PROMs, all of which were reported in the development papers (ESM Table 5). No independent assessments of the structural validity were identified. Four studies examined the structural validity of a cultural adaptation/language translation of the HFS [3538]. A further study assessed the structural validity of the short-form of HFS-II [31]. COSMIN quality ratings of the HFS-Norwegian, HFS-Singapore and HFS short-form were ‘very good’ and ratings were ‘adequate’ for the remaining PROMs. The same principles as noted above were applied to assess the quality of the evidence for these instruments. The quality of evidence for the HFS-Norwegian, HFS-Singapore and HFS-II short-form instruments was assessed as ‘high’. The HFS-Spanish, HFS-Swedish and TRIM-HYPO instruments were assessed as ‘moderate’. Many of the studies reported exploratory factor analysis (EFA) (rather than the confirmatory factor analysis required to receive a ‘satisfactory’ rating). Those studies reporting confirmatory factor analysis (language versions of the HFS) did so to examine whether the expected two-factor structure (observed for the original HFS) fitted their dataset. However, they all rejected this a priori-defined structure, and therefore went on to explore the latent structure of the tool using EFA.

Internal consistency reliability

Thirteen studies were identified that reported evidence of the internal consistency of the PROMs [3134, 3643]. Some were undertaken by the instrument developers and some were independent assessments (ESM Table 6). Most studies [32, 33, 36, 39, 40, 42, 43] had an ‘adequate’ COSMIN quality rating. Five studies had a ‘very good’ COSMIN quality rating [31, 34, 37, 38, 41].

Reliability (test–retest)

Seven studies were identified that assessed the test–retest reliability of a PROM measure. Four of the studies were conducted by the instrument developers (FH-15, HFS-II, QoLHYPO and TRIM-HYPO). The remaining studies were assessments of the language versions of the HFS instrument (ESM Table 7). Four studies had an ‘adequate’ COSMIN quality rating [32, 33, 36, 40]. Two studies had a ‘very good’ COSMIN quality rating [37, 38]. One study had a 'doubtful' COSMIN quality rating [34].

Hypothesis testing for construct validity

Ten studies reported on hypothesis testing for construct validity (ESM Table 8) [31, 3336, 3840, 42, 43]. Of these, nine were comparing with other outcome measurement instruments (convergent validity) [31, 3336, 38, 40, 42, 43]. These were HFS-II, HFS-Spanish, HFS-Singapore, HFS-Sweden, HFS-II short-form, HABS, HCS, QoLHYPO and TRIM-HYPO. Six studies included comparisons between subgroups (discriminative or known-groups validity) [34, 3840, 42, 43]. These were FH-15, HFS-II, HFS-Singapore, HABS, HCS and TRIM-HYPO instruments.

Other psychometric properties

No studies were found to demonstrate evidence for cross-cultural validity, measurement error, criterion validity or responsiveness.

Discussion

This systematic review has summarised and critically evaluated published evidence on the psychometric characteristics of PROMs used to assess the impact of hypoglycaemia on QoL in adults with diabetes using COSMIN methodology. Our intention was to provide an evidence base that would help researchers and clinicians when selecting PROMs, based on the robust and comprehensive consensus-based COSMIN criteria. We identified eight PROMs that had been developed to assess the subjective impact of hypoglycaemia on QoL or a subdomain of QoL.

None of the PROMs included in this review had a ‘high’ rating for content validity (in relation to assessing the impact of hypoglycaemia on QoL), which is arguably the most important measurement property of a PROM [28, 44]. All had ‘inconsistent’ COSMIN ratings for content validity, but the quality of the evidence to support those ratings was greater for the HFS and QoLHYPO. To that end, there is some support to recommend the use of HFS and QoLHYPO instruments in research studies and/or clinical practice. However, it is important to acknowledge the conceptual framework from which these two instruments were developed, and how this diverges from our operationalisation of the concept of QoL (i.e. multidimensional, subjective and changing over time). The HFS was developed to measure fear of hypoglycaemia through two subscales—behaviour and worry. Fear is arguably a very specific aspect of the psychological subdomain of QoL. Furthermore, the developers were not explicit in describing the target population for the instrument (i.e. their sample included people with ‘insulin-dependent’ diabetes, but it is unclear whether this included people type 1 and/or type 2 diabetes, and whether it is also applicable to people who manage their diabetes without insulin but experience hypoglycaemia). While the content of the QoLHYPO instrument includes items that assess various domains of QoL (e.g. social relationships, mood, daily activities), it was designed for use only by people with type 2 diabetes. Furthermore, there have been no translations beyond the original Spanish version. Consequently, the format and layout of the QoLHYPO is not clear for English-speaking researchers, and the developers provide no information on domains. Further investigation would be required to determine the suitability of the QoLHYPO instrument in measuring the impact of hypoglycaemia in people with type 1 diabetes and in other language groups.

We have included details of psychometric properties of the PROMs identified as part of the original literature search. However, it is plausible that additional papers have also reported psychometric properties for one or more of the included PROMs (particularly in intervention studies). To that end, the information on measurement properties reported here should not be considered exhaustive. We did not adopt the approach taken by (some of) the PROM authors to consider HbA1c as the ‘gold standard’ in the assessment of criterion validity and criterion approach to responsiveness. Studies have shown that HbA1c it is not a reliable indicator of whether an individual experiences hypoglycaemia [45, 46], nor a surrogate for QoL [47], nor of the impact or burden of hypoglycaemia. Advances in glucose monitoring technologies are continually changing our understanding of diabetes and are contributing to a better understanding of the lived experience of diabetes and hypoglycaemia. Consequently, it may be appropriate in future studies to consider ‘time in range’ or ‘time in hypoglycaemia’ as a marker for the impact of hypoglycaemia on QoL—but the extent to which this will reflect the subjective experience has yet to be elucidated. In the absence of an agreed ‘gold standard’, it is not possible to determine the assessment of any criterion validity or criterion approach to responsiveness for any PROM.

In this systematic review, we followed the robust and comprehensive guidance developed by the COSMIN initiative [23, 28]. However, it is not without its limitations. The assessment of content validity and psychometric performance of PROMs is determined by taking the lowest rating of any standard in the criteria (i.e. the ‘worst score counts’ principle) [22, 28]. This means that a study could be rated as ‘very good’ or ‘good’ on all but one criterion; however, the overall rating could be affected by a ‘doubtful’ or ‘inadequate’ rating, thus reducing the overall score to ‘doubtful’ (or ‘inadequate’). The omission of one key component in reporting (such as whether interviews were recorded and transcribed verbatim) can result in a lower overall content validity rating, which could be argued as overly harsh and should be recognised as a limitation of the COSMIN approach. Where appropriate within this review, we consistently rated in favour of the PROM (rather than assuming the worst). Another limitation of the COSMIN approach was identified in the guidance for determining content validity ratings of studies. Here we noted that there was no information on how to determine overall content validity rating with the combinations achieved. We have documented our approach; however, if the review was to be replicated, others may opt to ‘down-grade’ the overall content validity rating. Furthermore, as part of the content validity assessment, we sought to include the opinion of stakeholders. The COSMIN guidance does not advise on how to ratify ratings should there be conflicting opinions between or within stakeholder groups.

It should be noted that some of the PROMs included within this review are legacy or ‘first generation’ measures; that is, they were developed at a time when there were no international standards for instrument development methods, so these were either not reported, or reported selectively or in little detail. Similarly, the way in which PROMs are developed has changed over time [27]. It is now more common to report the methodological steps undertaken during the instrument development phase. The COSMIN ratings should therefore be interpreted with a degree of caution, and do not provide evidence that the instrument development was not rigorous or that the instruments are not ‘fit for purpose’, but rather expose an absence of key evidence.

While there is published evidence of studies that report hypoglycaemia to negatively impact upon QoL [18], we have identified that those that utilise hypoglycaemia-specific PROMs have inadequate reliability and validity for this specific purpose. Thus, the current literature on the impact of hypoglycaemia on QoL is limited (if not flawed) and needs to be interpreted with caution. Given that the content validity of the instruments was lacking, it is plausible that hypoglycaemia impacts individuals in ways that are currently not being measured. It may be that the items within the instruments are no longer relevant (e.g. due to changes in diabetes treatments, monitoring, society, language use), or that the items are not comprehensive enough to fully capture the ways in which hypoglycaemia affects adults in the modern world.

In conclusion, none of the PROMs identified had sufficient evidence to demonstrate satisfactory content validity, i.e. they do not assess the impact of hypoglycaemia on QoL in adults living with diabetes. Furthermore, most were also limited in their published evidence of reliability, validity and responsiveness. There is an urgent need to follow contemporary guidance [27, 4850] to develop new instruments that can assess the impact of hypoglycaemia on QoL.

Abbreviations

COSMINCOnsensus-based Standards for the selection of health Measurement INstruments
EFAExploratory factor analysis
FH-15Fear of Hypoglycemia 15-item scale
GRADEGrading of Recommendations Assessment, Development and Evaluation
HABSHypoglycemic Attitudes and Behavior Scale
HCSHypoglycemic Confidence Scale
HFSHypoglycemia Fear Survey
HFS-IIHypoglycemia Fear Survey version II
Hypo-RESOLVEHypoglycaemia REdefining SOLutions for better liVEs
PACPatient Advisory Committee
NSHENon-severe hypoglycaemic event
PROMPatient-reported outcome measure
QoLQuality of life
QoLHYPOQoLHYPO questionnaire
TRIM-HYPOTreatment-Related Impact Measure-Non-severe Hypoglycemic Events
The original online version of this article was revised: The authors omitted one hypoglycaemia-specific patient-reported outcome measure (TRIM-HYPO) from their systematic review.
Publisher’s note:

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Change history:

3/23/2022

A Correction to this paper has been published: 10.1007/s00125-022-05668-8

Acknowledgements

We would like to acknowledge the following individuals who helped with the rating of PROMs included within this review: N. Ali (Radboud University Medical Center, the Netherlands); S. A. Amiel (Department of Diabetes, Faculty of Life Sciences and Medicine, King’s College London, UK); M. Hamid (Moroccan League for the Fight against Diabetes, Morocco); O. Mast (Consumer Health Care, Sanofi, Germany); E. Renard (Department of Endocrinology, Diabetes, Nutrition, Montpellier University Hospital, Montpellier, France); B. Riley (member of Lay ADvice for Diabetes and Endocrinology Research group, Sheffield Teaching Hospitals NHS Foundation Trust, UK); R. Scibilia (Diabetes Australia, Australia); and C. Tack (Radboud University Medical Center, the Netherlands).

Authors’ relationships and activities

JC and JS have developed and validated diabetes-specific PROMs that are not included in this review. SRH has acted as a consultant or speaker for NovoNordisk, Eli Lilly, Sanofi Aventis, Mannkind, Zealand, MSD and Boehringer Ingelheim. All other authors declare that there are no relationships or activities that might bias, or be perceived to bias, their work.

Contribution statement

JC, JL, FP, CH, MMB, MC, RJM, SRH and JS made substantial contributions to conception and design, acquisition of data, or analysis and interpretation of data. JC was responsible for initial drafting of the article. JC, JL, FP, CH, MMB, MC, RJM, SRH and JS revised the manuscript critically for important intellectual content. JC, JL, FP, CH, MMB, MC, RJM, SRH and JS gave final approval of the version to be published. JC is the guarantor of this work.

Funding

Hypo-RESOLVE has received funding from the Innovative Medicines Initiative 2 Joint Undertaking (JU) (https://www.imi.europa.eu/) under grant agreement No. 777460. The JU receives support from the European Union’s Horizon 2020 research and innovation programme and EFPIA and T1D Exchange, JDRF, the International Diabetes Federation (IDF) and The Leona M. and Harry B. Helmsley Charitable Trust. The authors are solely responsible for the content of this work, which reflects only the authors’ views, and the IMI JU is not responsible for any use that may be made of the information it contains. JS and CH are supported by the core funding to the Australian Centre for Behavioural Research in Diabetes provided by the collaboration between Diabetes Victoria and Deakin University.

Data availability

Data are available on request from the authors.

References

1. 

    Hendrieckx C, Ivory N, Singh H, Frier BM, Speight J. Impact of severe hypoglycaemia on psychological outcomes in adults with type 2 diabetes: a systematic review. Diabet Med2019. 36: 91082-1091 doi: 10.1111/dme.14067

2. 

    Wieringa TH, de Wit M, Twisk JWR, Snoek FJ. Does hypoglycaemia affect the improvement in QoL after the transition to insulin in people with type 2 diabetes?. J Endocrinol Investig2018. 41: 2249-258 doi: 10.1007/s40618-017-0744-5

3. 

    LaManna J, Litchman ML, Dickinson JK, . Diabetes education impact on hypoglycemia outcomes: a systematic review of evidence and gaps in the literature. Diabetes Educ2019. 45: 4349-369 doi: 10.1177/0145721719855931

4. 

    Barendse S, Singh H, Frier BM, Speight J. The impact of hypoglycaemia on quality of life and related patient-reported outcomes in type 2 diabetes: a narrative review. Diabet Med2012. 29: 3293-302 doi: 10.1111/j.1464-5491.2011.03416.x

5. 

    Lundkvist J, Berne C, Bolinder B, Jönsson L. The economic and quality of life impact of hypoglycaemia. Eur J Health Econ2005. 6: 197-202 doi: 10.1007/s10198-005-0276-3

6. 

    Jacobson AM, Braffett MS, Cleary PA, Gubitosi-Klug RA, Larkin ME, the DCCT/EDIC Research Group. A 23-year follow-up of the diabetes control and complications/epidemiology of diabetes interventions and complications cohort. Diabetes Care2013. 36: 103131-3138 doi: 10.2337/dc12-2109

7. 

    Rossi MC, Nicolucci A, Ozzello A, . Impact of severe and symptomatic hypoglycemia on quality of life and fear of hypoglycemia in type 1 and type 2 diabetes: results of the Hypos-1 Observational Study. Nutr Metab Cardiovasc Dis2019. 29: 7736-743 doi: 10.1016/j.numecd.2019.04.009

8. 

    Heindrieckx C, Gonder-Frederick L, Heller SR, Snoek FJ, Speight J. How has psycho-behavioural research advanced our understanding of hypoglycaemia in type 1 diabetes?. Diabet Med2020. 37: 411-419

9. 

    Speight J, Reaney MD, Barnard KD. Not all roads lead to Rome—a review of quality of life measurement in adults with diabetes. Diabet Med2009. 26: 4315-327 doi: 10.1111/j.1464-5491.2009.02682.x

10. 

    Reaney M, Gwaltney C. Measuring and interpreting patient-reported outcome data from clinical trials of diabetes medication. J Diabetes Res Clin Metab2014. 3: 7 doi: 10.7243/2050-0866-3-7

11. 

    Mokkink LB, Terwee CB, Patrick DL, . The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol2010. 63: 7737-745 doi: 10.1016/j.jclinepi.2010.02.006

12. 

    Terwee CB, Prinsen CAC, Chiarotto A, . COSMIN methodology for evaluating the content validity of patient-reported outcome measures: a Delphi study. Qual Life Res2018. 27: 1159-1170 doi: 10.1007/s11136-018-1829-0

13. 

    Chiarotto A, Ostelo RW, Boers M, Terwee CB. A systematic review highlights the need to investigate the content validity of patient-reported outcome measures for physical functioning in patients with low back pain. J Clin Epidemiol2018. 95: 73-93 doi: 10.1016/j.jclinepi.2017.11.005

14. 

    Craxford S, Deacon C, Myint Y, Ollivere B. Assessing outcome measures used after rib fracture: a COSMIN systematic review. Injury2019. 50: 111816-1825 doi: 10.1016/j.injury.2019.07.002

15. 

    Rezai M, Kolne K, Bui S, Lindsay S. Measures of workplace inclusion: a systematic review using the COSMIN methodology. J Occup Rehabil2020. 30: 420-454 doi: 10.1007/s10926-020-09872-4

16. 

    Ortega-Avila AB, Cervera-Garvi P, Ramos-Petersen L, Chicharro-Luna E, Gijon-Nogueron G. Patient-reported outcome measures for patients with diabetes mellitus associated with foot and ankle pathologies: a systematic review. J Clin Med2019. 8: 2146 doi: 10.3390/jcm8020146

17. 

    Powell PA, Carlton J, Buckley Woods H, Mazzone P. Measuring quality of life in Duchenne muscular dystrophy: a systematic review of the content and structural validity of commonly used instruments. Health Qual Life Outcomes2020. 18: 263 doi: 10.1186/s12955-020-01511-z

18. 

    Speight J, Holmes-Truscott E, Hendrieckx C, Skovlund S, Cooke D. Assessing the impact of diabetes on quality of life: what have the past 25 years taught us?. Diabet Med2020. 37: 3483-492 doi: 10.1111/dme.14196

19. 

McGee HM, O’Boyle CA, Hickey A, O’Malley K, Joyce CR (1991) Assessing the quality of life of the individual: The SEIQoL with a healthy and a gastroenterology unit population. Psychol Med 21(3):749–759

20. 

    Schipper HC, Clinch JJ, Olweny CLMSpilker B. Quality of life studies: definitions and conceptual issues. Quality of life and pharmacoeconomics in clinical trials1996. PhiladelphiaLippincott-Raven Publishers11-23

21. 

    de Galan BE, McCrimmon RJ, Ibberson M, for the Hypo-RESOLVE Consortium. Reducing the burden of hypoglycaemia in people with diabetes through increased understanding: design of the Hypoglycaemia REdefining SOLutions for better liVEs (Hypo-RESOLVE) project. Diabet Med2020. 37: 61066-1073 doi: 10.1111/dme.14240

22. 

    Terwee CB, Mokkink LB, Knol DL, Ostelo RW, Bouter LM, de Vet HC. Rating the methodological quality in systematic reviews of studies on measurement properties: a scoring system for the COSMIN checklist. Qual Life Res2012. 21: 651-657 doi: 10.1007/s11136-011-9960-1

23. 

Mokkink L, Prinsen C, Patrick D et al (2018) COSMIN methodology for systematic reviews of patient-reported outcome measures (PROMs)—user manual. Available from https://www.cosmin.nl/wp-content/uploads/COSMIN-syst-review-for-PROMs-manual_version-1_feb-2018.pdf. Accessed 1 Dec 2018

24. 

    Prinsen CAC, Mokkink LB, Bouter LM, . COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual Life Res2018. 27: 51147-1157 doi: 10.1007/s11136-018-1798-3

25. 

Carlton J, Leaviss J, Clowes M, Hendrieckx C, Pouwer F, Speight J (2020) Psychometric characteristics of patient-reported outcome measures of the impact of hypoglycaemia on quality of life (QoL) in people with diabetes (PwD): a systematic review using COSMIN. PROSPERO 2019 CRD42019125153 Available from https://www.crd.york.ac.uk/prospero/display_record.php?ID=CRD42019125153. Registered 27 Feb 2020

26. 

    Terwee CB, Jansma EP, Riphagen II, de Vet HCW. Development of a methodological PubMed search filter for finding studies on measurement properties of measurement instruments. Qual Life Res2009. 18: 81115-1123 doi: 10.1007/s11136-009-9528-5

27. 

U.S Department of Health and Human Services Food and Drug Administration (FDA) (2009) Guidance for industry: patient-reported outcome measures: use in medicinal product development to support labelling claims. Available from https://www.fda.gov/media/77832/download. Accessed 1 Dec 2018

28. 

Terwee CB, Prinsen CAC, Chiarotto A et al (2018) COSMIN methodology for assessing the content validity of PROMs: user manual. Available from https://www.cosmin.nl/wp-content/uploads/COSMIN-methodology-for-content-validity-user-manual-v1.pdf. Accessed 1 Dec 2018

29. 

    Guyatt GH, Oxman AD, Vist GE, . GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ2008. 336: 924-926 doi: 10.1136/bmj.39489.470347.AD

30. 

    Terwee CB, Prinsen CAC, Ricci Garotti MG, Suman A, de Vet HCW, Mokkink LB. The quality of systematic reviews of health-related outcome measurement instruments. Qual Life Res2016. 25: 767-779 doi: 10.1007/s11136-015-1122-4

31. 

    Grabman J, Vajda Bailey K, Schmidt K, . An empirically derived short form of the Hypoglycaemia Fear Survey II. Diabet Med2017. 34: 4500-504 doi: 10.1111/dme.13162

32. 

    Cox DJ, Irvine A, Gonder-Frederick L, Nowacek G, Butterfield J. Fear of hypoglycemia: quantification, validation, and utilization. Diabetes Care1987. 10: 5617-621 doi: 10.2337/diacare.10.5.617

33. 

    Orozco-Beltràn D, Artola S, Jansa M, Lopez de la Torre-Casares M, Fuster E. Impact of hypoglycemic episodes on health-related quality of life of type-2 diabetes mellitus patients: development and validation of a specific QoLHYPO© questionnaire. Health Qual Life Outcomes2018. 16: 152 doi: 10.1186/s12955-018-0875-1

34. 

Brod M, Højbjerre L, Bushnell DM, Hansen CT (2015) Assessing the impact of non-severe hypoglycemic events and treatment in adults: development of the Treatment-Related Impact Measure-Non-severe Hypoglycemic Events (TRIM-HYPO). Qual Life Res 24(12):2971–2984. 10.1007/s11136-015-1023-6

35. 

Anderbro T, Amsberg S, Wredling R et al (2008) Psychometric evaluation of the Swedish version of the Hypoglycaemia Fear Survey. Patient Educ Couns 73(1):127–131. 10.1016/j.pec.2008.03.022

36. 

    Tasende C, Rubio JA, Alvarez J. Spanish translation, adaptation and validation of the Hypoglycemia Fear Survey in adults with type 1 diabetes in the Community of Madrid. Endocrinol Diabetes Nutr2018. 65: 5287-296 doi: 10.1016/j.endinu.2017.12.003

37. 

    Graue M, Iversen MM, Wentzel-Larsen T, Rokne B, Haugstvedt A. Assessing fear of hypoglycemia among adults with type 1 diabetes - psychometric properties of the Norwegian version of the Hypoglycemia Fear Survey II questionnaire. Norsk Epidemiologi2013. 23: 175-81 doi: 10.5324/nje.v23i1.1605

38. 

    Lam AYR, Xin X, Tan WB, Gardner DSL, Goh SY. Psychometric validation of the Hypoglycemia Fear Survey-II (HFS-II) in Singapore. BMJ Open Diabetes Res Care2017. 5: 1e000329 doi: 10.1136/bmjdrc-2016-000329

39. 

    Anarte Ortiz MT, Caballero FF, Ruiz de Adana MS, . Development of a new fear of hypoglycemia scale: FH-15. Psychol Assess2011. 23: 2398-405 doi: 10.1037/a0021927

40. 

    Gonder-Frederick LA, Schmidt KM, Vajda KA, . Psychometric properties of the Hypoglycemia Fear Survey-II for adults with type 1 diabetes. Diabetes Care2011. 34: 4801-806 doi: 10.2337/dc10-1343

41. 

    Pinhas-Hamiel O, Tisch E, Levek N, . Sexual lifestyle among young adults with type 1 diabetes. Diabetes Metab Res Rev2017. 33: 2e2837 doi: 10.1002/dmrr.2837

42. 

    Polonsky WH, Fisher L, Hessler D, Edelman SV. Development of a new measure for assessing insulin delivery device satisfaction in patients with type 1 and type 2 diabetes. Diabetes Technol Ther2015. 17: 11773-779 doi: 10.1089/dia.2015.0140

43. 

    Polonsky WH, Fisher L, Hessler D, Edelman SV. Investigating hypoglycemic confidence in type 1 and type 2 diabetes. Diabetes Technol Ther2017. 19: 2131-136 doi: 10.1089/dia.2016.0366

44. 

    Terwee CP, Prinsen CAC, Chiarotto A, . COSMIN standards and criteria for evaluating the content validity of health-related patient-reported outcome measures: a Delphi study. Qual Life Res2018. 27: 1159-1170 doi: 10.1007/s11136-018-1829-0

45. 

    Hendrieckx C, Halliday JA, Bowden JP, . Severe hypoglycaemia and its association with psychological well-being in Australian adults with type 1 diabetes attending specialist tertiary clinics. Diabetes Res Clin Pract2014. 103: 430-436 doi: 10.1016/j.diabres.2013.12.005

46. 

    Miller KM, Foster NC, Beck RW, T1D Exchange Clinic Network. Current state of type 1 diabetes treatment in the U.S.: updated data from the T1D Exchange clinic registry. Diabetes Care2015. 38: 971-978 doi: 10.2337/dc15-0078

47. 

    Walker J, Bradley C. Assessing the quality of life of adolescents with diabetes: using the SEIQoL, DQoL, patient and diabetes specialist nurse ratings. Pract Diab Int2002. 19: 5141-144 doi: 10.1002/pdi.348

48. 

    Wild D, Grove A, Martin M, . Principles of good practice for the translation and cultural adaptation process for patient-reported outcomes (PRO) measures: Report of the ISPOR task force for translation and cultural adaptation. Value Health2005. 8: 294-104 doi: 10.1111/j.1524-4733.2005.04054.x

49. 

International Consortium for Health Outcomes Measurement. Measuring results that matter: type 1 and type 2 diabetes in adults data collection reference guide. Version 1.0.0. ICHOM Diabetes in Adults Working Group, Type 1 and Type 2 Diabetes in Adults, November 2018. Available from https://ichom.org/files/medical-conditions/diabetes-in-adults/dia-reference-guide.pdf. Accessed 1 Dec 2019

50. 

    Carlton J, Peasgood T, Khan S, Barber R, Bostock J, Keetharuth AD. An emerging framework for fully incorporating public involvement (PI) into patient-reported outcome measures (PROMs). J Patient Rep Outcomes2020. 4: 4 doi: 10.1186/s41687-019-0172-8