PLoS ONE
Home Performance evaluation of survival regression models in analysing Swedish dental implant complication data with frailty
Performance evaluation of survival regression models in analysing Swedish dental implant complication data with frailty
Performance evaluation of survival regression models in analysing Swedish dental implant complication data with frailty

Competing Interests: The authors have declared that no competing interests exist.

Article Type: research-article Article History
Abstract

The use of inappropriate methods for estimating the effects of covariates in survival data with frailty leads to erroneous conclusions in medical research. This study evaluated the performance of 13 survival regression models in assessing the factors associated with the timing of complications in implant-supported dental restorations in a Swedish cohort. Data were obtained from randomly selected cohort (n = 596) of Swedish patients provided with dental restorations supported in 2003. Patients were evaluated over 9 years of implant loss, peri-implantitis or technical complications. Best Model was identified using goodness, AIC and BIC. The loglikelihood, the AIC and BIC were consistently lower in flexible parametric model with frailty (df = 2) than other models. Adjusted hazard of implant complications was 45% (adjusted Hazard Ratio (aHR) = 1.449; 95% Confidence Interval (CI): 1.153–1.821, p = 0.001) higher among patients with periodontitis. While controlling for other variables, the hazard of implant complications was about 5 times (aHR = 4.641; 95% CI: 2.911–7.401, p<0.001) and 2 times (aHR = 2.338; 95% CI: 1.553–3.519, p<0.001) higher among patients with full- and partial-jaw restorations than those with single crowns. Flexible parametric survival model with frailty are the most suitable for modelling implant complications among the studied patients.

Fagbamigbe,Karlsson,Derks,Petzold,and Chen: Performance evaluation of survival regression models in analysing Swedish dental implant complication data with frailty

Introduction

Survival regression methods are commonly used to explore heterogeneity among subjects in medical research [1] and to estimate prognostic factors for survival [26]. However, one of the major challenges in survival analysis modelling is clustering among followed subjects, otherwise known as frailty [7,8]. The concept of frailty is an issue of discourse in statistical modelling, including survival analysis. Frailty is a group-specific latent random effect that multiplies into the hazard function. The frailties are unobservable positive quantities. They follow a gamma distribution with a mean of 1 and variance to be estimated from the data. Theoretically, any distribution with positive support (mean = 1) and finite variance may be used to model frailty. In most cases, shared-frailty models are used to model the within-group correlation. Observations within a particular group are often referred to as correlated because they share the same frailty. Let us consider the case of patients attending a dental clinic. Any of the patients may have issues with any of their teeth after a particular intervention. The teeth of a single patient form a cluster. While it is reasonable to assume independence of the dental patients, it is incorrect to assume that the occurrence of infection to any of the teeth within each patient is independent. It is necessary therefore to accommodate the potential “dependency” by assuming that it was the result of a latent patient-level effect or frailty. Non-consideration of clustering in clustered data causes poor model fit and biased estimates. This suggests that a mixed-effects model that contains both the random and fixed effects would be most appropriate to model such outcomes.

The alternative, traditional survival regression models, divided into parametric (Poisson, Weibull), semiparametric (Cox), and nonparametric (Kaplan–Meier) have distinct disadvantages that could make them unsuitable to correctly predict survival outcomes [1,9]. For instance, the Kaplan-Meier model does not accommodate covariates, hence its utilization is limited [1,5]. Although the Cox proportional hazard (CPH) model is the most commonly used model in survival analyses [1,10,11] and has been used extensively in the literature [4,1113], its efficiency is limited for short observation periods [1]. Further, its distribution-free assumption is often violated in long-term studies. In either case, many of the subjects may not have experienced the event of interest and, thus, survival and cumulative hazard functions are incomplete and cannot be extrapolated in the CPH [1]. The CPH models assume a constant hazard, an assumption that is also frequently violated [14,15]. The Cox model has an advantage in that it does not assume the form of the baseline hazard function, therefore, not hindering the prediction of hazards and other related functions for a given set of covariates but this advantage gave birth to its major disadvantage [14]. Moreover, survival and cumulative hazard functions of the CPH model are step functions and, thus, limit the possibility of having smooth functions [10,16].

Parametric models, such as the exponential and Weibull models [1], attempted to overcome some of the shortcomings of the CPH model by producing smooth predictions by assuming a functional form of the hazard [1,17] and directly estimating the absolute and relative effects [14]. The models can be used to estimate the smooth cumulative hazard functions and hazard ratios of risk factors and extrapolate survival and cumulative functions [1]. Nonetheless, the models assume that the survival and hazard functions have a specific distribution which is often too structured and sometimes unrealistic for use with real data [9,10]. In addition, parametric models with complex underlying hazard fail to capture true effects [18,19]. Thus, in most cases, parametric models have insufficient flexibility and, thereby, produce biased cumulative hazard and survival functions [10].

While the disadvantages of non-parametric models can be overcome by the use of stratification, the number of factors used for such stratification may be limited [1]. Another way of alleviating the challenges of the CPH is to use a sufficiently large sample size and extensive study duration [20]. Also, parametric survival models may be useful if available data do not violate the underlying assumptions of the distributions. Despite these mitigations, none of the non-parametric, semi-parametric or parametric models is flexible enough to accommodate structural composition of all real-life data.

Royston and Parmar (RP) therefore developed flexible parametric survival regression (FPSR) models as a result of lack of adequate flexibility of the Cox and parametric survival models [9,10,21]. The FPSR model offers a compromise between the CPH and parametric models and retains the desired features of both types of models. The flexible parametric approach works by relaxing the assumption of linearity of log time by using restricted cubic splines [10,22]. The overall advantages of the FPSR models have been reported in the literature [1,9,14,18,19,23,24]. CPH, parametric and the FPSR models have incorporated frailty options.

Different factors could be associated with complications affecting dental restorations supported by implants but it is not known how non-consideration of the clustering nature of the implants (multiple implants in one subject) affect outcomes of the modelling approach. The need for appropriate statistical models for accurate medical inferences and decisions motivated this study. It is designed to evaluate and compare the application of CPH, parametric and FPSR models to complications affecting dental implants. Implants are clustered within patients and intra-cluster dependency may occur. We hypothesized that models with frailty, and the flexible parametric model with frailty, in particular, would perform better than all other range of models. We aimed to apply different survival analysis regression models to a dataset originating from Sweden [25,26] and assess the performance of the models to identify the model with the best data fit. We considered the (i) Cox proportional hazard models for frailty, (ii) Multilevel mixed-effects parametric survival models for proportional hazard and accelerated failure times and (iii) Flexible parametric survival regression models with frailty generally referred to as the Royston-Parmar (RP) models and their equivalents without frailty.

Methods and statistical models

Cox proportional hazard models with frailty

The Cox proportional hazard (PH) model with frailty is an extension of the Cox PH model developed in 1972 which assumed that hazards are multiplicatively proportional to baseline hazards [5] as shown in Eq (1).

The above equation provides estimates of β1, β2,…,βk, and its variance-covariance matrix but provides no direct estimate of the baseline hazard (h0(t)). However, the model provides an avenue to estimate the baseline cumulative hazard (H0(t)) and baseline survival (S0(t)) which can be used to estimate the h0(t) [10].

Let us assume groups i = 1,……………,n groups with j = 1,……………,ni observations in group i. For the jth observation in the ith group, the hazard is shown in Eq (2).

where group-level frailty is estimated by αi. The frailties are unobservable positive quantities and are assumed to have a mean of 1 and a variance θ. Shared-frailty models are used to model within-group correlation; observations within a group are correlated because they share the same frailty. The degree of within-group correlation can be measured by an estimate of “θ”, where θ is 0 in cases where there is no frailty.

By letting vi = log αi, the hazard is as shown in Eq (3)

which makes the log frailties vi, to be analogous to random effects obtainable in the corresponding standard linear models.

Numerically, let xi be the row vector of covariates for the time interval (t0i; ti) for the ith observation in a dataset with N subjects (i = 1,……….,N). The estimates of the coefficient (βi) of the covariates (Xi) can be estimated by maximizing the partial log-likelihood function in Eq (4)

Where j represents the index of the ordered failure times t(j), j = 1;………; D; such that Dj is the set of dj observations that fail at t(j); dj is the number of failures at t(j); and Rj is the set of observations k that are at risk at time t(j), that is, for all k such that t(0k)<t(j)≤t(k) [2729].

The data for Cox shared-frailty models are usually organized into G groups with the ith group consisting of n1 observations, i = 1,………,G. The estimation of θ is done via maximum profile log-likelihood. For fixed θ, estimates of β and v1,…….,vG are obtained by maximizing the parameters in Eq (5).

where Di is the number of death events in group i and logLCox(β, v1,………,vG) is the standard Cox partial log-likelihood, with the vi as the vector of the variables’ coefficients indicator which identifies the groups. The jth observation in the ith group has log relative hazard βxij+νi. The values that maximize logL(θ^) are the final estimates of β in νi [30].

Mixed-effects parametric survival models

The mixed-effects parametric survival models otherwise called the multilevel parametric survival models are well known [31]. These models contain both fixed and random effects. The accelerated failure-time (AFT) model and the multiplicative or proportional hazards (PH) model are the most-used models for adjusting survivor functions for the effects of covariates. In the AFT model, log t is expressed as a linear function of the covariates, when random-effects is incorporated, the function yields the function in Eq 6.

for j = 1,…………,M, clusters, with cluster j consisting of i = 1,…………,nj observations. The 1 X p row vector Xji contains the covariates for the fixed effects, with regression coefficients (fixed effects) β. The zji has 1 x q dimension and contains the covariates corresponding to the random effects. Also, vji are the observation-level errors with density φ(.). In the PH models, the model contains the covariates which have a multiplicative effect on the hazard function in Eq (7).
where h0(t), the baseline hazard function, is assumed to be parametric. Both the exponential and Weibull models can be implemented using the AFT and PH parameterizations, but the gamma and log-logistics and log-normal can only be implemented in AFT and implemented with “mestreg” in Stata.

Flexible parametric survival regression models

The FPSR model is based on a series of models that are modifications of several standard survival models [21] but has additional flexibility [21,23]. These models use restricted cubic splines to model a transformation of the survival function. The Weibull model is one of the most common parametric models and is an approximation of the PH model. It has been criticized for inflexibility in the shape of the baseline hazard function, which either increases or decreases monotonically. Weibull survival function is S(t) = exp(−λtγ), and the corresponding log cumulative hazard scale is ln {H(t)} = ln[−ln{S(t)}] = ln[−ln{exp (−λtγ)}] = ln(λ)+γ ln(t), after transformation. Addition of covariates to the model produces ln {H(t|xi)} = ln(λ)+γ ln(t)+xiβ.

Splines are flexible mathematical functions defined by piecewise polynomials with some constraints to ensure that the overall curve is smooth, are used. The polynomials join one another at points called knots. The fitted function is forced to have continuous 0th, 1st and 2nd derivatives. The most common splines used are cubic splines. Restricted cubic splines with k knots can be fitted by creating k-1 derived variables. For knots k1, k2,…………….kk, and parameters γ0, γ1,…………….γk−1, a restricted cubic spline function can be written as s(x) = γ01z1+⋯…………+γk−1zk−1; where z1 = x = ln(t) and zj(j≥2). The derived variables, zj, are computed as in Eq (8)

where j = 2,…,k−1; (xkj)+3=max{0,(xa)3};ϕj=(kkkj)/(kkk1);kk is the maximum k, and k1 is the minimum k. The derived variables can be highly correlated and are orthogonalized by using Gram–Schmidt orthogonalization.

The hazard function involves the derivatives of the restricted cubic splines functions as s′(x) = γ1z′12z′2+⋯…………+γk−1z′k−1. The choice of position of knots determines the complexity of the flexible models. Usually, k knots, maximum at 9 knots, has k+1 degrees of freedom (df). The position of the knots (internal) is usually in centiles computed as 100/df. So, for 3 knots, the df is 4 and the knots will be located at centiles 25, 50 and 75. The internal knots are bounded by “boundary knots” which are placed at the minimum and maximum of the distribution of uncensored survival times. The FPSR models become the Weibull model if the number of knots is 0, while γ0 and γ1 are equal to the scale parameter and shape parameter respectively. Royston et al. suggest using 1 or 2 knots for smaller (<10,000) datasets and 4 or 5 for larger (> = 10,000) datasets [10]. The FSPR models are implemented in Stata using “stpm2” with an option for frailty.

In the flexible parametric model, the contribution to the log-likelihood for the ith individual on the log cumulative hazard scale can be written as shown in Eq 9.

where di is the event indicator. The likelihood can be maximized by defining an additional equation for the derivatives of the spline function and constrain the parameters to be equal to the equivalent spline functions in the main linear prediction [22,32].

Model selection criteria

Log-likelihood, Akaike information criteria (AIC) [33] and the Bayesian information criteria (BIC) [34] were used for model selection. Lower values of AIC and BIC indicated a better model fit. AIC and BIC are usually computed and compared separately among different models to determine the best fitting model. However, confusion may arise if the best fitting model according to the AIC is different from that identified by the BIC [14]. Literature suggests that AIC will choose a more complex model irrespective of sample size while BIC is more likely to choose a simpler model [14]. AIC is often preferable in situations when a false negative finding would be considered to be more misleading than a false positive, and BIC is superior in situations where a false positive is as misleading as, or more misleading than, a false negative. AIC is best for prediction as it is asymptotically equivalent to cross-validation. BIC is best for an explanation as it allows consistent estimation of the underlying data generating process [14,35].

The data

The dataset used for this study originates from a project evaluating the effectiveness of dental implant therapy in Sweden. A cohort of 596 randomly selected adults, provided with implant-supported dental restorations in 2003, were followed over 9 years. The extent of dental treatment varied from the replacement of single teeth to the restoration of full jaws. The average number of implants per patient was 4.0 ±2.8 (range 1–12). Complications related to the restorations/implants were scored using the patient as the unit of analysis and timing was recorded in days relative to the time point of implant insertion. The complications included: Loss of a dental implant, development of peri-implantitis and/or occurrence of a technical complication. For details regarding case definitions of the different categories of complications, the reader is referred to Derks et al. [25,26,36]. The occurrence of any of the complications referred to above was considered as an event in the present analyses. There were a total of 1,038 events during the observation period with 469 complications in single-record/single-failure data.

Operational definitions

Median survival time: This is a statistic that refers to how long patients “survive” in general after dental restorative therapy including the use of implants.

The incidence rate is a measure of the frequency with which dental implant complications occurred per day.

Ethics approval and patient consent

The research protocol was approved by the regional Ethical Committee, Gothenburg, Sweden (Dnr 290–10), registered at ClinicalTrials.gov (NCT01825772) and study participants signed an informed consent form prior to inclusion.

Results

One of the 596 subjects was excluded from analysis due to missing data. The mean age (in 2003) of the 595 included participants was 62.3 (SD = 9.3) years, with 42% aged 60–69 years, 24% aged 70–79 years and 55% were females. Roughly 60% of patients presented without signs of periodontitis at the 9-year examination, 24% had periodontitis and 16% were edentulous (no teeth). A total of 28% had full-jaw restorations, 48% had partial-jaw restorations and 24% had single crowns, only. Regarding dental products, 31% received Straumann implants (Type A), 20% had Astra Tech implants (Type B), 40% had Nobel Biocare implants (Type C). The remaining 9% of subjects were treated with various other types of implants categorized as Type D.

The overall incidence rate of implant complications was 0.000241 per day. It was higher among those without remaining natural teeth (0.000387) and those who had full-jaw restorations (0.000439). The median survival time (when 50% of implant-carrying subjects would have “failed”) to implant complications was 2476 days, while the 25% survivorship was 820 days. The medium “survival” time was highest among those who had partial-jaw restorations (3347 days), females (3044 days), treated within the public dental service (3044 days), treated with Type A (3347 days) or other dental products (Type D) (3227 days respectively) as shown in Table 1.

Table 1
Distribution of incidence rate and quartile survival times by patients’ characteristics (n = 595).
CharacteristicsN(%)Days at riskincidence rateSurvival time (days)
25%Median (50%)75%
Age(years) in 2003 mean(sd)62.3(9.3)
<5080(13.5)2664360.0001351935cbccbc
50–59121(20.3)3999950.0001701185cbccbc
60–69252(42.4)8133590.0002997461874cbc
70–79145(24.4)4676010.0002615931927cbc
Age group
Younger (<60)201(33.8)6664310.0001561386cbccbc
Older (≥60)394(66.2)12809600.0002856851915cbc
Sex
Male267(44.9)8723440.0002886851927cbc
Female328(55.1)10750470.0002038993378cbc
Periodontal status
Healthy356(59.8)11684950.0001871081cbccbc
Periodontitis144(24.2)4686200.0002808681966cbc
No Teeth95(16.0)3102760.0003874711185cbc
Extent of treatment
Full jaw167(28.1)5374960.00043947010822840
Partial jaw283(47.6)9284680.0002089293347cbc
Single145(24.4)4814270.000083.cbccbc
Smoker
Yes76(12.8)2468400.0002929292118cbc
No519(87.2)17005510.0002348202509cbc
Clinical setting
Public dental service174(29.2)5752460.0002148683044cbc
Private dental service372(62.5)12119700.0002618072210cbc
Mix49(8.2)1601750.000187594cbccbc
Frequency of maintenance
Regular (annual)479(82.2)15653190.0002597462057cbc
Irregular (< annual)104(17.8)3419400.0001781752cbccbc
Ever Smoker
Yes209(35.1)6820820.0002387462515cbc
No386(64.9)12653090.0002438202280cbc
Dental product
Type A181(31.2)6050900.0002079593347cbc
Type B115(19.8)3749290.0002727461874cbc
Type C230(39.6)7404920.0002369292604cbc
Typ D55(9.5)1805800.0002228693227
Bone augmentation
No436(84.7)14285240.0002408202476cbc
Yes79(15.3)2565480.0002656542070cbc
Retention of restoration
Cemented198(34.1)6642740.0001172057cbccbc
Screw-retained346(59.6)11229310.0003146241661cbc
Both37(6.3)1197480.0002754042069cbc
Total59519473910.0002418202476cbc

cbc Cannot be computed

Missing data if the sum of categories <595.

The distribution of the incidence rate of implant complications by time is shown in Fig 1. The incidence rate reduced with increasing time of follow-up.

Distribution of incidence rate of dental failure by time.
Fig 1

Distribution of incidence rate of dental failure by time.

The hazard and survival functions of dental complications under different distributions

We assessed the hazard function under different models for periodontal status (one of the main prognostic factors) in Fig 2. In Fig 2, the chart in panel (a) is from the Weibull mixed effects parametric proportional hazard (b) Loglogistic mixed effects parametric proportional hazard (c) Cox proportional hazard and (d) Cox smoothed proportional hazard. The hazard for “periodontitis” was consistently higher than the hazard for “healthy” and “no teeth”. The hazard for “healthy” and “no teeth” was similar.

Comparison of the hazard functions of the models using the determinate variable.
Fig 2

Comparison of the hazard functions of the models using the determinate variable.

In Fig 3, we present the survival function under different models for periodontal status. The chart in panel (a) is from the Weibull mixed effects parametric proportional hazard (b) Loglogistic mixed effects parametric proportional hazard (c) Cox proportional hazard and (d) Cox smoothed proportional hazard for periodontal status. The survival for “periodontitis” was consistently lower than the survival for “healthy” and “no teeth”. The survival for “healthy” and “no teeth” was similar.

Comparison of the survival functions of the models using the determinate variable.
Fig 3

Comparison of the survival functions of the models using the determinate variable.

Test of assumption of proportionality

The test of violations of assumptions of the proportional hazard showed that the test was not violated (X2 = 5.50, df = 4, p = 0.240)

Comparison of the survival and hazard functions of the flexible model at different degrees of freedom

We compared the performance of the survival and hazard functions of the flexible model at various degrees of freedom (1, 2, 3 and 6) for the periodontal status of the patients. A degree of freedom of 1 is an equivalent of the Weibull distribution. The hazard functions of the Weibull distribution were different from the hazard functions at the other degrees of freedom. However, the function at 6 degrees of freedom was different and more flexible than at 2 and 3 degrees of freedom (Fig 4A). The survival functions at 2, 3 and 6 degrees of freedom were, however, similar but distinct at 1 degree of freedom (Fig 4B).

Comparison of the survival and hazard functions of the Weibull model and the flexible models.
Fig 4

Comparison of the survival and hazard functions of the Weibull model and the flexible models.

Selection of the best model

Loglikelihood, AIC BIC for all the models considered, with and without frailty, are presented in Table 2. All three parameters were consistently lower among the flexible frailty models at different degrees of freedom than the Cox PH frailty, parametric frailty models (Table 2). We observed that the AIC and BIC of the parametric models without frailty were consistently lower than those with frailty. Among the FPSR models at different degrees of freedom, the lowest loglikelihood was at df = 6, the lowest AIC was at df = 4, while the lowest BIC was at df = 2. However, for df >1, differences between the lowest and highest loglikelihood, between the lowest and highest AIC and between the lowest and highest BIC were 4.4 (0.45%), 1.47 (0.04%) and 18.7 (0.89%) respectively. According to the AICs, all the FPSR models at df>2 were similar. Hence we chose the simplest of all the FPSR models at df = 2. Our decision was further supported by the significance of the spline variables for the log baseline cumulative hazard (_rcsi) otherwise called the slope of the hazard curve within each of the knots generated by the degrees of freedom. At df = 2, the two slopes were statistically significant: _rcs1 = 2.26 (p<0.001) and _rcs2 = 1.150 (p<0.001). At df = 3, the first two slopes were statistically significant: _rcs1 = 2.25 (p<0.001) and _rcs2 = 1.134 (p<0.001) but the last slope was not significant (_rcs3 = 1.02 (p = 0.107). The slopes at df>2 had similar patterns with the slopes at df = 3.

Table 2
Comparison of the flexible models using different knots.
FrailtyModelModelll(null)ll(model)dfAICBIC
NoneCoxCox-2717.70-2633.63175301.265384.09
ParametricWeibull-1187.65-1134.54192307.072399.64
Exponential-1189.12-1134.86182305.732393.43
Log Logistic-1182.18-1130.25192298.512391.08
Gamma-1178.59-1130.46202300.922398.37
Flexible Modeldf = 1-1294.47-1134.54192307.072399.64
df = 2-1294.47-1128.29202296.582394.03
df = 3-1294.47-1127.95212297.902400.21
df = 4-1294.47-1125.68222295.362402.55
df = 5-1294.47-1124.84232295.692407.74
df = 6-1294.47-1123.84242295.682412.62
YesCoxCox with strata-2047.18-2001.85174037.704120.52
Cox frailty-2866.48-2577.24175188.485271.31
ParametricWeibull-4359.71-3900.89207841.787939.22
Exponential-4376.43-3910.82197859.647952.21
Log Logistic-4349.04-3898.90207837.807935.24
Gamma-4363.40-3905.27207850.547947.98
Flexible Modeldf = 1-1143.70-988.24192014.472107.04
df = 2-1143.70-972.51201985.012082.45
df = 3-1143.70-971.73211985.452087.76
df = 4-1143.70-969.99221983.982091.17
df = 5-1143.70-969.16231984.312096.37
df = 6-1143.70-968.11241984.222101.15

AIC Akaike Information Criteria BIC Bayesian Information Criteria df degrees of freedom ll loglikelihood PH Proportional Hazard AFT Accelerated Failure Rate

Modelling the risk factors of implant complications

We fitted an FPSR model at 2 degrees of freedom to the data and identified the adjusted determinants of implant complications among the patients. Table 3 showed that the adjusted hazard of implant complications was 45% (adjusted Hazard Ratio (aHR) = 1.449; 95% Confidence Interval (CI): 1.153–1.821, p = 0.001) higher among patients with periodontitis than those who were periodontally healthy. While controlling for other variables, the hazard of implant complications was about 5 times (aHR = 4.641; 95% CI: 2.911–7.401, p<0.001) and 2 times (aHR = 2.338; 95% CI: 1.553–3.519, p<0.001) higher among patients with full- and partial-jaw restorations respectively when compared to subjects with single crowns, only. The adjusted hazard of implant complications was 27% (aHR = 1.272; 95% CI: 1.047–1.548, p = 0.016) higher among male patients than females. The adjusted hazard of an implant complication was 40% (aHR = 1.397; 95% CI: 1.069–1.826, p = 0.014) higher among patients provided with Type B dental products with Type A products as reference. Smoking history, type of retention and age were not significant predictors of complications.

Table 3
Adjusted prognostic factors of dental implant complications using FPSR model (df = 2).
CharacteristicsAdjusted Hazard Ratio95% CIp-value
Periodontal status
Healthy1.000
Periodontitis1.4491.153–1.8210.001
No teeth1.0500.797–1.3830.730
Extent of treatment
Full jaw4.6412.911–7.401<0.001
Partial jaw2.3381.553–3.519<0.001
Single1.000
Age (years) in 2003
<501.000
50–591.0860.713–1.6520.702
60–691.1080.740–1.6580.620
70–790.8600.559–1.3220.491
Gender
Male1.2721.047–1.5480.016
Ever smoker
Yes1.0140.769–1.3370.922
Dental product
Type A1.000
Type B1.3971.069–1.8260.014
Type C1.0740.848–1.3600.554
Type D1.1160.779–1.5970.550
Retention of restoration
Screw-retained1.000
Cemented0.8700.627–1.2080.406
Both0.9200.607–1.3950.695
_rcs12.2622.080–2.459<0.001
_rcs21.1501.091–1.213<0.001

_rcs are the spline variables for the log baseline cumulative hazard

Discussion

This study was designed to apply and compare the performance of semi-parametric, parametric and flexible parametric survival regression models to a dataset on dental implant-related complications with or without frailty. This analytical study showed that models with frailty performed better than those without frailty except among the parametric models where the reverse was the case. This could be ascribed to inconsistencies and inflexibilities of parametric models (Royston and Lambert, 2011). Nonetheless, the AIC and the BIC of the flexible models were lower than those computed from the other models irrespective of whether the clustering nature of the implant data was considered or not.

Therefore, the flexible parametric survival regression model was the best of the three main models considered in this study. Our finding is consistent with findings in earlier studies [10,15,23]. All measures of model fit and model selection adopted in our study were consistently better in the flexible parametric survival regression models than in the other models. Loglikelihood, AIC and BIC were lower in the flexible parametric survival regression models than the Cox PH model and the parametric models. Similar findings have been reported in the literature [15,23,37]. The flexible models had a unique advantage by separating the hazard function into segments (splines) based on the specified degrees of freedom and computing the hazard within each spline [9,10,18,21].

We used AIC and BIC to select the ultimate degrees of freedom to use for the flexible parametric survival regression model. AIC and BIC are measures of the amount of information lost in the models [33,34]. The lower these values, the better the models. In this study, BIC was lowest at 2 degrees of freedom while AIC was lowest at 4 degrees of freedom. This discrepancy has previously been reported [35] and may be due to how the two information measures compute “complexities”. The problem of defining “N” (the number of observations) is not related to AIC because N is not used in computing AIC, which rather uses a constant 2 to weight complexity as measured by k (number of parameters estimated), rather than ln(N) in BIC. According to Stone at al. [38], AIC approximately minimizes the prediction error and is asymptotically equivalent to leave-1-out cross-validation (LOOCV) while BIC is equivalent to leave-k-out cross-validation (LKOCV) [39] and it is not consistent with the amount of data available. However, BIC has the advantage of being consistent. With a very large amount of data, and if the true model is among the candidate models, the probability of selecting the true model based on the BIC criterion would approach 1. This, however, may slightly affect prediction performance. We chose the flexible parametric survival regression model at 2 degrees of freedom as suggested by the BIC because the slopes of the curves within each spline were insignificant after 2 degrees of freedom, the differences in parameter estimates at 2, 3 and 4 degrees of freedom were negligible. Also, the AIC at 2, 3 and 4 degrees of freedom changed by 0.07%, which was considered negligible.

Our finding that the FPSR method with frailty fitted the data used in this study is further corroborated by the behaviour of the hazard and survival functions shown in Figs 24. However, there could be challenges of over-parametrization in the flexible model due to its adaptability and incorporation of up to ten knots. Also, the Cox model makes minimal assumptions about the form of the baseline hazard function and may have hindered the prediction of hazards and other related functions for a given set of covariates. It also results in unsmooth estimated curves and lack of information about what occurs between the observed failure times. Parametric models, on the other hand, produce smooth predictions by assuming a functional form of the hazard. Its assumed form is too structured for use with real data (Royston and Lambert, 2011). Therefore, the non-proportional hazards can be modelled using restricted cubic splines in FPSR models [14] and thereby produce a better fit.

The fitted flexible parametric survival regression model at 2 degrees of freedom showed that the hazard of implant complications was higher among male patients, patients with periodontitis, among patients with either full- or partial-jaw restorations and among patients that were provided with dental product Type B. It is plausible that more extensive restorations are at higher risk for complications through the simple fact that more implants and surfaces are exposed to potential events. More extensive restorations, however, may also serve as a surrogate parameter for the individual’s susceptibility to developing tooth- or implant-related problems. This may be illustrated by the fact that subjects presenting with periodontitis at remaining teeth were at higher risk for implant-related complications. This relationship is most likely explained by the strong association between periodontitis and peri-implantitis [40]. Peri-implantitis was one of the complications recorded in the present study. The background to the other factors identified in the model (sex and dental product) are not understood. It may be speculated that biting force and/or behaviour in terms of oral health may have had an impact.

The covariates included in the flexible model have shown that there is a wide range of factors that contribute to complications affecting dental implants. Their inclusion has influenced the performance of the models as they demonstrated reality. For instance, the risk of implant complications was generally higher among patients with periodontitis than those that were periodontally healthy. No difference, however, was noted between periodontally healthy and edentulous patients. Similar assertions have been made in earlier studies [41,42].

Conclusion

Flexible parametric survival model represents the best approach for estimating the hazard of clustered implant complications including (i) implant loss, (ii) peri-implantitis and (iii) technical complications. The study underscores the need to explore the multilevel (clustering) nature of datasets to be analysed. Non-consideration of the clustering nature of data is potentially misleading. The hazard of complications was higher among male patients, patients with periodontitis, patients with more extensive restorations and was dental product specific.

Acknowledgements

The authors appreciate the logistic supports provided by the Consortium for Advanced Research and Training in Africa (CARTA) to AFF to visit the University of Gothenburg as part of his fellowship at the University of Warwick.

Abbreviations

AFTAccelerated Failure Time
AICAkaike information criteria
aHRadjusted Hazard Ratio
BICBayesian information criteria
CIConfidence Interval
CPHCox Proportional Hazard
FPSRFlexible parametric survival regression
IQRInterquartile range
PHProportional hazard
RPRoyston-Palmar

References

XDu, MLi, PZhu, JWang, LHou, JLi, et al Comparison of the flexible parametric survival model and Cox model in estimating Markov transition probabilities using real-world data. PLoS One. 2018;13:113. 10.1371/journal.pone.0200807

WRSwindell. Accelerated failure time models provide a useful statistical framework for aging research. Exp. Gerontol. NIH Public Access; 2009;44:190200. 10.1016/j.exger.2008.10.005

PCAustin. A Tutorial on Multilevel Survival Analysis: Methods, Models and Applications. Int. Stat. Rev. [Internet]. 2017;85:185203. Available from: 10.1111/insr.12214

AFFagbamigbe, ASAdebowale, IOMorhason-Bello. Survival analysis of time to uptake of modern contraceptives among sexually active women of reproductive age in Nigeria. BMJ Open [Internet]. 2015 [cited 2016 Jan 12];5:18. Available from: http://bmjopen.bmj.com/cgi/content/long/5/12/e008371 10.1136/bmjopen-2015-008371

DCox. Regression Models and Life Tables (with Discussion). J. R. Stat. Soc. Ser. B. 1972;34:187220.

JFox. Cox Proportional-Hazards Regression for Survival Data The Cox Proportional-Hazards Model. Most [Internet]. 2002;2008:118. Available from: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.110.2264&rep=rep1&type=pdf

AFFagbamigbe, BBBakre. Evaluating Likelihood Estimation Methods in Multilevel Analysis of Clustered Survey Data. African J. Appl. Stat. [Internet]. 2018;5:35176. Available from: http://www.statpas.org/ajas/admin/articles/pdfs/ajas_2018_01_04_def.pdf

AFFagbamigbe, RFAfolabi, KYAlade, ASAdebowale, BOYusuf. Unobserved Heterogeneity in the Determinants of Under-five Mortality in Nigeria: Frailty Modeling in Survival Analysis. African J. Appl. Stat. [Internet]. 2019;6:56583. Available from: http://www.statpas.org/ajas/admin/articles/pdfs/ajas_2019_01_04_def.pdf

NOrsini. Review of Flexible Parametric Survival Analysis Using Stata: Beyond the Cox Model by Patrick Royston and Paul C. Lambert. Stata J. 2013.

10 

PRoyston, PCLambert. Flexible Parametric Survival Analysis Using Stata: Beyond the Cox Model. 1st ed Texas: Stata Press; 2011.

11 

KNOtwombe, MPetzold, NMartinson, TChirwa. A review of the study designs and statistical methods used in the determination of predictors of all-cause mortality in HIV-infected cohorts: 2002–2011. PLoS One. 2014 10.1371/journal.pone.0087356

12 

AFFagbamigbe, AOAkintayo, OOshodi, FTMakinde, MBabalola, EADamilola, et al Survival Analysis and Prognostic Factors of Time to First Domestic Violence after Marriage among Married Women in Africa. Public Health [Internet]. Elsevier Ltd; 2020;181:12234. Available from: 10.1016/j.puhe.2019.12.003

13 

Wamala R, Kabagenyi A, Kasasa S. Predictors of time to contraceptive use from resumption of sexual intercourse after birth among women in Uganda. DHS Work. Pap. No. 118 [Internet]. 2015; Available from: http://dhsprogram.com/pubs/pdf/WP118/WP118.pdf

14 

HBower, MJCrowther, MJRutherford, TMLAndersson, MClements, XRLiu, et al Capturing simple and complex time-dependent effects using flexible parametric survival models: A simulation study. Commun. Stat. Simul. Comput. [Internet]. Taylor and Francis Inc.; 2019 [cited 2020 Mar 6];117. Available from: https://www.tandfonline.com/doi/full/10.1080/03610918.2019.1634201

15 

PWDickman. Comparing Cox and flexible parametric models [Internet]. 2020 [cited 2020 Mar 6]. p. 12. Available from: http://pauldickman.com/software/stata/sex-differences-cox/

16 

CPNelson, PCLambert, IBSquire, DRJones. Flexible parametric models for relative suvival, with application in coronary heart disease. Stat. Med. 2007 p. 548698. 10.1002/sim.3064

17 

CWilliams, JDLewsey, DFMackay, AHBriggs. Estimation of Survival Probabilities for Use in Cost-Effectiveness Analyses. Med. Decis. Mak. [Internet]. SAGE Publications Inc.; 2016 [cited 2020 Mar 6];37:0272989X1667061. Available from: http://journals.sagepub.com/doi/10.1177/0272989X16670617

18 

RNg, KKornas, RSutradhar, WPWodchis, LCRosella. The current application of the Royston-Parmar model for prognostic modeling in health research: a scoping review. Diagnostic Progn. Res. Springer Nature; 2018;2:115. 10.1186/s41512-018-0026-5

19 

MJRutherford, MJCrowther, PCLambert. The use of restricted cubic splines to approximate complex hazard functions in the analysis of time-to-event data: a simulation study. J. Stat. Comput. Simul. Taylor and Francis Ltd.; 2015;85:77793.

20 

RLNijhuis, TStijnen, APeeters, JCMWitteman, AHofman, MGMHunink. Apparent and internal validity of a Monte Carlo-Markov model for cardiovascular disease in a cohort follow-up study. Med. Decis. Making [Internet]. Sage PublicationsSage CA: Thousand Oaks, CA; 2006 [cited 2020 Mar 6];26:13444. Available from: http://www.ncbi.nlm.nih.gov/pubmed/16525167 10.1177/0272989X05284103

21 

PRoyston, MKBParmar. Flexible parametric proportional-hazards and proportional-odds models for censored survival data, with application to prognostic modelling and estimation of treatment effects. Stat. Med. 2002;21:21752197. 10.1002/sim.1203

22 

PCLambert, PRoyston. Further development of flexible parametric models for survival analysis. Stata J. 2009;9:265290.

23 

BMiladinovic, AKumar, RMhaskar, SKim, RSchonwetter, BDjulbegovic. A Flexible Alternative to the Cox Proportional Hazards Model for Assessing the Prognostic Accuracy of Hospice Patient Survival. PLoS One. 2012;7:18. 10.1371/journal.pone.0047804

24 

BKearns, MDStevenson, KTriantafyllopoulos, AManca. Generalized Linear Models for Flexible Parametric Modeling of the Hazard Function. Med. Decis. Mak. 2019;39:86778. 10.1177/0272989X19873661

25 

KKarlsson, JDerks, JHåkansson, JLWennström, MMolin Thorén, MPetzold, et al Technical complications following implant-supported restorative therapy performed in Sweden. Clin. Oral Implants Res. [Internet]. Blackwell Munksgaard; 2018 [cited 2020 Mar 26];29:60311. Available from: http://doi.wiley.com/10.1111/clr.13271

26 

JDerks, DSchaller, JHåkansson, JLWennström, CTomasi, TBerglundh. Effectiveness of Implant Therapy Analyzed in a Swedish Population: Prevalence of Peri-implantitis. J. Dent. Res. [Internet]. SAGE Publications Inc.; 2016 [cited 2020 Mar 26];95:439. Available from: http://www.ncbi.nlm.nih.gov/pubmed/26701919 10.1177/0022034515608832

27 

EVittinghoff, VGlidden D., SCShiboski, CEMcCulloch. Regression Methods in Biostatistics: Linear, Logistic, Survival, and Repeated Measures Models [Internet]. Second Edi Washington DC: Springer; 2012 [cited 2020 Aug 17]. Available from: https://www.stata.com/bookstore/regression-methods-biostatistics/

28 

YXu, YBCheung. Frailty models and frailty-mixture models for recurrent event times. Stata J. DPC Nederland; 2015;15:13554.

29 

YXu, YBCheung. Frailty models and frailty-mixture models for recurrent event times: Update. Stata J. DPC Nederland; 2018;18:47784.

30 

GrambschTherneau. Modeling Survival Data: Extending the Cox Model. New delhi, India: Springer; 2000.

31 

MJCrowther. Multilevel mixed-effects parametric survival analysis: Estimation, simulation, and application. Stata J. Promot. Commun. Stat. Stata [Internet]. SAGE Publications Inc.; 2019 [cited 2020 Aug 17];19:93149. Available from: http://journals.sagepub.com/doi/10.1177/1536867X19893639

32 

TMLAndersson, PCLambert. Fitting and modeling cure in population-based cancer studies within the framework of flexible parametric survival models. Stata J. DPC Nederland; 2012;12:62338.

33 

HAkaike. Information theory as an extension of the maximum likelihood principle In: BNPetrov, FCsaki, editors. Second Int. Symp. Inf. Theory. Ed. Budapest: Akademiai Kiado; 1973 p. 267281.

34 

GSchwarz. Estimating the Dimension of a Model. Ann. Stat. Institute of Mathematical Statistics; 1978;6:4614.

35 

STATA. AIC and BIC. Stata J. 2020;21:19.

36 

JDerks, JHåkansson, JLWennström, CTomasi, MLarsson, TBerglundh. Effectiveness of implant therapy analyzed in a Swedish population: Early and late implant loss. J. Dent. Res. [Internet]. SAGE Publications Inc.; 2015 [cited 2020 Mar 26];94:44S51S. Available from: http://www.ncbi.nlm.nih.gov/pubmed/25503901 10.1177/0022034514563077

37 

MJBrewer, AButler, SLCooksley. The relative performance of AIC, AICC and BIC in the presence of unobserved heterogeneity. RFreckleton, editor. Methods Ecol. Evol. [Internet]. British Ecological Society; 2016 [cited 2020 Mar 9];7:67992. Available from: http://doi.wiley.com/10.1111/2041-210X.12541

38 

MStone. An asymptotic equivalence of choice of model by cross-validation and Akaike’s criterion. J. R. Stat. Soc. Ser. B. 1977;39:447.

39 

JShao. An asymptotic theory for linear model selection. Stat. Sin. 1997;7:22142.

40 

FSchwarz, JDerks, AMonje, HLWang. Peri-implantitis. J. Clin. Periodontol. Blackwell Munksgaard; 2018;45:S24666.

41 

SCulshaw. Periodontal Disease: Its Impact on Restorative Dentistry. Prim. Dent. J. 2017;6:2531. 10.1177/205016841700600103

42 

JGWittneben, DBuser, GESalvi. Complication and failure rates with implant-supported fixed dental prostheses and single crowns: a 10-year retrospective study. Clin Implant Dent Relat Res. 2014;16:356364. 10.1111/cid.12066