Skip to main content

Development of a prognostic model for 1-year survival after fragile hip fracture in Chinese



No prognostic model for the survival of fragile hip fracture has been developed for Asians. The goal of this study was to develop a simple and practical prognostic model to predict survival within 1 year after fragile hip fracture in Asians.


A single-center retrospective cohort study was designed. Under a multivariable Cox proportional hazards regression model, we used the preoperative characteristics of patients to predict survival within 1 year after hip fracture. We built a full model and then used the least absolute shrinkage and selection operator (LASSO) method to further shrink the model coefficients and achieved variable screening. Finally, we obtained a LASSO model. The model performance was evaluated with Nagelkerke’s R2 and the concordance (c) statistic. We assessed the internal validity with a bootstrapping procedure of 1 000 repetitions.


A total of 735 eligible patients were admitted to our department for hip fracture from January 2015 to December 2020, but 11 (1.5%) patients were lost to follow-up. Among the remaining patients, 68 (9.3%) died within 1 year after hip fracture. We identified 12 candidate predictors from the preoperative characteristics of the patients. The last model contained nine predictors: surgery, age, albumin, sex, serum creatinine, malignancy, hypertension, ability to live independently, and cardiovascular and cerebrovascular diseases. Among them, surgery, age, and albumin are effective predictors of survival. The discrimination c statistic of the model is 0.814 (95% confidence interval 0.762–0.865); the corrected value through internal validation is 0.795.


This prognostic model can accurately predict a 1-year survival rate for patients with fragile hip fractures. This information can help clinicians develop a reasonable and personalized treatment plan.


Background and objectives

Fragile hip fractures mainly occur in middle- and old-aged patients, especially elderly patients. These patients tend to have more medical complications. It is not uncommon for them to die shortly after discharge or even during hospitalization. This poses a challenge for orthopedic surgeons and patients when choosing treatment options. For example, for patients who die shortly after surgery, their family members may think that the doctor's recommendation to choose surgery is wrong; conservative treatment may have been more appropriate, whether in terms of economic benefits or survival time. How to use the preoperative characteristics of patients to quickly distinguish the high-risk patients who will die shortly after a hip fracture from the low-risk patients and thus inform the doctors and family members about the patient’s prognosis in advance is of great practical significance for both providing personalized treatment and alleviating doctor–patient conflicts.

The existing prognostic models [1,2,3,4,5,6,7] have many problems. First, some models use other models as predictive factors. This makes the model complex and difficult to understand. It inevitably reduces their practicality [4,5,6]. For example, one model contained only six predictors, three of which are other models [5].

Second, some models use outcome variables as predictors, such as postoperative complications. This greatly limits their applicability [1]. Furthermore, there are methodological shortcomings: improper selection of the statistical model, for instance, studying survival outcomes with a logistic regression model [5]; studying binary outcomes with a linear regression model [2]; converting all continuous variables into categorical variables [3, 6], resulting in lower information utilization; not correcting for overfitting [6]; and selecting all candidate predictors with univariate analysis [6]. Finally, no similar model has been developed for Asians.

The goal of this study was to develop a simple and practical prognostic model for Asians using simple and easy-to-obtain preoperative characteristics to predict survival within 1 year after fragile hip fracture (SFHF). This model is mainly aimed at clinicians. They can quickly predict the risk of death of patients based on this model before surgery. This provides a reference for both doctors and patients to jointly develop reasonable personalized treatment plans.

The basic principle of useful model development is that practicality takes precedence over accuracy.


The report for this paper follows the TRIPOD Statement [8].

Source of data

The data came from a retrospective cohort study designed specifically for the development of this model. The study was conducted at Fenyang Hospital, a general university hospital in Shanxi Province, China. The participants in the model were patients who had been admitted consecutively to the orthopedics department of the hospital for hip fractures from January 2015 to December 2020. The follow-up started on April 13, 2021, and ended on May 28, 2021.


Our hospital is a municipal tertiary hospital, and its patients mainly come from surrounding counties and cities. The inclusion criteria were: 1. Age ≥ 50 years . The author chose 50 as the cutoff point to include as many fragile hip fracture patients as possible, especially for some women whose fragile fractures occur quite early. 2. Fragile fracture. Fragile fractures or low-energy fractures (considered synonyms here) refer to hip fractures that occur when patients fall from a standing height or lower, for example, falling when walking or when standing up or sitting down. High-energy fractures such as high falls, traffic accidents, high-energy injuries, and fights were excluded. Hip fractures in this study include femoral neck and intertrochanteric and subtrochanteric fractures. Periprosthetic and pathological fractures were excluded.

In our hospital, the basic process after a patient is admitted is: They are considered for surgical treatment initially and complete the routine preoperative tests such as necessary laboratory tests, fracture sites, and chest X-rays. If the patient has a medical disease, we invite the relevant departments for consultation and give symptomatic treatment. With the consent of the patient, surgery is performed as soon as possible after stabilizing the patient's general condition. Such a scheme results in very few patients undergoing surgery within 24 h of hospitalization. Conservative treatment should be adopted if the doctor believes that the patient cannot tolerate surgery or if the patient refuses the procedure.

The general choice of surgical treatment methods is femoral neck fracture-3 hollow nail fixation (< 65 years old) or half hip or total hip replacement (≥ 65 years old) and intertrochanteric or subtrochanteric fracture-proximal femur intramedullary nail fixation.


The outcome of interest was any cause of death within 1 year after hip fracture. It was determined through telephone interviews. All interviews were conducted after the patient data had been collected.

To ensure a high successful follow-up rate, we made the following efforts.

  1. 1.

    We developed the principle of telephone interviews by minimizing the content of the interviews as much as possible. The interview content varied according to the condition of the patient. In addition to collecting death and time information, we collected the cause of death and the patient's self-care ability before the fracture and/or after recovery. The purpose was to make the interview easier to accept by the family. According to the treatment methods in our hospital, patients were divided into two categories: surgical and conservative. For the former, if the patient died, we asked about the time of death, the reason, and the self-care situation before the fracture; if the patient survived, we asked about the current self-care situation, whether the patient could take care of himself/herself now, and then ended the interview. For conservatively treated patients, we continued to ask about the self-care situation before the fracture. In this group, they could not simply be classified as conservative treatment because they might go to other places for surgery after leaving our hospital. Therefore, we first had to determine whether they had been following conservative treatment before proceeding with the interview. If not, they were classified as surgical patients and were interviewed like surgery patients in our hospital in addition to questions about the timing of the operation. In this way, at least one question was asked in each interview. For example, those who had recovered their self-care ability after surgery could end the interview after just asking about their current recovery status. At most five questions were asked. For example, among those who died after going to other hospitals for surgery, ⑴ whether they were operated on after discharge, ⑵ the operation time, ⑶ the time of death, ⑷ the cause, and ⑸ their ability of self-care before the fracture (Fig. 1).

  2. 2.

    We set up time reference points to help the patients' families recall when death had occurred. For example, we ask the family members of the patients who had surgery after discharge, "Which day after the discharge or fracture was the surgery was performed." For the patients who had died but their family members could not recall the specific time of death, we used a method of narrowing the time range gradually. For example, "In which season did the death occur? Before or after the Spring Festival? Did it occur after the Lantern Festival?"

  3. 3.

    In cases of an incorrect telephone call, the family could not determine the specific time of death, or unwillingness to cooperate, we returned to the original medical record, checked the telephone number, or looked for other telephone numbers that had been recorded and then attempted a re-interview.

  4. 4.

    When the original medical records were checked and the interviews were still unsuccessful, we turned to the community medical system, which registered the telephone numbers of all families living in the community. If we were able to obtain a new phone number, then we interviewed them again.

Fig. 1
figure 1

Telephone interview flow chart. (Note: -P: primary; -L: last; -S: surgery; -D: death; Y: yes; N: no; dotted arrow: performed only if ALI-A be no)

Through the above methods, most of the patients lost to follow-up could successfully be contacted. The rest were regarded as missing data.


Based on the principle of modeling (practicality is preferred to accuracy) and the purpose (to provide a reference for formulating a treatment plan), we determined the principle of selecting candidate predictors: 1. Simplicity. Predictors were selected from the patient's medical history and the routine preoperative preparations. 2. Stability. We measured predictors relatively stable across different testers or patients. 3. Independence. Predictors that could be determined independently by an orthopedic surgeon without consultation with other departments should be preferred. 4. Rapidity. The results of the predictors need to be determined quickly within 24–48 h after admission, without long waits. 5. Quantitative indicators are prioritized over qualitative indicators. 6. Relevance. Select relevant factors that affect the implementation of surgical treatment. 7. Subject matter knowledge. Select the indicators that have been shown to be relevant to survival rates.

Initially, 24 indicators were extracted for each patient to provide detailed sample characteristics, including their general characteristics: medical insurance, age, and sex; disease characteristics: fracture site, fracture type, time from the fracture to hospitalization, time from the hospitalization to surgery, and length of stay (LOS); medical history: diabetes, hypertension (HYP), malignancy (MAL), kidney disease (KD), and lung disease (LD) on admission; the ability to live independently (ALI) before the fracture; cardiovascular and cerebrovascular diseases (CCD); test results (all using the first test value after admission): partial pressure of oxygen (PaO2), fasting blood sugar (BS), serum creatinine (SC), hemoglobin (Hb), total protein (TP), albumin (ALB), and mean arterial pressure (MAP); and treatment: skeletal traction or surgery. To understand the impact of surgery (SUR) on survival, SUR was deliberately used as a predictor, although it was not a preoperative indicator.

Some of the indicators were defined as follows: 1. Medical insurance: This includes employee medical insurance and nonemployee medical insurance. China has achieved full medical insurance coverage since 2011. Nonemployee medical insurance mainly includes new rural cooperative medical insurance, a small number of other types of commercial insurance, and self-paid medical treatment. The type of medication can reflect the economic situation of a patient. 2. Fracture site: including the femoral neck, intertrochanteric, and subtrochanteric. 3. Fracture type: primary and secondary fractures. If there were fragility fractures before this fracture, such as the very common osteoporotic vertebral compression fractures, hip fractures, wrist fractures, and proximal humeral fractures, they were secondary fractures; if not, they were primary fractures. 4. Medical history: whether a patient had been diagnosed with diabetes, HYP, MAL, or KD was obtained from the medical records, which were provided by the patient and/or their family at the time of admission. 5. LD on admission: this can be positive or negative. Various types of pneumonia, tuberculosis, pleural effusion, and/or structural changes, such as pulmonary bullae and fibrosis, are positive but do not include locally stable small lesions, such as stable calcifications. They are determined by chest X-ray or chest CT. Otherwise, they are negative. In particular, this indicator did not include lung tumors, which were classified as MALs. 6. ALI: positive mainly means that the patient could not walk independently or complete the basic activities of daily life without the help of others before the fracture. This could be due to various reasons, including sequelae of cerebrovascular events, severe hip or knee arthritis, Alzheimer's disease, or severe depression. Otherwise, negative. 7. CCD: being positive for this indicator means that the patient had been diagnosed with diseases such as myocardial infarction, cerebral infarction, or cerebral hemorrhage; alternatively, CT and/or MRI examination of the head after admission showed the presence of old local infarct changes. In addition, this also includes intravascular thrombosis as shown by vascular ultrasound examination of the extremities but does not include atherosclerotic plaque formation, stenosis, or other changes. These examinations are not routine examinations after a patient is admitted. If a patient did not undergo these tests and had no history of a diagnosis of the abovementioned cardiovascular and cerebrovascular diseases, they were regarded as negative. 8. PaO2, BS, SC, Hb, TP, and ALB: These six indicators are routine laboratory test items after admission. Generally, specimens are collected at 7 o'clock in the morning on the second working day after admission, and all of the results are reported in the afternoon. 9. MAP: MAP is obtained from the one-third systolic blood pressure plus the two-thirds diastolic blood pressure. This value is obtained from the vital signs’ examination of a patient performed immediately after admission. 10. Skeletal traction: in general, skeletal traction was considered on the day of admission. If patients refused or if it was decided to perform surgery, bone traction was not applied. Almost all patients given this treatment underwent tibial tuberosity traction.

Through analysis of the above predictors, we reduced the number of candidate predictors. We excluded medical insurance, fracture site, fracture type, time from fracture to hospitalization, time from hospitalization to operation, and skeletal traction. Regardless of the results of these indicators, they did not affect the choice of surgical treatment. According to the principle of priority of quantitative indicators, diabetes, kidney disease, selected BS, and SC were excluded. Although MAP is a quantitative variable, it is easily affected by various factors and has poor stability, so we chose HYP instead. There is some overlap between serum TP and ALB. Previous studies [9, 10] have shown that low ALB is a prognostic factor for death after fracture, so TP was excluded. Based on the principle of stability, PaO2 was excluded. Because many elderly patients are often given oxygen immediately after admission, it was difficult to strictly follow the doctor’s instructions to stop oxygen inhalation two hours before measuring the index, making it very unstable across patients. As a result, 12 candidate predictors were retained, including age, sex, MLA, LD, ALI, CCD, SUR, BS, SC, Hb, ALB, and HYP.

Sample size

We did not specifically calculate the required sample size. The reason for this is that the sample size is affected by many factors, and there is no well-known calculation method [11,12,13]. Second, in reality, we cannot easily expand a single-center study into a multicenter trial or arbitrarily extend the research time to increase the sample size.

According to widely accepted empirical guidelines, we aimed to achieve at least 10 events per variable (EPV) or at least a total of 100 events in the sample. Fortunately, many empirical studies can provide guidance as to how to develop predictive models based on a small sample [14, 15].

Missing data

The characteristics of the missing data are shown in Fig. 2.

Fig. 2
figure 2

The characteristics of missing data. Note: Blue means no missing and red means missing. The label above the graph represents candidate predictors, the below represents the missing number of each candidate predictor, the left represents the number of observations with the same missing pattern, and the right represents the number of types with missing predictors. For instance, ALB predictor has 13 missing values. There are four observations missing both ALB and LD

Among the 14 variables (two outcome variables, death (Y) and time, and 12 covariates), there were eight variables with missing values, including SUR (containing nine missing values), BS (9), SC (9), Hb (9), ALB (13), LD (58), Y (11), and time (11). A total of 660 observations had no missing data, and five variable values were missing among five observations.

Whether the missing data came from covariates or outcomes, we used the mice package for single imputation (SI) in R software. Although multiple imputation (MI) could be used, the reasons why we chose SI are as follows. First, we had relatively little missing data. Among all of the variables with missing values, LD had the highest missing rate, which was 7.9%, and the missing rates of the remaining variables were less than 2%. Second, an SI dataset can be easily created from the first of a series of MI datasets, and it avoids complicated combinations over multiple MI datasets [16]. Third, no method has been found to combine LASSO models derived from multiple MI datasets [17]. Finally, empirical research has shown that the estimation of model regression coefficients is very consistent between the SI dataset and the MI stacked dataset [18].

Because our SI dataset came from the first of the MI data series, we also analyzed the missing mechanisms of the data and explained the method of generating the MI data series. The missing data were mainly related to the choice of treatment methods for the patients. When patients and their families were not actively involved or were skeptical about surgical treatment, they often refused any routine preoperative preparations. In this case, several variable values were often missing together. Therefore, missing data were often seen among patients who took conservative treatment. This is a "missing at random" (MAR) situation, and MI can effectively address this problem [19,20,21]. The 12 candidate predictors and the outcome of event and time were all included in the imputation procedure. In this way, eight variables, including LD, SUR, BS, SC, Hb, ALB, Y, and time, were imputed. Among them, the two factor variables, LD and SUR, were imputed using the logistic regression (logreg) method, while the remaining six variables were imputed using the predictive mean matching (pmm) method. No interaction terms were introduced during the MI procedure.

Statistical analysis methods

The distributions of the five continuous variables including age, BS, SC, Hb and ALB of the 12 candidate predictors were checked for extreme values. After excluding input errors, only SC was found to have obvious extreme values. The extreme values of SC were winsorized to avoid excessive leverage effects. This was done by shifting the values above the 99th centile (eight values, and among them, the maximum was 860 μmol/L) to the truncation points (99th centile, 190.75 μmol/L).

To make full use of the information, we did not categorize any continuous variables, and all kept their original scales. For all continuous variables, linear and nonlinear relationships with the outcome were fitted. The nonlinearity was fitted by using restricted cubic splines (RCSs), and 3, 4, and 5 knots were compared for each variable. In particular, we also checked the log transformation for SC. In addition, we plotted the relationship between the fitted variables and the outcome to check the biological rationality. Based on a higher Waldχ2 value but a lower degree of freedom (df) and biological rationality, the final coding of each number variable in the model was determined (Table 1). The optimal coding for each predictor was as follows: age: linear; BS: nonlinear (RCS, 3 knots); SC: linear; Hb: nonlinear (RCS, 3 knots); ALB: linear.

Table 1 Optimal coding exploration for continuous predictors (complete case analysis)

According to the principle of simplicity of modeling, all seven categorical variables of SEX, MLA, LD, ALI, CCD, HYP, and

SUR were coded as binary variables.

Type of model

Because our goal was a 1-year survival rate, a time-to-event outcome, we chose the Cox proportional hazards model.

Predictor selection during modeling

Based on our small sample and empirical studies [14, 15, 22, 23] showing that stepwise selection deteriorates the predictive quality of the model in small datasets, we chose to build a full model that included all 12 candidate predictors. We further refined the full model using the LASSO method, where selection was achieved through shrinking regression coefficients to zero [24, 25]. Therefore, we developed a LASSO model.

Interaction terms

To avoid the type I statistical error of multiple repeated detections of interactions [26], we tested the overall interactions of age with the other remaining predictors [27]; the same test was also performed for sex. If there was statistical significance as a whole, then we introduced the statistically significant interaction terms; otherwise, the possibility was excluded. We also tested the proportional hazards assumption.

Model performance

Overall performance measures of Nagelkerke’s R2 are presented.

The concordance (c) statistic is given as the LASSO model’s discrimination measure, which was further illustrated by dividing the predictions into three groups and plotting the Kaplan–Meier curves of each group.

We did not evaluate the calibration because the “assessment of calibration” makes little sense in the development data, while it is essential for external validation [28].

Internal validation

Then, we assessed the internal validity with a bootstrapping procedure of 1 000 repetitions for a realistic estimate of the performance of the LASSO and full model in similar future patients.



From January 2015 to December 2020, 923 patients with hip fractures were admitted to our hospital, and 735 patients were included in this study. As of the end of the last interview, May 28, 2021, 11 patients were lost to follow-up, and the average follow-up time was 325.6 days per person. The selection process and specific characteristics of the sample are shown in Fig. 3 and Table 2, respectively.

Fig. 3
figure 3

Participant flow diagram

Table 2 Participant characteristics

Model development

By performing SI on the raw data, we finally obtained a no missing value sample with 735 participants and 68 interesting events. The EPV ratio = 68/12 = 5.7. Admittedly, it was a small sample [15]. Based on this sample, all 12 candidate predictors were used to develop a full model. We also examined the unadjusted association between each predictor and the outcome (Table 3).

Table 3 Association between each predictor and outcome from the SI dataset

Model specification

In exploring the optimal coding of five continuous variables, we found that it was suitable for BS and Hb adopting nonlinearity (both RCS, 3 knots), while age, SC, and ALB maintained linearity. However, we all adopted linear coding when modeling because: 1. In the full model coding BS and Hb as nonlinear, the nonlinear Wald statistics had no statistical significance, with P values of 0.834 and 0.088, respectively. Compared with the full linear model (total Chi-square 82.01, 12 df), for the nonlinear model (total Chi-square 85.23, 14 df) the total Wald χ2 value only increased by 3.22 with more the 2 df. 2. “In smaller datasets, we may simply have to rely on the additivity assumption to be reasonable” [29]. Nonlinear coding makes the model complex and difficult to interpret, which violates the principle of simplicity. 3. Using the explained variation by R2, we can compare the effects of different encodings of predictors on the model [28]. Compared with the linear full model (R2 = 0.179), the R2 of the nonlinear full model (R2 = 0.184) only increased by 0.005. In view of the above results, it is reasonable to use nonlinear coding for BS and Hb.

Neither age nor sex had a significant total interaction with the remaining variables; their p values were 0.343 and 0.305, respectively. Therefore, no interaction terms were introduced in the full model. The coefficients of the full model and the LASSO model, as well as the encoding forms of each variable, are given in Table 4. The overall test for the proportional hazards assumption of the full model was not statistically significant (overall χ2 14.1975, 12 df, p = 0.288), with a nonproportionality suggested for LD.

Table 4 Presenting the model, including the baseline survival, for 1-year survival (from the SI dataset)

Based on the LASSO model, we created a monogram so that the readers can easily apply this model to similar patients (Fig. 4 and Table 5).

Fig. 4
figure 4

Nomogram based on the LASSO model. Note: For example, a man (SEX = 1.3) with fragility hip fracture, 80 years old (age = 5.1), no malignant tumor (MLA = 0), self-care ability well (ALI = 0), having cerebral infarction history (CCD = 1.4) and no hypertension (HYP = 0), serum creatinine 100 μmol/L (S C = 2.6), albumin 42 g/L ( ALB = 2.7), and choosing conservative treatment (SUR = 6.4), so the total score = 19.5, the 1-year survival close to 0.8 or 80%. The corresponding 1-year mortality is 20%

Table 5 A simple scoring system for calculating the survival/mortality

Model performance

The c statistic of the final LASSO model was 0.814 (95% CI 0.762–0.865). Discrimination was further illustrated by dividing the predictions into three groups, low, middle and high and plotting the Kaplan–Meier curves of each group (Fig. 5). We found that patients in the high-risk group (lp3) had a considerably worse chance of survival at 1 year after hip fracture: approximately 77%. In addition, we also presented the overall performance measures R2 and the optimism gained through internal validation of 1000 bootstrap repetitions so that readers can understand the predictive effectiveness of the model for similar patients. The same performance measures for the full model are also presented (Table 6).

Fig. 5
figure 5

Kaplan–Meier curves for three risk groups of low, middle, and high

Table 6 Model performance measures



This study has the following limitations. First, the worst shortcoming is the low effective sample size where EPV is less than 6 (5.7 events per variable) and the total interesting events are less than 100. This affects the stability of the LASSO model and usually leads to overfitting and optimism, although we tried to compensate for these shortcomings with the methodology. Second, because the LASSO models derived from each of the MI datasets have not been easily synthesized in an efficient way, the MI was not used for the missing data. This made the estimation of the model fail to fully consider the uncertainty of the inference of missing values. Third, to increase the EPV ratio, the number of candidate predictors was reduced as much as possible. This resulted in other indicators related to survival reported by other studies, such as the time to surgery [30,31,32,33,34,35], time from the trauma to operation [36, 37], the patient financial situation [38], fracture sites [39, 40], and fracture season [41] not being included in the model, not to mention other possible related factors, such as muscle loss [42] and multidisciplinary management [43, 44]. The calibrated R2 (0.158) also confirms that there is still much variability that cannot be explained by the model. Fourth, the design of this study was a retrospective cohort study. Compared with an ideal prospective cohort study, the collection of various values cannot be controlled in advance. This results in the accuracy of the data being somewhat inadequate, thus affecting the credibility of the model prediction. Fifth, the representativeness of the sample is insufficient. Most of the patients in this sample were rural patients (approximately 80% of the sample, but the urban population was approximately 64% in 2021 in China), which might affect the applicability and generalizability of the model. Finally, this study only performed internal validation. Although the results show that the performance of the model is acceptable, in view of the abovementioned shortcomings, external validation is obviously necessary.


Unlike the models of Elliott 2003 [5] and Anita 2009 [4], we avoided using various scoring models as predictors to keep the model simple and easy for clinical use. Different from the Maxwell 2008 [6] model, we avoided converting numerical variables into categorical variables on the basis of ensuring the stability of variable values to improve the utilization of valuable data. We used very common preoperative variables that can be quickly obtained after hospitalization to predict their 1-year survival. Our final model has no nonlinear coding or interaction terms and only contains nine covariates, including age, sex, a history of malignant tumors, self-care ability before fracture, history of cardiovascular and cerebrovascular diseases, history of hypertension, whether to choose surgery, and blood creatinine value and albumin value at admission. Therefore, we have achieved the goal of developing a simple model to guide doctors and patients to rationally choose treatment methods. Admittedly, whether the predictive model is practical needs subsequent clinical feedback, but its simplicity has laid a good foundation for practicality.

This may be the first survival prognosis model for fragile hip fracture developed for Chinese subjects. The multivariate regression analysis of our LASSO model showed that age, serum albumin value at admission, and whether to choose surgery were effective predictors of the 1-year survival rate, consistent with the results of many studies (Fig. 6) [9, 31, 34, 45,46,47]. In the full model without shrinkage of the coefficients, in addition to the above three predictors, sex, history of malignant tumors, and creatinine value at admission were also valid predictors. The prognostic validity of these three variables has been reported by practically all studies [31, 34, 39, 45, 46, 48,49,50]. This difference between the two models stems from the fact that the effect strength between the latter three predictors and the outcome is weaker than that of the first three. The patient's blood pressure, blood sugar, hemoglobin, lung disease, self-care ability before fracture, and history of cardiovascular and cerebrovascular disease were not significantly related to the 1-year survival rate. However, many studies [33, 37, 46, 49, 51,52,53] have shown that lung disease and preoperative walking ability are related to the postoperative mortality of hip fractures. The reasons for this difference may include the following: The study population was different since most of the research subjects were limited to patients treated by surgery; the age range was different, most of whom were patients over 65 years; and the race of the participants was different.

Fig. 6
figure 6

Relative contribution of each predictor to the full LASSO prediction model

The discrimination c value for the final LASSO model was 0.814 (95% CI 0.762–0.865), located in an intermediate accuracy range (0.71–0.90), so the discriminative ability of the model was acceptable. After correcting the optimism through internal validation, the c value was reduced from 0.814 to 0.795, a decrease of 0.019, which is still acceptable. Similar results are also seen in the full model performance evaluation. In contrast to the full model, the LASSO model decreased in optimism (from 0.026 to 0.019). These results prove that it is reasonable to use the LASSO method for model estimation based on the full model under the condition of a small sample. Similar results are also seen in the overall performance measure R2. This paper is only a prediction model development study, so we can only obtain the internal validity (or “reproducibility”) (the discriminative ability index c statistic was 0.795, close to the "good" level) of the model applied to potential populations through internal validation technology [54]. For the more concerned external validity (or “generalizability”/ “transportability”) applied to reasonably related populations, this needs to be obtained from the following external validation researches [55].


The applicable objects of this model are patients with fragile hip fractures. Obviously, it is not applicable to patients with nonfragility fractures. Because this model is simple and clear, in addition to orthopedic surgeons, patients and their family members can also easily grasp the model. The optimal application environment is general hospitals with nonurban populations as the main patients. The next external validation should be a general hospital with an urban population. Limited by sample size, this model does not consider many variables. It is recommended to include more relevant predictors in future model update studies.


Using the SFHF model, doctors can quickly classify patients with fragile hip fractures into low-, medium-, and high-risk patients. On this basis, the treatment plan can be more reasonably determined by the doctors, patients, and their families. When a patient is assessed as high risk, and surgery may not effectively improve the survival rate, conservative treatment may be more beneficial. It is believed that the use of this model will have a positive effect on avoiding doctor–patient conflicts.

Availability of data and materials

The datasets used during the current study are available from the corresponding author on reasonable request.



Least absolute shrinkage and selection operator;


Survival in patients with fragile hip fracture






Serum creatinine






Ability to live independently


Cardiovascular and cerebrovascular diseases


Length of stay

PaO2 :

Partial pressure of oxygen


Total protein


Events per variable




Single imputation


Multiple imputation


Restricted cubic splines


Degrees of freedom


Standard deviation


Confidence interval


  1. Sanz-Reig J, Salvador Marín J, Ferrández Martínez J, et al. Prognostic factors and predictive model for in-hospital mortality following hip fractures in the elderly. Chin J Traumatol. 2018;21:163–9.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  2. Niessen R, Bihin B, Gourdin M, et al. Prediction of postoperative mortality in elderly patient with hip fractures: a single-centre, retrospective cohort study. BMC Anesthesiol. 2018;18:183.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Jiang HX, Majumdar SR, Dick DA, et al. Development and initial validation of a risk score for predicting in-hospital and 1-year mortality in patients with hip fractures. J Bone Miner Res. 2005;20:494–500.

    Article  PubMed  CAS  Google Scholar 

  4. Söderqvist A, Ekström W, Ponzer S, et al. Prediction of mortality in elderly patients with hip fractures: a two-year prospective study of 1,944 patients. Gerontology. 2009;55:496–504.

    Article  PubMed  Google Scholar 

  5. Elliott J, Beringer T, Kee F, et al. Predicting survival after treatment for fracture of the proximal femur and the effect of delays to surgery. J Clin Epidemiol. 2003;56:788–95.

    Article  PubMed  CAS  Google Scholar 

  6. Maxwell MJ, Moran CG, Moppett IK. Development and validation of a preoperative scoring system to predict 30 day mortality in patients undergoing hip fracture surgery. Br J Anaesth. 2008;101:511–7.

    Article  PubMed  CAS  Google Scholar 

  7. Moppett IK, Parker M, Griffiths R, et al. Nottingham Hip Fracture Score: longitudinal and multi-assessment. Br J Anaesth. 2012;109:546–50.

    Article  PubMed  CAS  Google Scholar 

  8. Moons KGM, Altman DG, Reitsma JB, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162:W1.

    Article  Google Scholar 

  9. Aldebeyan S, Nooh A, Aoude A, et al. Hypoalbuminaemia-a marker of malnutrition and predictor of postoperative complications and mortality after hip fractures. Injury. 2017;48:436–40.

    Article  PubMed  Google Scholar 

  10. Ko Y, Baek SH, Ha YC. Predictive factors associated with mortality in Korean elderly patients with hip fractures. J Orthop Surg (Hong Kong). 2019;27:2309499019847848.

    Article  Google Scholar 

  11. Ogundimu EO, Altman DG, Collins GS. Adequate sample size for developing prediction models is not simply related to events per variable. J Clin Epidemiol. 2016;76:175–82.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Riley RD, Snell KI, Ensor J, et al. Minimum sample size for developing a multivariable prediction model: PART II - binary and time-to-event outcomes. Stat Med. 2019;38:1276–96.

    Article  PubMed  Google Scholar 

  13. Vittinghoff E, McCulloch CE. Relaxing the rule of ten events per variable in logistic and Cox regression. Am J Epidemiol. 2007;165:710–8.

    Article  PubMed  Google Scholar 

  14. Steyerberg EW, Bleeker SE, Moll HA, et al. Internal and external validation of predictive models: a simulation study of bias and precision in small samples. J Clin Epidemiol. 2003;56:441–7.

    Article  PubMed  Google Scholar 

  15. Steyerberg EW, Eijkemans MJC, Harrell Jr FE, et al. Prognostic modelling with logistic regression analysis: a comparison of selection and estimation methods in small data sets. Statistics in MedicineStatistics in MedicineStatistics in Medicine. 2000;19:1059. 

  16. Steyerberg EW. Missing values. Clinical Prediction Models. ,2019.

  17. Schomaker M, Heumann C. Bootstrap inference when using multiple imputation. Stat Med. 2018;37:2252.

    Article  Google Scholar 

  18. Steyerberg EW. Case Study on Survival Analysis: Prediction of Cardiovascular Events. Clinical Prediction Models. ,2019.

  19. Schafer JL, Olsen MK. Multiple Imputation for Multivariate Missing-Data Problems: A Data Analyst’s Perspective. Multivariate Behav Res. 1998;33:545–71.

    Article  PubMed  CAS  Google Scholar 

  20. Sinharay S, Stern HS, Russell D. The use of multiple imputation for the analysis of missing data. Psychol Methods. 2001;6:317.

    Article  PubMed  CAS  Google Scholar 

  21. Schafer JL, Graham JW. Missing data: our view of the state of the art. Psychol Methods. 2002;7:147–77.

    Article  Google Scholar 

  22. Harrell Jr. FE, Lee KL, Califf RM, et al. Regression modelling strategies for improved prognostic prediction. Statistics in MedicineStatistics in MedicineStatistics in Medicine. 1984;3:143.

  23. Steyerberg EW, Eijkemans MJC, Harrell FE Jr, et al. Prognostic modeling with logistic regression analysis: in search of a sensible strategy in small data sets. Med Decis Making. 2001;21:45.

    Article  CAS  Google Scholar 

  24. Statistical learning with sparsity: the lasso and generalizations. 2019. Chapman and Hall/CRC.

  25. Steyerberg EW. Modern estimation methods. Clinical Prediction Models. ,2019.

  26. Bedogni G, TSYBAkOV AB, Berlin S. Clinical prediction models—a practical approach to development, validation and updating. Development. 2009;18:53.

  27. Regression modeling strategies: with applications to linear models, logistic and ordinal regression, and survival analysis. 2015. Springer.

  28. Steyerberg EW. Evaluation of performance. Clinical prediction models. ,2019.

  29. Steyerberg EW. Assumptions in regression models: additivity and linearity. Clinical Prediction Models. ,2019.

  30. Belmont PJ, E’Stephan JG, Romano D, et al. Risk factors for complications and in-hospital mortality following hip fractures: a study using the National Trauma Data Bank. Arch Orthop Trauma Surg. 2014;134:597.

    Article  Google Scholar 

  31. Chow SK, Qin JH, Wong RM, et al. One-year mortality in displaced intracapsular hip fractures and associated risk: a report of Chinese-based fragility fracture registry. J Orthop Surg Res. 2018;13:235.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Griffiths EJ, Cash DJ, Kalra S, et al. Time to surgery and 30-day morbidity and mortality of periprosthetic hip fractures. Injury. 2013;44:1949–52.

    Article  PubMed  CAS  Google Scholar 

  33. Gurger M. Factors impacting 1-year mortality after hip fractures in elderly patients: A retrospective clinical study. Niger J Clin Pract. 2019;22:648–51.

    Article  PubMed  CAS  Google Scholar 

  34. Sarabia-Cobo CM, Sáenz-Jalón M, Vélez-Carrera B, et al. Variables to Predict Mortality in Hip Fractures in Patients Over 65 Years of Age: A Study on the Role of Anticoagulation as a Risk Factor. J Trauma Nurs. 2017;24:326–34.

    Article  PubMed  Google Scholar 

  35. Trpeski S, Kaftandziev I, Kjaev A. The effects of time-to-surgery on mortality in elderly patients following hip fractures. Pril (Makedon Akad Nauk Umet Odd Med Nauki). 2013;34:115–21.

    Google Scholar 

  36. Kim JW, Kim DH, Jang EC, et al. Mortality and its risk factors in nonagenarians after hip fractures. J Orthop Sci. 2019;24:850–4.

    Article  PubMed  Google Scholar 

  37. Pitto RP. The mortality and social prognosis of hip fractures. A prospective multifactorial study Int Orthop. 1994;18:109–13.

    Article  PubMed  CAS  Google Scholar 

  38. Petrelli A, De Luca G, Landriscina T, et al. Effect of Socioeconomic Status on Surgery Waiting Times and Mortality After Hip Fractures in Italy. J Healthc Qual. 2018;40:209–16.

    Article  PubMed  Google Scholar 

  39. Epstein RS. Hip fractures in the elderly. How to reduce morbidity and mortality. Postgrad Med. 1988;84:254–7, 260.

  40. Keene GS, Parker MJ, Pryor GA. Mortality and morbidity after hip fractures. BMJ. 1993;307:1248–50.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  41. Johansen A, Grose C, Havelock W. Hip fractures in the winter - Using the National Hip Fracture Database to examine seasonal variation in incidence and mortality. Injury. 2020;51:1011–4.

    Article  PubMed  Google Scholar 

  42. Chen YP, Kuo YJ, Hung SW, et al. Loss of skeletal muscle mass can be predicted by sarcopenia and reflects poor functional recovery at one year after surgery for geriatric hip fractures. Injury. 2021.

    Article  PubMed  Google Scholar 

  43. Quaranta M, Miranda L, Oliva F, et al. Haemoglobin and transfusions in elderly patients with hip fractures: the effect of a dedicated orthogeriatrician. J Orthop Surg Res. 2021;16:387.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Aletto C, Aicale R, Pezzuti G, et al. Impact of an orthogeriatrician on length of stay of elderly patient with hip fracture. Osteoporos Int. 2020;31:2161–6.

    Article  PubMed  CAS  Google Scholar 

  45. Amrayev S, AbuJazar U, Stucinskas J, et al. Outcomes and mortality after hip fractures treated in Kazakhstan. Hip Int. 2018;28:205–9.

    Article  PubMed  Google Scholar 

  46. Yombi JC, Putineanu DC, Cornu O, et al. Low haemoglobin at admission is associated with mortality after hip fractures in elderly patients. Bone Joint J. 2019;101-B:1122–8.

  47. Oztürk A, Ozkan Y, Akgöz S, et al. The risk factors for mortality in elderly patients with hip fractures: postoperative one-year results. Singapore Med J. 2010;51:137–43.

    PubMed  Google Scholar 

  48. Belmont PJ Jr, Garcia EJ, Romano D, et al. Risk factors for complications and in-hospital mortality following hip fractures: a study using the National Trauma Data Bank. Arch Orthop Trauma Surg. 2014;134:597–604.

    Article  PubMed  Google Scholar 

  49. Chariyalertsak S, Suriyawongpisal P, Thakkinstain A. Mortality after hip fractures in Thailand. Int Orthop. 2001;25:294–7.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  50. Paruk F, Matthews G, Gregson CL, et al. Hip fractures in South Africa: mortality outcomes over 12 months post-fracture. Arch Osteoporos. 2020;15:76.

    Article  PubMed  Google Scholar 

  51. Traven SA, Reeves RA, Althoff AD, et al. New Five-Factor Modified Frailty Index Predicts Morbidity and Mortality in Geriatric Hip Fractures. J Orthop Trauma. 2019;33:319–23.

    Article  PubMed  Google Scholar 

  52. Oztürk I, Toker S, Ertürer E, et al. Analysis of risk factors affecting mortality in elderly patients (aged over 65 years) operated on for hip fractures. Acta Orthop Traumatol Turc. 2008;42:16.

    Article  Google Scholar 

  53. Bombaci H, Erdoğan Ö, Çetinkaya F, et al. Preoperative indicators affecting postoperative mortality in elderly patients with hip fractures. Acta Orthop Traumatol Turc. 2012;46:425–9.

    Article  PubMed  Google Scholar 

  54. Steyerberg E.W. (2019) Evaluation of Clinical Usefulness. In: Clinical Prediction Models. Statistics for Biology and Health. Springer, Cham.

  55. Steyerberg E.W. (2019) Validation of Prediction Models. In: Clinical Prediction Models. Statistics for Biology and Health. Springer, Cham.

Download references


Special thanks to Chu Zheng for statistical assistance and supporting paper writing and publication; also thanks to Luyang Yang for supporting paper writing and publication; and thanks to Jia Zhang, Zhijie Hou, Yanrong Wang, Jinzhen Xia, and Xiang Zhen for their help in data verification and patient follow-up


This research did not receive any specific funding.

Author information

Authors and Affiliations



QL guided all processes of the research. HF was the main person in charge and executor of the research and the manuscript writer. BL was responsible for professional knowledge support for clinically related issues. XQ and WQ were responsible for data screening, collection, verification, and follow-up. All authors read and approved the final manuscript

Corresponding author

Correspondence to Qiang Liu.

Ethics declarations

Ethics approval and consent to participate

This research program was approved by the Medical Ethics Committee of Fenyang Hospital of Shanxi Province. Approved by the ethics committee, this research did not require informed consent.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Fu, H., Liang, B., Qin, W. et al. Development of a prognostic model for 1-year survival after fragile hip fracture in Chinese. J Orthop Surg Res 16, 695 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Prognostic model
  • Development
  • Fragile hip fracture
  • Survival