- Research article
- Open access
- Published:
Factor structure of the Oxford Shoulder Score: secondary analyses of the UK FROST and PROFHER trial populations
Journal of Orthopaedic Surgery and Research volume 18, Article number: 846 (2023)
Abstract
Aims
Frozen shoulder and proximal humeral fracture can cause pain, stiffness and loss of function. The impact of these symptoms on patients can be measured using the comprehensively validated, 12-item Oxford Shoulder Score (OSS). Evidence suggests that pain and function may have a differential impact on patients’ experience of shoulder conditions, and this may be important for clinical management. We therefore explored the factor structure of the OSS within the UK FROST and PROFHER trial populations.
Methods
We performed exploratory factor analysis (EFA), followed by confirmatory factor analysis (CFA), on baseline UK FROST data from 490 of the 503 trial participants. Data at 6 months post-randomisation were used for 228 of the 250 participants for the PROFHER trial.
Results
UK FROST factor extraction results, using Velicer's Minimum Average Partial and Horn's Parallel Analysis tests, suggested a unifactorial solution, but two factors were weakly indicated by the less reliable ‘Kaiser’s eigenvalue > 1’ and scree tests. We explored this further using EFA. Eight items (2 to 7, 9 and 10) loaded onto a ‘Function’ factor, three on a ‘Pain’ factor (1, 8 and 12) and item 11 cross-loaded. However, one- and two-factor models were rejected in CFA. Factor extraction of PROFHER data at 6 months demonstrated a single first-order factor solution, which was also subsequently rejected in CFA.
Conclusion
Insufficient evidence was found, within the constraints of the data available, to support the use of ‘Pain’ and ‘Function’ sub-scales of the OSS in either patient population.
Introduction
Primary frozen shoulder (adhesive capsulitis) is a common, idiopathic condition of insidious and spontaneous onset, involving progressively escalating pain, glenohumeral joint stiffness and global functional impairment [1]. It is prevalent in middle age, more common in women and in patients with diabetes mellitus [1, 2]. Symptom resolution and functional recovery can take up to three years and may lead to persistent disability [3, 4].
Proximal humeral fractures are common in adults, particularly the elderly and patients with osteoporotic bone [5]. They can be managed conservatively with immobilisation, or surgically with internal fixation or arthroplasty [6]. Shoulder pain and stiffness may persist beyond fracture union or with non-union.
The International Classification of Functioning, Disability and Health (ICF) description of functional health status includes impairments, participation restrictions and activity limitations [7]. Health-related patient-reported outcome measures (PROMs) that assess these limitations are important for evaluating the effectiveness of interventions for shoulder problems.
The Oxford Shoulder Score (OSS) is a widely used PROM for assessing outcomes of treatments for shoulder conditions and was originally conceptualised to be used as a single summary scale [8]. The 12-item single dimension measure has been validated in a range of patient populations [9,10,11]. Using baseline data from a UK shoulder surgery trial (UKUFF), Dawson and colleagues identified two potentially distinct component scales (pain and function) that could be used as outcomes in practice and research [12]. However, reproducibility in other populations is yet to be established.
The OSS was a patient-centred outcome in two large, UK randomised controlled trials (RCTs): UK FROST compared three interventions for adults with a primary frozen shoulder referred to secondary care [13] and PROFHER compared surgical and non-surgical management of adults with displaced proximal humeral fractures [6]. Our aim was to use these data to examine the dimensional structure of the OSS, to replicate the Dawson and colleagues study, as to whether there are two components of pain and function [12].
Methods
Secondary data analysis—UK FROST
The trial included 503 adults with a frozen shoulder recruited from 35 UK hospitals between April 2015 and December 2017. Eligibility criteria are listed in Additional file 1: Table S1 [13]. Patients were randomly assigned to either Early Structured Physiotherapy, manipulation under anaesthesia or arthroscopic capsular release. All participants provided written informed consent. Ethics approval was obtained from the National Research Ethics Service (NRES Committee North East, 14/NE/1176). Full trial details are reported elsewhere [13]. We used baseline data in the analyses.
Secondary data analysis—PROFHER
The trial included 250 adults with a proximal humeral fracture recruited from 32 UK hospitals between September 2008 and April 2011. Eligibility criteria are listed in Additional file 1: Table S1 [6]. Patients were randomly assigned to either non-operative management (sling immobilisation) or surgery (fracture fixation or humeral head replacement). Ethical approval was obtained from the York Research Ethics Committee (reference number 08/H131/12). Full details of the trial are published elsewhere [6]. Completion of the OSS is based on a recall period of 4 weeks, but as fractures are acute events, it was not possible to collect baseline measurements. We therefore used OSS data collected at 6 months post-randomisation.
Outcome measures
The OSS is a condition-specific measure focusing on pain and functional limitation of the shoulder following shoulder surgery. It comprises 12-items about symptoms over the past four weeks (items 1, 8, 11 and 12 on pain, the remainder on functionality), each with five response options. Responses are summed to create an overall score ranging from 0 (most severe) to 48 (least severe) [14].
Analysis
We used factor analysis (FA) to identify potential latent variables, the possible underlying traits that the OSS is measuring [15]. Exploratory Factor Analysis (EFA) was used to identify factors [16] and Confirmatory Factor Analysis (CFA) was used to establish ‘goodness-of-fit’ of the a priori factor structure models, i.e. whether the models explain the data structure well, generated from the EFA results [16].
Data management, descriptive statistics and EFA were conducted using Stata version 16.0 (StataCorp LLC, 4905 Lakeway Drive, College Station, Texas, USA), IBM SPSS Statistics version 26 (UK Head Office, IBM United Kingdom Limited, North Harbour, Portsmouth, Hampshire, UK) and FACTOR version 9.2 [17, 18] software. CFA was performed using LISREL 10.2 (Student) structural equation modelling software (Copyright by Scientific Software International Inc., 1981–2019, Skokie, IL, USA). Analyses were based on polychoric correlation matrices, instead of Pearson correlation coefficients, because of the lack of normality in the ordinal questionnaire data [19, 20].
EFA
Suitability for factor analysis was tested using Kaiser–Meyer–Olkin (KMO) measure of sampling adequacy (> 0.8) and Bartlett’s test of sphericity (P value for x2 of < 0.01) [21] Principal axis factoring was selected for the extraction method to identify all factors and due to the violation of multivariate normality [22]. A number of common indices were used to determine the appropriate number of factors from the data, which are generally based on the assessment of common variance between items: Velicer’s minimum average partial (MAP) test [23], Horn’s parallel analysis (PA) [24], parallel analysis based on minimum rank factor analysis (PA-MRFA) [25, 26], Hull method [27], scree test [28], and Kaiser’s eigenvalue > 1 [29]. The ‘Common part Accounted For’ measure of fit was used for the Hull method, as this can accompany any factor extraction technique.
We extracted factors using oblique (Promax) rotation to allow correlation between factors, as they cannot be expected to be independent of one another [30]. The association of each OSS item with any identified factor was expressed as pattern matrix loadings. Items with a loading > 0.3 [16] and at least 0.2 greater loading than on other factors to discriminate from them were considered loaded onto a given factor [31]. Internal consistency of the unidimensional and multidimensional OSS, i.e. the degree to which all items measure a common construct, was tested using Cronbach’s alpha [32]; with ≥ 0.7 considered satisfactory internal consistency and > 0.8 reflecting optimal consistency [33].
CFA
Hypothetical factor models developed using EFA were tested for fit using CFA.
UK FROST Models
-
Model 1 tested the single factor solution for all 12 OSS items. Model fit was confirmed by Velicer’s MAP, Horn’s PA, PA-MRFA and Hull method tests. Cronbach’s α: 0.89.
-
Model 2 comprised two first-order factors based on the less reliable Kaiser’s eigenvalue > 1 and scree tests. Items 1, 8 and 12 loaded on the ‘Pain’ factor and the remaining items (including cross-loaded item 11) loaded ‘Function’. Cronbach’s α: 0.66 for ‘Pain’ and 0.90 for ‘Function’.
-
Model 3 replicated Model 2, but with the cross-loaded item 11 loaded on ‘Pain’. Cronbach’s α: 0.71 for ‘Pain’ and 0.88 for ‘Function’.
PROFHER Model
-
Model 1 tested the single factor solution for the 12 OSS items, which was supported by results from all factor extraction methods. Cronbach’s α: 0.95.
Due to lack of data normality, we used the diagonally weighted least squares (DWLS) extraction, based on polychoric correlations and asymptomatic covariances [20]. A chi-square test for model fit was conducted and standardised chi-square values reported for each model, with higher values indicating poorer fit. The following goodness-of-fit indices were considered satisfactory: comparative fit index (CFI) > 0.95, goodness-of-fit index (GFI) > 0.95, standardised root mean square residual (SRMSR) < 0.08 and root mean square error of approximation (RMSEA) < 0.1 [34].
Results
UK FROST
A complete (orthogonal) baseline dataset was available for 490 (97.4%) of the 503 trial participants. Mean age was 54.3 years (SD 8.2) and 36.7% were male (n = 180). Mean OSS total score at baseline was 19.8 (SD 8.2).
EFA
KMO (0.92) and Bartlett’s tests (p < 0.001) confirmed data suitability for factor analysis. A one-factor solution was supported by the four more reliable extraction methods, with a two-factor model solution suggested by the remaining less reliable tests (Table 1; Additional file 1: Fig. S1 for the scree plot).
Using Kaiser’s eigenvalue > 1 rule, eight OSS items (2 to 7, 9 and 10) loaded onto a ‘Function’ factor (eigenvalue 6.39) and three (1, 8 and 12) onto a ‘Pain’ factor (eigenvalue 1.19) (Table 2). Item 11 cross-loaded on both factors.
The ‘Pain’ and ‘Function’ factors were strongly correlated (r = 0.67), accounting for 63.2% of the total variance, with the majority (53.2%) attributed to ‘Function’. The single factor, unidimensional solution accounted for 53.2% of the total variance. The one-factor solution with all 12 items had excellent internal consistency (Cronbach’s α = 0.89), as did the 8-item (Cronbach’s α = 0.88) and 9-item (including item 11) (Cronbach’s α = 0.90) ‘Function’ factors. However, internal consistency for the 3-item ‘Pain’ factor was unsatisfactory (Cronbach’s α = 0.66) and the 4-item version (including item 11) was only borderline acceptable (Cronbach’s α = 0.71).
CFA
CFA models were hypothesised from the preceding EFA results. Model 1 is the one-factor unidimensional solution. Models 2 and 3 are two-factor multidimensional solutions, with cross-loaded item 11 incorporated in either ‘Pain’ or ‘Function’. All three CFA models failed goodness-of-fit tests with large normed chi-square values and were rejected based on CFI and RMSEA (Table 3).
PROFHER
An orthogonal dataset of complete OSS scores at the 6-month follow-up was available for 228 (91.2%) of the 250 patients recruited to the study. Of these 228 participants, mean age was 65.7 years old (SD 11.5) and 76.8% were male (n = 175). Mean OSS total score at 6 months was 34.5 (SD 10.6).
EFA
KMO (0.95) and Bartlett’s tests (P value for x2 of < 0.001) confirmed data suitability for factor analysis. A one-factor solution was confirmed by all extraction methods (Table 4; Additional file 1: Fig. S1 for the scree plot). The single latent factor (eigenvalue 9.15) explained 76.2% of the variance in the data and the unidimensional solution had excellent internal consistency (Cronbach’s α = 0.95).
CFA
Similarly to UK FROST analyses, the single factor model for PROFHER was also rejected in CFA, with an unsatisfactory RMSEA ‘goodness-of-fit’ test result (Table 5). The remainder of fit test results were within acceptable limits.
Discussion
Multidimensional PROMs comprise components with a differential impact on quality of life and may help understand complex effects of interventions and personalised clinical management. We therefore explored the OSS factor structure for two common shoulder conditions. Results using UK FROST data suggested a two-factor, multidimensional solution, but our hypothetical factor structure models could not confirm this. Analyses using PROFHER data indicated a unidimensional model, but also failed fit testing. There was, therefore, no evidence within the constraints of the data available to support the use of OSS component scales in adults with either primary frozen shoulder or proximal humeral fracture.
Dawson et al. [12] identified pain and function components in the OSS using data from a UK trial of surgical repair of degenerative rotator cuff tears. However, these findings may not be generalisable to the wider population of patients undergoing shoulder surgery that we examined in our study. Dawson et al. [12] and our UK FROST analyses show a consistent pattern of results. Two-factor models tested in both the UKUFF and UK FROST analyses were based on the less reliable scree and eigenvalue extraction methods, with the UKUFF 2-factor solution also supported by Horn’s PA [35]. In each study, similar items loaded on the pain and function scales, with item 11 cross-loaded on both factors. However, our two-factor solution did not strongly discriminate between the closely linked constructs of pain and function. The differential impact of post-surgical pain and function may be more salient for patients with rotator cuff problems compared to those with other conditions, such as frozen shoulder or fractures.
The main difference between the previous study and our analyses is the lack of model fit. Although polychoric matrices were used to address the ordinal OSS data, the two extracted factors were highly correlated and included a cross-loaded item, constituting a weak structure. In addition, the ‘Pain’ factor only included four items and accounted for less than 10% of the overall variance. Overall, factor loadings in our study were slightly weaker, and the sample size 24% smaller, than in the UKUFF analyses. While a strong factor structure may be resistant to sample size variation, the weak two-factor solution from UK FROST may require a larger sample size to achieve model fit [36]. Of note, the proximal humeral fracture cohort contained substantially fewer patients than in the UKUFF analysis which may have influenced model fit. The limited sample size of these secondary data precluded separate sampling for the exploratory and confirmatory analyses, thus some overestimation of fit statistics resulting from using a single source cannot be ruled out.
Participants recruited to UKUFF had symptomatic, degenerative, full-thickness rotator cuff tears, considered suitable for surgery. Eligibility for elective cuff repair (and post-operative immobilisation) means that the UKUFF population may have not included patients with stiffness, as they are generally not listed for surgery but instead receive physiotherapy at that stage. Thus, the UKUFF cohort may have had relatively good pre-operative shoulder movement and functionality at baseline, as evidenced with a mean OSS of 25 compared to 20 in UK FROST. In contrast, UK FROST recruited patients with significantly limited passive external rotation and therefore, by definition, stiff shoulders. Responses to ‘Function’ questions in UK FROST may therefore have been less variable compared to UKUFF. This more homogenous sample may have obscured the distinction between pain and function constructs in the frozen shoulder population.
This paper presents a thorough and rigorous analysis of the baseline UK FROST data using robust methodology. UK FROST recruited over five hundred patients from 35 hospitals in the UK. We are confident that, while it is a marginally smaller sample compared with UKUFF, it should be a representative sample of adults with a frozen shoulder and a sufficiently large dataset with which to perform the analyses.
The secondary analyses from the PROFHER trial have potential limitations. Use of non-baseline study data may interfere with the validity of the factor structures. Data at 6 months was the earliest assessment available because the recall of questions is over the preceding four weeks, which includes pre-fracture status at baseline. This limitation is evidenced with patients at 6 months post-randomisation having relatively ‘healthy’ index shoulders (mean OSS of 34.5), due to the length of time since being treated. The unifactorial solution demonstrated in the PROFHER factor extraction fits with these observations. Further analyses based on item-response theory (Rasch analysis [37]) and using larger data sets may further evaluate the unidimensionality of the OSS.
Conclusion
We were unable to replicate previous evidence of pain and function component scales in the OSS in either patient population. Although loading patterns were similar using data from frozen shoulder patients (UK FROST) and for those with a rotator cuff tear (UKUFF), our models could not confirm this. There is therefore insufficient evidence, within the constraints of the data available, to recommend the use of OSS component scales in frozen shoulder or proximal humeral fracture populations. Instead, within these settings, we would recommend using the PROM as a single summary scale as it was originally conceptualised.
Availability of data and materials
Supporting data is available.
References
Fields BK, Skalski MR, Patel DB, White EA, Tomasian A, Gross JS, et al. Adhesive capsulitis: review of imaging findings, pathophysiology, clinical presentation, and treatment options. Skelet Radiol. 2019;48:1–14.
Whelton C, Peach C. Review of diabetic frozen shoulder. Eur J Orthop Surg Traumatol. 2018;28(3):363–71.
Hand C, Clipsham K, Rees JL, Carr AJ. Long-term outcome of frozen shoulder. J Shoulder Elbow Surg. 2008;17(2):231–6.
Vastamäki H, Kettunen J, Vastamäki M. The natural history of idiopathic frozen shoulder: a 2-to 27-year followup study. Clin Orthopaed Relat Res®. 2012;470(4):1133–43.
Kim SH, Szabo RM, Marder RA. Epidemiology of humerus fractures in the United States: nationwide emergency department sample, 2008. Arthritis Care Res. 2012;64(3):407–14.
Rangan A, Handoll H, Brealey S, Jefferson L, Keding A, Martin BC, et al. Surgical vs nonsurgical treatment of adults with displaced fractures of the proximal humerus: the PROFHER randomized clinical trial. JAMA. 2015;313(10):1037–47.
Stucki G, Kostanjsek N, Ustün B, Cieza A. ICF-based classification and measurement of functioning. Eur J Phys Rehabil Med. 2008;44(3):315–28.
Dawson J, Fitzpatrick R, Carr A. Questionnaire on the perceptions of patients about shoulder surgery. J Bone Joint Surg Br Vol. 1996;78(4):593–600.
Dawson J, Hill G, Fitzpatrick R, Carr A. The benefits of using patient-based methods of assessment: medium-term results of an observational study of shoulder surgery. J Bone Joint Surg Brit Vol. 2001;83(6):877–82.
Olley L, Carr A. The use of a patient-based questionnaire (the Oxford Shoulder Score) to assess outcome after rotator cuff repair. Ann R Coll Surg Engl. 2008;90(4):326–31.
Dawson J, Hill G, Fitzpatrick R, Carr A. Comparison of clinical and patient-based measures to assess medium-term outcomes following shoulder surgery for disorders of the rotator cuff. Arthritis Care Res. 2002;47(5):513–9.
Dawson J, Harris KK, Doll H, Fitzpatrick R, Carr A. A comparison of the Oxford shoulder score and shoulder pain and disability index: factor structure in the context of a large randomized controlled trial. Pat Relat Outcome Measur. 2016;7:195.
Rangan A, Brealey SD, Keding A, Corbacho B, Northgraves M, Kottam L, et al. Management of adults with primary frozen shoulder in secondary care (UK FROST): a multicentre, pragmatic, three-arm, superiority randomised clinical trial. Lancet. 2020;396(10256):977–89.
Dawson J, Rogers K, Fitzpatrick R, Carr A. The Oxford shoulder score revisited. Arch Orthop Trauma Surg. 2009;129(1):119–23.
Child D. The essentials of factor analysis. A&C Black; 2006.
Kline P. An easy guide to factor analysis. Psychology Press; 1994.
Lorenzo-Seva U, Ferrando PJ. FACTOR: a computer program to fit the exploratory factor analysis model. Behav Res Methods. 2006;38(1):88–91.
Lorenzo-Seva U, Ferrando PJ. FACTOR 9.2: a comprehensive program for fitting exploratory and semiconfirmatory factor analysis and IRT models. Appl Psychol Measur. 2013;37(6):497–8.
Olsson U. Maximum likelihood estimation of the polychoric correlation coefficient. Psychometrika. 1979;44(4):443–60.
Flora DB, Curran PJ. An empirical evaluation of alternative methods of estimation for confirmatory factor analysis with ordinal data. Psychol Methods. 2004;9(4):466.
Tabachnick BG, Fidell LS, Ullman JB. Using multivariate statistics: Pearson Boston, MA; 2007.
Costello AB, Osborne J. Best practices in exploratory factor analysis: four recommendations for getting the most from your analysis. Pract Assess Res Eval. 2005;10(1):7.
Velicer WF. Determining the number of components from the matrix of partial correlations. Psychometrika. 1976;41(3):321–7.
Horn JL. A rationale and test for the number of factors in factor analysis. Psychometrika. 1965;30(2):179–85.
Shapiro A, Ten Berge JM. Statistical inference of minimum rank factor analysis. Psychometrika. 2002;67(1):79–94.
Ten Berge JM, Kiers HA. A numerical approach to the approximate and the exact minimum rank of a covariance matrix. Psychometrika. 1991;56(2):309–15.
Lorenzo-Seva U, Timmerman ME, Kiers HA. The Hull method for selecting the number of common factors. Multivar Behav Res. 2011;46(2):340–64.
Cattell RB. The scree test for the number of factors. Multivar Behav Res. 1966;1(2):245–76.
Kaiser HF. The application of electronic computers to factor analysis. Educ Psychol Measur. 1960;20(1):141–51.
Fabrigar LR, Wegener DT, MacCallum RC, Strahan EJ. Evaluating the use of exploratory factor analysis in psychological research. Psychol Methods. 1999;4(3):272.
Harris K, Dawson J, Doll H, Field RE, Murray DW, Fitzpatrick R, et al. Can pain and function be distinguished in the Oxford Knee Score in a meaningful way? An exploratory and confirmatory factor analysis. Qual Life Res. 2013;22(9):2561–8.
Cronbach LJ. Coefficient alpha and the internal structure of tests. Psychometrika. 1951;16(3):297–334.
Taber KS. The use of Cronbach’s alpha when developing and reporting research instruments in science education. Res Sci Educ. 2018;48(6):1273–96.
Browne MW, Cudeck R. Alternative ways of assessing model fit. In: Bollen KA, Long JS, editors. Testing structural equation models. Beverly Hills, CA: Sage; 1993. p. 111–35.
Courtney M, Gordon R. Determining the number of factors to retain in EFA: using the SPSS R-menu v2 0 to make more judicious estimations. Pract Assess Res Eval. 2013;18(1):8.
Kyriazos TA. Applied psychometrics: sample size and sample power considerations in factor analysis (EFA, CFA) and SEM in general. Psychology. 2018;9(08):2207.
Fischer GH, Molenaar IW, editors. Rasch models: foundations, recent developments, and applications. New York: Springer; 1995.
Acknowledgements
Foremost, we thank the patients participating in these trials; without this, the article would have not been possible. We also thank the teams at the participating hospitals for their commitment to the trials and the collection of data which are included in this study.
Funding
The data used for this article were funded by the National Institute for Health Research (NIHR) Health Technology Assessment (HTA) programme (project 06/404/53 and 13/26/01). The views expressed are those of the authors and do not necessarily reflect those of the HTA programme, NIHR, or the Department of Health and Social Care. In addition, we would like to thank Health Education Yorkshire and Humber and the Yorkshire and Humber Foundation School for Jonathan Simpson’s 4-month, salaried Academic Foundation Year 2 placement.
Author information
Authors and Affiliations
Contributions
JS contributed to formal analysis, investigation, writing—original draft, and writing—review and editing. AK and SS contributed to conceptualisation, methodology, writing—review and editing, and supervision. SB and AR conceptualisation, methodology, writing—review and editing, supervision and funding acquisition.
Corresponding author
Ethics declarations
Ethical approval and consent to participate
All patients within the trials provided written, informed consent to participate in the studies and for their data to be used for research purposes. Ethical approval was obtained for both trials and their analyses.
Consent for publication
As above, the written consent included for the dissemination of results.
Competing interests
Professor Amar Rangan reports grants from De Puy Ltd and JRI Ltd; both are outside the submitted work. None of these influenced the trials or this article. No further conflicts of interest are declared.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1: Table S1
. Eligibility criteria for UK FROST. Table S2. Eligibility criteria for PROFHER. Fig. S1. Factor extraction scree plot for baseline UK FROST data. Fig. S2. Factor extraction scree plot for PROFHER data at 6 months.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Simpson, J., Keding, A., Spencer, S. et al. Factor structure of the Oxford Shoulder Score: secondary analyses of the UK FROST and PROFHER trial populations. J Orthop Surg Res 18, 846 (2023). https://doi.org/10.1186/s13018-023-04319-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13018-023-04319-x