The Oxford Shoulder Instability Score; validation in Dutch and first-time assessment of its smallest detectable change
© van der Linde et al. 2015
Received: 6 July 2015
Accepted: 2 September 2015
Published: 17 September 2015
The Oxford Shoulder Instability Score (OSIS) is a short, self-reported outcome measurement for patients with shoulder instability.
In this study, the OSIS was validated in Dutch by testing the internal consistency, reliability, measurement error, validity and the floor and ceiling effects, and its smallest detectable change (SDC) was calculated.
A total of 138 patients were included. Internal consistency was calculated with Cronbach’s α. Reliability (test-retest) was calculated with the intraclass correlation coefficient (ICC). The measurement error was calculated (SEM), and the SDC was estimated in a subgroup of 99 patients that completed the re-test after a mean of 13 days (5–30 days). Construct validity was evaluated by comparing the OSIS with the Western Ontario Shoulder Instability index (WOSI), the Simple Shoulder Test (SST), the Oxford Shoulder Score (OSS), the Disability of the Arm, Shoulder, and Hand assessment (DASH), and the Short Form-36 (SF-36).
Internal consistency was good, with a Cronbach’s α of 0.88. The reliability was excellent, with an ICC of 0.87. The SEM was 3.3 and the SDC was 9 points (on a scale of 0–48). Regarding the construct validity, 80 % of the results were in accordance with the hypotheses, including a high correlation (0.82) with the WOSI. No floor or ceiling effects were found.
The Dutch version of the OSIS showed good reliability and validity in a cohort of patients with shoulder instability.
KeywordsValidation Oxford Shoulder Instability Score Dutch
Research and evaluation of therapies for shoulder instability should focus both on objectively verifiable outcomes, such as the range of motion and re-dislocations, and on subjective functioning. A variety of patient-reported outcome measures (PROM) exist, some of which are specifically designed to reflect the patient’s subjective assessment of function. They enable the practitioner to detect functional changes in a standardized format. Because patients and doctors do not always agree on functional outcome after therapeutic interventions , PROMs have become increasingly important in assessing patient health status . They can focus on general health; a physical domain or body part, such as the shoulder; or a specific condition or disease, such as instability [5–7].
The Oxford Shoulder Instability Score (OSIS) is a comprehensive questionnaire including 12 questions to assess shoulder instability. With a Cronbach’s α of 0.92, a Pearson correlation coefficient of 0.97 and measurement error of 5.7, the OSIS has proven to be valid and reliable, making it clinically important in patients with shoulder instability . The OSIS was proven to be a useful outcome measure in several clinical studies [9–11], but it has not been translated and validated in languages other than English.
Translation and validation of internationally used PROMs will lead to culturally equivalent instruments and allow direct comparisons of national and international study results [12–14]. The aim of this study was to translate and validate the OSIS for the Dutch population and to evaluate its measurement properties according to the Consensus-based Standards for the selection of health Measurement Instruments (COSMIN) guidelines .
After we obtained the official licence for the original English version, the OSIS was independently translated into Dutch by three native Dutch-speaking, medically educated translators. When they reached a consensus, a professional translator and a native English speaker (without a medical background) independently translated the version back into English; both were blinded to the first version and emphasized specifically on the linguistic aspects. Finally, the latter version was compared to the original text, composing a pre-final version. All items were agreed to be relevant for this patient population, and taken together, the items represented a comprehensive measurement of shoulder instability.
The pre-final version was checked for cross-cultural differences. It was subsequently completed by 13 patients with shoulder instability that were asked independently to assess the comprehensibility of all questions. These patients were not included in our final analysis.
Patients and procedures
To assess the reliability and validity of the OSIS in the Dutch population, 154 patients with shoulder instability were recruited. Institutional approval was obtained by the local ethics committee; Institutional Review Board (IRB): METC, OLVG Hospital, Amsterdam, The Netherlands. Written informed consent was obtained from all participants.
A total of 154 patients with shoulder instability were included; all were diagnosed by one of the doctors in the outpatient clinic or the emergency department.
Patients were eligible to participate when they were 16 years or older and had been diagnosed with shoulder instability, based on their history and clinical examination. All patients were included on the ER or outpatient department of a hospital in Amsterdam. Exclusion criteria were an inability to master the Dutch language, a fracture in the glenoid, or a fracture in the humeral head. Hill-Sachs lesions and bony Bankart lesions were included. Tourists and temporary inhabitants of Amsterdam that were followed up in another clinic were also excluded, to avoid patient burden as a result of double follow-up.
All patients were assigned a study number and received either a web-based questionnaire, or alternatively, an identical paper questionnaire to complete at home. The order of administration was fixed. The web-based version required answers to all questions prior to submission. Missing values in paper submissions were completed in an interview by telephone.
Patients were asked to complete the questionnaire twice, without intervention. Both times, the questionnaire was either web-based or on paper. The repeated questionnaire was completed after a maximum interval of 5 to 30 days; this interval was considered long enough to forget prior answers, and short enough to assume an unchanged shoulder condition [17, 18].
Oxford Shoulder Instability Score
The OSIS is a disease-specific PROM that was developed by Dawson et al. in 1999 in the UK for assessing the outcome of treatment for shoulder instability .
This 12-item questionnaire contained five response categories for each question. In the original scoring system, answers were scored from 1 to 5 points and summarized to a total score that ranged from 12 (least impaired) to 60 (most impaired). The scoring system was revised in 2009, in accordance with the revised scoring for the Oxford Shoulder Score (OSS), which originated in the same institute . In the revised scoring system, answers were scored from 0 to 4, and the score was reversed; thus, the total score ranged from 0 (most impaired) to 48 (least impaired). We presented the results in terms of the new scoring system.
The OSIS was originally validated in 92 patients with shoulder instability that against the Rowe and Constant scores, with correlations of 0.51 and 0.56, respectively. The internal consistency (Cronbach’s α) was 0.92. The reliability was 0.97, calculated with a Pearson correlation coefficient. The measurement error was 5.7 points, calculated with the Bland and Altman method. No intraclass correlation coefficient (ICC) was calculated . To date, no cross-cultural validation has been conducted.
The following instruments were solely used to assess the construct validity of the OSIS. No other data is used from these additional questionnaires. All instruments have been validated in Dutch, with good to excellent reliability and internal consistency [17, 20–23].
Western Ontario Shoulder Instability index (WOSI)
The WOSI is a disease-specific PROM for assessing the outcome of treatment for shoulder instability [24, 25]. Responses to the 21-item questionnaire were summarized in a total score, ranging from 0 or 0 % (no limitations) to 2100 or 100 % (extreme limitations).
Simple Shoulder Test (SST)
The SST is a body-part-specific PROM . It was designed to measure functional limitations of patients with general shoulder complaints. A cumulative score is calculated based on 12 questions (yes/no) and ranges from 0 (poor) to 12 (excellent shoulder function). It was validated against the American Shoulder and Elbow Surgeons (ASES) survey with a correlation of 0.81 .
Oxford Shoulder Score (OSS)
The OSS is a body-part-specific PROM. It was developed and validated for patients with general shoulder complaints . Responses to the 12-item questionnaire were summarized to a total score that ranged from 12 (least impaired) to 60 (most impaired). This scoring system was revised in 2009 . Currently, answers are scored from 0 to 4, and the summary is reversed; thus, the total score ranges from 0 (most impaired) to 48 (least impaired).
The OSS was originally validated against the Constant shoulder score and the SF-36 subscales . Since that validation, it has been validated in Danish, Korean, Turkish, Italian, German, and Dutch [22, 33–37].
Disability of the Arm, Shoulder, and Hand (DASH) assessment
The DASH assessment is a body-part-specific PROM designed  to measure physical function and symptoms in patients with musculoskeletal disorders from any condition in any joint in the upper extremity.
Responses to the 30-item questionnaire are used to calculate the total score by averaging the item scores, subtracting 1, and multiplying the result by 25. The resulting score ranged from 0 (no disability) to 100 (extreme disability).
Short form 36 Health Survey, version 1 (SF-36)
The SF-36 is a general health PROM that includes 36 questions for assessing the general health of patients with all kinds of disorders. It is the most widely used PROM for assessing general health . It includes eight domains: physical function, social function, role limitations caused by physical problems (role physical), role limitations caused by emotional problems (role emotional), general mental health, vitality, bodily pain and perception of general health. Each domain has a total score of 0 (extremely poor) to 100 points (no complaint) .
The SF-36 was translated and validated in a Dutch general population, with a mean alpha coefficient across all scales and samples of 0.84. Previous studies have also validated the SF-36 specifically for shoulder complaints [43, 44].
Assessments of measurement properties
Internal consistency and factor analysis
Internal consistency tells you to what extend different items within one questionnaire measure the same construct of interest (e.g. shoulder instability). Ideally, this score is high, indicating that all items measure the same construct. The internal consistency of the OSIS was assessed by calculating Cronbach’s α. For acceptable internal consistency, the Cronbach’s α should preferably be ≥0.7 .
Internal consistency can also be addressed using confirmatory factor analysis. See Additional file 1: Appendix 1.
Measurement error is the systematic, random error in the construct, which cannot be attributed to true changes in the patient’s condition . When a score changes within the range of measurement error, it is not clear whether the change is a true effect of therapy or whether it should be attributed to measurement error.
Measurement error can be expressed as the standard deviation of repeated measurements in a single patient, referred to as the standard error of measurement (SEM). The SEM was calculated from the square root of the variance between the measurements and the error variance of the ICC. Subsequently, the SEM can be transformed into the smallest detectable change (SDC = 1.96*√2*SEM). The SDC represents the minimal change that a patient must show to ensure that the observed change is real, and not a measurement error . The SDC is thus calculated; it is not derived from clinical observations following treatment.
Since each instrument has a degree of uncertainty due to measurement error, reliability is defined as the degree to which the measurement is free from measurement error . The reliability refers to the proportion of the total variance in the measurements that can be attributed to true differences between patients. Reliability was assessed by calculating the ICC, which was calculated with a two-way, mixed-effects model for absolute agreement. The mixed-effect model is used because a ‘fixed’ value (all questions remained unchanged during the whole cohort) is compared to a ‘random’ value (a cohort of patients was selected from all patients with shoulder instability). Scores ≥0.70 are considered adequate .
Pre-determined hypotheses for testing the validity of the Dutch version of OSIS; expected correlations
1. OSIS and WOSI
2. OSIS and SST
3. OSIS and OSS
4. OSIS and DASH
5. Correlation between OSIS and body-part-specific PROMs (SST, OSS, and DASH) should be at least 0.1 higher than that between OSIS and the generic SF-36 subscales
6. Correlation between OSIS and SF-36 physical function scale should be at least 0.1 higher than the correlations between OSIS and the other SF-36 subscales
The highest correlation (≥0.7) was expected between the two disease-specific PROMs (OSIS and WOSI). High correlations (≥0.6) were expected between similar body-part-specific PROMs (OSIS and SST, OSS, and DASH). These correlation coefficients were expected to be at least 0.1 higher than the correlations between the OSIS and the more general subscales of the SF-36. Finally, because the OSIS predominantly measured physical function, we expected the correlation between the OSIS and the SF-36 physical function to be at least 0.1 higher than the correlations between the OSIS and the other SF-36 subscales.
Construct validity was considered good when at least 75 % of the results (correlations) were in accordance with our hypotheses .
Floor and ceiling effects
Floor and ceiling effects occur when more than 15 % of patients achieve the lowest or highest possible score, respectively . Moreover, when a patient scores close to one of the extremes at baseline, a real change (defined as the SDC) could cross that extreme. Patients that score within the SDC-range from one of the extremes can thus be regarded as being at either their floor or ceiling too.
Statistical analyses were performed with SPSS software, version 18.0.0 (SPSS, Gorinchem, The Netherlands).
No major differences occurred between the OSIS translations into Dutch and back into English, no content- or linguistic-related difficulties were reported. The final version was considered free of cross-cultural inconsistencies; all questions are applicable to the Dutch population.
Demographic data of patients completing baseline and the reliability cohort
Mean age, year (SD)
98 (71 %)
66 (66 %)
72 (53 %)
54 (55 %)
59 (43 %)
40 (40 %)
6 (4 %)
5 (5 %)
Dominant side dislocated
73 (53 %)
53 (54 %)
Time first dislocation to completion OSIS
8 (6 %)
8 (8 %)
21 (15 %)
17 (17 %)
>6 months–2 years
40 (29 %)
25 (25 %)
> 2 years
67 (49 %)
49 (50 %)
Sports-related traumatic instability
71 (54 %)
47 (47 %)
Internal consistency and factor analysis
For all 138 patients that completed the OSIS at baseline, the Cronbach’s α was 0.88, indicating good internal consistency.
Test-retest reliability (ICC), standard error of measurement (SEM) and smallest detectable change (SDC) for the OSIS
ICC (95 % CI)
New scoring system
The SEM was 3.3, which resulted in a SDC of 9.0 points, indicating that a patient has to show a change of 9.0 points to ensure the detection of a true change. This is 19 % of the total range.
Observed correlations for testing the validity of the Dutch version of OSIS
1. OSIS and WOSI
2. OSIS and SST
3. OSIS and OSS
4. OSIS and DASH
5. OSIS and SF-36 subscales:
Floor and ceiling effects
Floor and ceiling effects of the OSIS scoring system
% of patients scoring within SDC range
There is a growing interest in PROMs for both clinical and research purposes to supplement clinical outcome measures. To our knowledge, this is the first study to validate the OSIS in a foreign language and the first to report the measurement error and evaluate floor and ceiling effects.
The results show a high internal consistency (Cronbach’s α = 0.88); it was only slightly lower than that described in the original article (Cronbach’s α = 0.91 at pre treatment [n = 92] and 0.92 at follow-up [n = 64]). Compared to other Dutch-validated PROMs, our Cronbach’s α for the OSIS was higher than that of the SST (0.78) and lower than that of the OSS (0.92) [17, 22].
Considering the content of the questions, it is clear that the OSIS measures several constructs, such as pain, physical-, social-, and role functioning, frequency of dislocation and worries.
The reliability was addressed with a test-retest sample in 99 patients with a mean interval of 13 days (5–30) and showed an ICC of 0.88. This was lower than the 0.97 that Dawson et al. described after a 24-h interval in 34 patients; nevertheless, 0.88 is considered a very good ICC.
To our knowledge, the measurement error (SDC) of the OSIS has not been reported previously. Our SDC value showed that, to determine a treatment effect, one must find a difference of at least 9 points between two scores from an individual patient to ensure that the difference was not due to measurement error .
To assess the construct validity, Dawson et al. calculated correlations with the Rowe and Constant scores. However, the Rowe and Constant scores are not PROMs but observer-based measurement instruments. Moreover, the Constant score is not considered applicable to shoulder instability [49, 50]. Therefore, the construct validity was assessed by calculating correlations with the WOSI, the SST, the OSS, the DASH and the SF-36 subscales. With 80 % of the results in accordance with our hypotheses, the construct validity was considered good. The highest correlation (0.82) was observed between the two instability-specific PROMs (OSIS and the WOSI).
A high correlation was observed with the DASH (0.79), which addresses daily activities more specifically than the OSIS. However, many questions overlapped such as ‘putting on a pullover sweater’ (DASH) and ‘during the last three months, have you had any trouble (or worry) dressing, because of your shoulder?’ (OSIS). This similarity might explain the high correlation between the two instruments.
The OSIS was more closely correlated with the SF-36 subscales ‘pain’ (0.78) and ‘role physical’ (0.69) than with the subscale ‘physical function’ (0.65). These correlations were comparable to those described by Dawson et al. This may indicate that, in addition to physical function, the OSIS measures aspects of pain and role limitations due to physical problems.
In previous studies, floor and ceiling effects were not addressed. In this study, no patient had the maximum or minimum score. The estimation of the smallest detectable change indicated that the baseline patient scores should ideally be at least 9 points different from the extremes. That margin would enable detection of improvements and deteriorations that are distinct from measurement errors at follow-up. At most, 12 % of patients scored within the SDC-margin; thus, these scores were less than the commonly used cut off of 15 % .
A strong aspect of this study was the large size of our patient population without missing values.
Conversely, an unavoidable limitation of this study was the total number of questions posed to the patients. Completing six questionnaires at once requires considerable time and concentration, and patients might have digressed or lost focus. Also, although web-based versions have many advantages over paper versions such as an increased follow-up ratio and prevention of missing data, validation of digital formats should still be performed. Here, the results are expressed according to the new scoring system. It is important to be aware of the changed scoring system, and we recommend that future studies should specify the scoring system used.
Finally, for future studies, it would be very interesting to determine responsiveness and the minimal important change (MIC) of the OSIS. This information can be used to determine whether the observed change is important to the patient and to calculate the percentage of patients that report changes greater than the MIC (responders) in each arm of a trial. Then, the percentage of responders can be compared between groups .
This study found that the Dutch version of the OSIS was a reliable outcome measure in patients with shoulder instability, with a Cronbach’s α of 0.87 and an ICC of 0.87. In addition, the construct validity was considered good. Comprising 12 questions, the OSIS is user-friendly and can be easily administered. Furthermore, in the absence of floor or ceiling effects, it is a valuable PROM in clinical practice. Patients need to change at least 9 points to ensure that the difference is not due to measurement error.
The Dutch version of the OSIS can be acquired by its managing institution, Isis Outcomes, Isis Innovation Ltd, holding its copyright (http://isis-innovation.com/outcome-measures/oxford-shoulder-instability-score-osis/).
We thank Mr. R. Cohen for his help in the translation and Mrs. R. Pepping for her help in coordinating patients’ affairs. We thank the Department of Orthopedic Surgery and Traumatology at the Waterlandziekenhuis, the Netherlands for their financial contribution to publish this study.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Leroux T, Wasserstein D, Veillette C, Khoshbin A, Henry P, Chahal J, et al. Epidemiology of primary anterior shoulder dislocation requiring closed reduction in Ontario, Canada. Am J Sports Med. 2014;42:442–50.View ArticlePubMedGoogle Scholar
- Liavaag S, Svenningsen S, Reikeras O, Enger M, Fjalestad T, Pripp AH, et al. The epidemiology of shoulder dislocations in Oslo. Scand J Med Sci Sports. 2011;21:e334–40.PubMed CentralView ArticlePubMedGoogle Scholar
- Zacchilli MA, Owens BD. Epidemiology of shoulder dislocations presenting to emergency departments in the United States. J Bone Joint Surg Am. 2010;92:542–9.View ArticlePubMedGoogle Scholar
- Janse AJ, Gemke RJ, Uiterwaal CS, van dT I, Kimpen JL, Sinnema G. Quality of life: patients and doctors don’t always agree: a meta-analysis. J Clin Epidemiol. 2004;57:653–61.View ArticlePubMedGoogle Scholar
- Wright RW, Baumgarten KM. Shoulder outcomes measures. J Am Acad Orthop Surg. 2010;18:436–44.PubMedGoogle Scholar
- Irrgang JJ, Lubowitz JH. Measuring arthroscopic outcome. Arthroscopy. 2008;24:718–22.View ArticlePubMedGoogle Scholar
- Poolman RW, Swiontkowski MF, Fairbank JC, Schemitsch EH, Sprague S, de Vet HC. Outcome instruments: rationale for their use. J Bone Joint Surg Am. 2009;91 Suppl 3:41–9.PubMed CentralView ArticlePubMedGoogle Scholar
- Dawson J, Fitzpatrick R, Carr A. The assessment of shoulder instability. The development and validation of a questionnaire. J Bone Joint Surg Br. 1999;81:420–6.View ArticlePubMedGoogle Scholar
- Steffen V, Hertel R. Rim reconstruction with autogenous iliac crest for anterior glenoid deficiency: forty-three instability cases followed for 5–19 years. J Shoulder Elbow Surg. 2013;22:550–9.View ArticlePubMedGoogle Scholar
- Tan CK, Guisasola I, Machani B, Kemp G, Sinopidis C, Brownson P, et al. Arthroscopic stabilization of the shoulder: a prospective randomized study of absorbable versus nonabsorbable suture anchors. Arthroscopy. 2006;22:716–20.View ArticlePubMedGoogle Scholar
- van der Linde JA, van Kampen DA, Terwee CB, Dijksman LM, Kleinjan G, Willems WJ. Long-term results after arthroscopic shoulder stabilization using suture anchors: an 8- to 10-year follow-up. Am J Sports Med. 2011;39:2396–403.View ArticlePubMedGoogle Scholar
- Beaton DE, Bombardier C, Guillemin F, Ferraz MB. Guidelines for the process of cross-cultural adaptation of self-report measures. Spine (Phila Pa 1976). 2000;15;25:3186–91.View ArticleGoogle Scholar
- Guillemin F, Bombardier C, Beaton D. Cross-cultural adaptation of health-related quality of life measures: literature review and proposed guidelines. J Clin Epidemiol. 1993;46:1417–32.View ArticlePubMedGoogle Scholar
- Wild D, Grove A, Martin M, Eremenco S, McElroy S, Verjee-Lorenz A, et al. Principles of good practice for the translation and cultural adaptation process for patient-reported outcomes (PRO) measures: report of the ISPOR task force for translation and cultural adaptation. Value Health. 2005;8:94–104.View ArticlePubMedGoogle Scholar
- Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study. Qual Life Res. 2010;19:539–49.PubMed CentralView ArticlePubMedGoogle Scholar
- Terwee CB, Mokkink LB, Knol DL, Ostelo RW, Bouter LM, de Vet HC. Rating the methodological quality in systematic reviews of studies on measurement properties: a scoring system for the COSMIN checklist. Qual Life Res. 2012;21:651–7.PubMed CentralView ArticlePubMedGoogle Scholar
- van Kampen DA, van Beers LW, Scholtes VA, Terwee CB, Willems WJ. Validation of the Dutch version of the simple shoulder test. J Shoulder Elbow Surg. 2012;21:808–14.View ArticlePubMedGoogle Scholar
- de Vet HC, Terwee CB, Mokkink LB, Knol DL. Design of simple reliability studies. Measurement in medicine. New York: Cambridge University Press; 2011. p. 125.Google Scholar
- Dawson J, Rogers K, Fitzpatrick R, Carr A. The Oxford shoulder score revisited. Arch Orthop Trauma Surg. 2009;129:119–23.View ArticlePubMedGoogle Scholar
- van der Linde JA, Willems WJ, van Kampen DA, van Beers LW, van Deurzen DF, Terwee CB. Measurement properties of the Western Ontario Shoulder Instability index in Dutch patients with shoulder instability. BMC Musculoskelet Disord. 2014;15:211.PubMed CentralView ArticlePubMedGoogle Scholar
- Godfrey J, Hamman R, Lowenstein S, Briggs K, Kocher M. Reliability, validity, and responsiveness of the simple shoulder test: psychometric properties by age and injury type. J Shoulder Elbow Surg. 2007;16:260–7.View ArticlePubMedGoogle Scholar
- Berendes T, Pilot P, Willems J, Verburg H, te SR. Validation of the Dutch version of the Oxford Shoulder Score. J Shoulder Elbow Surg. 2010;19:829–36.View ArticlePubMedGoogle Scholar
- Veehof MM, Sleegers EJ, van Veldhoven NH, Schuurman AH, van Meeteren NL. Psychometric qualities of the Dutch language version of the Disabilities of the Arm, Shoulder, and Hand questionnaire (DASH-DLV). J Hand Ther. 2002;15:347–54.View ArticlePubMedGoogle Scholar
- Kirkley A, Griffin S, McLintock H, Ng L. The development and evaluation of a disease-specific quality of life measurement tool for shoulder instability. The Western Ontario Shoulder Instability Index (WOSI). Am J Sports Med. 1998;26:764–72.PubMedGoogle Scholar
- Kirkley A, Griffin S, Dainty K. Scoring systems for the functional assessment of the shoulder. Arthroscopy. 2003;19:1109–20.View ArticlePubMedGoogle Scholar
- Cacchio A, Paoloni M, Griffin SH, Rosa F, Properzi G, Padua L, et al. Cross-cultural adaptation and measurement properties of an Italian version of the Western Ontario Shoulder Instability Index (WOSI). J Orthop Sports Phys Ther. 2012;42:559–67.View ArticlePubMedGoogle Scholar
- Hatta T, Shinozaki N, Omi R, Sano H, Yamamoto N, Ando A, et al. Reliability and validity of the Western Ontario Shoulder Instability Index (WOSI) in the Japanese population. J Orthop Sci. 2011;16:732–6.View ArticlePubMedGoogle Scholar
- Salomonsson B, Ahlstrom S, Dalen N, Lillkrona U. The Western Ontario Shoulder Instability Index (WOSI): validity, reliability, and responsiveness retested with a Swedish translation. Acta Orthop. 2009;80:233–8.PubMed CentralView ArticlePubMedGoogle Scholar
- Hofstaetter JG, Hanslik-Schnabel B, Hofstaetter SG, Wurnig C, Huber W. Cross-cultural adaptation and validation of the German version of the Western Ontario Shoulder Instability index. Arch Orthop Trauma Surg. 2010;130:787–96.View ArticlePubMedGoogle Scholar
- Drerup S, Angst F, Griffin S, Flury MP, Simmen BR, Goldhahn J. Western Ontario shoulder instability index (WOSI): translation and cross-cultural adaptation for use by German speakers. Orthopade. 2010;39:711–8.View ArticlePubMedGoogle Scholar
- Lippitt SBHDIMFI. A practical tool for evaluating function: the Simple Shoulder Test. In: Matsen III FA, Fu FH, Hawkins RJ, editors. The shoulder: a balance of mobility and stability. Rosemont (IL): American Academy of Orthopaedic Surgeons; 1993.Google Scholar
- Dawson J, Fitzpatrick R, Carr A. Questionnaire on the perceptions of patients about shoulder surgery. J Bone Joint Surg Br. 1996;78:593–600.PubMedGoogle Scholar
- Frich LH, Noergaard PM, Brorson S. Validation of the Danish version of Oxford Shoulder Score. Dan Med Bull. 2011;58:A4335.PubMedGoogle Scholar
- Huber W, Hofstaetter JG, Hanslik-Schnabel B, Posch M, Wurnig C. The German version of the Oxford Shoulder Score—cross-cultural adaptation and validation. Arch Orthop Trauma Surg. 2004;124:531–6.View ArticlePubMedGoogle Scholar
- Murena L, Vulcano E, D'Angelo F, Monti M, Cherubino P. Italian cross-cultural adaptation and validation of the Oxford Shoulder Score. J Shoulder Elbow Surg. 2010;19:335–41.View ArticlePubMedGoogle Scholar
- Roh YH, Noh JH, Kim W, Oh JH, Gong HS, Baek GH. Cross-cultural adaptation and validation of the Korean version of the Oxford shoulder score. Arch Orthop Trauma Surg. 2012;132:93–9.View ArticlePubMedGoogle Scholar
- Tugay U, Tugay N, Gelecek N, Ozkan M. Oxford Shoulder Score: cross-cultural adaptation and validation of the Turkish version. Arch Orthop Trauma Surg. 2011;131:687–94.View ArticlePubMedGoogle Scholar
- Hudak PL, Amadio PC, Bombardier C. Development of an upper extremity outcome measure: the DASH (disabilities of the arm, shoulder and hand) [corrected]. The Upper Extremity Collaborative Group (UECG). Am J Ind Med. 1996;29:602–8.View ArticlePubMedGoogle Scholar
- Beaton DE, Katz JN, Fossel AH, Wright JG, Tarasuk V, Bombardier C. Measuring the whole or the parts? Validity, reliability, and responsiveness of the Disabilities of the Arm, Shoulder and Hand outcome measure in different regions of the upper extremity. J Hand Ther. 2001;14:128–46.View ArticlePubMedGoogle Scholar
- Desai AS, Dramis A, Hearnden AJ. Critical appraisal of subjective outcome measures used in the assessment of shoulder disability. Ann R Coll Surg Engl. 2010;92:9–13.PubMed CentralView ArticlePubMedGoogle Scholar
- Garratt A, Schmidt L, Mackintosh A, Fitzpatrick R. Quality of life measurement: bibliographic study of patient assessed health outcome measures. BMJ. 2002;15;324:1417.View ArticleGoogle Scholar
- Salaffi F, De Angelis R, Stancati A, Grassi W. Health-related quality of life in multiple musculoskeletal conditions: a cross-sectional population based epidemiological study. II. The MAPPING study. Clin Exp Rheumatol. 2005;23:829–39.PubMedGoogle Scholar
- Gartsman GM, Brinker MR, Khan M, Karahan M. Self-assessment of general health status in patients with five common shoulder conditions. J Shoulder Elbow Surg. 1998;7:228–37.View ArticlePubMedGoogle Scholar
- Ostor AJ, Richards CA, Prevost AT, Speed CA, Hazleman BL. Diagnosis and relation to general health of shoulder disorders presenting to primary care. Rheumatology (Oxford). 2005;44:800–5.View ArticleGoogle Scholar
- Snyder CF, Aaronson NK, Choucair AK, Elliott TE, Greenhalgh J, Halyard MY, et al. Implementing patient-reported outcomes assessment in clinical practice: a review of the options and considerations. Qual Life Res. 2012;21:1305–14.View ArticlePubMedGoogle Scholar
- Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60:34–42.View ArticlePubMedGoogle Scholar
- McHorney CA, Tarlov AR. Individual-patient monitoring in clinical practice: are available health status surveys adequate? Qual Life Res. 1995;4:293–307.View ArticlePubMedGoogle Scholar
- de Vet HC, Terwee CB, Knol DL, Bouter LM. When to use agreement versus reliability measures. J Clin Epidemiol. 2006;59:1033–9.View ArticlePubMedGoogle Scholar
- Jensen KU, Bongaerts G, Bruhn R, Schneider S. Not all Rowe scores are the same! Which Rowe score do you use? J Shoulder Elbow Surg. 2009;18:511–4.View ArticlePubMedGoogle Scholar
- Lillkrona U. How should we use the Constant Score?—A commentary. J Shoulder Elbow Surg. 2008;17:362–3.View ArticlePubMedGoogle Scholar
- Schunemann HJ, Akl EA, Guyatt GH. Interpreting the results of patient reported outcome measures in clinical trials: the clinician's perspective. Health Qual Life Outcomes. 2006;4:62.PubMed CentralView ArticlePubMedGoogle Scholar