Skip to main content
Log in

The interobserver reliability of clinical relevance in orthopaedic research

  • Original Article
  • Published:
European Journal of Orthopaedic Surgery & Traumatology Aims and scope Submit manuscript

Abstract

Purpose

A ratio of observed difference (OD) over the 95% confidence interval (CI) has been shown to be strongly associated with the perceived clinical relevance (CR) of medical research results. The purpose of this study was to evaluate the association between the OD/CI ratio and perceived CR in orthopaedic research.

Methods

Sixty-seven orthopaedic surgeons completed a survey with 15 study outcomes (mean difference and CI) and were asked if they perceived the findings as clinically relevant. The interobserver reliability of perceived CR and the association between CR and the OD/CI ratio and p-value were assessed.

Results

The interobserver reliability of CR between respondents was moderate (kappa = 0.46, CI 0.45 to 0.48). P-values did not differ between results with and without CR (median difference (MD) − 0.12, CI − 0.74 to 0.0009, p = 0.07). The OD/CI ratio, however, was greater for results with CR (MD 1.01, CI 0.3 to 3.9, p = 0.004). The area under the curve (AUC) for the p-value and OD/CI ratio receiver operating characteristic (ROC) curves was 0.80 (p = 0.01) and 0.97 (p = 0.0003). The cutoff p -value and OD/CI ratio that maximized the sensitivity (SN) and specificity (SP) for CR were 0.001 (SN 80%, SP 80%) and 0.84 (SN 100%, SP 90%). The SN and SP of a p-value cutoff of 0.05 was 100% and 50%.

Conclusion

The interobserver reliability of the perceived CR of orthopaedic research findings was moderate. The OD/CI ratio, in contrast to the p-value, was strongly associated with perceived CR making it a potentially useful measure to evaluate research results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1

Similar content being viewed by others

References

  1. Fayaz HC, Haas N, Kellam J et al (2013) Improvement of research quality in the fields of orthopaedics and trauma: a global perspective. Int Orthop 37:1205–1212. https://doi.org/10.1007/s00264-013-1897-2

    Article  PubMed  PubMed Central  Google Scholar 

  2. Houle TT (2007) Statistical reporting for current and future readers. Anesthesiology 107:193–194

    Article  PubMed  Google Scholar 

  3. Spreckelsen TF (2018) Editorial: changes in the field: banning p values (or not), transparency, and the opportunities of a renewed discussion on rigorous (quantitative) research. Child Adolesc Ment Health 23:61–62

    Article  PubMed  Google Scholar 

  4. Kelley K, Preacher KJ (2012) On effect size. Psychol Methods 17:137–152. https://doi.org/10.1037/a0028086

    Article  PubMed  Google Scholar 

  5. Strage K, Stacey S, Mauffrey C, Parry JA (2022) The interobserver reliability of clinical relevance in medical research. Injury. https://doi.org/10.1016/j.injury.2021.12.044

    Article  PubMed  Google Scholar 

  6. Tirefort J, Schwitzguebel AJ, Collin P et al (2019) Postoperative mobilization after superior rotator cuff repair: sling versus no sling: a randomized prospective study. J Bone Jt Surg Am 101:494–503. https://doi.org/10.2106/JBJS.18.00773

    Article  Google Scholar 

  7. Kelly MP, Lurie JD, Yanik EL et al (2019) Operative versus nonoperative treatment for adult symptomatic lumbar scoliosis. J Bone Jt Surg Am 101:338–352. https://doi.org/10.2106/JBJS.18.00483

    Article  Google Scholar 

  8. McKearney DA, Stender CJ, Cook BK et al (2019) Altered range of motion and plantar pressure in anterior and posterior malaligned total ankle arthroplasty: a cadaveric gait study. J Bone Jt Surg Am 101:e93. https://doi.org/10.2106/JBJS.18.00867

    Article  Google Scholar 

  9. Norvell DC, Ledoux WR, Shofer JB et al (2019) Effectiveness and safety of ankle arthrodesis versus arthroplasty: a prospective multicenter study. J Bone Jt Surg Am 101:1485–1494. https://doi.org/10.2106/JBJS.18.01257

    Article  Google Scholar 

  10. Jules-Elysee KM, Tseng A, Sculco TP et al (2019) Comparison of topical and intravenous tranexamic acid for total knee replacement: a randomized double-blinded controlled study of effects on tranexamic acid levels and thrombogenic and inflammatory marker levels. J Bone Jt Surg Am 101:2120–2128. https://doi.org/10.2106/JBJS.19.00258

    Article  Google Scholar 

  11. Weekes DG, Campbell RE, Shi WJ et al (2019) Prevalence of clinical depression among patients after shoulder stabilization: a prospective study. J Bone Jt Surg Am 101:1628–1635. https://doi.org/10.2106/JBJS.18.01460

    Article  Google Scholar 

  12. Sepucha K, Bedair H, Yu L et al (2019) Decision support strategies for hip and knee osteoarthritis: less is more: a randomized comparative effectiveness trial (DECIDE-OA Study). J Bone Jt Surg Am 101:1645–1653. https://doi.org/10.2106/JBJS.19.00004

    Article  Google Scholar 

  13. Keeney BJ, Austin DC, Jevsevar DS (2019) Preoperative weight loss for morbidly obese patients undergoing total knee arthroplasty: determining the necessary amount. J Bone Jt Surg Am 101:1440–1450. https://doi.org/10.2106/JBJS.18.01136

    Article  Google Scholar 

  14. Sattler LN, Hing WA, Vertullo CJ (2019) Pedaling-based protocol superior to a 10-exercise, non-pedaling protocol for postoperative rehabilitation after total knee replacement: a randomized controlled trial. J Bone Jt Surg Am 101:688–695. https://doi.org/10.2106/JBJS.18.00898

    Article  Google Scholar 

  15. Stepan JG, Lovecchio FC, Premkumar A et al (2019) Development of an institutional opioid prescriber education program and opioid-prescribing guidelines. J Bone Jt Surg 101:5–13. https://doi.org/10.2106/JBJS.17.01645

    Article  Google Scholar 

  16. Wang D, Wang H-Y, Luo Z-Y et al (2019) Finding the optimal regimen for oral tranexamic acid administration in primary total hip arthroplasty: a randomized controlled trial. J Bone Jt Surg Am 101:438–445. https://doi.org/10.2106/JBJS.18.00128

    Article  Google Scholar 

  17. Ferle M, Guo R, Hurschler C (2019) The laxity of the native knee: a meta-analysis of in vitro studies. J Bone Jt Surg Am 101:1119–1131. https://doi.org/10.2106/JBJS.18.00754

    Article  Google Scholar 

  18. Lian J, Diermeier T, Meghpara M et al (2020) Rotatory knee laxity exists on a continuum in anterior cruciate ligament injury. J Bone Jt Surg Am 102:213–220. https://doi.org/10.2106/JBJS.19.00502

    Article  Google Scholar 

  19. McHugh ML (2012) Interrater reliability: the kappa statistic. Biochem Medica 22:276–282. https://doi.org/10.11613/bm.2012.031

    Article  Google Scholar 

  20. Ma H, Bandos AI, Gur D (2015) On the use of partial area under the ROC curve for comparison of two diagnostic tests. Biom J 57:304–320. https://doi.org/10.1002/bimj.201400023

    Article  PubMed  Google Scholar 

  21. Martínez-Camblor P, Pardo-Fernández JC (2019) The youden index in the generalized receiver operating characteristic curve context. Int J Biostat 15:20180060. https://doi.org/10.1515/ijb-2018-0060

    Article  Google Scholar 

  22. Staffa SJ, Zurakowski D (2020) Calculation of confidence intervals for differences in medians between groups and comparison of methods. Anesth Analg 130:542–546. https://doi.org/10.1213/ANE.0000000000004535

    Article  PubMed  Google Scholar 

  23. Lee DK (2016) Alternatives to p value: confidence interval and effect size. Korean J Anesthesiol 69:555–562. https://doi.org/10.4097/kjae.2016.69.6.555

    Article  PubMed  PubMed Central  Google Scholar 

  24. Sullivan GM, Feinn R (2012) Using effect size-or why the p value is not enough. J Grad Med Educ 4:279–282. https://doi.org/10.4300/JGME-D-12-00156.1

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We would like to thank all of the surgeons who participated in this survey.

Funding

This study had no funding.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Joshua A. Parry.

Ethics declarations

Conflict of Interest

The authors have no conflict of interest to declare.

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.

Informed consent

Informed consent was not obtained due to the nature of study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 362 KB)

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Strage, K.E., Stacey, S.C., Mauffrey, C. et al. The interobserver reliability of clinical relevance in orthopaedic research. Eur J Orthop Surg Traumatol 33, 1721–1725 (2023). https://doi.org/10.1007/s00590-022-03346-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00590-022-03346-4

Keywords

Navigation