Included under terms of UK Non-commercial Government License.
NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.
Blyth M, Anthony I, Francq B, et al. Diagnostic accuracy of the Thessaly test, standardised clinical history and other clinical examination tests (Apley’s, McMurray’s and joint line tenderness) for meniscal tears in comparison with magnetic resonance imaging diagnosis. Southampton (UK): NIHR Journals Library; 2015 Aug. (Health Technology Assessment, No. 19.62.)
Diagnostic accuracy of the Thessaly test, standardised clinical history and other clinical examination tests (Apley’s, McMurray’s and joint line tenderness) for meniscal tears in comparison with magnetic resonance imaging diagnosis.
Show detailsThis study was a single-centre diagnostic accuracy study comparing two cohorts of patients with and without knee pathology, assessed by two groups of clinicians – primary care clinicians and musculoskeletal clinicians. The primary goal was to assess the diagnostic accuracy of the Thessaly Test for meniscal tears; however, the diagnostic accuracy of three other physical tests and a standardised clinical history were also analysed.
Musculoskeletal clinicians included both orthopaedic specialist knee surgeons and extended scope physiotherapists with specialist knee interest (operating in a secondary care orthopaedic setting). Primary care clinicians were either general practitioner- (GP-) or community-based physiotherapists.
The study protocol was developed in response to a call for proposals from the National Institute for Health Research Heath Technology Assessment (HTA) programme. Funding was provided by HTA.
Ethical approval was obtained from the West of Scotland Research Ethics Committee (reference 12/WS/0225) prior to initiation of the study. The study was registered on the International Randomised Controlled Trial Number registry (ISRCTN 43527822).
The study was performed as a collaboration between the Orthopaedic Research Unit at Glasgow Royal Infirmary and the Robertson Centre for Biostatistics at the University of Glasgow.
Study objectives
- To determine the diagnostic accuracy of the Thessaly Test for meniscal tears in the knee and whether or not this test can obviate the need for further investigation by arthroscopy or MRI.
- To determine how the Thessaly Test compares with clinical history and to other commonly used physical examinations (McMurray’s Test, Apley’s Test and the joint line tenderness Test) in diagnosing meniscal tears.
- To determine if the presence of arthritis or other knee pathologies influences the accuracy of the Thessaly Test.
- To determine if the use of combinations of physical tests (such as the Thessaly Test, McMurray’s Test, Apley’s Test and the joint line tenderness Test) provides better specificity and sensitivity than a single test alone in the diagnosis of meniscal tears.
- To determine the ability of non-specialist GPs to use the Thessaly Test in comparison with specialist knee clinicians.
Patient selection
Two cohorts of patients were recruited to the study between October 2012 and March 2014 (Figure 7). The first group had unspecified knee pathology and were typical of the target patient group that a primary care clinician may wish to assess using the Thessaly Test. These patients were primarily drawn from patients referred to a specialist knee clinic at Glasgow Royal Infirmary by GPs and other primary care clinicians. A proportion of this group were recruited directly from a single GP practice (n = 11). This group were used as an internal control to verify that subjects recruited at Glasgow Royal Infirmary were representative of the target population of patients who attend general practice with knee pain. The second control group consisted of patients with no current or recent knee pathology. These subjects were included in order to test the specificity of the Thessaly Test. Control subjects were recruited from two sources – from the hand injury clinic at Glasgow Royal Infirmary and from advertisement posters placed within Glasgow Royal Infirmary and local universities.
Only one knee from any one individual was included in the study.
Written informed consent was obtained from each patient.
As all patients attending the knee clinic were essentially eligible for the study, there were no screening failures from the knee clinic. Similarly, patients in the control group responding to our advert were effectively self-screened and therefore there were no screen failures from this source either.
Knee pathology group
Inclusion criteria
- Patients referred to the knee clinic at Glasgow Royal Infirmary.
Exclusion criteria
- Age < 18 years.
- Unable to give informed consent.
- Previous knee replacement surgery.
Control group
Inclusion criteria
- Patients attending the hand clinic at Glasgow Royal Infirmary or responding to advertisement posters placed within Glasgow Royal Infirmary.
Exclusion criteria
- Age < 18 years.
- Unable to give informed consent.
- Previous knee replacement surgery.
- A history of knee pain in the last 6 months.
- Osteoarthritis.
- Rheumatoid arthritis.
Sample size
Sample size calculation was based on the following assumptions. Assuming the sensitivity of the Thessaly Test is around 75%, the study would need around 300 subjects to estimate the sensitivity to within ± 5%. A similar calculation for the width of the CI for a binomial proportion is appropriate for the specificity (e.g. if the specificity was around 90%, the required sample size to estimate the specificity to within ± 8% would be n = 50 participants). The power for the pairwise comparison of tests, or combinations of tests, will depend on the degree of disagreement between the tests [e.g. with around 220 pairs of measurements the study would have 90% power to detect a difference in proportions of 0.10 when the proportion of discordant pairs is expected to be 0.15 (using McNemar’s Test)].
Patient assessment
Patients were examined by two types of independent practitioners: a trained orthopaedic musculoskeletal specialist (n = 11) and a primary care clinician (n = 12). The orthopaedic musculoskeletal specialist was either an orthopaedic consultant knee surgeon (n = 3) or an orthopaedic extended scope physiotherapist with a specialist interest in the knee (n = 8). Primary care clinicians were either a GP (n = 4) or a community physiotherapist (n = 8). There was a broad range of experience within the primary care clinician group ranging from newly qualified practitioners to a practitioner with more than 30 years of experience. We believe that the diverse mix of experience reflects reality in the primary care setting.
Each clinician performed the following physical tests: the Thessaly Test, McMurray’s Test, Apley’s Test and the joint line tenderness Test. In addition, each clinician took a clinical history from the patient. The order in which physical tests were carried out was randomised to avoid bias in knowledge gained from performing any previous test. This was achieved using a random sequence generator to determine the order that each test should be performed in. The sequences were generated in advance and printed on study data collection forms. The forms were handed out in the order they were printed by the study co-ordinator (KB) to sequential examiners.
All patients in both groups were asked to attend for a MRI scan of their knee, any patient not attending for MRI was excluded from the final analysis. MRI diagnosis of meniscal tears was used as the gold standard reference for each of the physical tests examined in this study. All MRI scans were reported by radiologists who were not linked to the study and who were blind to the patients’ status with regard to clinical history and examination findings. MRI scans for the knee pathology patients were carried out within 6 weeks of assessment.
Magnetic resonance imaging scan sequences are detailed below:
- T1 spin-echo Sagittal
- T2 fl2d Sagittal (flash-gradient ECHO)
- proton-density turbo-spin echo fat saturation transverse.
- proton-density turbo-spin echo fat saturation coronal.
In order to determine if the presence of osteoarthritis of the knee influenced the outcome of physical tests for meniscal tears, knee radiography was performed on all patients with knee pathology. No radiography was performed on control patients as it was deemed unethical to expose control patients to unnecessary X-rays. Degree of osteoarthritis of the knee was graded from knee radiography using the Kellgren and Lawrence system.32 Kellgren and Lawrence grading was undertaken by a single-blinded examiner (AP).
A proportion of patients with a knee pathology subsequently had a routine arthroscopy as part of their treatment/diagnosis of their knee condition. In these patients we recorded the presence of any meniscal tears at the time of arthroscopy. Arthroscopy was normally carried out within 6 weeks of the MRI scan. These data were then correlated with the diagnosis of meniscal tears obtained from MRI images of the patient, in order to verify the accuracy of MRI imaging for meniscal tears by computing specificity, sensitivity, negative and positive predictive values (NPVs and PPVs) for MRI, with arthroscopy as the gold standard.
Physical tests
The Thessaly Test was carried out as described by Karachalios et al.25 The Thessaly Test, McMurray’s Test, Apley’s Test and the joint line tenderness Test were all carried out as described in Chapter 1.
Standardised clinical history
A simple standardised clinical history was developed for this study (see Standardised minimal clinical history data set). Our concept was that any test that proved successful in this study required to be simple enough for national roll-out without requiring an extensive training programme. All clinicians were therefore given basic instruction in the use of the standardised clinical history in Standardised minimal clinical history data set and asked to use this as a minimum in their diagnosis.
Standardised minimal clinical history data set
- History of knee injury?
- Was this a sports injury?
- Past history of anterior cruciate ligament (ACL) injury?
- Past history of arthritis?
- Previous surgery to the knee?
- Past history of medial meniscal tear?
- Past history of lateral meniscal tear?
- Presence of associated patella–femoral symptoms?
Primary outcome measures
- Sensitivity and specificity of the Thessaly Test in determining the presence of meniscal tears when employed by primary care clinicians.
Secondary outcomes measures
- Sensitivity and specificity of the Thessaly Test in determining the presence of meniscal tears when employed by specialist musculoskeletal clinicians.
- Sensitivity and specificity of McMurray’s Test, Apley’s Test, the joint line tenderness Test and clinical history in determining the presence of meniscal tears.
- Determination of the influence of osteoarthritis on the sensitivity and specificity of the Thessaly Test, McMurray’s Test, Apley’s Test and the joint line tenderness Test.
- Determination of the influence of other knee pathologies such as ACL damage or patellofemoral instability on the sensitivity and specificity of the Thessaly Test, McMurray’s Test, Apley’s Test and the joint line tenderness Test.
- Determination of the optimal combination of tests for most accurate diagnosis of meniscal tears.
Statistical analysis
The full statistical analysis plan is described in detail in Appendix 1. In brief, we calculated the specificity, sensitivity, PPV and NPV for each of the following individual tests: Thessaly Test, McMurray’s Test, Apley’s Test, the joint line tenderness Test and clinical history. We have used evidence of meniscal tear on MRI as our ‘gold standard’ reference for diagnosis. We have compared the relative rates between each individual test. These results are reported separately for specialist musculoskeletal clinicians and for primary care clinicians.
The following subanalysis was also performed:
- Patients with and without osteoarthritis of the knee were compared to determine if osteoarthritis influences the outcome of the Thessaly Test or any of the other physical tests. Other patient factors were also considered [body mass index (BMI), sex, age and previous surgery].
- A comparison of results of all four physical tests between specialist musculoskeletal clinicians and primary care clinicians was performed to determine if specialised training influences the outcome of test results.
- We have examined whether or not a combination of physical tests can produce greater diagnostic accuracy than a single test alone.
- In a subset of patients, we have compared the accuracy of MRI scan diagnosis of meniscal tears to that achieved using arthroscopy.
Statistical tests applied to data
A chi-squared test was used for comparisons between musculoskeletal clinicians and primary care clinicians for each physical test and clinical history, as well as to compare the sensitivity or specificity of the four physical tests along the different bins of the categorical (or categorised) covariates.
McNemar’s Test was used to for comparisons between musculoskeletal clinicians and primary care clinicians for each physical test and clinical history. McNemar’s Test assesses the significance of the difference between two correlated proportions, where the two proportions are based on the same sample of subjects. In this study we wished to determine the difference between the proportion of successful diagnoses obtained by the two groups of observers; hence, the choice of McNemar’s Test over intraclass correlation.
Logistic regression was used to assess the influence of different covariates on the sensitivity or specificity of the four physical tests for musculoskeletal clinicians and primary care clinicians (all patients or knee patients).
Logistic regression using a stepwise procedure from the covariates and the four physical tests to build a model to predict MRI.
Guide to likelihood ratio graphs
The diagnostic abilities of two or more diagnostic tests are traditionally compared by their respective sensitivities and specificities. Comparison is relatively simple if both tests have similar sensitivity, but one has a superior specificity. However, comparison is more complicated when the specificities and sensitivities are both different and a trade-off of one over the other is required.
Likelihood ratio graphs are a simple graphic that readily facilitates comparison between two or more diagnostic tests, allowing an assessment of whether or not a decrease in specificity may be offset by a sufficient gain in sensitivity to yield a test that, nevertheless, has superior diagnostic ability.
For all likelihood ratio graphs in this report, the Thessaly Test is plotted as the reference test (solid black lines). Any comparator test plotted in zone I is superior in all aspects to the Thessaly Test. Comparators plotted in zone II are superior at detecting the absence of a meniscal tear, but inferior at detecting the presence of a tear. Comparators plotted in zone III are superior at detecting the presence of a tear, but inferior at detecting the absence of a tear. Any comparator plotted in zone IV is inferior to the Thessaly Test (Figure 8).
- Study design/methods - Diagnostic accuracy of the Thessaly test, standardised cl...Study design/methods - Diagnostic accuracy of the Thessaly test, standardised clinical history and other clinical examination tests (Apley’s, McMurray’s and joint line tenderness) for meniscal tears in comparison with magnetic resonance imaging diagnosis
- Discussion - Total hip replacement and surface replacement for the treatment of ...Discussion - Total hip replacement and surface replacement for the treatment of pain and disability resulting from end-stage arthritis of the hip (review of technology appraisal guidance 2 and 44): systematic review and economic evaluation
- Introduction - The relative clinical effectiveness and cost-effectiveness of thr...Introduction - The relative clinical effectiveness and cost-effectiveness of three contrasting approaches to partner notification for curable sexually transmitted infections: a cluster randomised trial in primary care
- Introduction - Start2quit: a randomised clinical controlled trial to evaluate th...Introduction - Start2quit: a randomised clinical controlled trial to evaluate the effectiveness and cost-effectiveness of using personal tailored risk information and taster sessions to increase the uptake of the NHS Stop Smoking Services
- List of abbreviations - Total hip replacement and surface replacement for the tr...List of abbreviations - Total hip replacement and surface replacement for the treatment of pain and disability resulting from end-stage arthritis of the hip (review of technology appraisal guidance 2 and 44): systematic review and economic evaluation
Your browsing activity is empty.
Activity recording is turned off.
See more...