Abstract
Background and aims
Femoroacetabular impingement is the abnormal contact of the proximal femur and acetabulum during motion. It causes hip pain and joint degeneration in young patients. This systematic review aims to clarify the clinical effect of arthroscopic femoral osteochondroplasty for cam lesions and to review the available literature for the general medical readership, including providers of primary and secondary care.
Methods and results
Electronic databases were searched for studies of arthroscopic femoral osteochondroplasty in primary femoroacetabular impingement. A total of 2618 article titles, 242 abstracts and 33 full text articles were considered. Ultimately nine studies with clinical outcome scores met the inclusion criteria and were included in the qualitative systematic review. Six studies were suitable for meta-analysis using an inverse variance, random effects model (RevMan software).
In the nine studies, improvements were seen in Western Ontario and McMaster Universities Osteoarthritis index, Non-arthritic Hip Score and Modified Harris Hip Scores. Across the six studies suitable for meta-analysis (537 patients), a 24-point weighted mean improvement in Non-arthritic hip score was seen. This yielded a large overall effect size of 1.6.
Conclusion
Arthroscopic femoral osteochondroplasty appears to be a beneficial treatment for primary femoroacetabular impingement, with a large effect size seen across six eligible studies.
Introduction
Primary femoroacetabular impingement (FAI) was described by Ganz as a cause of hip pain and osteoarthritis. 1 It results from abnormal contact/impingement of the femoral neck and acetabular rim. This reduced joint clearance can be a consequence of increased non-sphericity of the femoral head–neck junction (cam) and/or acetabular over-coverage (pincer).1,2 These are described by alpha angle of ≥55° (cam lesion)3–5 and a centre edge angel ≥40°(acetabular over-coverage, i.e. pincer impingement). 6 These pathologies can be combined (mixed FAI). Early reports from a prospective cohort suggest that cam deformity is associated with a significant increase in the risk of end stage osteoarthritis (OA) at 5 years. 7
Symptoms of the FAI include groin pain associated with sporting activity and prolonged or repetitive hip flexion.1,2 The clinical impingement sign involves the elicitation of groin/hip pain from forceful flexion and internal rotation of the hip. 8 Conservative measures can be trialled, e.g. non-steroidal anti-inflammatory drugs and activity modification, but these may mask the underlying joint destruction. 2
Before the advent of arthroscopic osteochondroplasty of the hip, Ganz pioneered the use of surgical dislocation via a trochanteric osteotomy to facilitate adequate, safe exposure of the hip joint in treatment of FAI and other disorders. 9 Due to the trochanteric osteotomy patients’ flexion and weight-bearing status must be restricted for up to eight weeks to allow the osteotomy to heal. 2 This type of surgery has shown improvements in the Merle d’Aubigne Hip Score, 10 Harris Hip Score 11 and in both the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) and short-form 12. 12
Scoring systems in the literature.
Aim
A critical review of the literature was performed followed by analysis of preoperative and postoperative clinical outcome scores. The hypothesis was that arthroscopic femoral osteochondroplasty for cam lesions resulted in an improvement in clinical outcome scores. 23
Methods
Search process and article selection
Electronic database and hand searches were performed in March 2013. Medline and EMBASE databases were interrogated. Since FAI was first described in 1999, 24 the search was confined to 1999 to March 2013.
The following short search strings on the title and abstract: ‘femoroacetabular’; ‘femoro-acetabular’; ‘hip AND impingement’; ‘hip AND debridement’; ‘hip AND arthroscop$’ (where $ is a wildcard); ‘hip AND (osteochondroplasty OR osteoplasty)’; ‘femoroplasty’; ‘cam AND impingement’; ‘pincer AND impingement’. Each search string’s results was added to the project workspace on OVID. This simultaneous search within OVID allows an initial deduplication of articles with the same unique identifier.
PUBMED was interrogated for citations using the equivalent search strategies, specifically looking for articles which are not from Medline. Furthermore, The Cochrane Central Register of Controlled Trials was also searched for trials not indexed in the aforementioned databases. 25
To compensate for the lag time from publication to database indexing, a hand search of the table of contents of relevant journals covered the period October 2012–March 2013. Potentially relevant journals were: (1) American Journal of Sports Medicine, (2) British Journal of Sports Medicine, (3) Clinical Orthopaedic Related Research, (4) Journal of Bone and Joint Surgery (Am), (5) Bone and Joint Journal, (6) Operative Techniques in Orthopaedics, (7) Arthroscopy, (8) Hip international, (9) Journal of Arthroplasty, (10) Techniques in Orthopaedics, (11) Orthopaedics & Traumatology: Surgery and Research.
Citations were imported into Zotero reference manager (version 3.0.14) and an initial screening of article titles was made. Thereafter, potential studies were identified for further abstract review with suitable articles undergoing subsequent full text review.
Inclusion criteria:
Studies concern skeletally mature patients with primary FAI; Cam/mixed impingement undergoing arthroscopic femoral osteochondroplasty ± procedure; Minimum six months follow-up; Peer reviewed journals; Levels of evidence 1–4;26 Have pre and postoperative clinical outcome scores.
Exclusion criteria:
Skeletally immature patients; Secondary FAI (e.g. hip dysplasia); Revision surgery; Small case series < 10 patients; Conference proceeding abstracts; Duplicate/overlapping cohorts – the most recent will be used; Review articles/ systematic reviews / meta-analysis / cadaveric studies.
Figure 1 demonstrates the selection process according to the template recommended by Preferred Reporting Items for Systematic Reviews and Meta-Analyses.
Article selection process.
Data extraction
Details extracted include study design; population/age; timeframe for recruitment; inclusion criteria; surgical procedures performed; follow-up time; loss-to-follow-up rate, preoperative and postoperative clinical outcome scores and where applicable the p value for the difference; definition and rates of failure, e.g. conversion to total hip replacement (THR), revision surgery and complications. An overview is presented to establish similarities and attempt to draw wider statistical conclusions, i.e. meta-analysis.
Statistical analysis
Where possible, for each study, RevMan 27 was used to calculate the absolute mean change in clinical outcome scores and standardised mean difference (effect size). 28 The Non-Arthritic Hip Score (NAHS) was used across multiple studies. Due to the heterogeneity of the studies, a Random Effects (inverse variance) model was used to calculate the weighted mean change in NAHS and overall effect size. 29 An effect size of >0.8 is generally considered large. 30
Some studies do not describe the SD in the mean clinical outcome score. The authors were contacted to request these, but at this time some data were not available. Assuming an approximate normal distribution of data, the SD can be estimated for Cochrane Reviews and other meta-analysis using the method described by Walter and Yao. 31
Results
Study design and patient population.
Total patients (some may be bilateral).
FAI: femoroacetabular impingement; MSK: musculoskeletal; OB: outerbridge cartilage grading; RA: rheumatoid arthritis; SCFE: slipped capital femoral epiphysis.
Results for individual studies.
IQ: range interquartile range for non-parametric data; MHHS; Modified Harris Hip Score (0–100); N: no; NAHS; Non-arthritic Hip Score (0-100); OCP: osteochondroplasty; #: number of procedures not detailed; SD: standard deviation; WOMAC: Western Ontario and McMaster Universities Osteoarthritis Index (0–96); Y: yes; ¥: calculated using method described by Walter and Yao (see ‘Methods’ section).
Meta-analysis including studies by Palmer and Stahelin.
Weight is illustrated for weighted mean rather than effect size.
df: degrees of freedom; IV: inverse variance; NAHS: Non-arthritic Hip Score (0–100); SD: standard deviation; ¥: calculated using method described by Walter and Yao (see ‘Methods’ section).
The meta-analysis (weighted mean change and effect size) has been performed twice. Table 4 includes the non-parametric data of Palmer et al. 38 and Stahelin et al. 13 (mean change in NAHS 24.1 and effect size 1.6) and then repeated excluding these two studies yielding a mean change in NAHS of 24.6 and an effect size of 1.7 (this separate calculation has not been tabulated).
Using the data from all NAHS studies, a forest plot with each study’s mean change in NAHS, 95% confidence interval and the overall weighted mean change is shown in Figure 2.
Weighted mean change in Non-arthritic Hip Score.
Discussion
There were no randomised controlled trials which met the inclusion criteria that compared non-operative and operative treatments. Three studies were retrospective,32,36,37 while six were prospective.13,21,33–35,38 One of the papers by Brunner compared computer navigated with conventional arthroscopic surgery 33 but no clinical advantage was noted with navigation.
The NAHS was the most common outcome measure used in six studies,13,33,34,36–38 the Modified Harris Hip Score (MHHS) was used in four32,35–37 and WOMAC in one. 21 All nine studies reported improvements in the clinical outcome scores used, and those that reported complication rates described these as being low. Variable definitions of failure were used, including decreased clinical outcome score, conversion to THR and revision surgery. Where complications were discussed, generally a <1% rate of dysaesthesia/paraesthesia was reported, however this was as high as 10% in one paper. 33 Across all nine studies (786 patients), only two cases of heterotopic ossification were reported, and no cases of femoral neck fracture or avascular necrosis. There may be underreporting of complications.
All nine studies clearly stated their proportions of cam/mixed impingement and had otherwise similar inclusion/exclusion criteria. Seven studies mentioned Tönnis grade of osteoarthritis with five studies excluding grade 313,33,34,36,38 and two studies excluding grade 2 and above.32,37 Byrd and Jones 35 excluded those with ‘bone-on-bone contact’ but did not use the Tönnis grading system. Ilizaliturri included many patients with osteoarthritis but again, did not use Tönnis grade. Few studies mentioned the specific radiological criteria for inclusion, e.g. femoral alpha angle. The mean ages of the patients ranged from 33 to 65 years. If Javed’s study 37 was excluded the range would have been 33–43. Javed had specifically looked at patients over 60 years of age. It could be anticipated that his patients may have had a poorer outcomes based on age and likelihood of osteoarthritis, however all his patients were Tönnis grade 0 or 1. Interestingly, Javed’s study and that of Haviv et al. 36 had the poorest mean improvement in NAHS (15.1 and 15.0 points, respectively). Haviv’s patients had a high number of patients with Outerbridge grade 3/4 chondral damage. Javed and O’Donnell 37 and Haviv et al. 36 used both MHHS and NAHS with similar improvements for both scoring systems. For the purpose of meta-analysis and estimating the overall size of effect, both measures could not be included as the resultant overall size of effect would be disproportionately weighted in favour of these papers. The NAHS was the common system used across all eligible studies; it was therefore the only measure used to calculate the overall size of effect across these studies.
Three studies were excluded from the quantitative meta-analysis.21,32,35 Byrd and Jones 35 described a mean ‘change in score’ of 20 for MHHS without stating the pre and postoperative means and standard deviations. Their data whilst interesting, and appearing in keeping with the other data sets, could not be used to estimate an effect size or weighted mean. Bardakos’ study 32 was excluded since it described medians with interquartile ranges and used Wilcoxon’s signed rank test for comparing pre and postoperative MHHS. These data could therefore not be used to calculate an effect size. 39 Ilizaliturri was the only author to use the WOMAC scale, 21 which was designed for osteoarthritis 18 and has not been validated for use in hip arthroscopy. This raised concerns regarding its suitability for hip arthroscopy. Furthermore, the postoperative results had a non-parametric distribution of data. For these reasons, the study is not included in the quantitative meta-analysis.
Three papers used validated outcome measures (MHHS and NAHS) and provided enough data (pre and postoperative means, standard deviations and statistical tests) to be directly included in the meta-analysis.13,37,38 Haviv provided the SDs for his data 36 by email and therefore had his study included. The two papers by Brunner et al.33,34 had pre and post op means, the NAHS ranges and used parametric statistical tests, suggesting normally distributed data. These papers could therefore reasonably be included in the meta-analysis after having their standard deviations estimated using the method detailed by Walter and Yao. 31
As discussed in the individual results section, the papers by Palmer et al. 38 and Stahelin et al. 13 were analysed by their authors using Wilcoxon Signed Rank tests suggesting non-parametric data. The authors did, however, describe means and standard deviations. An attempt was made given the limited information available to consider the extent of this potentially abnormal distribution. RevMan is considered by the Cochrane Group to being robust enough to tolerate a small amount of skew if the data were reported as mean and SD.39,40 The meta-analysis was therefore performed with these non-parametric studies and also without them. The weighted mean improvements in NAHS were 24.1 and 24.6, respectively, and the overall sizes of effects were 1.6 and 1.7, respectively (both considered large effects 30 ). We can therefore see that subjectively little difference is noted with and without their inclusion. Therefore, the full data set is presented in Table 3. This analysis included the data from 537 patients across six studies13,33,34,36–38 and gave a weighted mean improvement in NAHS of 24.09 points (95% CI 18.42–29.76) which has been demonstrated in Figure 2.
Table 1 demonstrates the considerable differences in mean age, gender and follow-up. Heterogeneity is considered to be extremely high based on tau 2 ; the ratios of chi 2 to df (degrees of freedom), and the high I2 statistics 41 shown in Table 3. Heterogeneity is further demonstrated in Figure 2, where the 95% confidence intervals do not all overlap. This heterogeneity is due to differences in condition of articular cartilage, age, gender, activity levels, occupation, comorbidity and patient expectation in addition to surgical proficiency or technique. The random effects (inverse variance) model was therefore more appropriate than a fixed effects meta-analysis. Differences in the selection criteria between studies may affect the variance, i.e. SD. More restrictive inclusion criteria may reduce the variance and increase the weight given to that particular study using an inverse variance model. This would influence the meta-analysis result. Furthermore, as described, the high heterogeneity reduces the integrity of the weighted mean/overall size of effect.
Nonetheless, the individual mean changes and individual effect sizes can always be considered by the reader without such inter-study statistics/meta-analysis. Each study has therefore had a separate description of their patient characteristics, interventions and outcomes to allow the reader to make a qualitative assessment based on the narrative review in addition to the attempted quantitative meta-analysis. However, with extremely high heterogeneity, the meta-analysis results should be interpreted with caution.
What this review contributes to our understanding
Despite hundreds of articles having been published on FAI,42,43 this review includes only nine studies in the qualitative review and six studies in the meta-analysis. It therefore may not represent the true effect seen in other studies that did not meet the inclusion criteria. Previous systematic reviews have covered a wide range and combination of treatments which vary in approach (open/arthroscopic/combined) and technique (femoral/acetabular osteochondroplasty, chondral and labral procedures). Unlike previous general reviews,44–47 this review focuses on the outcomes of arthroscopic femoral osteochondroplasty, where objective clinical outcome scores are available. The general nature of previous reviews made it difficult for authors to perform inter-study statistics. Kemp et al. reviewed FAI studies in 2010 and calculated the individual sizes of effect. 48 They included studies that would not fit our inclusion criteria, i.e. not all patients underwent arthroscopic femoral osteochondroplasty, and they included groups of ‘pure pincer’ patients. Our study focuses on the effect and treatment of cam lesions, i.e. pure cam and mixed patients. It would otherwise be difficult to compare and contrast so many different studies. It is appreciated that this review does include studies of varied additional chondral and labral procedures, but this is very typical of FAI treatment in day-to-day practice. It would have been interesting to have focused on pure cam rather than including mixed FAI, but there was extreme paucity of such eligible studies.
Although the studies included in this review are of a ‘low level’ of evidence, 6 and are particularly prone to selection/reporting/publication/other biases, their cohorts experienced statistically and clinically significant improvements in their MHHS and NAHS. One author has previously highlighted difficulty in getting a highly motivated and opinionated group of patients to agree to randomisation. 49 Nevertheless, McMaster University is currently recruiting patients for a randomised controlled trial of cam/mixed FAI with two treatment arms: arthroscopic femoral osteochondroplasty versus lavage. 50 Their results are expected in 2015.
Conclusion
We believe this review is the first to calculate a weighted mean difference in the NAHS but it remains necessary to improve the quality of FAI research. Until then, this systematic review does support a benefit from arthroscopic femoral osteochondroplasty for cam lesions but patient selection and magnitude of benefit remains unclear.
Footnotes
Declaration of conflicting interests
None declared.
Funding
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
