Abstract
BACKGROUND:
Bronchopulmonary dysplasia (BPD) is used to clinically describe the severity of lung disease and to serve as a common surrogate endpoint for long-term pulmonary morbidity in clinical trials, but its performance as a surrogate end-point warrants evaluation. Our objective was to assess real-world performance of BPD as a surrogate marker for long-term pulmonary outcomes.
METHODS:
We performed a systematic review of large, multi-centered, blinded, randomized control trials to evaluate the use of BPD as a surrogate marker for long-term pulmonary outcomes. Long-term pulmonary outcomes occurred within two years and included measures of hospital utilization, respiratory illness, respiratory medication, and mortality. Direction and magnitude of effect were evaluated using number needed to treat analysis.
RESULTS:
Five studies were included in our review. Studies varied in definition of BPD and in long-term outcomes measured. Only one study found a significant, consistent risk reduction in both BPD and any long-term pulmonary outcome. Two studies found significant reductions in long-term pulmonary outcomes with a non-significant reduction in BPD.
CONCLUSIONS:
BPD is an imperfect surrogate marker for long-term pulmonary outcomes. It did not consistently predict the magnitude or direction of the effect of an intervention on longer-term pulmonary outcomes. Furthermore, there was significant variation in the definitions of BPD and in the long-term pulmonary outcomes used. There is a need for future work to identify more predictive surrogate markers and a need for better standardization of assessments of long-term pulmonary outcomes.
Introduction
Bronchopulmonary dysplasia (BPD) was first described in 1967 by Northway and colleagues as a group of clinical, radiological, and pathologic findings in infants with chronic lung disease [1]. The lungs of these preterm infants were acutely injured leading to inflammation, fibrosis, and smooth muscle hypertrophy. Since that time, the definition of BPD has evolved to reflect the changing patient population and pathology of the disease [2–7]. In 1988, Sheehan et al proposed a new definition for BPD of oxygen dependence at 36 weeks post menstrual age (PMA), which could serve as a useful predictor of abnormal pulmonary outcomes in the first two years of life. They believed that this definition more accurately portrayed the increasing incidence in earlier gestation infants and better reflected long-term pulmonary outcomes when compared to the previously used definition of oxygen use at 28 days [3]. Given the lack of uniformity in the administration of supplemental oxygen, efforts have been made to standardize the definition of BPD. Subsequently, Walsh et al. proposed a physiological definition designed to account for discrepancies in oxygen administration across institutions, and the National Institute of Child Health and Human Development, the National Heart, Lung, and Blood Institute and the Office of Rare Diseases proposed a new severity-based definition of BPD [8, 9]. Currently, the diagnosis of BPD is used to clinically describe the severity of lung disease and continues to serve as a common surrogate endpoint for long-term pulmonary morbidity in clinical trials, but its performance as a surrogate end-point for trials warrants evaluation.
The National Institutes of Health put forth a preferred definition of a surrogate endpoint as “a biomarker that is intended to substitute for a clinical endpoint” [10]. According to the US Institute of Medicine of the National Academies, evaluating biomarkers consists of three steps: 1) analytical validation, to ensure that biomarker tests are reliable, reproducible, and adequately sensitive and specific; 2) qualification, to ensure the biomarker is associated with the clinical outcome of concern; 3) utilization analysis, to determine that the biomarker is appropriate for the proposed use [11]. A proper surrogate marker should go beyond correlation by accurately predicting a clinical endpoint with high positive and negative predictive values [12].
Recently, it has been suggested that BPD is an inappropriate surrogate marker [13–15]. Critics point to the evolution of the basis of the term to a clinical, functional definition (i.e. the need for oxygen) rather than one based on histopathology, as well as a lack of evidence supporting its use as a sufficiently predictive surrogate marker for long-term outcomes. Therefore, we have performed a systematic review designed to assess real-world performance of BPD in clinical trials as a surrogate marker for long-term pulmonary outcomes.
Methods
We conducted a PubMed search using the search terms “bronchopulmonary dysplasia” or “chronic lung disease,” “premature,” and “infant.” Search limitations included randomized control trials and clinical trials, English language, human subjects, age group “birth to 23 months,” and publication dates from January 2000 to May 2015 with full-text available on-line. Study enrollment was limited to at least 200 subjects.
We included only published articles in the systematic review. Study design was limited to multi-center, blinded, randomized control trials. Cohort studies, case-control studies, abstracts, case series, and review articles were excluded from analysis. Controls consisted of a separate group of randomly assigned patients receiving either alternative treatment or placebo.
To be included in the review, papers were required to have used bronchopulmonary dysplasia or chronic lung disease as a primary outcome. We included studies using any definition of BPD including the traditional definition (any oxygen requirement at 36 weeks PMA), physiological definition (ventilator support, >30% oxygen, or oxygen saturation under 88% on room air with a step-wise reduction in oxygen if receiving under 30% oxygen at 36 weeks PMA), as well as study definitions. Studies were required to include long-term respiratory outcomes or have a follow up study with long-term pulmonary outcomes. Long-term pulmonary outcomes evaluated included healthcare utilization (e.g. physician visits, admissions to hospital), respiratory illness (e.g. asthma, wheezing), use of respiratory medications (e.g. bronchodilators, inhaled or systemic steroids, diuretics), use of oxygen post discharge, and mortality. Evaluated outcomes occurred within the first two years of life.
Follow-up studies that included long-term pulmonary outcomes were searched for and included for studies that met initial search criteria but lacked long-term outcome data.
Studies were identified by one author (BKC) and reviewed for eligibility by two co-authors (AMH, ANT).
Results
The PubMed search returned 361 articles. After review, five studies met inclusion criteria [16–24]. All studies were multi-center, blinded, randomized control trials with study size ranging from 209 to 1,316. Follow-up rates ranged from 69.8% to 93.5% . All studies reported summary statistics. Two studies were sufficiently powered to evaluate for BPD and long-term outcomes, [18, 23] while the remaining studies were powered for BPD but not long-term outcomes.
Patient population
The studies differed in their patient populations (Table 1). Four of the five studies had a mean or median gestational age between 25–27 weeks and birth weights between 750–900 g. The DINO trial enrolled neonates with a mean gestational age (GA) of 30 weeks and a mean birth weight (BW) over 1300 g. Four of the five studies excluded infants with major congenital abnormalities or severe co-morbidities [16–19, 22]. Three of the five studies had an eligibility requirement of receipt of respiratory intervention (e.g. supplemental oxygen, mechanical ventilation) [17–19, 22]. Two studies [17, 22] compared inhaled nitric oxide (iNO) to placebo, one study [18] compared intra-tracheal recombinant human CuZn superoxide dismutase (r-h CuZnSOD) to placebo, one study [16] compared high docosahexaenoic acid (DHA) diet to standard DHA diet, and one study [20, 23] compared continuous positive airway pressure (CPAP) to surfactant and mechanical ventilation as well as low target oxygen saturation (85–89%) to high target oxygen saturation (91–95%).
Summary of studies meeting inclusion criteria
Summary of studies meeting inclusion criteria
BW, birth weight; DHA, docosahexaenoic acid; ER, emergency room; GA, gestational age; ICU, intensive care unit; iNO, inhaled nitric oxide; PMA, post-menstrual age; RAD, reactive airway disease; RDS, respiratory distress syndrome.
Duration of study follow-up ranged from 12 months to 18–22 months, with two studies [17, 18`] ending at 12 months, two studies [16, 22] ending at 18 months, and one study [23] ending at 18–22 months. All studies conducted parent or primary caregiver interviews at 12 months of age. The studies otherwise varied in frequency of follow-up. Two studies also conducted in-person assessments of study participants.
All studies evaluated rate of BPD in the initial study and/or secondary analysis. The studies used various definitions for defining BPD with two studies [18, 23] evaluating multiple definitions of BPD (Table 1). Although the observed rate of BPD was lower in the intervention group than the control group for all five studies, statistical significance was only achieved in the NO CLD trial (Table 2).
BPD and long-term respiratory outcomes
BPD and long-term respiratory outcomes
BPD, bronchopulmonary dysplasia; ER, emergency room; ICU, intensive care unit; NNT, number needed to treat; OR, odds ratio; RAD, reactive airway disease; RB, risk benefit; RR, relative risk. Significant data in bold. aTraditional definition of BPD at 36 weeks. bPhysiological definition of BPD at 36 weeks. cTraditional definition of BPD at 36 weeks with abnormal findings on chest radiograph. dData from follow-up study. eRisk benefit for survival without chronic lung disease.
In stratified analyses, the DINO trial found a significant difference in rate of BPD between intervention groups among those with BW < 1250 g (p: 0.04) and male gender (0.03); however, there were no significant differences between genders (p: 0.20) and BWs (p: 0.84). The Early iNO trial also found a significant difference in rates of BPD for infants with BW 1000–1250 g (p: 0.001). The SUPPORT trial found a significant difference in rate of BPD by the traditional definition with secondary randomization between target oxygen saturation (p: < 0.01).
All studies included utilization of healthcare services as a secondary outcome (e.g. hospitalizations, emergency room (ER)/physician visits) (Table 1). Three studies reported respiratory illness (e.g. asthma, signs and symptoms of asthma) as a secondary outcome [16, 23]. Four studies used requirement for respiratory medications (e.g. bronchodilators, steroids, diuretics, oxygen) [17, 23]. One study used infant mortality at one year as a secondary outcome [22].
Relationship of BPD and long-term respiratory outcomes between intervention groups
Healthcare utilization
Long-term measures of healthcare utilization included any hospitalizations, respiratory hospitalizations, ER/physician visits, intensive care unit (ICU) readmissions, and mechanical ventilation during readmission. When comparing the impact of the treatment on BPD and post-neonatal intensive care unit (NICU) healthcare utilization, expressed in terms of number needed to treat, two studies [16, 18] showed consistent direction of risk difference, two studies [17, 23] showed mixed direction of risk difference, and one study [22] showed inconsistent direction of risk difference between rates of BPD and rates of healthcare utilization outcomes (Table 2). All studies evaluated the rate of any hospitalization. With the exception of the Early iNO trial, the remaining studies showed non-significant reductions in rates of any hospitalization. The Early iNO trial showed a non-significant increase in the rates of any hospitalization. Both trials evaluating the rates of respiratory hospitalizations showed non-significant changes between the intervention and control groups. The NO CLD trial and the oxygen saturation study arm of the SUPPORT trial showed increases in respiratory hospitalizations that were inconsistent with the reductions seen in BPD. The CPAP study arm of the SUPPORT trial showed a consistent reduction in respiratory hospitalizations. The three studies that evaluated ER/physician visits found mixed directions of effect when compared to rates of BPD. The only significant change was seen in the CPAP study arm of the SUPPORT trial which observed a decrease in the rate of ER/physician visits despite a non-significant decrease in the rate of BPD. Non-significant reductions in the rates of ER/physician visits were seen in the oxygen saturation study arm of the SUPPORT trial and the SOD trial. The Early iNO trial found a non-significant increase in ER/physician visits that was inconsistent with the reduction in BPD. They also found inconsistent, non-significant increases in ICU readmissions and mechanical ventilation during readmission.
Respiratory illness
Long-term measures of respiratory illness included asthma or asthma-like symptoms, cough, and other respiratory illnesses such as pneumonia. Of the three studies that evaluated for asthma or asthma-like symptoms, one study [16] used asthma only, one study [17] used wheezing or whistling in the chest only, and one study [23] used asthma, asthma-like symptoms, and other respiratory illness such as reactive airway disease and bronchiolitis as secondary outcomes. All measures of respiratory illness showed consistent risk reduction with BPD; however, the only significant reduction was seen in the SUPPORT trial when evaluating wheezing or whistling in the chest apart from colds (Table 2). The magnitude of the number needed to treat (NNT) relative to the NNT of BPD varied by study. The DINO trial had a risk difference less than that of BPD, the NO CLD study had a risk difference greater than that of BPD, while the SUPPORT trial had risk differences both less than and greater than that of BPD.
Respiratory medication
Long-term measures in the four studies evaluating respiratory medication use included respiratory illness requiring asthma medications, inhaled steroids, systemic steroids, diuretics, and home and follow-up oxygen use [17, 23]. Of these four trials, two showed consistent, significant decreases in the rates of respiratory medications (Table 2). The NO CLD trial showed a significant, consistent risk reduction between rates of BPD and use of all respiratory medications with reductions greater than the reduction seen in BPD. The SOD trial showed a significant, consistent risk reduction in respiratory illness requiring asthma medications with a greater reduction than that for BPD. The SUPPORT trial found mixed, non-significant direction of risk differences in both study arms. The Early iNO trial showed an inconsistent, non-significant risk increase in follow-up oxygen use with no difference in any/home oxygen use.
Mortality
Only the Early iNO trial used infant mortality as a long-term secondary outcome. This study showed a consistent, non-significant risk reduction in mortality at 1 year corrected age (NNT: 21.2).
Results stratified by BPD
Only the SUPPORT trial stratified results by BPD. They found significant increases in the rate of long-term respiratory outcomes in those with BPD for all measures of healthcare utilization, all measures of respiratory illness except cough for more than 3 days without a cold, and all measures of respiratory medications except use of systemic steroids and nebulized medications (p < 0.05). No significant differences were seen in those three measures.
Summary of significant results
A consistent relationship between BPD and subsequent markers of respiratory morbidity was not seen in the trials studied (Table 3). Only the NO CLD trial found a significant decrease in rates of BPD. This study also saw significant decreases in the use of respiratory medications including oxygen, but in no other outcome measures. The SOD study also found a significant decrease in respiratory illness requiring asthma medications, but failed to find a significant decrease in BPD. Similarly, the SUPPORT study failed to find a significant decrease in BPD, but did see significant decreases in measures of healthcare utilization and respiratory illness.
Summary of significant interactions of bpd and long-term respiratory outcomes
Summary of significant interactions of bpd and long-term respiratory outcomes
We performed a systematic review designed to evaluate interventional effects on BPD and long-term pulmonary outcomes to see if BPD is performing as a consistent surrogate marker of later respiratory morbidity in neonates in large, multi-center, randomized control trials. The results of this review indicate that BPD has not performed as a strong surrogate marker for long-term pulmonary outcomes. It did not consistently predict the magnitude or direction of the effect of an intervention on longer-term pulmonary outcomes.
In the trials reviewed, BPD may have identified infants at higher risk for respiratory disease overall, but was not highly predictive of requirement for medication and/or treatment. Only the NO CLD trial found a consistent, significant decrease between BPD and any measure of long-term pulmonary outcomes. Of those outcomes, only a decrease in respiratory medications including oxygen was significant. Moreover, while that study found consistent reductions in the need for respiratory medications, the reduction exceeded that of BPD. The SOD trial found a similar magnitude of reduction in the need for respiratory medications, albeit without a significant decrease in rates of BPD.
Taken together, studies varied both in the definition of BPD, as well as the in the long-term respiratory outcomes evaluated. The variation in the definition of BPD creates differing populations of patients under the term BPD, making it difficult to evaluate its use as a surrogate marker for long-term pulmonary outcomes. Similarly, there is not one marker for long-term pulmonary outcomes. While many of the studies used similar themes to evaluate long-term pulmonary outcomes, the definitions of the outcomes measures used varied significantly. For example, multiples studies evaluated asthma and asthma-like symptoms; however, where one study used “asthma” as an outcome measure [16], a second study used multiple wheezing episodes to evaluate asthma [23]. There is a need for both validated questionnaires to assess clinically relevant symptoms, as well as physiologic measurements to complement such. The lack of standardization in assessment of long-term pulmonary outcomes is a design issue for randomized controlled trials in the field, just as the variety of BPD definitions is. Outcomes for extremely preterm birth infants provide information that is critical for future clinical and policy decisions, and a failure to develop appropriate outcome measures can have far-reaching effects [25]. Steps are currently underway to standardize the definitions of pulmonary outcome assessments in preterm infants, and several studies of pulmonary outcomes in infants have collaborated to standardize wheezing assessments (NCT01601847, NCT00920621) [26, 27].
The findings of this review are consistent with previous findings stating that BPD is an imperfect surrogate marker for long-term pulmonary outcomes [13–15]. Future steps can be undertaken to try to identify other surrogate markers that more accurately predict long-term pulmonary outcomes. Certainly, work has already been done to better characterize extremely pre-term infants with biological, physiological, and clinical data to then provide surrogate markers to predict respiratory morbidity [14, 28].
Limitations of our review include only five studies that met our inclusion criteria. However, this also underscores the paucity of studies directly measuring long-term pulmonary outcomes. The included studies employed diverse methodologies with a lack of standardized definitions of BPD and long-term pulmonary outcomes. As such, it is not only difficult to directly compare rates of BPD between studies, but also how rates of BPD amongst the study populations compare to respiratory clinical endpoints. A further limitation of our study is the focus only on randomized control trials. Other study designs, including longitudinal cohort studies, have documented BPD in the neonatal period and long-term outcomes beyond two years, and analogous study design issues with regards to the functioning of BDP as a surrogate outcome and validation and standardization of ascertainment of long-term pulmonary outcomes. Finally, there was a lack of power for long-term pulmonary outcomes in the trials themselves, so we have reported not only significance levels but also direction and magnitude of effects.
Overall, BPD appears to be an imperfect surrogate marker for long-term pulmonary outcomes at two years of age. Interventions in large, multi-center, randomized control trials showed inconsistent directionality and magnitude of effect between rates of BPD and long-term pulmonary outcomes. Moreover, there is still variation in the definition of BPD used for diagnosis and in the specific long-term pulmonary outcomes used to evaluate infant morbidity. Our aim was to assess real-life performance of BPD in randomized controlled trials; however, a lack of standardization in both the definition of BPD and in pulmonary outcome measures complicates the design of the study and this assessment. Further studies may benefit from using individual patient data analyses in working to identify a more predictive surrogate marker and more standardized assessments of long-term pulmonary outcomes.
Disclosure statements
The authors have no competing financial interests or other conflicts of interest to disclose. The authors received no honorarium, grant or other form of payment to produce the manuscript.
