Abstract
Background:
Preterm infants are at risk for neonatal morbidity, transiently reduced thyroid hormone (TH) levels, and impaired visual abilities. To determine the interrelationship between these factors, we measured TH levels in the period ex utero and compared their visual abilities with those of term infants at 6 months (corrected) of age.
Methods:
The preterm group consisted of 62 infants stratified by gestational age: Group A (23–26 weeks, n = 10), Group B (27–29 weeks, n = 23), Group C (30–32 weeks, n = 19), and Group D (33–35 weeks, n = 10). Controls were 31 healthy full-term infants. In the preterm group, free thyroxine, triiodothyronine, and thyroid-stimulating hormone levels were measured at 2 and 4 weeks of life and 40 weeks postconceptional age. All infants were assessed for visual acuity, contrast sensitivity, and color vision using electrophysiological techniques.
Results:
Compared with controls, the preterm infants demonstrated reduced contrast sensitivity at low temporal frequencies and slower blue–yellow color processing. Groups did not differ from controls in visual acuity. In the preterm group, reduced contrast sensitivity and slow blue–yellow and red–green color vision processing were associated with low TH levels, low gestational age, and several medical morbidities.
Conclusions:
Our findings signify that some of the weak visual abilities in preterm infants can be accounted for, in part, by their reduced TH levels in the early postnatal period.
Introduction
One potential factor contributing is the transient reductions in thyroid hormone (TH) level that typically follow preterm delivery (11,12). It is well established that TH is critical for normal brain development, including development of structures necessary for visual processing (13 –15). During pregnancy, maternal TH crosses the placenta into the fetal circulation (16). Because of the delayed maturation of the fetal thyroid system, maternal TH is the fetus' sole source of hormone in early gestation (17), and over the course of gestation, this reliance on maternal TH supply diminishes to assume full thyroid function at term. However, because preterm infants are severed prematurely from the mother and consequently her supply of TH, they often experience an insufficiency of TH in the period ex utero. This state of reduced TH, which is known as transient hypothyroxinemia of prematurity (THOP) (18), can also be exacerbated by neonatal illness (19) and illness severity (20). Relevantly, THOP has been linked to suboptimal neurodevelopment (21 –23), including weak visuospatial abilities (12,24). Although treatment with L-thyroxine has been shown to improve outcome in infants with THOP, a benefit was only seen in infants born before 29 weeks gestation (25,26) and the specific effects of TH treatment on subsequent visual functioning were not directly examined.
In this study, we used objective electrophysiological techniques involving visual evoked potentials (VEPs) to examine whether early postnatal TH levels are related to later visual abilities, namely contrast sensitivity, visual acuity, and color vision. In light of animal studies showing early TH insufficiency affects retinal development (15), we predicted a relationship between visual deficits and low early postnatal levels of TH in preterm infants and that this relationship would be further modified by factors relating to timing of birth and neonatal illness.
Methods
Participants
The preterm sample consisted of 67 infants recruited from the Sunnybrook Health Sciences Centre (SHSC) in Toronto, Canada, a University of Toronto-affiliated hospital supporting Levels I, II, and III neonatal nurseries. The infants were drawn from all eligible sequential cases at 23–35 weeks gestation between November 2004 and January 2007. One investigator (E.A.) informed all prospective parents about the study during child's first week of life, and if they were in agreement, she obtained via heelprick a sample of the child's blood (0.5 cc) at 2 weeks of life.
Controls (Group E, 37–42 weeks gestation) were 38 infants born during the same period in the SHSC Level I neonatal nursery. Based on careful review of hospital records by the investigator, only parents of healthy full-term infants were solicited via letter for participation.
As per the standard procedure in Ontario until May 2006,* all infants were screened at 2–3 days of age for both phenylketonuria and congenital hypothyroidism, with the latter using the thyroid-stimulating hormone (TSH) test. Subsequently born infants were screened for congenital hypothyroidism (CH) and up to 21 other diseases. Any infant with a positive screen or known or suspected congenital or chromosomal abnormality (e.g., down syndrome) was excluded. Additional exclusion criteria were as follows: child or maternal history of thyroid disease; maternal exposure to organic solvents, alcohol, and other substances of abuse; maternal history of anticonvulsant or antirhythmic medication during pregnancy. Of the original 67 preterm participants, two in Group C were subsequently diagnosed with congenital hypothyroidism and one in Group D died while one infant from Group C, one from Group D, and seven from Group E (who participated in our study of infant attention at 3 months of age) dropped out prior to the 6-month vision assessment. Final sample sizes were Group A = 10, Group B = 23, Group C = 19, Group D = 10, and Group E = 31. There were 13 sets of twins in the study: 4 in Group B, 5 Group C, 3 Group D, and 1 Group E.
The overall study permitting recruitment, blood sampling, and medical chart review was approved by the SHSC Research Ethics Board. All subsequent follow-up phases were approved by The Hospital for Sick Children (HSC), where all of the assessments were carried out. Parents or guardians of all participants provided signed informed consent.
Measures
Demographic data were obtained on gestational age, birth weight, gender, and socioeconomic status (SES) using the Hollingshead index (27). For each preterm neonate, the following medical morbidities were recorded from the child's medical chart at SHSC: highest bilirubin point and age at peak point; presence of sepsis (positive blood or cerebral spinal fluid culture), respiratory distress syndrome, patent ductus arteriosus (PDA), bronchopulmonary dysplasia, or necrotizing entercolitis; evidence of severe white matter brain injury (periventricular leukomalacia, late ventriculomegaly, peri- or intraparenchymal hemorrhage); use of surfactants or phototherapy; and ROP. Except for ROP and bilirubin, for which the highest stage and zone and highest point and age at peak, respectively, were recorded, each morbidity factor was categorized as present or absent.
TH levels were determined for preterm infants at 2 and 4 weeks of age and 40 weeks postconceptional age (PCA). However, given the overlap, or near overlap, between the 4- and 40-week PCA sampling for Group D, they were only sampled at 2 and 4 weeks of life, and their 4-week results were substituted in the database for their 40-week PCA interval values. Later blood samples were drawn at SHSC or a suburban satellite hospital to which the infant was transferred. If the child was discharged before collection of the second or third samples, a nurse traveled to the home and drew the blood. All samples were assayed in the Department of Laboratory Medicine at HSC for levels of free thyroxine (fT4), total triiodothyronine (T3), and TSH using the Bayer Immuno 1 or Bayer IMS analyzer.
To assess vision, two noninvasive VEP methods were used: (i) sweep VEP for contrast sensitivity and visual acuity and (ii) transient VEP for color perception. The sweep method presented vertical black-and-white sinusoidal (striped) gratings using PowerDiva software (v.1.8.5; Smith Kettlewell Eye Research Institute, San Francisco, CA) to generate stimuli and record responses. To measure contrast thresholds, VEPs were recorded while infants viewed a grating of fixed spatial frequency (0.5 cycles per degree [cpd]), which was “swept” from low (0.5%) to high contrast (20%) at three temporal frequencies (6, 10, and 15 Hz). To measure visual acuity, infants viewed gratings of fixed contrast (80%) that were “swept” across spatial frequencies from 3 to 23 cpd. For further details on stimulus presentation and threshold estimation, see articles by Mirabella et al. and Till et al. (28,29). Average levels of contrast sensitivity and visual acuity across trials were provided by the PowerDiva system for each infant. However, if an average score was unavailable for a particular condition, we then derived individual trial scores using Power Diva tools and recorded the score with the largest signal to noise ratio of above 4, if possible (30).
The transient VEP system served to measure responses to chromatic stimuli. Stimuli consisted of three vertical sinusoidal (striped) gratings of low spatial frequency (0.5 cpd) presented in an onset–offset (100–400 ms) mode at 2 Hz. One grating was presented along a tritanopic confusion axis, which produced modulation of the short (S) wavelength sensitive cones; a second grating was presented along an axis orthogonal to this, which modulated long (L) and medium (M) wavelength sensitive cones selectively; and a third grating (achromatic condition) was presented with all cones modulated proportionally. The stimuli were generated using VisionWorks Stimulus Maker software (Vision Research Graphics, Durham, NH) and evoked responses were acquired on a Dell Dimension 4100 and analyzed using the NeuroScan 4.2 program (Compumedics Neuroscan and Compumedics DWL, Charlotte, NC). Stimuli were presented in a pseudo-random order and evoked responses were averaged over a minimum of 30 presentations per stimulus type. Recordings were deemed reliable if the P1 waveform components from each of the two averaged recordings were within 10 ms of each other. If not deemed reliable, no response was recorded in the statistics database for that condition. Waveform latencies were measured from pattern onset to peak of the component [see Ref. (29) for complete details].
All VEPs were administered at 6 months corrected age and measured using the International 10–20 system (31) with active electrodes placed over the infant's scalp at O1, Oz, and O2, and at Cz (reference) and Pz (ground). During testing, each infant sat on the parent's lap and viewed the monitor binocularly from a distance of 100 cm. Child's attention was attracted to the stimuli by dangling a small toy ∼2 cm from the screen. At the end of testing, each infant received an ophthalmologic examination for refraction and fundus examination purposes.
Data management and analyses
Any missing hormone and visual ability data were imputed by using the mean value of the child's gestational group or a further value adjusted for child's overall level of performance (31). For the contrast sensitivity data, replacement involved first adjusting the gestational-age group mean value by child's score relative to the subgroup mean for the other two temporal conditions if these data were available; otherwise, the gestational-age group mean was used. For visual acuity, we used the gestational-age subgroup mean. Regarding color vision, we imputed missing values accordingly: (a) for those infants who had a reliable achromatic response but no reliable chromatic VEP waveform, we substituted their missing red–green or blue–yellow latencies with the mean of the corresponding gestational-age group adjusted by child's achromatic latency relative to the one for the gestational-age group; (b) for infants without an achromatic latency, the gestational-age group mean was used. This set of procedures resulted in an average of four imputed values per gestational-age group. For each subgroup, all outliers were also replaced by winsorizing (32), an established procedure whereby the outlying value was replaced with the score of the nearest neighbor within the 95th confidence range.
One-way analyses of variance (ANOVA) was used to test for group differences in participant demographics, nonbinary indices of illness, and TH levels. A homogeneity of variance test was used to account for unequal sample sizes between groups, and for significant variables, we reported the Brown-Forsythe statistic. Post hoc analyses employed the Dunnett's T3 or Dunnett t-test, where applicable, comparing each preterm group with controls. Chi-square analyses evaluated group differences on binary (present/absent) indices of illness. Repeated measures ANOVA served to compare TH levels at the different time periods within the preterm group. Multivariate ANOVA tested for group and gender differences in sweep and transient VEP measures with the Dunnett t-test for post hoc analyses. If gender was significant for any measure, multivariate analysis of covariance (MANCOVA) with gender as covariate was used.
To assess the combined contribution of demographics (gestational age, birth weight, gender, SES), neonatal illness, and TH levels on visual functioning in the preterm group, each condition significantly differentiating preterm and control groups was independently correlated with each predictor variable. Factors showing significant associations were entered together into a multiple linear regression to determine their combined impact on visual ability. A two-tailed significance level of 0.05 was applied to all analyses.
Results
Table 1 shows participant demographics by gestational-age groupings. Groups differed significantly (p < 0.01) in gestational age, birth weight, SES, and gender composition but not mean APGAR* scores at 1 or 5 minutes. As expected, birth weight was strongly correlated with gestational age, but because birth weight is expected to covary with gestational age (infants born earlier in gestation are typically smaller), birth weight was not used as a covariate. Although SES was lowest in Groups A and B, it was not used as a covariate because it did not correlate with visual abilities. The results indicated that Group A had more females than males (90% vs. 10%) and Group C had more males than females (84% vs. 16%), whereas males and females were roughly equally distributed in the other three groups. Gender was therefore used as a covariate in assessing for group differences in visual ability.
Data are presented as mean ± standard deviation (SD) (gestational age, birth weight, and bilirubin), as medians (APGAR), and as percentage of the group demonstrating presence of each medical morbidity.
SES, socioeconomic status; ns, not significant; NA, not available; PDA, patent ductus arteriosus.
Table 1 also summarizes the different gestational age groups' medical morbidity findings. Significant differences were seen for presence of respiratory distress syndrome (p < 0.001), PDA (p < 0.001), use of surfactants (p < 0.01) or phototherapy (p < 0.01), any evidence of brain injury (p < 0.05), and highest point (μmol/L) of bilirubin (p < 0.001), all reflecting higher morbidity in the lower gestational age groups. A trend toward significance (p = 0.075) was seen for ROP stage, reflecting the higher incidence of ROP in infants born before 30 weeks gestation. No differences were seen in frequency of sepsis, bronchopulmonary dysplasia, necrotizing enterocolitis, and age at peak bilirubin.
Table 2 summarizes the 2-, 4-, and 40-week PCA TH levels for preterm groups. For TSH, no group differences and no significant changes in TSH over time were observed. However, a significant interaction was seen between group and time of blood sample (p = 0.039), reflecting the greater decline in TSH levels over time for infants born before than after 32 weeks gestation.
Data are presented as mean ± SD.
TSH, thyroid-stimulating hormone; fT4, free thyroxine; T3, triiodothyronine; PCA, postconceptional age.
For fT4, significant group differences were seen at all three time points (p < 0.05). Post hoc tests indicated that Group A had significantly lower 2-week fT4 levels than the other groups, whereas Groups B and C differed from D but not each other. Four-week fT4 levels were significantly lower in Group A than C. For 40 weeks PCA values, a trend (p = 0.051) was seen for lower fT4 values in Groups A, B, and C than Group D. Repeated measures ANOVA revealed significant effects of time of blood sampling (p = 0.022) and a significant group by time interaction (p < 0.001). The latter was accounted for by the larger group differences at 2 weeks of life than at 40 weeks PCA and by the larger increase in fT4 between 2 and 4 weeks in Group A than the others, thus signifying that fT4 increased more with time for infants born with earlier than later gestation.
For T3, significant group differences were seen at 2 and 4 weeks of life (p < 0.001), but not at 40 weeks PCA. Post hoc tests revealed that at 2 weeks, Group A's T3 levels were significantly lower than the others, and Groups B and C did differ from D. At 4 weeks of life, T3 levels were significantly lower in Groups A and B than Group D. Repeated measures ANOVA revealed significant effects of time since birth (p < 0.001) and a significant group by time interaction (p < 0.001), which reflected a greater increase in T3 with time for infants born with lower gestation.
Fundus ophthalmoscopy showed no abnormality in any infant. However, when the groups were examined for refractive error, they differed (p < 0.05) in spherical error, but not cylindrical error size, in both left and right eyes. Post hoc analyses revealed that Group A was different from the other groups. The mean of this group was myopic compared with the normal hyperopic error of 6-month olds (33) (Table 3).
D, dioptres.
On the sweep VEP, groups differed significantly (p < 0.01) in contrast sensitivity only at the slow (6 Hz) temporal frequency (Table 4). Post hoc tests revealed lower sensitivity (p < 0.05) for Groups A, B, and C than controls. Although the other temporal frequencies were not different among groups, contrast sensitivity at the medium (10 Hz) temporal frequency was lower for all preterm groups than controls and a linear relation between detection and gestational-age grouping was observed. Groups did not differ in the fast (15 Hz) temporal frequency condition, reflecting the poor contrast detection of all groups. There was no effect of gender on sweep VEP results. There were no significant group differences in visual acuity.
Data are presented as mean ± SD.
Sample size decreased in these groups on chromatic measures such that for Group B, n = 14; Group C, n = 12; Group D, n = 7; and Group E, n = 20.
p < 0.05, post hoc analysis.
p < 0.001, post hoc analysis.
Groups differed significantly (p < 0.05) on achromatic, blue–yellow, and red–green transient VEP measures (Table 4). Post hoc analyses indicated that on the achromatic and blue–yellow conditions, these group differences were mainly driven by longer latencies in Group A infants, whereas they were driven by the shorter latencies of Group B in the red–green condition. Controlling for gender did not affect the group differences on any condition. However, a main effect of gender on red–green latencies (p = 0.037) revealed that males had shorter latencies than females, contrary to expectation. As these results are based on reliable red–green responses, it is unlikely that our results can be explained by congenital red–green color deficiencies.
The proportional contributions of demographic factors, TH levels, and medical morbidities to visual functioning were examined only for those endpoints on which preterm infants performed significantly more poorly than full-term controls (see Table 5). In the slow (6 Hz) temporal frequency contrast sensitivity condition, the combination of sepsis, PDA, and fT4 at 40 weeks PCA best explained this ability, accounting for 20% of its variance; the strongest predictor was PDA. On the transient VEP, achromatic latencies were best explained by the combination of gestational age, phototherapy use, T3 at 2-week, and fT4 at 40-week PCA, which accounted for 45% of its variance. Although not significant at an individual level, the 40-week PCA fT4 value was the strongest predictor of achromatic responses. For blue–yellow latencies, the combination of gestational age, PDA, bronchopulmonary dysplasia, and fT4 at 2 weeks accounted for 28% of its variance, with bronchopulmonary dysplasia being the strongest predictor. Red–green latencies were best explained by gender and 2-week TSH, fT4, and T3 levels, which accounted for 34% of its variance; the effects of fT4 at 2 weeks were highly significant.
SE, standard error; BPD, bronchopulmonary dysplasia.
Discussion
TH is essential for normal visual development (13). Because infants born preterm are at risk of both transiently reduced TH levels (18) and impaired visual processing (1), we investigated whether differences in their neonatal TH levels could account for visual processing deficits at 6 months (corrected) age. Regression analyses revealed that it was not gestational age alone, but rather gestational age plus low TH levels as well as several medical morbidities that predicted poor contrast sensitivity and slow color vision processing.
A primary finding of this study was that contrast sensitivity effects were mainly driven by infants born at 23–32 weeks gestation, whereas color vision effects were confined to infants born before 30 weeks gestation. These findings suggest that different visual abilities may have discrete time points when affected by medical morbidity and when TH is needed. In our study, lower contrast sensitivity was associated with the presence of sepsis and PDA in the early neonatal period, as well as with decreased fT4 levels at 40 weeks PCA. Although previous research has shown that sepsis in extremely preterm infants contributes to vision impairment at 18–22 months (34) and TH deficiency during pregnancy to large contrast sensitivity deficits in offspring of hypothyroid women or with congenital hypothyroidism (35), this study is the first to demonstrate that low TH, particularly low fT4, is associated with contrast sensitivity deficits in 6-month-old preterm infants.
Lower 2-week T3 and 40-week PCA fT4 levels were associated with longer achromatic latencies in the transient VEP assessments, as were gestational age and phototherapy use. Although animal literature provides strong support for a role of TH in retinal development, the human literature describing such a relationship is virtually nonexistent. Regarding color vision, research with rodents has shown that in the developing retina, TH regulates cone opsin expression (15) and TH receptors are required for the development of green cone photoreceptors (36). Although preterm infants are known to be at increased risk of developing color vision deficits (1), this is the first work to examine the association between TH deficiency and color vision development in children known to be at increased risk of blue–yellow deficits (37). We presently showed that longer preterm blue–yellow latencies were associated with lower 2-week fT4 levels, as well as with shorter gestation and presence of PDA and bronchopulmonary dysplasia. Although preterm infants' red–green vision is usually comparable to that seen in infants born at term (38), with the exception of those infants born with the shortest gestation, we found that the preterm group as a whole had faster red–green responses than controls. In preterm infants, longer red–green latencies were associated with lower 2-week fT4 and T3 levels. Thus we demonstrated an association between hypothyroxinemia due to preterm birth and color vision development.
We did not find any effect for visual acuity, consistent with the extant literature showing either accelerated development (39) or no effect of birth before 32 weeks gestation (18).
Although this study is the first of its kind to examine how early TH levels influence several core visual abilities in preterm infants, it is not without limitations. First, and most notably, the sample sizes were small in Groups A and D, for whom recruitment was more difficult than anticipated. Indeed, the lack of significance in the 10 Hz contrast sensitivity condition, despite a clear difference between subgroups and a stepwise increase with gestational age, may reflect a lack of power rather than a lack of effect. Second, we experienced loss of some data due to equipment problems. Although a few mothers did bring their babies back when the system was fixed, this was not possible for all cases. Third, a few babies did not have a complete set of TH measurements, despite our concerted efforts to obtain these.
In conclusion, hypothyroxinemia in preterm infants, particularly their low postnatal fT4 levels, is associated with suboptimal visual processing at 6 months corrected age. We hope our findings will encourage further study of the role of TH on brain development following preterm birth as well as the effects of TH supplementation on visual development of extremely preterm infants.
Footnotes
Acknowledgments
The authors are grateful to Kelly Nash and Meagan Williamson for carrying out this study and its many facets, Dr. Christina Gerth for conducting the ophthalmologic examinations, and Jennifer Vaughan for collecting blood samples at SHSC and from discharged infants in their homes and for sending them to HSC. Funding for this research was provided by the March of Dimes, RESTRACOMP, and the Vision Science Research Program.
Disclosure Statement
The authors declare that no competing financial interests exist.
*
This was when the Provincial screening laboratory moved from Toronto to the Children's Hospital of Eastern Ontario in Ottawa and the program was expanded to 22 diseases.
*
APGAR scores (appearance, pulse, grimace, activity, respiration) are used to assess the health of a newborn immediately after birth.
