Abstract
Introduction:
Prevention of childhood overweight should start as early as possible preferably in “the first 1000 days of life.” Sleep is one of the modifiable health behaviors during this age period, besides dietary intake and physical activity. The aim of this systematic review is to summarize the existing literature regarding the association between sleep during infancy (age ≤24 months) and body composition measures during childhood (age ≤12 years).
Methods:
We registered the protocol of this systematic review (PROSPERO registration no. CRD42018087088) and conducted the review following the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) statement. We searched for articles published until July 31, 2019 reporting on longitudinal associations with a minimal follow-up of 6 months. Methodological Quality was assessed and a narrative synthesis was performed.
Results:
We included 19 studies. Sleep was reported as sleep duration (n = 18) or sleep problems (n = 2). Sleep was assessed at least once before the age of 12 months in 14 out of the 19 studies. Methodological quality was rated as strong for five studies, moderate for five studies, and weak for nine studies.
Conclusion:
This narrative synthesis found inconsistent evidence that longer infant sleep duration during the first 2 years of life is associated with a healthier body composition during childhood.
Introduction
Childhood obesity is becoming a permanent problem in our society even at the toddler age.1,2 Because childhood obesity often tracks into adulthood, 3 prevention should start as early as possible, preferably in the first 1000 days of life: between conception and second birthday. 4 Childhood obesity risk factors during the first 1000 days that are modifiable and identified in a recent systematic review include: higher maternal prepregnancy BMI, prenatal tobacco exposure, maternal excess gestational weight gain, and accelerated infant weight gain. 5 Less consistent evidence was found for curtailed infant sleep, bedtime routine with bottle feeding at age 9 months, bottle feeding at age 24 months, and introduction of solid food intake before age 4 months. 5 Curtailed infant sleep and bedtime routine have recently been under the attention of pediatricians and youth health care workers and healthy sleep is part of newly developed lifestyle interventions during infancy.4,6–10 Infant sleep problems can be problematic and include sleep latency and night awakenings, both leading to reduced sleep duration, quality, and efficiency (% of time spent in bed that the child really sleeps). Systematic reviews and meta-analysis so far have looked at the association and dose–response effects of sleep duration at age 0–4 years11,12 and age 0–18 years13–15 and concluded that there is an inverse association between sleep duration and weight gain. From these reviews we cannot conclude if infant sleep duration is a determinant of childhood overweight, as these reviews did not investigate infant age separately. Also, new studies in infants have been published that are not included in the last reviews. Infant sleep is different from childhood sleep in timing (frequent day-time naps, nighttime wakening's), initiation (feeding, pacifiers, room or bedsharing), and optimal measurements (diaries vs. accelerometry detecting the duration of laying and sleeping in bed or other places).
Infancy (age 0–2 years) is a defining period in life and overweight prevention should ideally start as early as possible. Sleep is one of the modifiable health behaviors during this age period, besides dietary intake and physical activity. The aim of this systematic review is to summarize the existing literature regarding the association of sleep duration and sleep problems during infancy (age ≤24 months) with body composition outcomes during childhood (age ≤12 years).
Methods
Protocol and Registration
We registered the protocol of this systematic review with the International Prospective Register of Systematic Reviews (PROSPERO; Registration no. CRD42018087088; available from www.crd.york.ac.uk/PROSPERO/display_record.php?ID=CRD42018087088. We conducted the review following the Preferred Reporting Items For Systematic Reviews and Meta-analysis (PRISMA) statement for reporting systematic reviews and meta-analysis. 16
Information Sources and Search Strategy
We searched Ovid MEDLINE and EMBASE for articles published from inception until July 31, 2019 in collaboration with a clinical librarian (J.G.D.). Detailed search terms and MeSH terms are provided in Supplementary Table S1. The search strategy can be displayed as the following Boolean combination of concepts: ([children 0–2 years] AND [sleep duration] AND [body composition and adverse cardiometabolic outcomes]) OR [relevant cohorts| -studies]. No language limitation or filter was applied. In addition, we reviewed the reference lists from the relevant articles and earlier published review articles.
Study Selection
Eligible studies were longitudinal studies with a minimal follow-up time of 6 months, reporting the determinant sleep (either duration or problems) at any age between 0 and 24 months and as outcome BMI, another indicator of weight status up until 12 years of age. We included all definitions of sleep problems or different measurements of sleep quality or sleep duration. Two independent reviewers (M.W.H. and T.G.M.V., I.H., or K.O.A.) screened titles and abstracts using Rayyan, a web-app for screening titles and abstracts. 17 Reviewers were blinded for each other's decision and discrepancies were discussed until consensus was reached. Full texts were screened for eligibility by M.W.H. and in case of doubt screened by T.G.M.V. as well. If more than one study analyzed associations in the same children at the same age, we selected the study that reported the most complete analysis for that age period, preferring continuous outcomes over binary outcomes and panel data over linear regressions on single determinants.
Data Extraction
Data extraction was performed by M.W.H. and checked by T.G.M.V. or I.H. for all articles, using a predefined data extraction table. Discrepancies were discussed until consensus was reached. We chose to present the adjustment model with the most complete set of covariates, also if this included confounders that could be considered potential mediators in a causal pathway. We additionally report confounders that were considered, but not included in the final model. Table 1 presents associations with BMI or weight-for-age outcomes, Table 2 associations with other body composition measures.
Summary of Observational Studies Assessing the Association between Sleep Duration and Indicators of Body Composition Other Than BMI/Weight Sorted by Methodological Quality Assessment and Age at Exposure
BIA, bioelectrical impedance; DEXA, dual energy X-ray absorptiometry; FMI, fat mass index; FFMI, fat-free mass index; SES, socioeconomic status.
Summary of Studies Assessing the Association between Sleep and BMI or Weight for Age z-Score/Category Sorted by Methodological Quality Assessment, and Age at Exposure
Only studies with strong quality are included in the narrative synthesis on the association between sleep duration and BMI- or weight-for-age z-scores.
BISQ, Brief Infant Sleep Questionnaire; CDC, Centers for Disease Control and Prevention; CFI, Comparative Fit Index; CI, confidence interval; IOTF, International Obesity Task Force; OR, odds ratio; RMSEA, root mean squared error of approximation; SE, standard error; SES, socioeconomic status; WHO, World Health Organization; +, significant association; 0, no significant association.
Methodological Quality Assessment
For the assessment of the methodological quality of the studies, we used a tool adapted from the “Effective public health practice project quality assessment tool,” see Supplementary Data. 18 We choose this tool as a specific tool on quantitative analysis in the public health field. We customized our criteria list and the corresponding judgment rules so they were fitting longitudinal cohort and interventional studies. We assessed the following five dimensions: selection bias, confounding, measurement, study attrition, and data analysis. Each dimension was judged as strong, moderate, or weak based on two questions. For the global quality rating of an article, no weak ratings resulted in a strong rating; one weak rating in a moderate rating; and two or more weak ratings in a weak rating. For a strong quality of the dimension confounding, the analysis had to adjust for at least: maternal BMI, socioeconomic status/maternal education/income, ethnicity, and birthweight/dysmaturity/baseline BMI. Studies that were not adjusted for maternal BMI were qualified as weak, regardless of other covariates, as we value maternal BMI as an important predictor for childhood BMI. 19
The customized tool was pretested by assessing four articles by three independent researchers and improved by specifying sleep duration measurement methods. At the same time, researchers also improved their quality assessment skills by discussing discrepancies in these first four articles. Thereafter, all articles were assessed by two independent researchers (M.W.H. and T.G.M.V. or I.H.) using the tool as displayed in Supplementary Table S2. If needed, references were checked for study protocol details and selection procedures. Inconsistencies were discussed and resolved between the two researchers.
Narrative Synthesis
A best evidence synthesis was applied to summarize the results for the association between sleep duration and body composition outcomes and draw conclusions regarding the level of evidence (Table 3). We based the rating system on that of Chinapaw et al., 20 consisting of four levels (i.e., strong, moderate, inconsistent, and insufficient). The levels of evidence take into account the number, the methodological quality, and the consistency of outcomes of the studies:
Level of Evidence in Narrative Synthesis, Presented Per Study, for the Association between Sleep Duration and Body Composition Measurement
Strong evidence: consistent findings in multiple (≥2) high-quality studies.
Moderate evidence: consistent findings in one high-quality study and at least one moderate-quality study, or consistent findings in multiple (≥2) moderate-quality studies.
Inconsistent evidence: inconsistent findings in multiple (≥2) studies.
Insufficient evidence: only one study available.
Following the directions of Chinapaw et al., 20 results were considered consistent when at least 75% of the studies showed results in the same direction and when the results were significant (p < 0.05). If there were two or more high-quality studies, we disregarded the studies of low methodological quality in the evidence synthesis. Associations are considered significant per study if 75% of the tests found significant associations in the same direction per outcome.
Results
Description of Included Studies
We identified 7539 unique records through database searches and 4 records through hand search. After titles and abstracts were screened, 7471 records were excluded (Fig. 1). We assessed 77 full-text articles for eligibility and excluded 58 full texts (reasons displayed in Fig. 1). Four studies were excluded after comparison with other publications from the same cohorts as they studied similar associations, but with binary outcomes or single outcomes vs. panel data.7,21–23

PRISMA flow diagram for the identification, screening, eligibility, and inclusion of studies. PRISMA, Preferred Reporting Items for Systematic Reviews and Meta-Analysis.
We included 19 studies in the systematic review (Tables 1 and 2). Two studies were based on analyses from the same cohort; project Viva,24,25 presenting different follow-up times. Data across studies involved 51,963 unique participants. Studies were published between 2008 and 2019, with 11 of the 19 published in 2016 or later. All studies had a longitudinal design: 14 observational studies24–37 and 5 randomized clinical trials6,38–40,53 were included.
Sleep was reported as sleep duration (n = 18) or sleep problems (n = 2). Sleep was assessed at least once before the age of 12 months in 14 out of the 19 studies. Sleep duration was measured by parental report of sleep duration (n = 8), Brief Infant Sleep Questionnaire (n = 2); 24 hours activity diary (n = 2); and accelerometry (waist worn n = 2, thigh worn n = 1). Eight studies reported a combined sleep duration (deprivation score) for an age range. Sleep duration was reported as 24 hours sleep duration (n = 14); nighttime sleep duration (n = 3); both night- and daytime sleep duration separately (n = 1); nap duration (n = 1); or sleep variability (n = 2). Sleep problems were defined by the Zuckerman definition by two articles.26,34 Alamian compared three sleep definitions: Zuckerman 41 ; Richman 42 ; and Lozoff.26,43
Methodological quality was rated as strong for five studies, moderate for five studies, and weak for nine studies. Weak ratings were often due to selection bias, low response rates at baseline, and poor study attrition at follow-up (Supplementary Table S3).
Association between Sleep Problems and Weight-for-Age z-Score or BMI Category
Two studies reported sleep problems using the Zuckerman definition, that is, waking three or more times per night, waking event lasting at least 1 hour, or parental report of “severe” disturbance. Using this definition, Sha et al. found no significant association between sleep problems at age 1, 3, 6, 8, and 12 months and weight-for-age z-score at the same ages in a pooled effects model. 34 Alamian et al. found that infants with sleep problems at age 6 and/or 15 months had 1.7 times higher odds of overweight at age 12 years [confidence interval [95% CI] 1.05–2.97], but they found no association with obesity. 26 With only these two studies from moderate and low quality we concluded there is no evidence for an association between infant sleep problems and childhood body composition.
Narrative Synthesis from Studies on Sleep Duration
We decided that a meta-analysis of the association between sleep duration and body composition measurements was not in place, due to high levels of heterogeneity in outcomes and ages. Instead we performed a narrative synthesis on the association between sleep duration and BMI or weight-for-age z-score or category (Table 1) and other body composition measures (Table 2). We did not include studies that assessed sleep problems as determinant. Found associations are displayed per study in Table 3.
Sleep Duration and BMI or Weight-for-Age z-Score or Category
With more than two high-quality studies available, we used only the five high-quality studies of Table 1 for this evidence synthesis. Four studies report inconsistent findings, with only significant associations for certain subgroups,28,37 follow-up times, 33 or ages. 29 Therefore, we concluded there is inconsistent evidence for an association between infant sleep duration and childhood BMI or weight-for-age z-score/category.
Kupers et al. found that per 1 hour increase in 24 hours sleep duration at age 4 months, weight-for-age between 6 and 12 months decreased with 0.00011 (95% CI −0.00020 to −0.00002), but associations were smaller and not significant at age 12–24 months. 33 Derks et al. found that in the Generation R cohort per 1 hour increase in 24 hours sleep duration at age 2 months, there was a significantly lower BMI SD score at age 6 years. [−0.018 (95% CI −0.026 to −0.009)], but for sleep duration at age 6 and 24 months there was no significant association after adjustment for confounders and baseline BMI. 29 The same first author found no association in the PEAS Kids Growth Study, where sleep duration was measured at 2, 4, and 12 months by 24-hour diary and BMI measured at age 4, 6, and 10 years. 40 Zhou et al. found a significant negative association between sleep duration and BMI in Malay infants in their longitudinal mixed model and no significant association in Chinese and Indian infants in Singapore. In the overall group they found no significant association. 37 Collings et al. only found a significant association after stratification by ethnicity. They measured sleep at age 6–36 months and found that per one SD higher sleep duration in South Asians, BMI at the same age range decreased with 0.030 SD (95% CI −0.061 to −0.00016). For white families this association was smaller and nonsignificant. 28
Sleep Duration and Other Body Composition Measures
With less than two high-quality studies for all except one outcome, we used all eight studies of Table 2 for the narrative synthesis on other body composition measurements. There was inconsistent evidence for an association between sleep duration and Fat Mass Index. Derks et al. found in Generation R a significant inverse association with Fat Mass Index at one out of three ages at which sleep duration was measured. 29 The same first author found no association in the PEAS Kids Growth Study at three different ages at which sleep was measured and three ages at which Fat Mass Index was measured. 40 Diethelm et al. compared consistent short sleepers at age 18 and 24 months with consistent long sleepers and found that short sleepers had a 0.03 higher Fat Mass Index gain between age 2 and 7 years (standard error 0.11, p = 0.04). 30
There was no evidence for an association between sleep duration and fat-free mass index, % body fat, sum skin folds, and waist circumference (Table 3).
Discussion
This systematic review summarizes the existing evidence on the longitudinal association between sleep duration and sleep problems during infancy (<24 months) and indicators of body composition during infancy or childhood. We assessed the methodological quality of each study and performed a narrative synthesis. There was inconsistent evidence for an association between infant sleep during the first 24 months of life and BMI, weight-for age z-scores and Fat Mass Index during childhood. The significant associations reflect small effects; each hour off additional 24-hour sleep during infancy was associated with a 0.0001 to 0.0300 point lower BMI. There was no evidence for an association with fat-free mass index, sum of skin folds; % body fat; and waist circumference.
Comparison with Earlier Reviews
This systematic review summarizes all longitudinal studies on infant sleep duration (age period 0–2 years) and childhood overweight and obesity. Prior reviews of longitudinal data for sleep during age 0–4 years11,12 and age 0–18 years13–15 concluded that there is an inverse association between childhood sleep duration and weight gain, but that the underlying explanatory mechanisms are still uncertain. We cannot confirm this association in infants. Evidence for an association with sleep duration during infancy could also come from interventional studies. So far, results from three intervention studies on promotion of sleep quality and sleep duration during the first months of life have been published. Two of them showed healthier weight at age 1–2 years, but not necessarily due to longer sleep duration.8,53
The first explanation for the lack of consistent evidence for an association is that sleep duration at infant age may only have a temporary effect on body composition, but not a longer-term effect. If so, this opens opportunity to change sleep habits at an early age to improve later healthy sleep habits and body composition. A second explanation for our findings can be the lack of high-quality studies. Methodological quality was strong for only 5 out of 19 studies. These cohorts were more representative of the base population from the start and had lower study attrition during follow-up. We encourage more high-quality studies on infant sleep and growth, specifically in the first 6 months of life, as existing studies have considerable risk of bias. A third explanation may be the different methods of sleep measurement across studies.
Measurement of Sleep During Infancy
The measurement of sleep duration and quality during infancy is complex due to a combination of longer nighttime stretches of sleep and shorter daytime naps and subtle differences in movements between awake stages and sleeping stages. The three most common methods of measurement are: parental report by questionnaire (e.g., Brief Infants Sleep Questionnaire); parent-reported 24-hour activity diaries; and accelerometry. A recent study by Tikotzky examined how well these three agree in the assessment of nocturnal wakefulness, but also sleep duration. They found that parents are only aware of nocturnal awakenings that involve signaling (e.g., crying), which makes parental report less valid after the age of 6 months when infants increasingly learn to self-soothe. They advise to use both objective and subjective measures of sleep, especially when measuring nocturnal wakefulness after the age of 6 months. They also found that sleep duration was more than 30 minutes longer based on diary report vs. accelerometry at the ages of 3, 6, 12, and 18 months. They additionally conclude that parents report infant sleep more accurately by diary than by the Brief Infants Sleep Questionnaire. 45 Accelerometry can provide an objective estimate of movement, but does not detect the difference between sleep and wakefully lying still in small infants. It is considered a valid method to estimate sleep duration in older children,46,47 but under the age of 2 years, it is so far only valid for nighttime sleep duration.
Sleep patterns and habits during infancy and toddlerhood differ across cultures.48,49 According to the National Sleep Foundation, appropriate sleep duration for healthy newborns (0–28 days after birth) ranges from 14 to 17 hours; for infants (0–12 months) from 12 to 15 hours; and for toddlers (1 and 2 years of age) from 11 to 14 hours. 50 The cutoff values for short sleep duration that are used in the studies in this systematic review vary and might have altered the findings of individual studies and therewith of our review and narrative synthesis.
Strengths and Limitations
A strength of this review is that we have performed a very thorough literature search guided by a clinical librarian (J.G.D.). Our search identified 19 longitudinal studies, of which only 6 were included in prior reviews. Our search for outcomes included all different body size measurements and body composition measurements. Another strength is that we selected longitudinal studies with a follow-up of at least 6 months, so we could study the sequence of events. Reversed causation was minimized by adjusting for baseline body composition in many studies. We could not investigate causal mechanisms due to a lack of studies that investigated these. We selected the most complete reported model of each study for data extraction, even if this included confounders that could be regarded as potential mediators in the causal pathway of an association between sleep and body composition outcomes. Ideally these covariates should be taken out of all models across studies, something that could be done in a future pooled analysis. A weakness of this review and narrative synthesis is heterogeneity of the studies. The measurements of the determinant (continuous 24-hour sleep duration) and outcomes (age- and sex-adjusted BMI or weight-for-age scores) are very similar in all five strong-quality studies in the narrative review, but not the age at which they were measured and the analyses did not consider the same confounders.
Implications
Despite the lack of evidence for an association between infant sleep and childhood body composition, attention to adequate sleep is important. Both parents and professionals need to be aware that adopting healthy sleep habits might be easier during early infancy than in later childhood and adulthood. The inconsistent finding in our narrative synthesis might be due to a lack of strong-quality studies and due to heterogeneity. There is a need for more strong-quality observational studies. We recommend future observational and interventional studies to incorporate other sleep measurements than sleep duration alone in their study design and to measure sleep at different ages (starting between age 1 or 6 months) with a combination of objective and subjective measures.45,46,51 Objective measures are accelerometry or video recording 46 and subjective measures are 24-hour activity/sleep diaries and validated questionnaires, in which parents report sleep duration and regularity of bedtime as well as sleep quality, and sleep problems. 52
We assessed the methodological quality of each study with an adapted tool. Using this we have rated the validated questionnaires stronger than accelerometry, no matter of the specific outcome measurement. We recommend future researchers to qualify accelerometry in combination with parental report more accurate than parental report alone when nighttime or 24-hour sleep duration is measured after infant age of 6 months. 45 For the measurement of infant sleep problems, we recommend standardized questionnaires (e.a. Brief Infant Sleep Questionnaire).
Conclusion
This systematic review provides a thorough overview of studies investigating the longitudinal association of sleep during infancy (<24 months) and body composition during childhood. There was inconsistent evidence for a longitudinal association between infant sleep duration and child body composition.
Footnotes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
