Abstract
The vocabulary size of 16- to 30-month-old children (N = 1235) was assessed using the Estonian adaptation of the MacArthur–Bates Communicative Development Inventory: Words and Sentences (ECDI-II). The relationship between children’s expressive vocabulary size and different factors of the child and his/her social environment was examined. Results confirm the findings of studies from other languages showing that girls have larger vocabularies than boys, and first-born children are at an advantage compared to later-born children. Children of highly educated parents have bigger vocabularies than children whose parents have a lower educational level; this effect was significant for both maternal and paternal education. It was found that children of less-educated fathers who spend a high amount of time per week (> 40 hours) in daycare have smaller vocabularies than children of fathers with higher educational attainment.
In this article we present normative data for the Estonian adaptation of the MacArthur–Bates Communicative Development Inventory: Words and Sentences (ECDI-II) and report on the relationship between Estonian toddlers’ vocabulary skills and various characteristics of the child and his/her social environment. The main focus is on the variability of expressive vocabulary, which has been thoroughly researched in various language and cultural contexts (e.g. Bates et al., 1994; Eriksson et al., 2012; Le Normand, Parisse, & Cohen, 2008; McNally & Quigley, 2014; Stokes & Klee, 2009; Trudeau & Sutton, 2011; Westerlund & Lagerberg, 2008). The studies demonstrate that across these contexts the developmental pathways are similar: vocabulary development is characterized by high variability, a vocabulary spurt and the subsequent vast growth of vocabulary. Expressive vocabulary is one of the most important indicators of development at the second year of life (Van Hulle, Goldsmith, & Lemery, 2004). Previous studies have shown that language development at these early stages is significantly associated with later cognitive and language development (Botting & Conti-Ramsden, 2000) as well as psychosocial development later in adulthood (Schoon, Parsons, Rush, & Law, 2010). This highlights the importance of studying the sources of individual variation.
It is widely acknowledged that certain characteristics of the child and the child’s social environment, such as the child’s gender (Gleason & Ely, 2002), birth-order (Pine, 1995) and parental educational level (Hoff, 2003), contribute to explaining this variation. Yet some of the findings regarding the effects of the characteristics of the social environment have been inconsistent across studies conducted in different language communities and socio-cultural contexts and vary based on the age of the children studied, as well as methods of assessment, which are discussed in more detail as follows.
Sources of individual variation
A number of studies have investigated gender-related variability in early vocabulary development. It has been consistently demonstrated that girls compared to boys are at an advantage in emerging language skills (Berglund, Eriksson, & Westerlund, 2005; Eriksson et al., 2012; Feldman et al., 2000; Fenson et al., 1994; Huttenlocher, Haight, Bryk, Seltzer, & Lyons, 1991; Hyde & Linn, 1988; Le Normand et al., 2008; Schults, Tulviste, & Konstabel, 2012; Simonsen, Kristoffersen, Bleses, Wehberg, & Jørgensen, 2014; Stokes & Klee, 2009; Wehberg et al., 2008; Zambrana, Ystrom, & Pons, 2012). Several studies with methodological approaches ranging from parental-report measures of child language to speech sampling of toddlers’ verbalizations show that from about the second year of life girls’ vocabularies tend to grow much more quickly than boys (Bornstein, Hahn, & Haynes, 2004). According to some researchers, this might be due to the differences in boys’ and girls’ physiological development and maturation (Le Normand et al., 2008). It is also explained by the differences in parental input to their sons and daughters (Hoff & Naigles, 2002; Huttenlocher et al., 1991). There have been some indications in previous research that gender differences at the first stages of vocabulary acquisition do not warrant the need for separate norms for boys and girls (Kern, 2007). For example, Fenson and colleagues (1994) argue that even though there is a slight advantage for girls, the variance in early language skills that is explained by gender is small (1–2%).
Another frequently analysed variable associated with variation in language growth is birth-order (Berglund et al., 2005; Fenson et al., 1994; Hoff-Ginsberg, 1998; Jones & Adamson, 1987; Oshima-Takane & Robbins, 2003; Oshima-Takane, Goodz, & Derevensky, 1996; Reese et al., 2015; Zambrana et al., 2012). The findings about the effect of birth-order on language outcomes are controversial. For example, it has been suggested that first-born children are at an advantage in some aspects of their speech (e.g. clausal diversity, Huttenlocher, Waterfall, Vasilyeva, Vevea, & Hedges, 2010), have bigger vocabularies (Gayraud & Kern, 2007; Hoff-Ginsberg, 1998) and learn 50 words a month earlier than their second-born siblings (Pine, 1995). By contrast, Oshima-Takane and her colleagues (1996) compared spontaneous language data of 32 children and found that second-born children are more advanced in pronoun production, while not differing in general language development compared to their first-born siblings. A reason for the divergent findings may lie in the various measures used to assess children’s language abilities in these studies. For example, Jones and Adamson (1987) found birth-order related differences in children’s language skills using parental-report measures but not with spontaneous speech samples.
Hoff-Ginsberg (1998) has shown that mothers use longer utterances with their first-born rather than later-born children. Berglund and her collaborators (2005) point out that, while later-born children have to compete with their siblings for their mothers’ attention, this can be alleviated with the attention they get from their first-born siblings, in addition to learning from the overheard conversations between caregivers and older siblings (Oshima-Takane et al., 1996). It is likely that there are differences in the social environment of the first-born and later-born child but it is uncertain how these differences in the environment influence language development.
Family socio-economic status (SES) – most frequently indexed by maternal education (Hoff, 2006) in studies of child language – is another widely examined predictor (Rowe, 2008). In their study, Huttenlocher and colleagues (2010) found that SES is a highly significant predictor of language, with either maternal education or family income as the SES measure. According to their explanation the influence of SES on language growth is mediated by the differences in caregiver speech. It has been suggested that mothers with higher educational levels offer their children a richer environment in which different words can be acquired (Hoff, Laursen, Tardif, & Bornstein, 2002). General knowledge and beliefs about children’s development as well as parenting and conversational style have been found to be the factor mediating the effect of parental education on language development (Hoff, 2003; Rowe, 2008). For example, it has been shown that highly educated mothers talk more and use a richer vocabulary, use less directive speech, ask more questions, answer their children’s questions more frequently and start conversations more often with their children as compared to mothers with lower educational attainment, who are more focused on regulating their children’s behaviour (Hoff et al., 2002; Hoff-Ginsberg, 1998). Similarly, Fernald, Marchman, and Weisleder (2013) found that at 18 months, children from lower-SES families had significantly lower vocabulary scores and they were less efficient in real-time processing of language than children from higher-SES families at the same age. Since different cultures vary based on their socio-economic structure, the extent of the influence SES has on early vocabulary development might vary across cultures (Berglund et al., 2005).
In recent years more attention has been paid to the second caregiver in the family – the father. Pancsofar, Vernon-Feagans, and the Family Life Project Investigators (2010) argue that the greater emphasis on fathers stems from the changing roles of fathers in the lives of their children, with less stereotyped roles of fathers, and increasing cultural and linguistic diversity among fathers (Pancsofar, Vernon-Feagans et al., 2010). This reflects in the social environment of the child, which is associated with children’s developmental outcomes. It is widely acknowledged that sensitive, nurturing and cognitively stimulating interactions between fathers and their children are associated with children’s linguistic and cognitive abilities (Cabrera, Fagan, Wight, & Schadler, 2011; Cabrera, Shannon, & Tamis-LeMonda, 2007) as well as social and emotional development (Cabrera et al., 2007). Tamis-LeMonda, Shannon, Cabrera, and Lamb (2004) explain that fathers who are more educated and economically secure have more positive exchanges with their children, thus providing an opportune environment for the child’s vocabulary to grow. For example, Tamis-LeMonda and her colleagues (2004) found that fathers’ as well as mothers’ supportive parenting predicted children’s outcomes, whereas fathers’ education and income were directly associated with child measures and indirectly via the quality of mother–child engagements.
Daycare facilities offer another different learning environment for young toddlers where their socio-communicative skills develop. From the point of view of emerging language skills it is necessary to receive a lot of child-directed and referential language at the early stages of language development. In studies of language development and non-parental care (Berglund et al., 2005; National Institute of Child Health and Human Development [NICHD], 2000), it has been proposed that childcare quality – indexed by number of employees per child, employee’s educational level – is the most important variable associated with language development (Berglund et al., 2005). High-quality care occurs when caregivers provide frequent warm, stimulating and responsive interactions with children, as well as clear intentional instruction (Li, Farkas, Duncan, Burchinal, & Vandell, 2013). It is uncertain if a daycare employee is able to pay enough personal attention to each of the 14 children (the average number of children per adults in Estonian daycare groups) and engage every child in one-to-one conversations throughout the day which would result in the growth of vocabulary. Previous studies have reported that children who experience high-quality care in early childhood have higher cognitive, language and pre-academic skills prior to school entry (Li et al., 2013) as well as a reduced risk for late talking (Lekhal, Zachrisson, Wang, Schjølberg, & von Soest, 2011). In a large-scale longitudinal study with Dutch preschoolers, Luijk and colleagues (2015) investigated associations between non-parental care and language development of preschoolers, using the Dutch CDI as one of the assessment methods. Their analyses showed that after controlling for the effect of other variables (i.e. ethnicity, SES, gender, parity) more time spent in non-parental care is associated with better language skills. They also found that more hours spent in non-parental care in the first year of life are associated with a drop in language proficiency at 1;0–1;6 years of age, but this effect was reversed at later age and followed by language acceleration in preschool children (Luijk et al., 2015). In another example, Bleses and her collaborators (2008a) analysed Danish 0;8 to 1;8 year olds’ daycare attendance, based on the amount of hours children spend at daycare in a week, in relation to children’s language comprehension scores on the Danish CDI. They found a high and consistent decrease in the comprehension scores from children who attend daycare a few hours per week to children who attend a medium number of hours per week and children who attend daycare a high number of hours per week, but no significant difference between children who do not attend daycare and those who do (Bleses et al., 2008a). These studies suggest that daycare attendance benefits some aspects of children’s language proficiency at certain stages of development.
Communicative Development Inventories
Development of parent-report measures (i.e. MacArthur–Bates Communicative Development Inventories; Fenson et al., 1994) in language acquisition research has provided an important tool for gathering data from large representative samples in a cost-efficient manner. Longitudinal studies have shown the predictive validity of the parental-report measures as well as the stability of the CDI scores across time and different assessment methodologies (Bates, Dale, & Thal, 1995; Gatt, Grech, & Dodd, 2014; Perez-Pereira & Resches, 2011). For example Ring and Fenson (2000) compared the parent-reported CDI score with laboratory assessments of children’s expressive and receptive language and found a significant correspondence between the two different approaches. In a more recent study, Gatt et al. (2014) reported that children’s CDI score is significantly related to word use elicited through picture naming and vocabulary sampled naturalistically through play. Fenson et al. (1994) reason that parents have broad knowledge of their children’s language abilities and parental-report measures can give a better overview of the range of children’s linguistic competencies, because they are based on parents’ knowledge and experience of their children’s behaviour across different situations. Although there are some limitations related to parental reports, such as report bias – some parents might overestimate their children’s skills, others underestimate – Bates et al. (1995) conclude that parental reports give an accurate view of the variation and developmental changes from 8 to 30 months of age.
Due to their high reliability and validity as well as applicability in research and clinical settings, the Communicative Development Inventories have now been adapted for up to 60 different languages across the world (Dale & Penfold, 2011). Data from a wide range of languages help understand early language development. The current study makes a contribution to the broader documentation of variation in early language development by compiling data from a less studied language.
The Estonian language belongs to the Finnic branch of the Finno-Ugric languages and is spoken by approximately 1 million people. Estonian is an agglutinative type of language and it is characterized by a large number of cases (14 productive cases), no grammatical gender (either of nouns or personal pronouns), no articles (either definite or indefinite) and differentiation between three quantities both in vowels and consonants (see Sutrop, 2004 for detail). The adaptation of the ECDI-II (Schults, Tulviste, & Haan, 2013; Tulviste, 2007) was based on the ECDI-I (Schults et al., 2012; Tulviste, 2007). Both adaptations followed closely the original American versions (Fenson et al., 1994), and the Polish (Smoczynska, unpublished material) adaptations of the CDI. In constructing the inventory the main purpose was to follow the example of the original CDI vocabulary checklist by keeping the same semantic categories and the amount of words roughly the same. Due to cultural differences a number of words were added and/or replaced across the vocabulary checklist. In piloting the ECDI-II, additional information was gathered from the parents about words their children use most frequently. Based on this information and data analysed following the CHILDES conventions (MacWhinney, 2000) from Estonian mother–child spontaneous speech samples, an adapted version of the ECDI-II was created, with 631 words in 21 semantic categories.
Current study
Based on the literature reviewed above, the main aim of our study is to determine the predictors of expressive vocabulary development in a sample of Estonian 16- to 30-month-old toddlers. Our first hypothesis assumes that age is the main predictor of vocabulary scores at ages 16–30 months. Our second hypothesis states that girls, first-born children and children of highly educated mothers and fathers have a bigger vocabulary compared to boys, later-born children and children of less-educated parents, recruited respectively. Lastly, the third hypothesis assumes that children who spend more time at daycare have smaller expressive vocabularies compared to children who do not attend daycare.
Method
Participants
This study is based on data drawn from a large number of children who made up the standardization sample for the Estonian adaptation of the MacArthur–Bates Communicative Development Inventory: Words and Sentences (ECDI-II; Schults et al., 2013; Tulviste, 2007). The distribution of the sample by age and gender is given in Table 1. Children with serious health problems were excluded from the sample. Children who have contact with other languages besides Estonian are included in the sample (N = 131). An ANCOVA with age as a covariate did not reveal significant differences in the vocabulary size of children who are in contact with other languages compared to children who are brought up in an Estonian-only language environment.
Distribution of participants by age and gender.
The sample consisted of 1235 16- to 30-month-old native Estonian speakers of whom 48% were boys and 52% girls. The distribution of the sample according to birth-order, parental education and daycare attendance is given in Table 2. Among the sample 60% of children were first-born (N = 718), whereas 474 children were born second or later (40%). With regard to parental education, 42% of the mothers (N = 498) and 32% of the fathers (N = 352) had acquired tertiary education. This is reflective of the educational distribution among the Estonian population. Based on the OECD adult education level indicator, 38.4% of 25- to 64-year-old Estonians have acquired tertiary education, which is one of the highest among the EU member states (Organisation for Economic Co-operation and Development [OECD)], 2015). The indicator is especially high for Estonian women (46.8%), while it is 29.4% for Estonian men.
Children’s observed and adjusted mean expressive vocabulary size by gender and factors of the child’s social environment.
Notes: Obs. mean = observed mean; Adj. mean = mean vocabulary adjusted by age.
Approximately 66% of the 16- to 30-month-olds attended daycare at the time of assessment (N = 783). On average the children started daycare at 18 months of age (SD = 7.5), they spend 4 days of the week (SD = 1.54) and on average 6.4 hours per day (SD = 2.93) in daycare; 33% of the children are at daycare for approximately 25–40 hours (N = 373). This is quite representative of the Estonian population. Based on the data from 2013, 75% of Estonian 1- to 6-year-olds attend daycare or preschool institutions with approximately 20% of them being 2 years of age or under and the distribution of boys to girls is fairly equal (48% vs 51%). (Statistical Yearbook of Estonia, 2014).
Material and procedure
The ECDI-II, covering 16–30 months of age, is a parental-report measure of children’s semantic and grammatical development (Eriksson et al., 2012; Schults et al., 2013; Tulviste, 2007). It consists of two major parts – expressive vocabulary checklist and early grammar.
The participants were recruited through childcare facilities, health clinics, parenting groups and websites by posting advertisements. Parents were asked to fill in the ECDI-II form and a background information sheet containing questions about children’s developmental history, socio-demographic characteristics of the family and language(s) spoken in the home.
Parents were presented with a list of 631 words in 21 semantic categories (see Table 3) and asked to mark the words their children produce on their own. The parent was instructed to mark the words even if the child had used a different pronunciation or a synonym for the word represented in the form. If the parent was unsure whether or not his/her child uses certain words they were instructed to observe their child for a couple of days and then fill in the form. The forms were mostly filled in by mothers, with a few instances where fathers or both parents had noted themselves as filling in the form. Table 3 shows the number of items in the semantic categories and the scale reliabilities (Cronbach’s alpha) for each category in order of appearance in the vocabulary checklist section of the ECDI-II.
Summary of number of words and scale reliabilities for each of the semantic categories in order of appearance in the vocabulary checklist of the ECDI-II.
Analyses
First, we examined the internal consistency of the ECDI-II vocabulary checklist. Descriptive statistics with observed and adjusted vocabulary scores (by age in months) for the whole sample are given in Table 2. Chi-square tests were used to analyse associations between the factors of the child and his/her social environment. Finally multiple regression analyses were carried out to determine the predictors of expressive vocabulary size. Age (in months) was treated as a continuous variable; gender was treated as a categorical predictor with two values (boys – 0, girls – 1). Birth-order was dichotomized into a variable with two values (first-born children – 0, second or later-born children – 1). Parental education (maternal and paternal separately) was coded into variables with three categories each (0 – primary/vocational, 1 – secondary/post-secondary studies, 2 – tertiary education/graduate degree) to represent low, medium and high educational level respectively. Daycare attendance was coded as a variable with four categories based on the average hours per week children spend at daycare (0 – children not attending daycare, 1 – at daycare 1–24 hours per week, 2 – at daycare 25–40 hours per week, 3 – at daycare more than 40 hours per week). For the regression equation parental educational level and daycare attendance were treated as categorical variables and were therefore dummy-coded into several binary variables. In addition to observing the main effects of these variables, we included a full set of interactions into the final regression equation.
Results
Reliability
Reliability of the ECDI-II vocabulary checklist was examined for the entire sample based on the internal consistency of scores for each of the semantic categories. Calculations of Cronbach’s alpha yielded a coefficient of .961 overall, which shows high internal consistency and is comparable to previously published values (Berglund & Eriksson, 2000; Bleses et al., 2008b; Simonsen et al., 2014) including the CDI normative study (Fenson et al., 1994). Thus, the ECDI-II appears to be a reliable measure of vocabulary development in normally developing Estonian toddlers 16–30 months of age (Figure 1). Percentile scores for boys’ and girls’ expressive vocabulary size across five age groups where age is scored in 3-month intervals are given in Table 4.

Percentile scores of expressive vocabulary size by children’s age (grouped in 3-month intervals) and gender.
Percentile scores for boys and girls’ expressive vocabulary size on the ECDI-II across five age groups in 3-month intervals.
Predictors
Chi-square tests were used to examine the relations between age, gender, birth-order, daycare attendance and parental educational level (Table 5). Significant associations were observed between age and daycare attendance with the number of daycare attendees growing as children get older. Additionally, daycare attendance was significantly related to both maternal and paternal education. Among children who attend daycare and whose mothers have a high educational level most of the children are at daycare 1–24 hours per week (41%, N = 128). Among the low and medium maternal education categories, the proportion of children who are at daycare 1–24 hours is approximately 20% (N = 43 for low maternal education; N = 57 for medium maternal education). Based on paternal education, the distribution is similar, with highly educated fathers having children who spend less time in daycare (43%, N = 94), compared to children of fathers with low or medium educational levels where approximately 70% of the children are at daycare from 25 up to 40 or more hours per week (N = 171 for low paternal education, N = 172 for medium paternal education). Lastly we observed a significant association between maternal and paternal education, with mothers and fathers having spouses with similar educational attainment. While there are observed significant associations between the factors of interest, tests to see if the data met the assumption of collinearity indicated that multicollinearity was not a concern. The Variance Inflation Index (VIF) for all the factors of interest was close to 1.
Associations between predictor variables age, gender, maternal and paternal educational level, daycare attendance and birth-order (chi-square tests).
p < .05.
Sources of individual variation
A hierarchical multiple linear regression analysis was carried out to ascertain the extent to which child’s age, gender, birth-order, parental educational level (maternal and paternal separately) and daycare attendance predicted children’s expressive vocabulary size. In the first step we included age, gender and birth-order in the regression equation, and obtained a statistically significant model explaining 46% of the variance in expressive vocabulary (R2 = .46, F(3,1190) = 335.4, p < .001). In the next step we included the dummy-coded categorical predictors maternal and paternal education and daycare attendance into the regression equation, which significantly improved the predictive power of the regression model (R2 = .49, F(10,1190) = 114.6, p < .001). In the next step we included a full set of interactions among age, gender, birth-order, maternal and paternal education as well as daycare attendance into the regression equation, out of which only age by gender and paternal high educational level by daycare attendance more than 40 hours per week appeared as significant predictors of children’s expressive vocabulary. Since the age by gender interaction introduced multicollinearity into the model, we opted to remove the age by gender interaction from the final regression model, because the effect of these variables is already represented by age and gender as separate predictors. The age by gender interaction indicates that gender differences become more pronounced as the children grow older and their vocabularies grow larger. There is a levelling off at the end of the observed age range, due to a ceiling effect among children at 30 months of age (see Figure 1). To facilitate interpretation, we only added the paternal education and daycare attendance interaction term into the final model. This yielded a model explaining 50% of the expressive vocabulary size (R2 = .50, F(12,1190) = 98.6, p < .001). The inclusion of the interaction term into the regression equation accounted for an additional 0.01% of the variance. The raw and standardized coefficients for the predictors in the three observed regression models with R2 change statistics are presented in Table 6.
Summary of hierarchical multiple regression analysis examining the main and interaction effects of child’s age, gender, birth-order, maternal and paternal education and daycare attendance in predicting expressive vocabulary size.
Notes: *p < .05; **p < .01; ***p < .001. aMaternal and paternal education were represented as two dummy variables respectively, with low educational level (primary, vocational education) serving as the reference category. bDaycare attendance was represented as three dummy variables with children not attending daycare as the reference category. cOnly the significant interaction term is presented here.
Age was the strongest predictor of expressive vocabulary size, explaining most of the variation in the sample. However gender, maternal and paternal education and birth-order still accounted for a significant amount of the variance. Gender differences were statistically significant with girls scoring on average 65 words more than boys. In total gender accounted for approximately 2% of the variance in expressive vocabulary size. Children whose mothers have acquired a university degree have significantly bigger vocabularies than children whose parents have a low educational level even when age, gender and birth-order are controlled for. On average, high maternal education accounted for 1% of the total variance in expressive vocabulary size. Children who attended daycare more than 40 hours per week had on average a smaller vocabulary compared to children who do not attend childcare, which accounted for an additional 1% of the variance in expressive vocabulary size. Both results were highly statistically significant (p < .001). In addition to observing the main effects of our explanatory variables, we observed interactions among the predictor variables and found that daycare attendance for more than 40 hours per week predicts children’s vocabulary as a function of paternal education. Figure 2 presents the unstandardized predicted vocabulary scores for children according to their father’s educational level and daycare attendance. It can be observed that the biggest differences relating to paternal education are among children who attend daycare more than 40 hours per week, out of whom children of highly educated fathers score on average 350 words on the CDI (SD = 97, N = 38) and children of fathers with a low educational level score 131 words less (mean = 219, SD = 96, N = 98). These effects are significant after accounting for the effect of other explanatory variables.

Predicted mean scores of expressive vocabulary size as a function of paternal educational level and daycare attendance.
Discussion
This study examined the relations between child’s age, gender, factors of the child’s social environment and children’s expressive vocabulary in a representative sample of Estonian 16- to 30-month-old toddlers. The results demonstrate a significant effect of age, gender, birth-order and parental education as well as daycare attendance on toddlers’ vocabulary size. Our data corroborate the findings of earlier studies showing that vocabulary grows rapidly from 16 to 30 months of age, indicating that the ECDI-II is a sensitive measure of communicative development at this early age range. Another important finding was that Estonian girls scored consistently higher than boys, first-born children had bigger vocabularies than later-born children and children of highly educated parents had bigger vocabularies than children whose parents had lower educational attainment; this effect was significant for both maternal and paternal education. Daycare attendance also revealed an interesting link with children’s vocabulary, showing that daycare attendance is associated with vocabulary size, more specifically spending a high amount of time (> 40 hours) in daycare per week seems to result in a smaller vocabulary compared to children not attending daycare.
Our results are in accordance with those of Trudeau and Sutton (2011): in their study with 16- to 30-month-old children acquiring Quebec French, age accounted for the highest amount of variance in children’s expressive vocabularies, but gender, maternal education and birth-order were also significant predictors. While the effect of daycare setting, paternal education and whether or not the child lives with one or two parents did not account for any additional variation in Trudeau and Sutton’s study (2011), paternal education and daycare attendance (as well as their interaction) were found to be significantly related to the variance in the present investigation of expressive vocabulary of Estonian children.
In our sample, girls had larger vocabularies than boys in almost every age group. This result is in accordance with a number of studies conducted with the same instrument in various language communities (Berglund et al., 2005; Eriksson et al., 2012; Le Normand et al., 2008). Although Fenson and his collaborators (1994) report an advantage in favour of girls in their analysis, they argue that gender accounts for only 1–2% of the variance in children’s vocabulary. Similarly, our data indicated a relatively small advantage for girls that accounted for about 2% of the variance in Estonian children’s expressive vocabulary size. Significant gender-related differences suggest the use of separate norms for Estonian girls and boys. For example, the 50th percentile of the expressive vocabulary score for a 22- to 24-month-old is approximately 193 words. But for girls the 50th percentile score at this age is at 226 words and for boys, 160 words. If we look at the 10th percentile, which is often used as a cut-off for children at risk of language delay (Dale, Price, Bishop, & Plomin, 2003), we see that for 22- to 24-month-old girls the 10th percentile score cut-off is at approximately 47 words, for boys at 24 words and for the total sample the 10th percentile is at 37 words. Using combined norms increases the risk of not noticing delay in girls’ early communicative development, therefore applying separate norms for ECDI-II-based assessments increases the sensitivity of ECDI-II and allows for more accurate estimation of the need for further detailed assessments of risk of language delay.
Our results also support the previously reported advantage of first-born children over later-born children in lexical productivity (Berglund et al., 2005; Pine, 1995). It is likely that it is because first-born children are subjected to more child-directed speech. In contrast, the conversational context of a later-born child might often include an older sibling, which results in fewer opportunities for joint attention between the caregiver and the child, which is facilitative of language growth at this early age.
Parental education, whether it be mothers’ or fathers’, appears an influential factor in these early stages of development. Our results indicate consistently that children whose mothers have acquired a post-secondary education are more advanced in their language abilities compared to children whose mothers have not attended university or college. Children whose mothers have secondary education or whose mothers are in the process of acquiring a university degree are also at an advantage compared to children whose mothers have primary or vocational educational levels.
Our results show that paternal education plays an important role in the early vocabulary development of their children. This result is in accordance with a wealth of studies on fathers’ involvement which show that interactions between fathers and their children are positively associated with child cognitive and language outcomes (Pancsofar, Vernon-Feagans et al., 2010; Sarkadi, Kristiansson, Oberklaid, & Bremberg, 2008; Tamis-LeMonda et al., 2004). While it has been consistently found that maternal and paternal educational levels explain the variation in child development, specifically language development (e.g. Hoff et al., 2002; Rowe, 2008), further analysis is needed on the specific measures of parents’ child-directed speech in order to determine which parental variables (e.g. speech measures, communicative style, the amount of talk, contingency of the speech and so on) are associated with child language outcomes.
Our findings revealed a significant interaction indicating that high daycare attendance per week relates to child’s expressive vocabulary as a function of paternal educational level. Children whose fathers have high educational attainment have bigger vocabularies, even if they spend a high amount of time per week at daycare, whereas children who spend the same amount of time in daycare but have less-educated fathers, have significantly smaller vocabularies. This result might indicate that families where fathers are highly educated can provide more stimulating at-home environments and communicative opportunities for their children which facilitate language acquisition, even after the long days children spend at daycare. It is likely that it is because their knowledge is more up to date with scientific approaches to development (see Hoff et al., 2002). This result is in accordance with several studies observing the associations of family SES and the early communicative environment of the child (Dollaghan et al., 1999; Huttenlocher et al., 1991; Zambrana et al., 2012). At the same time, the differences in expressive vocabulary related to parental education may also reflect the differences in how parents fill out the parental-report measure of their child’s vocabulary. Parental-report measures rely on the parents’ knowledge and abilities to recollect and notice their child’s growing vocabulary.
In order to analyse the effect of daycare on vocabulary development, we compared the vocabularies of children who do not attend daycare with those spending a different amount of hours per week in daycare. We looked specifically at the average time children spend at daycare per week and found that children who spend more than 40 hours per week at daycare have significantly smaller vocabularies. The findings differ from those of Luijk and colleagues (), who found in their study with Dutch 1- to 6-year-olds that more hours in non-parental care after the first year of life was associated with better language skills. In our study, no statistically significant differences emerged in the size of vocabulary between children who do not attend daycare and children who go to daycare 1–40 hours per week. At the same time, our findings imply that spending long days at daycare (over 40 hours) is a risk factor for children’s vocabulary development. This result is similar to what Bleses and her colleagues (2008a) found with Danish children, whose word comprehension decreased by the number of hours spent in daycare per week. Our study adds to an accumulating body of evidence indicating too much daycare is related to a smaller expressive vocabulary. Overall, these results show that parental decisions regarding their children’s daycare attendance might have far-reaching consequences. It is likely that these decisions are influenced by the family SES, which in turn determines the age of daycare admittance and the amount of time the child spends in daycare per week. Since the current study design is cross-sectional we cannot draw any conclusion on whether these effects will have a lasting impact on children’s language development. We also do not have data on the size of children’s vocabulary at daycare entry. It is possible that children with a bigger vocabulary at daycare entry interact more with their teachers and peers, and benefit more from attending daycare. Longitudinal designs are needed to study the long-term effects of early daycare attendance.
The current results have several implications. First, determining the importance of gender, birth-order and parental education factors in accounting for the variation of vocabulary development offers support to a substantive amount of previous research. Our findings indicate that different norms should be applied for Estonian boys and girls at ages 16–30 months. This can help childcare specialists assess children’s language abilities more accurately and determine the risk of possible language delays. The association between children’s language production and parental education has to be considered when developing strategies and interventions in order to educate parents from families with lower socio-economic status.
There are some limitations that have to be considered while interpreting the current results. For instance, the assessment of children’s expressive vocabulary development is based on parental reports, which have been shown to be a valid representation of children’s abilities (Bates et al., 1994; Gatt et al., 2014; Ring & Fenson, 2000) but they can be influenced by subjective components associated with the characteristics of the reporter, such as social desirability, value orientation and personality (Bornstein & Putnick, 2012). That is why it is beneficial to include various assessment methods to study child language at these early stages of development.
In conclusion, it can be stated that the vocabulary development of Estonian children has similarities with developmental trends observed in other languages and cultural contexts measured with the same instrument – the MacArthur–Bates Communicative Development Inventory. The results of our study extend the literature on the relationship between expressive language and factors of the child’s social environment by demonstrating the importance of age, gender, birth-order and parental education in predicting the growth of expressive vocabulary on a representative sample of Estonian toddlers. Furthermore, our study demonstrated the negative effect daycare attendance can have on expressive vocabulary size, by providing information on how children of fathers with lower education are at risk if they spend long days in daycare.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study received funding from the Estonian Science Foundation grant No. ETF9033.
