Abstract
In most studies investigating the educational outcomes of linguistically diverse students, variables that identify this population have been considered as static. In reality, owing to the dynamic nature of students and their families, students’ home language environments change over time. This study aims to understand how elementary school students’ home language environments change over time, and how longitudinal patterns of English literacy achievement across grades 3, 6, and 10 differ among students with various home language shift patterns in Ontario, Canada. The longitudinal cohort data of 89,609 students between grades 3 and 10 from the provincial assessments were analyzed for changes in their home language environment. A subsample of 18,000 students was used to examine different patterns of relative literacy performance over time and their associations with immigration background and early intervention programming using multi-group latent growth curve modeling. Our findings suggest a strong movement toward an English-dominant home language environment among multilingual students; yet, students whose homes remained as multilingual demonstrated the highest literacy achievement in the early grade as well as the highest improvement in relative performance over time. The paper draws implications for promoting students’ home language, instilling a positive view of multilingual competence.
The educational and literacy outcomes of students from diverse home language environments have been an important concern for educators and policymakers. In most studies investigating this issue, variables that identify this population have been considered as static. In other words, such labels as English language learners, English-as-a-second-language learners, or language minority students, tend to serve as a time-invariant variable, reflecting the view of students’ status as fixed. In reality, however, students’ language use and their home language environment change over time, as a result of ecological interactions among students, their families, and communities. Although it is common for students from immigrant families to change their primary language from that spoken at home (L1) to the socially dominant language (L2) (Wong-Fillmore, 1991), the extent to which they maintain their L1 in their home environments widely varies. For example, some students continue to hear and speak the L1 at home, whereas others gradually increase the use of the L2 with family members and can even lose aspects of their L1 competence.
In one study (Hammer et al., 2009) on children’s language and literacy acquisition, the authors acknowledged that students’ home language environment can change over time (e.g., the use of the L1 in their homes can increase or decrease) and that this can influence students’ L2 literacy development. However, large-scale research is scant regarding how students and their family members in multilingual homes change their language use over time. Under-researched as well is the association of such changes with students’ literacy development over time.
In the present study, we build on Hammer et al.’s (2009) findings and contribute to the current literature by recognizing home language environments as a dynamic (rather than static) variable. Whereas Hammer et al. (2009) used a small-scale dataset covering two years, we used population-level literacy performance data from Ontario, Canada over a seven-year time span in order to understand how students’ home language environments change, and how their relative achievements in English literacy differ among multiple subgroups of dynamic home language environments.
Home language environment and language shift
For many children from immigrant families, reaching the same level of language proficiency in both their L1 and L2 is often challenging. Children who were once proficient in their L1 have been reported to go through a process called language shift once they begin to receive intensive input in the L2 and become linguistically assimilated into the larger society (Flores, 2015; Wong-Fillmore, 1991). Traditionally, language shift was thought to be a process that takes generations; yet, researchers have begun to address how language shift can, in fact, take place during a single generation. Stevens and Ishizawa (2007) argued that, based on their findings that the language repertoires of siblings in the same households may differ, the inter-generation perspective of L1 shift is too simplistic. In the same vein, Pease-Alvarez (2002) acknowledged an intra-generation language shift and criticized the traditional view for characterizing bilingualism as “a temporary inter-generational bridge between monolingualisms” without taking “the complex and dynamic network of sociocultural processes” into account (p. 115).
Research on language shift in the home environment concerns families’ daily interactions, their language ideologies and beliefs, and their goals for shaping language learning and use (King & Fogle, 2017). One of the earliest studies was by Scheff (1965), who examined changes in immigrant families’ home and community language use. Scheff found that home language use was more resistant to change than community language use, but this varied by age, education, length of residence, and income. Although many researchers have focused on parents’ influence on the prevalence of languages spoken at home, children are agentive in influencing their parents’ language behavior, actively shaping their own home language environment. The process may be intuitively viewed as a negotiation between parents and their children (Fogle & King, 2013). Children have been shown to be more likely to use, prefer, and maintain their L1 if they view their parents positively and also see their family as cohesive and egalitarian (Tannenbaum & Howie, 2002). The younger the age of language-minority children when they enroll in majority-language schools, the faster the attrition of the home language, which can hold negative implications for within-family communication (Wong-Fillmore, 1991).
Although the phenomenon of language shift among multilingual children and their families itself has been extensively researched, the dynamic nature of the home language environment has not drawn much attention among educators and policymakers as a factor to consider in relation to academic achievement. In the context of schools, Slavkov (2018) examined registration forms in three Canadian provinces and found that only a small portion conceptualized home language environments as dynamic, by asking multiple language-profiling questions (e.g., first language, primary language spoken at home, other languages); the other forms operationalized home language environment as static (e.g., home language). Furthermore, in the current literature, research that examines the association between changes in the home language environment and educational outcomes over time is scarce.
Related to this intra-generation view of language shift is the idea of complex systems, or dynamic systems, which is gaining traction among many researchers in developmental psychology and education (Fusella, 2013; Larsen-Freeman & Cameron, 2008). These theories focus on changes, dynamism, and interdependencies within a complicated system (e.g., process, person, society). Larsen-Freeman and Cameron (2008) argued that focusing on both individual growth and variability as well as stability can help researchers and educators gain a better understanding of the developing system. They further encourage researchers to use appropriate analysis techniques that can capture variability at different levels and timescales, such as multivariate time-series modeling and growth curve analysis. As such, examining language shift and literacy development from a complex systems perspective can provide valuable insights into multilingual students’ learning processes and help educators construe ways to provide better support for the students.
Variation in literacy achievement by home language
The language shift within individual students and their dynamic home language environments deserves attention in research on literacy development and academic achievement in general. This line of inquiry has the potential not only to further demonstrate the benefits of bi/multilingualism but, more importantly, to inform researchers about the effects of changes in the home language environment on literacy development from a longitudinal perspective.
It is well established that language and literacy skills transfer across languages and that bilingualism and biliteracy have substantial cognitive benefits (Bialystok, 2011; Cummins, 1993). At a minimum, L1 maintenance shows no correlation with L2 literacy development (Nguyen et al., 2001), but a more common finding is that a rich home language environment and parental use of the L1 are associated with high academic achievement, as Sneddon (2000) found in a study of Gujarati-speaking families in London, and Dolson (1985) found for Spanish speakers in Los Angeles.
Similarly, in the context of Ontario, Jang et al. (2013) found that elementary school students who use multiple languages at home transcend early reading achievement gaps and have the most competent reading comprehension skill profile of all home language subgroups (including domestic students) after living in Canada for five years. Findings by Sinclair et al. (2019) also suggested students who use multiple languages at home in grade 6 are likely to have strong literacy profiles and retain those strengths in high school. However, in the latter study, students who exclusively use languages other than English at home (a group that showed high achievement in elementary school) had a tendency toward literacy skill attrition in high school. This implies that such students may benefit from intervention in middle school and early high school. Indeed, Fox and Cheng (2007) and Han and Cheng (2011) found that high school students who immigrated to Ontario and who were learning English as an L2 experienced a complex set of challenges while attempting to meet the literacy standards required for graduation.
Many factors interact to impact immigrant students’ literacy and academic achievement, including their multilingual proficiencies, national origin, educational ambition, school context, their families’ social, human, and economic capital, and the level, length, and nature of acculturation (Gunderson, 2007; Portes & Rumbaut, 2001). Relatedly, Jia et al. (2014) investigated the relationship between English reading comprehension, length of residence, acculturation to mainstream Canadian culture, and heritage enculturation (maintenance of one’s own culture) within a sample of Ontario-based Chinese immigrant adolescents. Length of residence and acculturation significantly predicted increases in reading comprehension, but heritage enculturation did not demonstrate a significant effect in the model. This suggested that acculturation positively impacts reading comprehension in L2, but retaining one’s culture has neither a positive nor a negative impact.
Both cross-sectional and longitudinal studies have examined the association between home language environment and literacy development. Some cross-sectional studies showed no statistically significant correlation between parental use of English in a multilingual home environment and English vocabulary in grade 2 (Gutiérrez-Clellen & Kreiter, 2003) or English grammatical ability in grade 5 (Duursma et al., 2007). With regard to longitudinal studies, Mancilla-Martinez and Lesaux (2011) investigated the relationship between the home language environment of Spanish-speaking language-minority students in preschool, and their vocabulary growth in Spanish and English up to the age of 12 years. Intuitively, preschool students from homes where English was used significantly more than Spanish demonstrated the highest level of English vocabulary, followed by students who used equal amounts of both languages and those who used mostly Spanish at home. However, these latter two groups’ rates of growth were significantly higher, and their rates of deceleration significantly lower, than that of the group who spoke mostly English, significantly reducing the vocabulary gap by the age of 12 years.
The longitudinal study by Hammer et al. (2009), mentioned briefly above, uniquely conceptualized home language as a dynamic, time-variant factor. These authors examined how changes in bilingual (Spanish–English) maternal language usage influenced young children’s bilingual vocabulary development. Mothers’ increased English usage did not significantly impact the children’s English vocabulary growth or emergent English literacy abilities skills, but it was associated with a decreased rate of Spanish vocabulary growth. Although this study offered insightful findings, it was limited to a relatively small sample size (n = 72), covered a short period of time (2.5 years), and had a narrow focus on language usage by mothers. With the current study, we aimed to overcome such limitations.
Latent growth curve modeling and its applications
As already noted, the focus on changes and dynamism within and between individuals necessitates the use of innovative longitudinal analytic methods. Latent growth curve modeling (LGCM) is a flexible modeling approach appropriate for tracking changes over time. Unlike analysis of variance (ANOVA), which treats within-group differences as error variance, LGCM evaluates both inter-individual and intra-individual variability (i.e., differences among individuals and changes within individuals over time) (Preacher, 2010). Specifically, LGCM can be used to examine linear and curvilinear trends of change for individuals as well as the overall sample’s change (Duncan et al., 2013). Under the structural equation modeling framework, LGCM can account for measurement error, evaluate the influence of time-invariant and time-varying covariates, and determine the variation in longitudinal patterns among multiple subpopulations (Duncan & Duncan, 2009).
Figure 1 displays a sample path diagram of an unconditional linear LGCM with three time points. Two latent factors in circles (intercept and slope) are estimated through three manifest variables in rectangles at Time 0, 1, and 2 (with the same time intervals). The model depicted in Figure 1 can be described in the following equations:

Path diagram of linear unconditional LGCM.
LGCM has been implemented to evaluate the development of literacy skills, for instance, the growth in morphological awareness and receptive vocabulary (Kieffer & Lesaux, 2012), word reading and productive vocabulary (Mancilla-Martinez & Lesaux, 2010), oral reading fluency (Jimerson et al., 2013; Yeo & Park, 2014), and reading comprehension (Guglielmi, 2008; Lervåg & Aukrust, 2010). The extent to which mediating factors predict initial status and rate of growth can be evaluated by the inclusion of covariates. For instance, Kieffer and Lesaux (2012) found that phonological awareness did not statistically significantly predict the initial status or rate of growth in morphological awareness and vocabulary among Spanish-speaking language-minority students in grades 4–7.
LGCM allows the evaluation of how the initial status, growth trajectories, and influence of covariates vary across groups through multi-group LGCM or dummy-coded covariates. For example, previous studies utilizing LGCM have found that language-minority students often demonstrate lower initial oral reading fluency (Jimerson et al., 2013; Yeo & Park, 2014) and reading comprehension (Lervåg & Aukrust, 2010) than their language-majority peers, although differences in growth trajectories remain inconclusive. The inclusion of covariates in multi-group LGCM allows for the investigation of variables that predict initial status and rate of growth among groups differently. For instance, self-reported L1 proficiency has significantly predicted initial English reading achievement among Hispanic, but not Asian students in grades 8 to 12 (Guglielmi, 2008). Similarly, vocabulary knowledge has been shown to have a significantly stronger relationship with reading achievement growth among language-minority than language-majority students (Lervåg & Aukrust, 2010).
Current LGCM research on literacy exemplifies the capacity for this method to answer longitudinal research questions and to determine differences in growth trajectories or change patterns among multiple subgroups (Guglielmi, 2008; Halle et al., 2012). Few studies to date have specifically investigated longitudinal literacy achievement while incorporating dynamically measured home language environment as a grouping variable.
Current study
The purpose of this seven-year longitudinal cohort study was threefold: (1) to explore the patterns of changes in students’ home language environment over time; (2) to model longitudinal literacy achievement patterns among multiple subgroups with different home language shifts; and (3) to determine the influence of immigration status and English-as-a-second-language (ESL) program support on the relative literacy achievement pattern of each group. The last set of variables—immigration status (Canadian-born vs. first-generation immigrant) and ESL program support (whether or not the student was receiving ESL instruction at G3)—was included as they have been reported to predict literacy outcomes in the previous studies focusing on multilingual students in Ontario (Jang et al., 2013; Sinclair et al., 2019). The present study, inspired by a complex systems perspective (Larsen-Freeman & Cameron, 2008) and Hammer et al.’s (2009) conceptualization of home language as a dynamic variable, answers the following research questions:
How do students and their families change their home language use between grades 3 and 6? How are changes in home language environment associated with students’ relative literacy achievement patterns between grades 3 and 10? Do immigration status and ESL program enrollment in grade 3 predict the relative literacy achievement patterns of students with different home language environments?
Method
Participants
We used longitudinal cohort data of students who participated in Ontario’s provincial assessment program administered by the Education Quality and Accountability Office (EQAO). The literacy achievement of students in Ontario is measured by this assessment in the spring of grades 3, 6, and 10 (G3, G6, and G10, respectively). The data utilized in this study are from students who took the G3 assessment in 2010, G6 in 2013, and G10 in 2017. The original datasets included approximately 130,000 students in G3 and G6, and 210,000 students in G10. Students were included in the present study if they satisfied these three criteria: (1) they wrote the tests in all three years; (2) they took the English version throughout (as opposed to in French) 1 ; and (3) their home language information was available in both G3 and G6. This resulted in 89,609 students (49.6% female) in the final dataset.
Among these 89,609 students, 8.6% were born outside Canada, among which 60.0% had been in Canada for at least five years, and an additional 15.4% had been in Canada for at least three years at the time of the G3 assessment administration. By the G6 administration, 93.0% of non-Canadian-born students had been in Canada for five years or more. With regard to early intervention, 9.5% of all students included in the analysis were enrolled in an ESL program in G3. Among these students, 66.2% were Canadian-born, comprising the early-identified “domestic language learners” (Jang et al., 2013, p. 425). Owing to its longitudinal nature, the data did not include students who arrived in Canada or entered the Ontario school system after G3.
Measures
During the G3 and G6 EQAO literacy assessment administrations, students were asked to complete a questionnaire that included two questions about home language environment: “In which language(s) do people speak to you at home?” and “Which language(s) do you speak at home?” with options of “only English,” “mostly English,” “another language(s) as often as English,” “mostly another language(s),” or “only another language(s).” To be consistent with the classification and notations used by Jang et al. (2013) and Sinclair et al. (2019), students’ home language status was coded in this paper as language(s) heard at home followed by language(s) spoken at home; for example, EnEn (English–English) means hearing mostly or only English and speaking mostly or only English, and OtEq (Other–Equal) means hearing mostly or only another language(s) and speaking both English and the other language(s) equally. Each student was assigned to one of the nine groups (i.e., EnEn, EnEq, EnOt, EqEn, EqEq, EqOt, OtEn, OtEq, OtOt) in G3 and G6, separately.
The English version of the EQAO literacy assessment elicits the same English literacy skills at G3, G6, and G10: in reading, explicit comprehension, implicit comprehension, and making connections; in writing, topic development, and writing conventions. The G3 and G6 reading measures consist of 26 multiple-choice and 10 open-response questions using four reading passages (Cronbach’s α = .84, .88). The G3 and G6 writing measures (α = .79, .81) include eight multiple-choice and six open-response questions. The G10 literacy measure, also known as the Ontario Secondary School Language Test, integrated reading and writing measures into five sections. The reading measure incorporated 31 multiple-choice and five open-response questions (α = .81), with the writing measure including eight multiple-choice and eight open-response questions (α = .77). The EQAO reports a composite score as overall literacy skill on a scale of 0.1 to 4.9 in G3 and G6, or 200 to 400 in G10. Due to the different metrics used between G3–6 and G10, the composite scores in each grade were transformed to z-scores using all students who completed the English version of the test. With a mean of 0 and a standard deviation (SD) of 1, a student’s z-score expresses their relative standing of overall literacy skills among the test-taker population.
Analysis
In order to examine changes in home language environment over time (RQ1), each student’s home language status at G3 and G6 was determined separately. Changes in students’ home language environment between two grades were analyzed descriptively and visually.
To investigate relative literacy performance among multiple groups of home language environment (RQ2) and the predictive power of two covariates (i.e., immigration status and ESL enrollment in G3) (RQ3), we conducted multi-group LGCM. The group membership for each student was determined using the home language status at both G3 and G6. At each time point, students were identified as to whether they were living in an English-dominant (EnEn) or multilingual home (all other eight categories other than EnEn). Then, a dynamic variable that focuses on the type of changes in home language environment between G3 and G6 was created, resulting in four groups: (a) consistently English-dominant (Eng to Eng), (b) English-dominant to multilingual (Eng to Multi), (c) multilingual to English-dominant (Multi to Eng), and (d) consistently multilingual (Multi to Multi).
LGCM analyses were conducted using Mplus Version 8.2 (Muthén & Muthén, 1998–2017). All analyses used maximum likelihood estimation with robust standard errors and a Satorra-Bentler scaled test statistic, or MLM (Satorra & Bentler, 2001), which is robust to the violation of the normality assumption (Muthén & Muthén, 1998–2017). For model building, we started by testing three baseline models: intercept-only, linear, and curvilinear models (Geiser, 2012). Models were evaluated based on multiple fit indices: chi-square goodness-of-fit index, the comparative fit index (CFI, > .95), Tucker-Lewis index (TLI, > .95), root mean square error of approximation (RMSEA, < .08), and standardized root mean square residual (SRMR, < .10) (Cangur & Ercan, 2015). After identifying the best-fitting functional form, the parameters for the latent factors were allowed to differ by group (Curran et al., 2010). We then tested whether the means of the latent intercept factor (initial status in G3) and latent slope factor (rate of change) were invariant across four groups. All 12 possible pairs of the parameter estimates (six pairs for intercept, six pairs for slope) were tested using a series of Wald tests (Wang & Wang, 2012). An a priori significance level of α = .05 was applied. Finally, two covariates (immigration status and ESL enrollment in G3) were incorporated into the model to predict the latent growth factors. The path diagram of the final hypothesized model in a linear form is illustrated in Figure 2. The factor loadings (or, time scores) of the slope factor, which represent values of the time metric, were fixed at 0, 0.43, and 1 (instead of 0, 1, and 2.33) to relate the interpretation of the slope parameters to the entire seven years rather than every three years.

Final hypothesized model as a linear function with two covariates.
Results
RQ1: How do students and their families change their home language use between grades 3 and 6?
We first report students’ home language use at G3 and G6, cross-sectionally, and then track changes in home language environment for individual students during this time period. Figure 3 displays the compositions of home language environment of 89,609 students in G3 and G6. For each grade, nine boxes represent different home language environments. The boxes in the top row are homes where students hear only or mostly English, whereas the boxes in the bottom row are homes where they hear only or mostly another language(s). Similarly, the boxes on the far-right side are homes where students speak only or mostly English, whereas the boxes on the far-left side are homes where they speak only or mostly another language(s). Boxes located higher and to the right represent students and their families who use more English at home.

Home language use of students and their families at G3 and G6 (N = 89,609).
In G3, the majority of students (66.1%) heard and spoke only or mostly English (EnEn). The second largest group was EqEq (8.7%), where students heard and spoke English and another language(s) equally, followed by OtOt (8.2%). For these three groups (74.0%), the language(s) they spoke corresponded with the language(s) they heard. These three groups were followed by EqEn (4.8%) and OtEq (4.3%), who spoke more English than they heard. Fewer students were identified as EnEq (3.1%) or OtEn (3.0%), whereas only 1.0% and 0.9% of students were in EqOt and EnOt, respectively.
Moving on to G6, the order of the home language environment groups in terms of the group size remained largely the same. Compared to G3, more students were living in English-dominant homes (EnEn, 70.7%), followed by EqEq (6.8%) with a lower proportion. The proportion of students in EqEn (6.3%) increased by over 30%, becoming the third largest group; on the contrary, the proportion of OtOt (5.6%) decreased by 32%.
Subsequently, we tracked changes in home language environment for individual students by comparing their reported home language use between G3 and G6. Among 81 possible combinations of home language status at two time points (nine groups at G3 times nine groups at G6), the majority of students (54,613, 61.0%) reported their status as EnEn in both G3 and G6, suggesting that their home language environment remained as English-dominant throughout the time period.
In order to focus our investigation on multilingual students, we excluded these constant English-monolingual students and re-examined the data comprised of 34,996 students whose home was multilingual at one or more time points. Figure 4 illustrates the most prominent ten patterns of changes in home language environment, with the thickness of the arrows representing the relative size of groups with the corresponding pattern. Although the home language environment of some multilingual students remained unchanged (7.3% for OtOt, 4.9% for EqEq), many students tended to hear and speak more English than another language(s) at home in G6 compared to G3. For example, 6.6% of the students reported hearing and speaking another language(s) as often as English in G3, but changed to hearing and speaking mostly English in G6 (EqEq to EnEn, 6.6%). Similarly, 4.0% of the students used to hear and speak mostly another language(s) in G3, but ended up hearing and speaking English as often as another language(s) in G6 (OtOt to EqEq, 4.0%). Many students who reported hearing or speaking another language(s) as much as English in G3 found themselves in a mostly English-speaking home environment in G6 (EqEn to EnEn, 6.1%; EnEq to EnEn, 5.1%). Although some students reported their home language environment changed from most English to multilingual (EnEn to EqEn, 5.4%; EnEn to EqEq, 2.7%), students and their families, in general, tended to increase their use of English at home.

Most prominent patterns of changes in home language environment of multilingual children from G3 to G6 (n = 34,996).
RQ2: How are changes in home language environment associated with students’ relative literacy achievement patterns between grades 3 and 10?
As mentioned earlier, students were classified into four groups based on the patterns of changes in home language environment: (a) Eng to Eng, (b) Eng to Multi, (c) Multi to Eng, and (d) Multi to Multi. The left side of Table 1 provides the means and standard deviations of literacy test scores (in standardized z-scores) with the sample size by group. Owing to the substantial differences in the number of students across the four groups, 4,500 students from each group were randomly selected to comprise a subsample that was used in subsequent LGCM analyses (n = 18,000). The minimal discrepancies of the descriptive statistics between the population data and the subsample (on the right side of Table 1) reflect the representativeness of the subsample. The observed literacy scores between any two time points for each group were strongly correlated (r = .62–.73).
Sample sizes, means, and standard deviations of literacy test performance by group.
Note. Standard deviations are shown in parentheses.
We used multi-group LGCM to examine differences in comparative literacy achievement patterns among students with different patterns of change in their home language environment. Literacy scores showed high kurtosis at all three points (3.14–3.88) and thus violated univariate and multivariate normality. To compensate, an estimator robust to the violation of multivariate normality was used, as mentioned in the previous section. Although homoscedasticity was also violated (p < .001), LGCM can estimate different time-point-specific variances and does not require this assumption (Preacher et al., 2008). As a preliminary step to evaluate whether there was enough variability across students, we calculated the intra-class correlation coefficient (ICC), the proportion of variance that is explained by student demographics rather than within-student longitudinal growth. The ICC was .220, which is large enough to warrant the use of LGCM (Hox, 2010). To determine the appropriate shape of the change-in-rank curve, we fitted three unconditional models with different functional forms to the data before applying multi-group analysis. Although the intercept-only model showed adequate fit, except RMSEA, χ2(4) = 1088.168 (p < .001), CFI = .954, TLI = .966, RMSEA = .123, CI90 [.117, .129], SRMR = .054, the linear model demonstrated better fit, χ2(1) = 10.219 (p = .001), CFI = 1.000, TLI = .999, RMSEA = .023, CI90 [012, .036], SRMR = .004. The curvilinear model with a latent quadratic factor could not be identified due to the limited number of time points in the data (Preacher, 2010).
Based on the linear model, multi-group LGCM modeling was conducted by allowing the estimated latent intercept and slope factors to vary by group. The model fit the data well, χ2(4) = 110.595 (p < .001), CFI = .995, TLI = .986, RMSEA = .077, CI90 [.065, .090], SRMR = .015. This resulting model was used to investigate possible differences in relative literacy achievement patterns among multiple subgroups with various home language shift patterns.
The left side of Table 2 presents the results of the fitted multi-group model with parameter estimates for each group (see Figure 5 for the results in a diagram). The estimated mean intercepts (π0) indicate that in G3, Multi to Multi outperformed all other three groups with their literacy performance estimated as 0.078 SD higher than the population average. Although the achievement of Eng to Eng (0.053) was close to that of Multi to Multi, the other two groups, Eng to Multi (–0.157) and Multi to Eng (–0.076), performed lower than the average. With regard to the rate of change, the estimated mean slopes (π1) suggest that literacy achievement of Eng to Eng increased, on average, by 0.099 SD between G3 and G10. This estimate of the rate of relative performance change doubled for Eng to Multi and Multi to Eng (0.195 and 0.184, respectively) and almost tripled for Multi to Multi (0.280), suggesting that the rate of change in relative literacy achievement of Multi to Multi over time was the highest among all four groups.
Results from the fitted multi-group LGCM without covariates (left) and with covariates (right) (n = 18,000).
***p < .001, **p < .01, *p < .05
Note. Intercept and slope variances for the model with covariates (right) refer to residual variances after variability in their prediction by two covariates.

A path diagram for the fitted multi-group LGCM without covariates, with associated fitted parameter estimates for means (π0, π1), variances (ζ0, ζ1), and covariance (ψ) for each group (n = 18,000; ***p < .001, **p < .01, *p < .05)
In order to test invariance of the means of latent intercept and slope factors across groups, a series of Wald tests was conducted. Table 3 shows that most of the p-values from Wald tests comparing all 12 possible pairs were significant, except for two: (1) the intercept factor means of Eng to Eng and Multi to Multi (p = .194), and (2) the slope factor means of Eng to Multi and Multi to Eng (p = .501). These results suggest that all the differences in mean initial status in G3 (the intercept factor) and the mean rate of change between G3 and G10 (the slope factor) across groups are statistically significant, except for the initial score in G3 between Eng to Eng (0.053) and Multi to Multi (0.078), and the rate of change for Eng to Multi (0.195) and Multi to Eng (0.184). These findings are further illustrated visually in Figure 6, which displays the estimated relative performance trajectories for each group.
Pairwise Wald test statistics of invariance of latent intercept means (π0, upper diagonal) and slope means (π1, lower diagonal) across groups (n = 18,000).
***p < .001, ** p < .01, *p < .05

Estimated changes in comparative literacy performance by group (n = 18,000).
The parameter estimates for the variance of intercept and slope (
Effective error and effective curve reliability values for each group were reported at the bottom of Table 2 as measures of the precision and reliability, respectively, of the estimated rates of change (Brandmaier, 2020; Brandmaier et al., 2018). Effective error is an unstandardized index that captures how (im)precisely an individual slope estimate measures that individual’s true rate of change in a longitudinal study design, and its inverse is proportional to the precision of measurement. On the other hand, effective curve reliability considers slope variance (ζ1) and can be interpreted as a standardized effect size. Although the reliability value of Multi to Multi (0.681) appears substantially lower than the other groups (0.919–0.961), this seemingly unsatisfying reliability should be accounted for by relatively small slope variance (0.025), rather than the precision of the instrument (Brandmaier et al., 2018; Singer & Willett, 2003). That is, despite its comparable precision (0.012), detecting inter-individual differences was more difficult for Multi to Multi than for other groups due to its distinctively small individual differences within the group.
RQ3: Do immigration status and ESL program enrollment in grade 3 predict the relative literacy achievement patterns of students with different home language environments?
Two covariates (immigration status and ESL in G3) were added to the model as predictors, resulting in improved model fit compared to the previous model, χ2(12) = 188.999 (p < .001), CFI = .993, TLI = .979, RMSEA = .057, CI90 [.050, .065], SRMR = .015. With regard to the effects of covariates, the intercept and slope estimates on covariates (α0, α1, β0, β1) on the right side of Table 2 suggest that neither of the two covariates had significant effect on Eng to Eng. Intuitively, students in Eng to Eng are less likely to have immigrated or been enrolled in ESL; the low variability in the covariates in this group must have reduced their predictive power. Yet, for all the other three groups, both covariates were found to be statistically significantly associated with initial status in G3 and rate of rank change. Born Outside was positively associated with literacy achievement in G3 (α0) across three groups, and these significant, positive associations were also found for the rate of change (α1) for Eng to Multi and Multi to Multi. These results indicate that, on average, students from (at least at one point) multilingual homes and who were born outside Canada outperformed their Canadian-born peers in G3 and showed a higher rate of change in relative performance over time. Students who were enrolled in ESL in G3 also demonstrated a higher rate of change (β1) across three groups, although their initial literacy achievement in G3 (β0) was significantly lower than non-ESL students in G3.
The addition of the covariates to the model resulted in some changes to the means of intercept and slope (π0, π1) for these three groups, especially for Multi to Multi. Once the immigration status and ESL enrollment in G3 are controlled for, the advantage of Multi to Multi’s initial performance in G3 becomes stronger (0.078 to 0.155). Their rate of change in rank over time somewhat decreases compared to the previous model (0.280 to 0.203) along with the rates of change of Eng to Multi and Multi to Eng, but still maintains the advantage over the other groups. Yet, the decrease in variances (ζ0, ζ1) after adding the covariates was rather marginal across groups. Particularly for Eng to Eng, the residual variances are almost identical to those in the previous model, suggesting that, again, the two covariates explained little variance in the intercept or slope for this group.
The R2 values in Table 2 indicate the explained variances in the latent intercept and slope factors by the covariates. Most of the R2 values are minimal, particularly for Eng to Eng, the group for which the covariates have little to predict. Contrariwise, for Multi to Multi, the two covariates collectively explain 41.1% of the variation of the slope factor of the previous unconditional model. This implies that the immigration status and ESL enrollment in G3 have remarkably strong prediction power on the rate of change in relative literacy achievement over time for this group.
Discussion and conclusion
The present study was designed to examine changes in students’ home language environment and the association between these changes and their relative literacy achievement patterns. By investigating province-wide longitudinal data, we were able to portray the home language maps of a linguistically diverse province at different time points and delineate how home language use by individual students and their families changes over time. Between G3 and G6, the proportion of students with an English-dominant home environment increased, whereas students hearing or speaking other language(s) at least as often as English (i.e., OtOt, EqOt, OtEq, EqEq) declined. When comparing these two home language maps (one in G3 and the other in G6), linguistic diversity in the family domain appears to decrease as students become older.
Our findings on changes in home language environment from a longitudinal perspective are consistent with the literature that witnessed prevalent, and rapid, intra-generation language shift (Flores, 2015; Swidinsky & Swidinsky, 1997). Eight out of 10 of the most prominent patterns of change that we found were those moving from more use of non-English language(s) towards heavier use of English at home. Overall, nearly half of the students’ homes used English to a greater extent in G6 than in G3, whereas approximately a quarter of our sample used less English during the same period. In the homes of the remaining quarter of the students, the extent to which English was used remained the same. Some interesting patterns of changes involved the direction from English-dominant to multilingual homes led by other family member(s) (e.g., EnEn to EqEn, EnEn to EqEq, considering the former part of the notation refers to the language(s) students hear). A possible explanation of these patterns may be that once parents assumed that their children had become proficient enough in English, they might attempt to support their children’s L1 development.
Our LGCM analysis with four groups suggests that students whose home language environment shifted across time regardless of the direction (i.e., Multi to Eng, Eng to Multi) showed significantly lower literacy performance compared to their peers in the earlier grade, but a higher rate of change in relative achievement over time compared to the consistently English-dominant group (Eng to Eng) whose rate of change in rank was negative. Furthermore, students who maintained their home language (Multi to Multi) had significantly greater initial relative achievement in G3 and a higher rate of change in rank compared to the other three groups. Although their advantage (approximately 0.2 standard deviation in both the initial status and rank-change rate) could be seen as small, we would argue that this difference in population data should not be considered negligible as this interpretation should be only relative rather than absolute. Our findings suggest at a minimum that maintaining home language in the school years may not impede English literacy development of linguistically diverse students; rather, although we cannot make causal inferences based on our analyses, one possible explanation could be that they may benefit from maintaining and developing multilingual competence (Bialystok, 2011).
Examination of the roles of the covariates reveals an interesting finding that immigrant students outperformed their domestic peers in G3 across groups. Considering that 77.3% of these non-Canadian-born students were living in multilingual homes, their higher comparative outcome in English literacy may seem counter-intuitive. Yet, it is important to note that, at the time of G3 administration, the majority of immigrant students had lived in Canada at least for five years and had received formal instruction in English in school for almost five years (two years of kindergarten and three elementary school years). These considerations make it clear that our finding is consistent with that of previous research studies (Cummins, 1981; Jang et al., 2013), which report higher performance of immigrant students after five years of residence in the same geographical context as ours. Furthermore, the fact that students who were born outside Canada and whose home was multilingual a later year (i.e., Eng to Multi, Multi to Multi) demonstrated higher rates of change in rank, may imply the cognitive benefits of bi/multilingualism. Nonetheless, we acknowledge that this result may have been confounded by students’ socioeconomic status, which was not controlled for in the current study. In fact, Sinclair et al. (2019) pointed out an important sociopolitical factor, Canada’s selective immigration policy, as a plausible explanation of higher achievement among immigrant students. Because this policy is highly selective in approving immigration applications from applicants with higher language proficiency, education, and financial assets (Haque, 2017), a fair proportion of multilingual students are likely to have parents with a relatively high socioeconomic background.
Another noteworthy finding is that a relatively large proportion of students receiving ESL support in G3 was Canadian-born (66.2%). When looking by group, 9.3% of Canadian-born, Eng to Multi students were enrolled in an ESL program in G3. That these students who heard and spoke mostly English at home in G3 were identified as having limited English proficiency highlights that not all Canadian-born students from English-speaking homes are fully prepared for academic language demands at school. Early identification and intervention for these students, who have a lower chance of being identified as English learners than their non-Canadian-born peers (McGloin, 2011), might effectively support them in the long term, as evidenced in the current study by the positive relationship between ESL support in G3 and rate of change in comparative literacy achievement.
Due to the nature of Ontario’s provincial assessment system and items included in the student questionnaire, the current study is limited in that the measures were administered at three time points only; students’ home language environment was self-reported by arguably immature students and was not followed in G10; students’ socioeconomic status was not controlled for; and outcome measures were standardized via z-scores in order to place the scores in the same metric. As for the last point, the z-score metric captured students’ relative growth compared to peers at the time of testing but is not truly sufficient for measuring growth as, to measure each student’s growth independent of their peer-to-peer comparison, the underlying assessments across the years must be vertically scaled. Psychometrically, the metric scale utilized in LGCM analyses can greatly influence the results associated with growth trajectories (Goldschmidt et al., 2010). Modeling individual students’ growth trajectories in academic achievement requires vertically scaled test scores for consistent interpretation across time points (Briggs et al., 2008). Item response theory (IRT)-based vertical scaling is recommended, owing to its increased measurement accuracy (Seltzer et al., 1994); however, it requires common items between grades for either separate or concurrent calibration (Kolen & Brennan, 2004). Unless the tests assess students in adjacent grades, it is often practically challenging to use common items for achievement testing. IRT-based vertical scaling was not feasible in the current study because the tests used in the study are not vertically scaled. As such, the rate of changes in relative achievement reported here should be interpreted with caution until the tests themselves are vertically scaled and the results of this study are replicated with an IRT-based, vertically scaled metric.
The present study contributes to the literature in several meaningful ways. One of the contributions of this study is our specific focus on the complex, dynamic nature of students’ home language status, using large-scale longitudinal data. Investigating relative achievement patterns longitudinally, together with changes in home language environment, provides important evidence for identifying how such contextual factors influence educational outcomes over time. Moreover, our application of LGCM to literacy testing highlights the importance of longitudinal modeling of both intra-personal and inter-personal variations in relating individual student differences to differential learning trajectories.
Finally, this paper draws implications for promoting students’ home language, instilling a positive view of multilingual competence. Our LGCM analysis demonstrates that students whose homes maintain a multilingual environment (Multi to Multi) show the highest academic achievement in the early grades as well as the highest rate of change in relative performance over time. These students were found to have a clear, although not large, advantage over other groups, especially when they have an immigrant background and received early intervention services (ESL in G3). Yet, caution must be applied as this study was not designed as a randomized controlled trial. For example, it is possible that parents who are more supportive of education are more invested in the maintenance of the L1 and are less apt to linguistic assimilation.
Yet, it is alarming that, ironically, our findings clearly indicate a movement toward an English-dominant home environment among multilingual students and their families in Canada—a country in which linguistic diversity is supposedly celebrated (Kim et al., 2019). Although language shift from L1 to L2 among language-minority children is viewed as highly detrimental in the existing literature, especially among families in which parents’ L2 skills are limited (Wong-Fillmore, 1991), many immigrant parents have been reported to believe that maintaining the L1 would impede their children’s academic success and, thus, encourage them to maximize the use of L2 even at home (King & Fogle, 2006). Indeed, parents’ perception of, and attitude towards, maintaining the L1 has been known to be one of the most influential factors in their children’s L1 maintenance (Park & Sarkar, 2007; Swidinsky & Swidinsky, 1997). Parents who are learners of L2 (English in the context of this study) have a right to make an informed decision on their home language practices based on rigorous research. We close the paper with a call for more research on the relationship between changes in students’ home language environment and their L2 development, the latter of which is directly linked to their academic achievement. Of equal necessity is more attention from researchers to the factors affecting the shift in home language use in multilingual families, their perspectives about language and identity, and effective support strategies for publicly disseminating the positive aspects of multilingualism.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
