Abstract
Characterization of language in naturalistic settings in autism spectrum disorder has been lacking, particularly at young ages, but such information is important for parents, teachers, and clinicians to better support language development in real-world settings. Factors contributing to this lack of clarity include conflicting definitions of language abilities, use of non-naturalistic standardized assessments, and restricted samples. The current study examined one of the largest datasets of naturalistic language samples in toddlers with autism spectrum disorder, and language delay and typically developing contrast groups at age 3. A range of indices including length of phrase, grammatical markings, and social use of language was assayed during a naturalistic observation of a parent–child play session. In contrast to historical estimates, results indicated only 3.7% of children with autism spectrum disorder used no words, and 34% were minimally verbal. Children with autism spectrum disorder and language delay exhibited similar usage of grammatical markings, although both were reduced compared to typically developing children. The greatest difference between autism spectrum disorder and language delay groups was the quantity of social language. Overall, findings highlight a range of language deficits in autism spectrum disorder, but also illustrate that the most severe level of impairments is not as common in naturalistic settings as previously estimated by standardized assessments.
There is a deepening appreciation of the fact that autism spectrum disorder (ASD) is a heterogeneous disorder, not only in terms of clinical profile (Kim et al., 2015; Rice et al., 2012), but also likely etiology (Bill and Geschwind, 2009). This heterogeneity is particularly striking within the domain of language (Pickles et al., 2014) wherein two individuals—both with the same diagnostic label of ASD—can display opposite profiles with one person being nonverbal throughout life while the other verbally fluent and may even obtain a college degree or beyond. Rapidly changing levels of public awareness and enhanced efforts toward earlier identification and treatment of ASD (Pierce et al., 2011, 2016) provide a difficult landscape for accurately defining language profiles in ASD. Yet, this understanding is essential within a modern cohort as a guide for physicians, clinicians, educators, and parents alike, particularly since language ability prior to treatment has been linked to more successful outcomes (Anderson et al., 2007; Fossum et al., 2017; Mayo et al., 2013; Moulton et al., 2016; Pickles et al., 2014). Moreover, as we hope for a future that includes precision medicine in ASD (Geschwind and State, 2015), accurately indexing language ability in ASD before and following treatment becomes an essential component for clinical trials.
Studies that index and characterize the speech and language abilities of children with ASD are among the oldest in the literature (Bartak et al., 1975; Hartung, 1970; Rutter, 1967; Sulzbacher and Costello, 1970). One such study, conducted over 40 years ago by one of the founders of the field, Michael Rutter, concluded that “language disability is probably necessary for the behavioral syndrome of autism” (Bartak et al., 1975). This early study reported that 17 out of the 19 ASD children examined had no phrase speech by age 2.5 years (Bartak et al., 1975). Reports that followed continued to suggest that a large percentage, as high as 50%, of children with ASD fail to develop verbal communication (Eigsti et al., 2011; Lord et al., 2004; Prizant, 1996; Tager-Flusberg et al., 2005).
Recently, however, this view of nonverbal and minimally verbal rates in ASD has changed. Awareness of the disorder has grown and the definition of ASD has shifted to include higher ability individuals within the autism spectrum. Moreover, there are now widespread efforts toward earlier identification and treatment to improve outcome. New studies suggest that the rates of nonverbal and minimally verbal children with ASD may be considerably lower than 50%. For example, Anderson et al. (2007) followed a clinic referred sample of 130 children with ASD and 42 children considered non-spectrum from ages 2 to 9 years and examined changes in expressive language over time. At age 9, only 20% of the ASD sample was considered to have “no-to-few words” based on parent responses during the Autism Diagnostic Interview—Revised. Similarly, Norrelgen et al. (2015) indexed the verbal ability of 165 4- to 6-year-old children with ASD following intervention based on the Vineland Adaptive Behavior Scales (VABS; Sparrow et al., 2005), a parent report measure of child functioning. Fifteen percentage of the sample was considered nonverbal (fewer than three words) and an additional 10% were minimally verbal (use of at least three words, but limited phrase speech). Rose et al. (2016) also examined 246 children with ASD following intervention at 5 years of age, but used a more comprehensive battery that included the VABS, the Autism Diagnostic Observation Schedule (Lord et al., 2012), and the Mullen Scales of Early Learning (Mullen, 1995). There was variation in the proportion of children designated as having limited verbal language depending on which measure was used for classification, ranging from 36% to 43% demonstrating use of single words only, of which 25%–29% demonstrated fewer than a total of five words. However, all children were attending an autism-specific, center-based intervention program, which may have led to sampling bias and an over inclusion of children with more severe symptoms.
Bal et al. (2016) also examined verbal abilities in a large cohort of 1470 4–17-year-olds from the Simons Simplex Collection. The proportion of minimally verbal children was determined using five different assessments and definitions including parent estimate of language, scores from standardized parent report measures, and individual items from diagnostic assessments. Definitions across measures ranged from 25 words or fewer and no use of phrase speech to age equivalents below 18 months on standardized assessments. There was substantial lack of agreement across different definitions ranging from 3% to 99% classification agreement, resulting in a range of 5%–16% of the sample being identified as minimally verbal. Due to these differences, and bias of the sample itself from simplex families only, the authors warn against using results to determine rates of minimally verbal children with ASD. While the aforementioned studies provide important insights into language ability in ASD, a range of issues including use of varied definitions of language ability, reliance on standardized tests and lack of information about language in naturalistic settings, and use of restricted samples that may not result in representative ASD cohorts continue to contribute to a lack of clarity regarding language ability and disability in ASD.
Differences in definitions of verbal abilities are prevalent throughout research on language use in individuals with ASD. Studies that have focused on nonverbal or minimally verbal individuals have used definitions for inclusion criteria ranging anywhere from no spoken words to fewer than 20 spoken words (Bal et al., 2016; Chenausky et al., 2016; DiStefano et al., 2016; Kasari et al., 2008, 2014; Koegel et al., 2009; Paul et al., 2013; Romski et al., 2010; Thurm et al., 2015; Woynaroski et al., 2016; Yoder and Stone, 2006). As exemplified by Bal et al. (2016), depending on what type of measure is used and how stringent the definition is for inclusion, agreement for determining verbal status across participants varies greatly. To combat the inconsistencies within the field, Tager-Flusberg et al. (2009) put forth comprehensive guidelines to better define levels of spoken language. Within these guidelines, the first verbal stage is defined as children using first words that have an age equivalent of 15 months or older on standardized language assessment, using at least 5 word types (number of distinct words used) and 20 word tokens (total number of words used during an observation), producing consonant–vowel syllables, producing at least four consonants, and using at least two communicative functions (requesting, commenting, etc.). Therefore, children who are performing at a lower rate than described above are considered “preverbal,” which is most consistent with individuals who would be considered “minimally verbal” in other studies. These detailed guidelines provide a comprehensive framework to categorize language and allow for easier comparison of results across future studies.
Another issue contributing to the continued lack of clarity regarding rates of nonverbal and minimally verbal children with ASD surrounds the fact that standardized measures are commonly used to measure language level. However, different measures lend themselves to varied definitions of language level due to the nature of the exact questions or skills assessed by the measure, leading back to the issue of inconsistent definitions of language status. Children with ASD also have considerable fluctuations in motivation and attention and standardized test instruments may yield an inaccurate profile of abilities (Koegel et al., 1997). Moreover, standardized testing often fails to capture more nuanced aspects of language such as a child’s appropriate use morphemes and the quantity that he or she initiates verbally. One method to overcome the shortcomings of standardized assessments is through the fine-grained analysis of language during natural observations. In a rare example of this approach, Tek et al. (2014) performed detailed video coding analyses to study structural language profiles in 17 children with ASD and 18 typically developing (TD) children longitudinally across 18–60 months of age. A computerized language program was employed to analyze grammatical markings (e.g. plurals, past tense, use of articles) used by the children during a parent–child play session. Overall, differences in mean length of utterance (MLU), number of utterances, as well as differences in amount of use of various grammatical markings, were observed between diagnostic groups cross-sectionally and over time. However, substantial variability in expressive language was present in the ASD group and those with higher expressive speech skills demonstrated structural language similar to TD children, while children with ASD with lower verbal skills showed more widespread impairment. One of the most compelling features of this data analysis was the concrete information regarding actual language skills at a detailed level. Rather than interpreting performance on a standardized measure by comparing scaled scores across groups, scores relating to the actual frequency of use of each skill were compared. Such information is easy to interpret, clinically meaningful, and derived from an example of natural interaction. However, studies utilizing a video coding approach, such as the aforementioned study, often rely on small sample sizes to draw conclusions given the labor-intensive process of quantifying the data. As such, there are few studies on use of grammatical markings in children with ASD due to the difficult analysis process (Eigsti et al., 2011).
Despite newer research, due to inconsistencies of definitions, and limitations inherent in standardized assessments, it remains unclear if the rates of nonverbal or minimally verbal toddlers with ASD in an early detected and early treated contemporary cohort are indeed lower than previously reported. Particularly important, but lacking in the literature, are precision studies of language use in naturalistic social communication settings in toddlers with ASD; how such use varies with language skill level; and how social use and skill levels compare with toddlers without ASD who also have language delay (LD). Furthermore, toddlers with ASD and toddlers with LD are often confused with each other because early age standardized language tests may place them at similar levels of delay. Conversely, some toddlers later diagnosed as ASD are misdiagnosed at early ages because they score within the normal range on standardized language tests and are mistaken for typical at early ages because they display good rule-based language (Bacon et al., 2017). Including both ASD and LD toddlers within the same study would allow for the analysis of language deficits specific to a diagnosis of ASD, rather than general delays in language.
Ideally, fine-grained analysis of language use within large samples of ASD and non-ASD contrast groups should provide a robust approach for determining verbal status in a precise manner, rather than relying on rough estimations based on scores from standardized assessments that lack translation to clinically meaningful information. Additionally, this method would offer the opportunity to analyze details of language use that cannot be assessed using broad standardized assessments, such as the structural use of language, to further our understanding of the acquisition of smaller units of language in children with ASD. Finally, such analyses also offer the advantage of allowing for observation of language use during natural interactions, rather than in response to specific, structured prompts. Overall, detailed coding of language samples offers a way to combat restrictions in determining language level by allowing for fine-grained analysis of vocalizations in a more naturalistic setting.
While most studies examine language in ASD across a relatively wide age range (e.g. 5–7 years), here we aimed to provide a detailed characterization of language abilities of children with ASD (compared to language delayed and typical toddlers) at a very specific point in time—age 3—when many children receive their initial diagnosis. Our sample was largely generated by virtue of failure of a broadband screen at a well-baby check-up in pediatric office, and not parent concern. Although this approach is not entirely bias-free, because most referrals were the result of failing a screen at a well-baby check-up within the context of a Level 1 screening program where all children are screened in the general population, the resulting sample may be more representative of ASD in general pediatric community settings than other sample ascertainment strategies used in previous efforts, such as recruiting from special education classrooms and focusing on ASD specialty clinics. Our approach of recruiting from communities throughout San Diego County also mitigates disparities in ASD identification driven by education level, Socioeconomic status (SES), or minority status (Fountain et al., 2011) as well as biases associated with recruiting children who are already receiving community services.Developmental screening is a viable method to reduce these disparities and improve age of detection of ASD for all community members (Herlihy et al., 2014). In the current study, spontaneous language during a naturalistic parent–child play session was examined in one of the largest natural language samples, including over 100 ASD and 100 contrast toddlers, further improving the possibility of a more representative sample. Language was examined in a multifaceted method through analysis of performance on standardized measures and naturalistic language samples using clear definitions of language level (Tager-Flusberg et al., 2009), quantification of usage of grammatical markings, and rate of conversational exchanges between parent and child to provide a comprehensive summary of language ability. Understanding language use of children with ASD in more naturalistic settings at the time of diagnosis may aid clinicians when attempting to disentangle ASD from LD or global developmental delay and may be useful information for future modifications of diagnostic criteria.
Methods
Participants
Eligible participants were recruited from a larger ongoing longitudinal study examining ASD detection in the general population using universal broadband screening starting at 12 months (i.e. “The 1-Year Well-Baby Check-Up Approach,” see Pierce et al., 2011 for more information) as well as from community referrals. One-hundred and nine toddlers with ASD, 61 TD, and 40 with a history of LD were included in analyses. Note that 55% of toddlers in the LD group presented with a transient LD, meaning they showed a delay in language at an early age during intake into our program, but by age 3 they no longer demonstrated that delay. All toddlers in the ASD and TD groups showed consistent diagnoses from intake until age 3. Diagnoses for all toddlers were determined using best practice guidelines, including the combination of standardized testing, observational measures of child behavior, and parent report. Specifically, PhD level psychologists with specialized experience in child development and ASD interviewed the parents about the child’s development, observed the child’s performance on a battery of assessments (described below), and used clinical judgment to make a final diagnosis. Most toddlers entered the program between 12 and 24 months and were reassessed every 9–12 months until 3 years of age when a final diagnosis was given and participation in the current study measures occurred (see Table 1 for further participant demographics). All procedures were approved by the institutional Human Research Protections Program.
Participant demographics.
ADOS: Autism Diagnostic Observation Schedule; ASD: autism spectrum disorder; LD: language delay; RRB; Restricted and Repetitive Behavior; TD: typically developing.
For assessment scores (Mullen, Vineland, ADOS), means are listed with standard deviations in parentheses. Test scores are based on assessments conducted at 3 years of age. Children included in the LD group showed a history of LD, with 45% of the sample continuing to show a LD at 3 years of age (mean initial age detected = 21.1 months; Mullen Receptive Language T Score = 42.1; Mullen Expressive Language T Score = 33.8). Significant differences between groups are represented by asterisks (*p ⩽ 0.05, **p ⩽ 0.01, ***p ⩽ 0.001).
Measures
Autism Diagnostic Observation Schedule
The Autism Diagnostic Observation Schedule (ADOS; Lord et al., 2012) is a semi-structured assessment used to measure behavioral features of ASD. The appropriate module of the ADOS (i.e. Toddler, 1 or 2) was used as a tool to help inform the clinician’s overall diagnostic judgment. All psychologists were research reliable on administering the ADOS.
Mullen Scales of Early Learning
The Mullen Scales of Early Learning (MSEL) (Mullen, 1995) assesses cognitive and motor development through a series of structured tasks and provides standardized scores for visual reception, receptive language, expressive language, and fine motor skills.
VABS
The VABS (Sparrow et al., 2005) is a measure of adaptive behavior through caregiver report and provides standardized scores for communication, daily living skills, socialization, and motor skills.
Parent–Child Interaction Task
The Parent–Child Interaction (PCI) task consisted of a 10-min video recorded, free-play interaction between the child and one parent. The parent–child dyad was given access to a standardized set of age-appropriate toys placed in standardized locations within an observation room. The parent was instructed to play with their child as they normally would at home and the play session was video recorded using two cameras with high-definition zoom and frame switching capability to ensure that the child’s face could be observed if he or she moved. PCI observations were coded for child and parent language use as described below and results compared between diagnostic groups.
PCI Coding
Videos of the PCI sessions were transcribed and then coded using a continuous 5-s partial interval scoring procedure. Using this procedure, the 10-min observation was broken down into 5-s intervals. For the child, videos were coded at the vocalization/word level, morpheme level, and phrase level. Using this information, and guidelines described below, preverbal status was then determined. Parent phrases were also coded based on their function (see Table 2 for further description and examples).
Vocalization/word, phrase, and grammatical coding variables and definitions.
Grammatical morphemes based on Brown’s 14 grammatical morphemes that are acquired prior to age 3 (adapted from Brown, 1973 and Tek et al., 2014).
Child vocalization and word level coding
Utterances coded at the vocalization/word level included any sound made by the child and ranged from unidentified sounds to fully articulated words. Raters recorded whether any of the six possible target behaviors occurred at any point during the interval (see Table 2 for further description). The interval was not marked if the behavior did not occur during the interval, and the interval was only marked once if the behavior occurred multiple times during the interval. If two or more behaviors occurred in the same interval, both were scored (e.g. babbling and a word approximation). The number of full words used provided a quantification of the number of word tokens used during the observation.
Child morpheme coding
Child morpheme coding was modeled after previous research (Tek et al., 2014) and incorporated Brown’s 14 morphemes (Brown, 1973) which delineate morphemes first acquired by toddlers and are ordered by acquisition, as well as use of Wh- questions. Similar to vocalization and word coding, the scorer recorded whether any morpheme was used at any point during the interval.
Child phrase level coding
For child and parent phrases, an exclusive category was selected that best fit the function of the phrase (echolalia, initiation, or response) and was recorded in the interval that it started. The length of each phrase spoken by the child was also coded and the MLU was determined by calculating the average across all phrases spoken.
Parent phrase coding
Parent phrase production was coded based on its function as either an initiation, response, narration, or feedback. Identical to the phrase level coding for the child participants, all parent phrase production was coded in the interval that it started.
Coding interpretation
For child vocalization and word, child morpheme, child phrase, and parent phrase coding variables, a percentage of time engaged in each behavior was quantified. The number of intervals the child engaged in the behaviors was divided by the total number of intervals during the observation and multiplied by 100. This method allowed for ease of clinical interpretation of the frequency of the behavior during the short observation.
Preverbal status
Using the guidelines developed by Tager-Flusberg et al. (2009), performance during the PCI observation was analyzed to determine how many children met criteria for “preverbal status.” Tager-Flusberg et al.’s (2009) guidelines provide several methods for determining preverbal status across different language domains (phonology, vocabulary, pragmatics, and grammar). As we were most interested in actual word use, the definitions for determining preverbal status for vocabulary were used. Therefore, the number of word tokens and word types during the PCI were used to determine preverbal status for vocabulary. Word tokens were determined by calculating the frequency of word approximations and full word used from the PCI Coding. Frequencies of word type were determined for children with fewer than 20 word types and were analyzed through transcription analysis. Children with fewer than 20 word tokens or 5 word types were considered “preverbal status.”
Inter-rater reliability
To assess inter-rater reliability, 33% of the videos were randomly selected and coded independently by two raters. Ratings were compared using partial interval exact scoring where reliability is calculated by determining whether the two raters are in agreement for each interval (both raters say the behavior occurred, or both raters say the behavior did not occur) or are in disagreement (one rater said the behavior did occur while the other said it did not occur). This type of reliability method is a robust, more conservative method of calculating percent agreement as the observers must agree not only exactly when the behavior did occur, but also when it did not occur and can be susceptible to underestimating reliability due to minor disagreements in the exact timing of the behaviors when used during short intervals, such as 5-s intervals as used in the current study (Vollmer et al., 2008). Mean percent agreement for all variables (word, morpheme, and phrase level coding variables) was 96.7% (range, 82.9%–99.9% across variables, and 65%–100% across individual variables for individual participants).
Data analysis
Multivariate analysis of variance (MANOVA) analyses was conducted for each level of coding including vocalization/word level coding, child morpheme coding, and child and parent phrase level coding, to assess for differences between diagnostic groups. Bonferroni corrections were used to adjust for multiple comparisons. The relationship between language ability and social behavior was of particular interest for the ASD group to understand the relationship between measures of language and social behavior and the potential impact of language skills across domains. Therefore, Pearson’s correlations were conducted comparing the relationship between child full word use and child initiations and scores of social skills on the VABS and ADOS.
Results
As expected, differences were seen between diagnostic groups on the battery of standardized assessments—MSEL, VABS, and ADOS within all subdomains (see Table 1). Our naturalistic language sample obtained during the PCI provided more detailed information and revealed that in terms of overall word use, only 3.7% (4 of 109) of the ASD sample and 0% of the LD and 0% of the TD sample used zero words. Results also indicated that 34% (37 of 109) of toddlers with ASD and 7% (3 of 40) toddlers with LD used fewer than 20 word tokens during the PCI observation. Figure 1 shows the distribution of number of word tokens used by those toddlers that used fewer than 20 word tokens in each diagnostic group. Word type was analyzed for toddlers with ASD who presented with 20 or fewer word tokens, and 19 of the 37 toddlers (17% of the entire sample) used fewer than five word types and 4 used no words (3.7% of the entire sample). No toddlers with LD used fewer than five word types.

Distribution of word tokens for participants who used fewer than 20 word tokens during the PCI. Note, all TD children used over 20 word tokens and are not depicted.
MANOVA analyses revealed significant effects between groups for many language features (see Table 3 for full results). The largest differentiation between groups was seen for child babbling (F = 9.85, p < 0.001, η2 = 0.090), child word approximations (F = 38.42, p < 0.001, η2 = 0.271), child full words (F = 56.17, p < 0.001, η2 = 0.351), Wh- questions (F = 27.21, p < 0.001, η2 = 0.208), child mean length of utterance (F = 41.29, p < 0.001, η2 = 0.285), child initiations (F = 42.775, p < 0.001, η2 = 0.292), child response (F = 18.89, p < 0.001, η2 = 0.154), and parent narration (F = 8.06, p = 0.001, η2 = 0.072). The aforementioned variables showed significant differences between ASD and TD toddlers, as well as between ASD and LD (see Figures 2 and 3 for a sample of language features across groups).
Results of MANOVA analyses.
MANOVA: multivariate analysis of variance; ASD: autism spectrum disorder; LD: language delay; TD: typically developing.
Bonferroni corrections used for multiple comparisons.
p ⩽ 0.05, **p ⩽ 0.01, ***p ⩽ 0.001.

Average percentage of intervals child and parent language behaviors occurred by diagnostic group. Significant differences between groups are represented by asterisks (*p ⩽ 0.05, **p ⩽ 0.01, ***p ⩽ 0.001).

Mean length of utterance spoken during the PCI observation by diagnostic group. Significant differences between groups are represented by asterisks (*p ⩽ 0.05, **p ⩽ 0.01, ***p ⩽ 0.001).
Pearson’s correlations revealed overall significant positive correlations between child full word use and VABS Socialization Standard Score (r = 0.593, p < 0.001) and child initiations and VABS Socialization Standard Score (r = 0.469, p < 0.001), and significant negative correlations between child full word use and ADOS Total Score (r = –0.564, p < 0.001) and child initiations and ADOS Total Score (r = –0.585, p < 0.001). However, these differences were largely driven by diagnostic group status. When correlations were performed separately for each diagnostic group, the ASD group was the only group to show significant correlations between child full word use and VABS Socialization Standard Score (r = 0.456, p < 0.001), child full word use and ADOS Total Score (r = –0.242, p = 0.011), and child initiations and ADOS Total Score (r = –0.384, p < 0.001). The ASD and TD groups both showed significant correlations between child initiations and VABS Socialization Standard Scores (ASD: r = 0.363, p < 0.001; TD: r = –0.344, p = 0.007) although in opposite directions.
Discussion
Leveraging one of the largest natural language sample datasets containing over 200 3-year-old toddlers, the results of the present study overwhelmingly refute the idea that up to 50% of toddlers with ASD are either nonverbal or minimally verbal. No matter how nonverbal or minimally verbal is defined, only a small proportion of toddlers with ASD meet such definitions. Only 3.7% (n = 4) of the toddlers with ASD used no words or word approximations at all, and 17% (n = 19) had fewer than five types of words or word approximations. Furthermore, only 34% (n = 37) had fewer than 20 word tokens during a naturalistic free-play observation with a parent. These estimates of rates of preverbal or minimally verbal cases of toddlers with ASD in a naturalistic social setting are much lower than previous standardized assessment-based estimates of about 50% of the ASD population. More recent efforts suggest minimally verbal/preverbal rates are anywhere from 5% to 43% (Anderson et al., 2007; Bal et al., 2016; Norrelgen et al., 2015; Rose et al., 2016). While it is generally agreed that nonverbal rates are now lower than 50% and could be due to a multitude of factors including increased awareness and detection of ASD, changes in diagnostic criteria, and increased access to early intervention (Dawson and Bernier, 2013; Zwaigenbaum et al., 2015), inconsistencies in defining verbal abilities, broad analyses of general performance on standardized measures, and sampling methods have led to a continued lack of clarity of verbal capabilities in children with ASD. In our cohort, there were significant differences between diagnostic groups on standardized assessment of communication (e.g. the MSEL and VABS). These differences were unsurprising as these measures were used by psychologists to determine developmental ability and inform diagnostic impression. Group differences were observed in language and communication scores, providing rough evidence of language deficits in children with ASD. However, further detailed examination of language abilities through video coding of natural language samples in our study provided much more in depth and accurate estimates of verbal abilities. Our study also confirms the assumptions that rates of children with ASD that are preverbal are much lower than documented in early literature.
When examining language abilities across the domains measured, toddlers with ASD showed delays across variables and more impairment than LD and TD groups. For word and phrase level coding, children with ASD performed at a lower level than toddlers with LD, who performed at a lower level than TD toddlers. The communication impairments seen in toddlers with ASD seem to go beyond the deficits seen in children with a history of non-ASD LDs, especially when examining language use during a social interaction rather than a standardized assessment. The social use of language was markedly different in toddlers with ASD compared to LD and TD, as toddlers with ASD also showed marked reductions in initiations, use of Wh-questions, and responses to parent. However, parents of toddlers with ASD initiated to their children just as frequently as other parents, showing they were creating just as many opportunities for reciprocal communication as other parents. Toddlers with LD also showed reduced word use; however, they responded to their parents’ initiations, used Wh- questions, and initiated interaction in general at a similar rate as TD toddlers. Previous work has highlighted a lack of difference in ratings of autism symptoms (e.g. social communication) for children with ASD with and without co-occurring language impairment (Lindgren et al., 2009; Loucas et al., 2008). This distinction further highlights how toddlers with ASD have a specific impairment related to social communication that cannot be fully accounted for by lower language ability.
Toddlers with ASD and LD did show similar usage of grammatical markings, although both were reduced compared to TD toddlers. This suggests that although behind, toddlers with ASD and LD may show typical progression of grammar rather than atypical development. This lack of difference in language ability between ASD and LD highlights how toddlers with ASD may struggle more with social nuances of language rather than learning routine rules of language. Although reduced, grammatical skills were not significantly different from toddlers with LD, as contrasted with differences seen on several measures of social use of language. This is similar to findings from previous literature showing similar rates of acquisition and composition of word types to normative development, although reduced vocabulary size (Charman et al., 2003; Luyster et al., 2007). Hudry et al. (2010) have also demonstrated that children with ASD tend to show more impairment in receptive language relative to deficits in expressive language. The TD toddler understands the meaning of words well before using them expressively, leading to a much larger receptive vocabulary than expressively vocabulary as they are learning to talk. Toddlers with ASD often do not follow this profile showing much weaker receptive language than expressive skills relative to expected performance. Hudry and colleagues suggest that this atypical pattern in language development may contribute to difficulties with social communication and understanding social interactions because others may overestimate a child’s receptive understanding leading to increased difficultly in participating in social interactions. Therefore, although there may be general delays in expressive language and acquisition of grammar, this may not be the driving force behind difficulties with social communication.
Although toddlers with ASD showed reduced language skills overall, there was considerable variability in performance within the group as well. This variation may be due to differences in brain activation to language as described in a recent fMRI study (Lombardo et al., 2015). In examining language heterogeneity within ASD, infants and toddlers with ASD who had poor later language outcomes defined as demonstrating scores lower than one standard deviation from the mean on the MSEL (which would likely include the nonverbal or minimally verbal ASD toddlers in the present study) had little to no brain activation in canonical language systems such as superior temporal cortex at the age of first clinical detection. In contrast, ASD infants and toddlers who later had good or optimal language outcomes defined has MSEL scores in the average or above average range (such as the more language able ASD toddlers in the present study) had strong language activation in typical language processing brain regions. Thus, in ASD, distinctly different neurological language activation patterns are strongly associated with and may explain the wide range language and social communication outcomes described in the present study. Furthermore, correlations between language level and social behavior revealed positive correlations between language level and higher ratings of social skills on the VABS and a negative correlation between language level and rating of social impairment on the ADOS (see Figure 4). However, these differences appear to be largely driven by diagnostic classification as the LD and TD groups show a clear difference in performance on standardized measures as compared to the ASD group, and show overlapping ranges, yet different, rates of language use during the PCI observations. It is largely the ASD group that shows a unique relationship between language and social measures, with the toddlers who have more impaired language use, also having more impairment of social skills. Interestingly, the same relationship is not observed in LD cases, who also show language impairment, albeit, not to the same severity as many of the cases of children with ASD. These correlations highlight the wide range of abilities across children and emphasize the impairments in social skills and language faced by children with ASD in particular. Therefore, increasing the vocabulary, language flexibility, and initiation of social communication are goals of utmost importance for toddlers with ASD to improve core symptoms of the disorder, especially for those children demonstrating severe language deficits.

Correlations between child language use during the PCI and standardized scores of social skills. Panels (a) and (b) represent correlations between VABS Socialization Standard Scores and percentage of full word used by the child and child initiations during the PCI. Panels (c) and (d) depict correlations between ADOS Total Scores and percentage of full word used by the child and child initiations during the PCI.
Overall, our study provides a highly reliable estimate of language profiles of ASD at the 3 years of age due to its relatively large sample size, use of TD and language delayed contrast groups, measurement through detailed naturalistic language sampling, and the fact that the majority of subjects entered the study as the result of being detected by a routine screening questionnaire in general pediatric community settings, rather than high risk (e.g. baby sibling) referrals, autism services settings, or self-referrals from the community. Although not a completely bias-free sampling method, our sample may have fewer biases than samples obtained from treatment centers or diagnostic evaluation centers that may contain children with more severe phenotypes. Limitations of the current study include that 55% of the toddlers in the LD group did not retain their LD classification at 3 years of age. Eighty-five percent of toddlers were referred to our center and evaluated prior to the evaluation at 3 years of age, the focus of this article. Due to the transient nature of language impairments at young ages (Dale et al., 2003; Everitt et al., 2013), a substantial portion of children no longer scored as delayed by age 3. Little is known on how to differentially identify persistent and transient cases of LD at early ages. Moreover, it is still unclear how early delays may affect later outcomes. However, all children were referred for treatment (e.g. speech therapy) once a delay was identified. All children were analyzed in one group due to power limitations and due to similar history. However, this difference may make it more difficult to interpret differences between toddlers with ASD and those with non-ASD LD.
In sum, our study provides updated estimates for expected language outcomes in a contemporary population of children with ASD who began early intervention prior to age 3. The findings highlight that while toddlers with ASD show delays in communication on average, the vast majority demonstrate at least some verbal language, with only 3.7% failing to demonstrate any verbal communication at age 3. Additionally, using the language definition guidelines put forth by Tager-Flusberg et al. (2009), only 34% of the ASD sample was considered to have the lowest categorical rating of vocabulary, further supporting that most children go on to use meaningful verbal communication. The findings here also highlight how toddlers with ASD struggle with the social aspects of language over learning rules such as grammatical markings. Although language was reduced as compared to children with LD, grammatical markings were similar, showing that when they did communicate, grammatical use was only slightly impacted as compared to the ability to initiate conversation or respond to questions. In particular, they showed a marked fivefold reduction in asking questions in a natural social situation (Table 3). This difference provides insight into language goals as well. As toddlers with ASD may be better apt to learn rules of language, intervention may be best suited to focusing on conversation exchanges regardless of the sophistication of language. This type of treatment goal lends itself nicely to naturalistic developmental behavioral interventions (Schreibman et al., 2015). These types of interventions incorporate following the child’s lead, reinforcing and expanding communication, and a focus on play-based teaching environments, rather than therapist led and drill-based teaching in a structured setting. Support for naturalistic developmental behavioral interventions for young toddlers with ASD has been documented by robust RCTs showing improvements in language, social skills, and adaptive behavior following treatment (Baranek et al., 2015; Dawson et al., 2010; Green et al., 2010; Wetherby et al., 2014). Together with the increased push for early identification, early treatment start and use of progressive intervention strategies, the expectations for communication and language abilities for toddlers with ASD have advanced tremendously.
Supplementary Material
Supplementary Material, AUT766241_Lay_Abstract – Naturalistic language sampling to characterize the language abilities of 3-year-olds with autism spectrum disorder
Supplementary Material, AUT766241_Lay_Abstract for Naturalistic language sampling to characterize the language abilities of 3-year-olds with autism spectrum disorder by Elizabeth C Bacon, Suzanna Osuna, Eric Courchesne and Karen Pierce in Autism
Footnotes
Funding
This work was supported by NIH grants ACE P50-MH081755 awarded to E.C. and R01-MH080134 awarded to K.P. The authors sincerely thank the toddlers and families who participated in this research.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
