Abstract
This article reports on the development and use of the Developmental Sentence Scoring for Japanese (DSSJ), a new morpho-syntactical measure for Japanese constructed after the model of Lee’s English Developmental Sentence Scoring model. Using this measure, the authors calculated DSSJ scores for 84 children divided into six age groups between 2;8 and 5;2 on the basis of 100-sentence samples collected from free-play child–adult conversations. The analysis showed a high correlation of the DSSJ overall score with the Mean Length of Utterance. The analysis of the DSSJ sub-area scores revealed large variations between these sub-area scores for children with similar overall DSSJ scores. When investigating the high-scoring children (over 1 SD over group average), most children scored high in three to five sub-areas, but the combination of scores for these sub-areas varied from child to child. It is concluded that DSSJ is a valuable tool especially for language acquisition research. The overall DSSJ score reliably reflects the overall morpho-syntactic development of Japanese children, and the sub-area scores provide specific information on individual acquisition patterns.
Keywords
Introduction
Assessment tools for language development are not available for all languages. Even Japanese, which has a relatively long history of linguistic research, suffers from a shortage of linguistic measures for clinical and research use (Miyata, Nisisawa, & Otomo, 2005; Miyata, Otomo, & Nisisawa, 2007). One way of improving this unsatisfactory situation is through the adaptation of approved English assessment tools. For obvious reasons, this is more straightforwardly performed in the area of vocabulary acquisition than for morphology or syntax. In fact, vocabulary measures, such as the Peabody Picture Vocabulary Test (Dunn & Dunn, 1981) or the MacArthur Communicative Development Inventories (Fenson et al., 1993), have been adapted to a large number of languages including Japanese. Dale (2011) reports an impressive number of 60 languages, but still this represents only 1% of all languages.
In the field of grammar development, the Mean Length of Utterance (MLU; Brown, 1973) is probably the most widely adapted tool. This has to do with the fact that the definitions of MLU can be easily modified to work with other languages. Counting the number of morphemes per sentence is possible also for languages whose grammatical acquisition process is still unexplored. For Japanese, the MLU measure has been in use since the 1990s (Miyata, 1999, 2009; Ogura, Naka, Yamashita, Murase, & Mahieu, 1997; Watamaki, 1993, 1994, 1999). Although MLU has not yet been standardized for Japanese, its usage is currently predominant for Japanese language development research purposes.
While MLU provides a simple and relatively efficient means of measuring grammar development, it is not clear to what degree sentence length reflects grammatical complexity (Crystal, 1974). This is particularly true for a highly elliptical null-argument language like Japanese. The Developmental Sentence Score (DSS; Lee, 1974) measures grammatical development more directly than MLU by scoring the use of selected morphological and syntactical items within sentences. Furthermore, DSS provides more detailed information on the morpho-syntactical development of a child. Sub-scores for individual grammatical areas allow a more detailed assessment and provides valuable information for language therapy (Hughes, Fey, & Long, 1992).
The Developmental Sentence Score (DSS)
DSS was developed in the early 1970s as a paper and pencil procedure. It consists of two parts. In the first part, called DST (Developmental Sentence Types), 100 consecutively spoken sentences are divided into ‘pre-sentences’ and ‘sentences’. Sentences include both subject and predicate, while pre-sentences are incomplete because either the subject or predicate have been omitted. Identical sentences are excluded, even when they are not direct repetitions and/or occur at different locations in the transcript. When more than 50 complete sentences are found in the transcript, it is possible then to proceed with the computations of the DSS (Developmental Sentence Structures).
In the DSS analysis, each sentence is checked for selected grammatical items listed in a DSS table. The table covers eight grammatical sub-areas (‘indefinite pronouns and noun modifiers’, ‘personal pronouns’, ‘main verbs’, ‘secondary verbs’, ‘negatives’, ‘conjunctions’, ‘interrogative reversals’, and ‘WH-questions’). According to the level of grammatical complexity, each item used receives between 1 and 8 points corresponding to eight developmental levels. For example, in the area of ‘conjunctions’, the lowest scoring conjunction is the coordinating and with 3 points, the adversative conjunctions but and if are granted 5 points, while the conjunctive use of question words (e.g. I know where you are.) scores highest with 8 points, according to their increasing syntactic and cognitive difficulty (Lee, 1974, pp. 155–156).
Each sentence is checked for the grammatical items listed in the DSS table. For example, if the child utters a sentence like Who broke my chair, she will obtain 2 points for the WH-question word who, 2 points for the main verb form broke, 1 point for the personal pronoun my, and 1 additional ‘sentence point’ for the complete sentence ‘meeting all adult standards’ (Lee, 1974, p. 157), making a total score of 6 points for this sentence (p. 179). This procedure is repeated for 50 sentences. The scores of all sentences are added and divided by the number of sentences to find an average score per sentence. This average score per sentence constitutes the DSS overall score. Further, it is possible to compute the individual scores for each grammatical sub-area.
The possibility to extract detailed scores for grammatical sub-areas makes DSS an attractive diagnostic tool for assessment, as well as the evaluation of therapeutic intervention. Hughes et al. (1992) studied the development of 21 children with language delay. They compared 10 children receiving language therapy to 11 children without therapeutic treatment. Samples of 50 sentences were collected before and after five months of treatment. Higher DSS scores clearly reflected the effect of the intervention for the group who received language therapeutic treatment.
A number of studies found a dissociation of MLU and DSS for several clinical populations. Rondal and DeFays (1978) found that children with Down syndrome in MLU stage III (2.50–3.00) had a lower DSS score than MLU-matched typically developing children. Mortimer and Rvachew (2010) examined the development of toddlers with speech sound disorders and found that these children had problems with verb morphology. Even in the case of such a toddler with a typical MLU, they scored lower on DSS than typically developing children.
Several studies compared the scores of the Index of Productive Syntax (IPSyn; Scarborough, 1990) with DSS and MLU. IPSyn is a grammatical measure similar to DSS, but using a different scoring method. While DSS produces an accumulative score based on each occurrence of a given grammatical form, IPSyn only scores the first and the second occurrence of a form. Oetting, Cantrell, and Horohov (1999) studied the influence of dialect variation on MLU, DSS, and IPSyn scores of children with Specific Language Impairment (SLI) in comparison with age-matched as well as MLU-matched typically developing children in the age range of 4–6 (N = 31). They found that DSS, but not IPSyn, was sensitive to the morphological limitations due to language impairment. Dialect variation had no influence on any of the measures. However, Holdgrafer (1995), who compared DSS and IPSyn of samples from 19 typically developing and 10 language delayed children between 3;7 and 5;0, found IPSyn more sensitive to language impairment than DSS. Both studies found a strong correlation between DSS and IPSyn scores for typically developing children. Rice, Redmond, and Hoffman (2006) found MLU as useful as DSS and IPSyn for the assessment of SLI children. Their study compared MLU with DSS and IPSyn for 124 SLI and MLU- and age-matched typically developing children. The SLI and MLU-matched children developed in a similar way, and no mismatch of MLU and DSS was observed for the SLI children. DSS and MLU were highly correlated for both groups.
Some studies included also older non-typically developing children. Finestack and Abbeduto (2010) compared narrative samples from highly verbally expressive adolescents with Down syndrome, Fragile X syndrome, and mental-age matched typically developing children on DSS and Oral and Written Language Scales (OWLS). The participants with Fragile X syndrome scored higher on DSS than the Down syndrome group. These higher scores were caused by a higher frequency of grammatical items rather than qualitative differences. This corresponds to the study of Reed, Griffith, and Rasmussen (1998) who examined 8- to 17-year-old typically developing children and adolescents. They found a substantial difference between the 8-year-olds and the older children for the DSS overall score and all sub-areas except secondary verbs. The higher scores for older children were mostly due to an increased frequency of use. This suggests that DSS can be applied to a wider age range than MLU, which is commonly used to evaluate children only up to age 3 or 4.
When extending the age limit of DSS, the method of elicitation appears to play a role. Especially open-ended questions and narratives seem to be effective in eliciting morpho-syntactically complex utterances. Fields and Ashmore (1980) specifically investigated the influence of data elicitation on the morpho-syntactical complexity captured by DSS, MLU, and inflectional scoring for 10 language delayed and 10 typically developing children between 4;6 and 6;6. They found that open-ended questions yielded higher DSS scores than picture tasks and also higher than data obtained from unstructured wireless telemetry recordings. Finestack and Abbeduto (2010) sampled narratives from their adolescent participants with Down syndrome and Fragile X syndrome. In fact, narratives or structured interviews with open-ended questions might be more effective than free-play situations for eliciting morpho-syntactically complex samples from older children. Kemper, Rice, and Chen (1995) analyzed narratives from 62 children 5–10 years of age and found a rapid developmental increase of MLU and DSS up to the age of 6, followed by a slowdown until approximately 8 years of age.
A second important factor is sample size. Johnson and Tomblin (1975) compared the reliability of DSS based on 50-sentence speech samples of 50 children aged 4;8–5;2. The original speech samples were cut into five-sentence slices and recombined to sample sizes varying between 10 and 250 sentences. As expected, the reliability was growing with sample size, but was still relatively low for 50 sentences (r = .75) and 100 sentences (r = .86). Only with 150 sentences a reliability of r = .90 was reached. This method of combining samples from different children, however, is based on the assumption that the children of an age group obtain a similar DSS score, which is not necessarily the case. We think that an intra-individual comparison of different sample sizes is necessary for the investigation of sample size effects. Even so, this study suggests that the commonly recommended sample size of 50 sentences for DSS might be too low to yield reliable results.
Although a number of studies indicate an interesting relationship between DSS and MLU, as well as IPSyn, there has not yet been any systematic comparison of profiles for typically developing children and children with different types of disorders. Also, the sampling method and the minimal sample size issue should be further investigated, in order to obtain reliable cross-sectional results and to determine the upper age limit for typically developing children.
Constructing a Developmental Sentence Score for Japanese (DSSJ)
Overall, DSS appears to be an attractive measure of morpho-syntactic development. DSS covers a relatively wide age range and delivers detailed information on development within separate grammatical areas, which makes it a promising tool for the research and the assessment of typically and atypically developing children. We therefore decided to construct a morpho-syntactic index for Japanese based on the model of the English DSS. When adapting DSS to Japanese we faced the following problems.
Selection of items and stage assignment
Japanese morpho-syntactical structure is very different from English. This means that DSS items and their scoring cannot be directly transferred from English to Japanese. Even for items existing in both languages, the timing of acquisition can be different because of sociolinguistic factors (e.g. personal pronouns). Rather than looking for equivalents of the original English items, we selected grammatical items based on close observation of the development of eight Japanese children aged 2–5.
In order to reflect the general acquisition process of typically developing Japanese children, we decided to only include items that were acquired in the same order for the eight children who had been investigated longitudinally (Miyata et al., 2006, 2009; see Online Appendix A). The selected items were grouped in the following nine sub-areas: (1) verb final inflection, (2) verb middle inflection, (3) copula inflection, (4) conjunctions and conjunctive particles, (5) noun phrase structure and compounds, (6) case, topic, focus, and quotative particles, (7) adverbs, (8) sentence modality markers and formal nouns, and (9) final particles.
The selected items were grouped into five developmental stages. For example, for verb final inflection, present tense (taberu eat-PRES ‘he eats’), past tense (e.g. tabeta eat-PAST ‘he ate’), and colloquial imperative (tabete! eat-IMP:te ‘eat!’) were grouped into stage 1, while the hortative (tabeyoo eat-HORT ‘let’s eat’) was classified as a stage 2 item. For case particles, the nominative ga and locative ni were grouped into stage 1, while the accusative particle
Sentence point
The DSS score includes a ‘sentence point’ added for complete sentences ‘meeting all adult standards’ (Lee, 1974, p. 157) as a criterion reflecting growing grammatical competence. The sentence point cannot easily be transferred to Japanese. Because of the optionality of arguments and case particles, only a few utterances of young Japanese children would be considered ungrammatical, especially in the early stages. Only when the child starts to use more complicated constructions will inflection or case particle errors increase and non-adult standard utterances appear. Ironically, the application of the sentence point system would therefore result in a reverse effect: younger children are more likely to gain sentence points than older children. Because of this, we decided to forego the ‘sentence point’ in the Japanese version of DSS.
Pre-sentences and sentences
Similarly, the distinction between ‘pre-sentences’ and ‘sentences’ proved difficult in Japanese. Lee (1974, p. 82) defines sentences as utterances where ‘subject and verb are both spoken’, whereas pre-sentences are ‘utterances which contain only a partial subject-verb grammatical structure’. It is obvious that this distinction is not applicable to a null-argument language. In Japanese the portion of sentences without any overt subject and/or object is high (Guerriero, Oshima-Takane, & Kuriyama, 2006; Hirakawa, Oshima-Takane, & Ito, 2009; Kuno, 1973; Martin, 1975), and sentences consisting only of verb or adjective are ‘conceived of as complete and are grammatical’ (Tsujimura, 2007, p. 255). The DSSJ therefore disregards the distinction of pre-sentence and sentence and analyzes the complete sample of 100 sentences.
How reliable is this resulting Japanese version of DSS? In order to test DSSJ, we analyzed cross-sectional speech samples from children between 2;8 and 5;2. On the basis of these samples, we investigated the following questions. (1) Does DSSJ accurately evaluate the grammatical development of Japanese children? (2) How strong is the correlation of DSSJ with MLU? (3) Do children who score high on DSSJ also display high MLU scores, and vice versa? (4) Do children with high overall scores also score highly in the same DSSJ sub-areas, or do individual differences exist? (5) Which sub-area scores increase most during development? (6) Can DSSJ be computed with fewer than 100 sentences?
Method
Sample
The cross-sectional samples used for this study consisted of dyadic adult–child interactions of 84 children (31 boys and 53 girls) acquiring Japanese as their native language. The observations included six age groups ranging from 2;8 to 5;2, each group consisting of 14 children (Table 1).
Specification of the data samples.
We used cross-sectional data deriving from a variety of studies. Fifty-three recordings were collected especially for this study in kindergartens in the Chubu area (Prefectures Aichi, Mie, and Gifu; children 2, 8–14, 21–28, 41, 42, 55–84) and in the Tokyo area (children 15–19). The samples were video-recorded using an external microphone for better sound quality. Additionally, a digital voice recorder was placed near the children. After a warming up book-reading session, the trained interviewers engaged each child in dough-play and elicited about 120 utterances. The elicitation usually required 20 minutes of interaction.
Another 24 samples were taken from a larger sample collected by Jessika Tsubakita at several kindergartens in the cities of Kyoto and Uji (children 29–40, 43–54). For our study we extracted 100 child sentences from 20-minute mother–child sessions using play-dough. These sessions took place in a room separate from other students in the kindergarten. Children who uttered fewer than 100 fully comprehensible sentences and one child with very unclear pronunciation were excluded from this study. The following samples were randomly taken from longer free-play mother–child interactions at home (father–child interaction in the case of the Ishii corpus). Children 1, 4, and 5 belong to a longitudinal data set collected in Tokyo and Saitama area by one of the co-authors, Keiko Itoh, and child 57 belongs to a data set of six mother–child conversations provided by the co-author Kiyoshi Otomo. The Ishii data (child 20; Jun) is taken from the longitudinal data of the Ishii corpus (Ishii, 2004), and children 3 (Njd), 6 (Tom), and 7 (Als) are early recordings (2;8) from the MiiPro corpus (Miyata & Nisisawa, 2009, 2010; Nisisawa & Miyata, 2009), available through CHILDES from http://childes.talkbank.org (MacWhinney, 2000).
Transcription and scoring
All samples were transcribed in Japanese and Latin script (Miyata, Muraki, & Morikawa, 2004) using the Wakachi2002 v.3.0 format proposed by Miyata (2006) and provided with morphological tags in JMOR03 format (Miyata & Naka, 2006). We selected 100 fully comprehensible consecutive utterances. Complete and immediate imitations, self-repetitions, rote-learned phrases like song texts, nursery rhymes, and commercials were excluded. Furthermore, identical utterances were excluded, even if occurring at different locations in the transcript.
From these ‘cleaned’ transcripts, we ran the MLU and the DSS commands of the CLAN program (MacWhinney, 2000). For MLUm (Mean Length of Utterance in Morphemes) we used the following command based on the definitions outlined in Miyata (2012): mlu +t%mor +b+ -sco* -sonoma* +d1 +t*CHI @. For DSS we compiled a specific Japanese library file (dssrulesjp.cut) that is now included in the Japanese MOR package distributed by CHILDES. For DSS we used the following CLAN command: dss +ddssrulesjp.cut +lj +b*CHI +e @.
Results
Mean DSSJ and MLUm scores of the six age groups.

Mean DSSJ overall scores in comparison to MLUm of the six age groups.
Mean DSSJ sub-area scores for the six age groups (SD in parentheses).
Abbreviations of the areas: VM middle verb inflection, VL last verb inflection, ADJ adjective inflection, COP copula inflection, CNJ conjunctive particles and conjunctions, NP noun phrase structure and compounds, CASFQ case, topic, focus, and quotative particles, ADV adverbs, SMOD sentence modalizers, FINP final particles.
Discussion
DSSJ overall scores
Overall, we found a steady increase of the DSSJ overall score as age increased for the six age groups between 2;8 and 5;2, and a strong correlation with average DSSJ scores and age. However, the oldest age group (5;2) scored slightly lower than the 4;8 age group. As we can assume that children’s morpho-syntax improves over the years, we conclude that DSSJ reflects this development, at least for the age span of 3–5.
We found a strong correlation between DSSJ and MLUm. This corresponds to previous research on the English DSS, which showed a strong correlation to MLU, at least for typically developing children, while children with language disorders regularly scored lower on DSS than on MLU (Mortimer & Rvachew, 2010; Oetting et al., 1999; Rice et al., 2006; Rondal & DeFays, 1978). Indeed, we found a very strong correlation for the Japanese DSSJ and MLUm (r = .94 for the group scores and r = .95 for the individual children). This was also true for the children scoring high on DSSJ (1 SD higher than average): they all scored higher than average on MLU as well.
The strong correlations show that there was no dissociation of DSSJ and MLUm in the older age groups. This suggests that DSSJ as well as MLUm might be valuable even after age 4 or MLU 4.0, respectively, the upper reliability limit that has been suggested for the English MLU (Bernstein & Tiegerman-Farber, 1997; Bloom & Lahey, 1978; Brown, 1973; Klee & Fitzgerald, 1985; Scarborough, Wyckoff, & Davidson, 1986). Other researchers found MLU to be reliable up to age 5 (Miller & Chapman, 1981) or even 6 (Chabon, Kent-Udolf, & Egolf, 1982), and found correlations between MLU and other grammatical measures for these age groups (Kemper et al., 1995). This upper limit does not mean that MLU scores do not continue to increase after age 6, but we would expect a larger degree of variation within the samples of individual children and thus a lower reliability. There are fewer studies investigating the upper age or score limit for DSS, but it appears that DSS is applicable for an age range longer than that for MLU. Reed et al. (1998) found that 11-year-olds scored higher on DSS than 8-year-olds for most grammatical items mostly because the items were used more frequently. On this basis we might expect a higher age limit for the Japanese DSSJ as well.
The evidence mentioned above suggests that the lack of increase of the DSSJ score which we found for the 5-year-olds compared with the younger 4;8 group is not necessarily due to an upper limit of applicability of DSSJ. The MLU scores for the older group were similarly low and strongly correlated to the DSSJ overall scores. The slightly lower average for the older group could also not be attributed to a single low-performing child but rather reflected an overall tendency of this age group. Ten out of 14 children of the older group scored lower than the average 4;8 child. Also, in most sub-areas, the average scores for the 5;2 age children were slightly below the scores of the 4;8 age group, although the difference was not significant.
Furthermore, the case of the two highest scoring children (DSSJ 7.93, MLU 7.71, and DSSJ 6.29, MLU 6.65, respectively) speaks against any leveling off of DSSJ at age 5 or DSSJ 4.0. These figures show that complex morpho-syntactic structures are recognized by DSSJ and MLUm alike, at least up to a level of DSSJ 7.0.
If we disregard the possibility of DSSJ leveling off at this early stage, what prevented our 5;2 age children to perform better than our 4;8 age group? The reasons for the comparably low DSSJ and MLU results remain unclear. It is possible that our elicitation method of free-play child–adult interaction is less effective for this age group.
In our study, free-play with play-dough was used to stimulate child–adult conversations. This same style of play was used for all children, no matter the age group, to ensure comparability of data for each age group. Play-dough is a preferred toy of kindergarten children, and most children over 3 years old like to play with it because of its formability, colorfulness, and creative usage.
Nevertheless, using play-dough during child–adult conversation might not be the ideal speech elicitation method for older children. In our study the 5-year-old children tended to be less communicative than the 4-year-olds. When fascinated by a topic, the older children produced detailed explanations using complex sentence structures, but there were a number of situations where they answered only monosyllabically when prompted by the interviewer. This is in line with our experience with longitudinal observation. Children over 4 years old, especially boys, were less intrigued by the prospect of playing with their mother, and preferred to play outside with their friends. It is also possible that older children are more inhibited in conversations with adults than are younger ones. Japanese children of this age live in a world that is only partially shared by adults, and they increasingly become aware that adults are not only less knowledgeable about Pokémon, Purikyua, and Gokaiger, but also consider this information as irrelevant. As a result they become more reluctant to talk about their interests with adults. For older children, therefore, free conversation might be less effective for elicitation of complex speech.
Although requiring more effort from both children and investigator, narratives might be a better option for older children. Kemper et al. (1995) analyzed narratives of children aged 5–10, and found an increasing MLU up to age 7. Similarly, a preliminary analysis of Japanese narratives elicited from 3- to 11-year-old children (Inaba, 2013) suggests that also in the case of Japanese children, MLU scores continue to increase up to at least age 6. While the 3-year-olds found it difficult to retell the story even with much support from the investigator, most children over 5 years old retold the story willingly and more skillfully, even though it was obvious that they struggled with the complicated story line (Inaba, personal communication).
Further investigation of the elicitation method used, especially with respect to the older age groups, is necessary, but it is possible that for more accurate results, a switch from free conversation to a more suitable task like narration is necessary for children over 5 years of age.
DSSJ sub-areas
While we found a steady increase in the overall score of DSSJ for all ages, the scores of the single sub-areas increased to a differing degree. The scores for verb inflection were high even in the youngest age group and continued to increase as the age increased. Likewise, noun phrase structure and compounds, conjunctions, and case particles also showed substantial increases as the age increased. However, other sub-areas like adjective inflection, adverbs, and sentence modalizers, increased to a lesser degree.
In addition, individual children showed differential performance across sub-areas. When we investigated the sub-area scores of the high-scoring children (more than 1 SD over the average of their age group), individually they displayed very different patterns. Only one child scored higher than average in all sub-areas but one. Most of the other children scored highly on three to five sub-areas, but these sub-areas varied according to the individual child reflecting his or her specific speech style. As a result there are different ways to reach a high DSSJ overall score. This recalls the study by Rollins, Snow, and Willett (1996), who found significantly different rates of morpheme use within a group of children of the same age and MLU for English. They concluded that MLU is a global measure behind which differences of morpho-syntactical developmental style might hide. The same can be said of the DSSJ overall score.
Overall, we can see tendencies that are different for the younger children than for the older ones. Most younger high-scoring children up to 3;8 scored higher than average on finite verb inflection, conjunctions, and case particles, while the older children’s high scores were concentrated on conjunctions, complex noun phrases, and compounds. Therefore, many high-scoring children appeared to be advanced especially in the areas of basic syntax and sentence conjunction, although the individual pattern differed from child to child.
Sample size
The research on the English DSS as well as on MLU suggests that the sample size plays an important role in the reliability of the results and suggests that the recommended size of 50 sentences for DSS might be too small (Johnson & Tomblin, 1975). We therefore compared DSSJ scores based on 100 sentences to samples of 50 sentences, analyzing the first and the second halves of the sample separately. Although we found relatively strong correlations of these half-sized samples with MLUm, it turned out that the correlation between the corresponding halves was low (r = .51), and we conclude that DSSJ needs more than 50 utterances to yield a reliable result. This is possibly related to a relatively high portion of sentences consisting of only one morpheme. As mentioned above, we had to disregard the notion of pre-sentences as opposed to complete sentences because Japanese is a null-argument language, and as such, sentences consisting only of a verb are as complete as sentences that include both subject and verb. As a result, our samples include sentences consisting of only one morpheme, for example a simple noun. These simple sentences, which are quite commonly used even by children in the older age groups, display no morpho-syntactic knowledge and therefore do not contribute to the increase of the DSSJ overall score.
It is possible that this portion of single-morpheme sentences makes a higher number of sentences necessary for DSSJ compared with the English DSS. In fact, it is not clear whether 100 sentences give a sufficient sample size. Future research should investigate the reliability of samples larger than 100 sentences in combination with different elicitation methods. For the time being we assume that a sample size of 50 utterances is not sufficient for the Japanese DSSJ.
Conclusions
Overall we conclude that the DSSJ overall score reliably reflects the morpho-syntactic development of Japanese children. As we found a strong correlation between DSSJ and MLUm for all age groups including the 5-year-olds, the question arises why one would ‘bother’ to calculate DSSJ if one can obtain the same result with the less complicated MLUm. If both measures are similarly sensitive to the morpho-syntactic development, what is the advantage of DSSJ? Compared with MLU, DSSJ scores grammatical items directly, and besides the overall score, more detailed results for the different grammatical sub-areas can be obtained. While taking a 100-sentence sample for DSSJ might require time and effort considered too excessive to be practical in clinical assessment, the more detailed analysis provided by DSSJ will be beneficial for research on the characteristics of different types of children including children with language disorders.
The current study examined the newly developed morpho-syntactical measure DSSJ in comparison to MLU. Based on speech data from 84 Japanese-speaking children between 3;2 and 5;2, we could show that DSSJ is a reliable measure at least up to age 4;8. The DSSJ scores of the oldest age group (5;2) did not increase in comparison to the group which is 6 months younger, but the correlation with MLU was equally high. The reason for this lower performance of the oldest group is not clear. It is possible that our method of data elicitation (free-play child–adult conversation) was less effective for the oldest children, and that narratives might yield better results for older children. An examination of the sample size (50 vs. 100 sentences) revealed a low temporal reliability for 50-sentence samples for all age groups, which indicates that DSSJ should be used with samples larger than 50 sentences. Research concerning the elicitation method and the optimal sample size is needed.
Footnotes
Acknowledgements
We like to thank all the participating children and their parents. Without their support this research would not have been possible.
Funding
This research was supported by a research grant from the Ministry of Education, Science, Sports and Culture through the Grant-in-Aid for Scientific Research (A)(2)(1999-2001) No. 11694009 to Hidetosi Sirai, (B)(1)(2001-2003) No. 13410034 to Kiyoshi Otomo, (C)(1) No. 12610567, and (B)(2006-2008) No. 18330141 to Susanne Miyata.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
