Abstract
This article reports on an experiment investigating the effects of featural Relativized Minimality (Friedmann et al., 2009) on the representation and processing of relative clauses in the second language (L2) English of Mandarin speakers. Object relatives (ORCs) are known to cause greater problems in first language (L1) acquisition and in adult processing than subject relatives (SRCs). Featural Relativized Minimality explains this in terms of intervention effects, caused by a DP (the subject of the ORC) located between the relative head and its source. Intervention effects are claimed to be reduced if the relative head and the intervenor differ in features, such as number (e.g. I know the king who the boys pushed). We hypothesize that L2 learners will show intervention effects when processing ORCs and that such effects will be reduced if the intervenor differs in number from the relative head. There were two tasks: picture identification and self-paced reading. Both manipulated relative clause type (SRC/ORC) and intervenor type (±plural). Accuracy was high in interpreting relative clauses, suggesting no representational problem. Regarding reading times, ORCs were processed slower than SRCs, supporting an intervention effect. However, faster reading times were found in ORCs when intervenor and head noun matched in number, contrary to hypothesis. We suggest that our more stringent stimuli may have resulted in the lack of an effect for mismatched ORCs, in contrast to some earlier findings for L1 acquirers.
Keywords
I Introduction
The usefulness of a competence performance distinction, such as that proposed by Chomsky (1965, and subsequently), has been much debated. Such a distinction has been adopted, implicitly or explicitly, in many different domains of language acquisition, particularly in research coming from the generative linguistic perspective. The distinction is often formulated as a representational versus processing issue, namely the unconscious linguistic knowledge that speakers attain and their access to that knowledge in real-time sentence comprehension or production. In terms of methodology, offline measures are typically considered to reflect linguistic representations, whereas online measures are taken to indicate processing. In fact, one cannot make such a rigid distinction between what offline and online measures show: sometimes online measures provide a better reflection of linguistic representation while offline measures show the effects of processing (e.g. Chondrogianni and Marinis, 2012; Hopp, 2009; Juffs, 2004; Slabakova et al., 2017; Sorace and Filiaci, 2006).
In the field of second language (L2) acquisition, there has been considerable discussion of the extent to which second language learners and speakers (henceforth L2ers) are similar to or different from native speakers in terms of the nature of their linguistic representations and/or their processing resources. A question has been whether or not any differences that are found reflect non-convergence as far as linguistic competence is concerned. In fact, it has been suggested by a number of researchers that, while L2ers’ linguistic representations may converge with those of native speakers, their processing or computational capacity, or working memory span, can lead to performance differences (see, amongst others, Felser and Cunnings, 2012; Hopp, 2010; Slabakova et al., 2017; Sorace, 2011).
In this article, we investigate how L2ers represent and process English relative clauses in comprehension, using offline and online measures. We will revisit the well-known observation that L2ers find object relative clauses (ORCs) more difficult than subject relative clauses (SRCs), but considered in the framework of Relativized Minimality (RM) (Rizzi, 1990). We expect the difficulties that adult L2ers have with ORCs to be similar to the difficulties shown by adult native speakers, reflecting processing problems, and different from younger child first language (L1) acquirers, for whom relative clauses impose a greater computational burden.
1 Relative clauses and RM
The types of relative clauses at issue are SRCs and ORCs, the difference being the role of the relative head within the clause. In (1a), the head noun (the king) is interpreted as the subject of the relative clause, whereas in (1b) it is the object.
(1) a. SRC: The king who _ pushed the boy. b. ORC: The king who the boy pushed _.
ORCs are known to cause greater problems than SRCs in L1 acquisition (see, amongst others, Adani et al., 2010) and in adult processing (see, amongst others, Gordon et al., 2001), as discussed below. To account for these differences between relative clause types, Friedmann et al. (2009) and Villata et al. (2016) advance featural Relativized Minimality (see (2)), which incorporates revisions to the original Relativized Minimality (RM) proposal put forward by Rizzi (1990).
(2) Featural Relativized Minimality In a configuration X . . . Z . . . Y . . . a local relation between X and Y is disrupted when: a. Z structurally intervenes between X and Y (i.e. Z c-commands Y and Z does not c-command X); b. Z matches the specification in morphosyntactic features of X.
RM (including featural RM) is a syntactic locality principle, governing the behaviour of various kinds of dependencies, including the A’ dependencies involved in relative clauses. A dependency relationship must be established between the relative head (= X) and the position (the gap) from which movement has taken place or where the head is interpreted (= Y), in other words, the subject or object position, depending on the type of relative clause. A local dependency can be disrupted by intervening material, the disruption being worse if the intervenor shares certain kinds of features with the elements that it intervenes between. In the case of ORCs, intervention effects occur because the NP subject of the ORC (= Z) is located between the relative head and its source, making it harder to establish the appropriate dependency (see 3a). In the case of SRCs, there is no intervening NP, as shown in (3b).
(3) a. The king who X Z Y b. The king who _ pushed the boy. X Y
A further claim is that intervention effects are reduced if the relative head and the intervenor differ in certain features, such as number; the idea is that a mismatched intervenor is less likely to be interpreted as a potential local filler for the gap; compare (4a) and (4b). However, only features that participate in the computation of Aʹ dependencies, such as phi-features, can have this kind of effect (e.g. Belletti et al., 2012; Friedmann et al., 2009).
(4) a. Feature match (sg sg): I know b. Feature mismatch (sg pl): I know
2 Previous research
Investigation of relative clause acquisition and processing is of long-standing. 1 Here, we concentrate on research that is relevant to potential intervention effects, particularly the role of featural differences. A number of studies on child L1 acquisition and adult L1 processing have been used to support the claim that ORCs are harder to acquire/process than SRCs, and that acquisition/processing of ORCs is easier when there are featural differences between an intervenor and the elements that it intervenes between.
The work of Gordon et al. (2001, 2004) provides examples from adult processing, although not conducted within the RM framework. In both of these studies, processing (reading times) of English relative clauses by adult native speakers was investigated, by means of a series of self-paced reading tasks. Stimuli included SRCs and ORCs; the NP occurring in the relative clause was sometimes of the same category as the head noun and sometimes different (NP/pronoun/proper name). See (5a) (SRC) and (5b) (ORC).
(5) a. The barber that admired the lawyer/you/John climbed the mountain. b. The barber that the lawyer/you/John admired climbed the mountain.
The results show that ORCs take longer to process than SRCs (as measured by reading times at the critical region) when the same type of NP occurs as the relative head and within the relative clause (e.g. the barber / the lawyer). When the NPs were mismatched (e.g. the barber / you or the barber / John), there was no longer an advantage for SRCs. In other words, the effect of a mismatch was to increase the speed of processing of ORCs.
As for L1 acquisition, several recent studies have looked at relative clause acquisition in different languages from the perspective of featural differences between the head noun and potential intervenors. A number of differences have been examined, including NPs versus pronouns (similar to the adult studies discussed above), as well as features like gender and number. For these comprehension studies, a general finding is that accuracy is lower on ORCs than SRCs, with ORCs sometimes being misinterpreted as SRCs. When there is a mismatch between certain characteristics of the intervenor and the head, accuracy on ORCs improves significantly.
Friedmann et al. (2009) look at the comprehension of Hebrew relative clauses by children aged 3;7 to 5 (3 years and 7 months to 5 years). Tasks involved matching orally presented sentences to pictures or to scenarios acted out by the experimenter. Children performed above chance on headed SRCs but at chance on headed ORCs. When the relative clause included an arbitrary pro (the cow / someone) instead of an NP similar to the head (the cow / the chicken), there was dramatic improvement in performance on ORCs, suggesting beneficial effects of a mismatched intervenor.
Belletti et al. (2012) compare effects of gender in L1 Hebrew and L1 Italian children, aged 3;9 to 5;5. Once again, the task involved picture matching. For Hebrew, while 14 out of 31 participants performed at chance on ORCs when the gender of head and intervenor matched, eight of the 14 showed significantly better performance on ORCs when the gender was mismatched. For Italian, on the other hand, it made no difference whether gender was matched or mismatched, which the authors explain in terms of syntactic differences in how gender works in the two languages; only in Hebrew is gender an inflectional feature in the relevant sense, as it shows agreement on the verb. Adani et al. (2010) similarly found that gender mismatches in Italian had no effect.
We turn now to number, the feature we are concerned with in the present study. Several studies have shown an advantage for comprehension of ORCs when the intervenor differs in number from the head. Adani et al. (2010) look at Italian-speaking children aged 5;3 to 10;1, while Adani et al. (2014) investigate English-speaking children with specific language impairment (SLI) (ages 9;5 to 16) compared to typically developing children matched for grammatical development (ages 6;0 to 7;5). In both studies, the task involved picture identification, administered on a computer screen. In both studies, a number mismatch between intervenor and head significantly improved performance on ORCs, for children with SLI as well as typically developing children of all ages tested. The five year olds in Adani et al. (2010) were particularly inaccurate, with 41% accuracy on matched ORCs, and 64% accuracy on mismatched ORCs. The SLI group in Adani et al. (2014) performed even less accurately, while the typically developing children were at around chance on matched ORCs; both groups nevertheless showed significantly better performance on mismatched items.
Contemori and Marinis (2014) look at the processing of relative clauses by English-speaking adults and children (ages 6;0 to 8;11), looking at the effects of number mismatches on accuracy and speed in online interpretation of ORCs. The task was a self-paced listening task. Participants listened to sentences presented phrase by phrase, pressing a key after each phrase, while looking at two pictures on a computer screen. At the end of each sentence, they had to select which of the pictures matched the sentence. This provided an accuracy measure. Their response times (RTs) after each segment provided a measure of processing. Children were quite accurate overall (78% or higher) but nevertheless more accurate in the mismatched cases than the matched cases. As for RTs, these authors found an effect for number but not for match vs. mismatch. The RT results suggest an effect for plurality at different segments, with plural (pl) nouns generally faster than singular (sg) at the NP1 position (head noun) and the NP2 position (noun within the relative clause).
Researchers have also investigated other filler-gap dependencies, including wh-questions and clitic left dislocation, and have reported that a number mismatch assists children in processing (e.g. Contemori et al., 2018; Durrleman et al., 2016; Manetti et al., 2016). It is worth pointing out that stimuli used in these L1 studies (for both Italian and English) in fact provide two overt cues to number within the relative clause, hence introducing a possible confound (as also noted by Adani et al., 2010). That is, number is indicated not only on the NP subject of the relative clause (the intervenor) but also on the verb, as shown in (6) (test items from Adani et al., 2014).
(6) a. The hippo that the rhino is washing . . . b. The hippo that the rhinos are washing . . .
One could argue that such items do not provide a pure test of the effects of featural RM, since the verb is not a potential filler for the gap but nevertheless provides an extra indication of the difference between matched and mismatched cases. Since inflectional features are implicated in relativized RM, it is not surprising that they show up overtly on the verb as well as the subject. Bentea and Durrleman (2017) controlled for this issue by looking at relative clauses in French-speaking children (ages 5 to 8 years). In French, verbal agreement can be overt or covert, depending on the conjugation class of the verb in question, as shown in (7) (example from Bentea and Durrleman, 2017).
(7) a. Le garçon lave le chat. The boy washes the cat. b. Les garçons lavent le chat. The boys wash the cat.
In verbs of the first conjugation, like laver ‘wash’, the singular and plural verb forms in the present tense, lave / lavent, are pronounced in exactly the same way (namely, [lav]). For verbs of the other conjugation classes, singular and plural verb forms are phonologically distinct. Note also that the plural inflection on the noun garçons in (7b) is not pronounced; it is the plural article les that indicates that this NP is plural.
These researchers found that French-speaking children, like Italian- and English-speaking children, had difficulties with ORCs but not SRCs in an aurally presented picture-matching task. As far as number match versus mismatch was concerned, mismatches did not improve the performance of the 5 year olds but did yield better performance in the older children. Crucially, the mismatch effect in the older children was observed regardless of whether inflection was audible on the verb. Bentea and Durrleman suggest that overt inflection on the verb is not crucial; rather, covert inflection yields the same effects. Alternatively, their study could be taken to show that it is the number marking on the NP (via the article in the case of French) that is crucial and that the additional verbal inflection plays no role. We return to this issue in the discussion.
As far as L2 acquisition is concerned, there has been considerable research on L2 acquisition of relative clauses (e.g. Doughty, 1991; Eckman et al., 1988; Gass, 1982), focusing on aspects that are harder or easier to learn. Much of this early research investigated whether the Noun Phrase Accessibility Hierarchy (AH) (Keenan and Comrie, 1977) predicts the difficulty of learning different kinds of relative clauses. The AH places relativization on an implicational hierarchy, such that the least marked type of relative clause is one with a relativized subject. Relativized objects are more marked, and other positions are yet more marked. In other words, on this kind of account, ORCs are expected to be harder than SRCs, though for a different reason from that advanced by the RM approach. Featural mismatches are not discussed in the context of the AH and would, presumably, not be expected to make a difference.
Previous research has not examined potential intervention effects in L2 comprehension of relative clauses. However, there has been some preliminary research on intervention effects on L2 production. A pilot study reported in Belletti and Guasti (2015) used an elicitation task to elicit relative clauses in spoken production of adult learners of Italian. Earlier studies with L1 acquirers of Italian as well as adult native speaker controls found that participants avoided producing ORCs while having no problems with SRCs (e.g. Contemori and Belletti, 2014). Instead, older children produced passives, as did adults. 2 In a passivized relative clause, intervention effects are avoided (compare the king who the boy pushed with the king who was pushed by the boy). Like the children acquiring Italian as L1 and the adult native speakers, advanced proficiency L2 learners of Italian were much more likely to produce a passivized relative clause than a non-passivized ORC, again avoiding intervention. However, this was not the case for intermediate proficiency learners who produced ORCs to a greater extent than passives; Belletti and Guasti suggest that this is because passives are harder to acquire.
On similar lines, Suzuki (2019) and Suzuki and Hirakawa (2019) look at relative clause production by adult Japanese-speaking learners of English, at a low proficiency level. Both studies elicited written production of relative clauses. As in the studies reported for Italian, relatively few problems were found in the production of SRCs by either group. In contrast, when ORCs were elicited, passives were produced instead of ORCs. Suzuki (2019) manipulated animacy as a feature, on the assumption that clauses in which the ORC head and the intervenor match in animacy (the boy / the girl) would be harder than those in which they are mismatched (the ball / the girl). 3 A higher level of production of passives was found in the cases of animacy match than mismatch. Suzuki and Hirakawa (2019), on the other hand, manipulated number; however, they found no effects for number match versus mismatch.
The above production studies on L2 Italian and L2 English report avoidance of ORCs, suggesting an intervention effect, although not always an effect of featural differences.
To summarize, intervention effects, relating to processing times, comprehension and production, have been reported in children and adults, L1 and L2 acquirers. ORCs cause more problems than SRCs; mismatched intervenors sometimes help to reduce problems caused by the presence of an intervenor.
3 Representation and processing
Before going on to present our experiment, it is important to discuss whether the reported problems with relative clauses reflect problems of linguistic representation, problems in online processing, or both. We assume that the parser and the grammar are distinct and that the parser draws on the grammar in online L2 processing (see, amongst others, Hopp, 2010; Omaki and Schultz, 2011; Phillips and Ehrenhofer, 2015). Thus, complex computations required by the grammar may translate into processing difficulties. Adult native speakers are accurate on ORCs whether intervenors are matched or mismatched, suggesting that ORCs in particular, and RM more generally, are part of the adult grammatical system. Nevertheless, adults sometimes experience processing delays, particularly in cases where the intervenor in an ORC matches the relative head in some feature(s). For children, on the other hand, the problem is arguably not simply a processing issue, since, at least for the younger children reported in the experiments above, accuracy on ORCs in offline tasks is around 50%, especially when features match.
Rizzi and colleagues (Belletti and Rizzi, 2013; Belletti et al., 2012; Friedmann et al., 2009; Rizzi, 2018; Villata et al., 2016) argue that it is not RM as such that is at issue. They propose a distinctness hierarchy, relating to the relationship between the target (X), in this case the relative clause head, and the intervenor (Z), in this case the subject of an ORC. They advance four possibilities: disjunction, intersection, inclusion and identity, defined as in (8). 4
(8) a. Disjunction: X and Z share no relevant features +R, +wh, . . . . . . +NP. . . . . . < +R, +wh >
5
whoever the boy pushed _ b. Intersection: X and Z share some relevant features but not all +R, +NP, –Pl . . . . . . +NP, +Pl . . . . . . < +R, +NP, –Pl > the king who the boys pushed _ c. Inclusion: The features of Z are properly included in the features of X +R, +NP . . . . . . +NP . . . . . . < +R, +NP > the king who the boy pushed _ d. Identity: X and Z share all relevant features *+R, +NP . . . . . . +R, +NP . . . . . . < +R, +NP > who who pushed _
Feature mismatch is a case of intersection, whereas feature match is a case of inclusion. Total identity of X and Z is never possible. In adult grammars, disjunction, intersection and inclusion are all possible, since adults produce and comprehend ORCs, regardless of feature match/mismatch. However, featural RM can result in a processing disadvantage for adults in cases of inclusion. Belletti and Rizzi (2013) suggest that syntactic computations involving inclusion are more complex than those involving non-inclusion (disjunction or intersection) and that this complexity difference shows up in online processing: cases involving inclusion take longer to parse. In other words, the adult grammar allows both inclusion and non-inclusion; processing speed in cases involving inclusion can nevertheless be affected by computational complexity.
Rizzi and colleagues propose that children have a different cut-off on the hierarchy in (8), such that inclusion is not permitted in their grammars. According to Friedmann et al. (2009: 84), ‘children adhere to a somewhat stricter version of RM, requiring not just a distinct featural specification of the target with respect to the intervenor, but imposing the stronger requirement of a disjoint specification.’ We take this to mean that the child’s representation differs from the adult’s. In the course of acquisition, children change from a more restrictive system allowing only disjunction or intersection to a less restrictive one. In fact, some of the researchers who have tested child participants have argued that, for older children at least, the problem is one of parsing rather than the underlying grammar (e.g. Adani et al., 2010). Given that older children (6+ years) in several of the experiments were relatively successful, they might be past the age of acquisition of structures implicating RM, their processing instead being affected in the same way as adults.
In the case of L2 acquisition, we assume that the grammars of adult learners, like adult native speakers, are constrained by RM. We hypothesize that featural differences between a relative head and an intervenor may prove advantageous in adult L2 parsing, as is the case in adult L1. There might, then, be differences in offline relative clause accuracy and online processing speed, both for SRCs versus ORCs and for mismatched versus matched items. Before turning to our experiment, we consider the extent to which featural RM effects might be derivable from the L1.
4 Crosslinguistic differences: Properties of relative clauses in Chinese
In the present study, we look at the acquisition of English relative clauses by native speakers of Mandarin Chinese. Chinese relative clauses differ from English in that they are prenominal rather than postnominal. This means that predictions concerning the relative ease or difficulty of SRCs versus ORCs are potentially different for the two languages, depending on whether linear or hierarchical effects are considered.
Consider the sentences in (9) and their corresponding structures in (10), from Suzuki and Hirakawa (2018). (9a) and (10a) represent SRCs, while (9b) and (10b) show ORCs. From a linear point of view, SRCs should be more difficult than ORCs, since nuhaizi (‘girl’) in (9a/10a) intervenes between the gap and the head noun, whereas nanhaizi (‘boy’) in (9b/10b) does not. This, then, is the opposite of the situation in English. However, if one considers hierarchical structure rather than linear order, then Chinese and English are alike: nuhaizi (‘girl’) in (9a/10a) does not c-command the gap, so does not intervene, whereas nanhaizi (‘boy’) in (9b/10b) does c-command it, so is a potential intervenor in terms of the definition of featural RM provided in (2). Researchers differ as to which type of relative clause is considered more difficult in Chinese. Mandarin speakers are argued to have greater difficulty with SRCs on some accounts (Chen et al., 2008; Gibson and Wu, 2013; Hsiao and Gibson, 2003) and ORCs on others (Hu et al., 2016a, 2016b; Lin and Bever, 2006). Hu and colleagues argue that a hierarchical account in terms of RM is implicated.
(9) a. [zhui nuhaizi de] nanhaizi (SRC) chase girl DE boy ‘The boy who chased the girl’ b. [nanhaizi zhui de] nuhaizi (ORC) boy chase DE girl ‘The girl who the boy chased’ (10) a. b.
Taking hierarchical structure into account, Chinese and English are more alike than they appear on the surface. This means that effects of L1 transfer cannot be ruled out: intervention arises in the case of ORCs in both languages. However, there is another relevant difference between the two languages, given that we will be examining the role of featural mismatches involving number. Unlike English, Chinese does not have obligatory number marking on nouns; in fact, number marking is quite restricted. This means that there is no equivalent mismatch effect and that any potential advantage for number-mismatched NPs in the L2 could not be derived directly from the L1.
II Experiment
An experiment was conducted to investigate the status of SRCs and ORCs in the comprehension of English relative clauses by Chinese speakers. We hypothesize that adult L2ers will be able to interpret SRCs and ORCs appropriately but that they will show intervention effects when parsing ORCs, particularly in cases of matching features.
1 Participants
Participants were tested in Montreal and Ottawa, Canada, and were compensated for their time. Proficiency was determined by means of a cloze test completed after the experiment; only people scoring at least at the intermediate level were retained for the study. 6 All participants were Mandarin-speaking learners of English, the majority from the People’s Republic of China. They were first immersed in English as (young) adults, aged 17 years or older, principally by moving to an English-speaking country (usually Canada). Many participants reported non-immersive childhood exposure to English, which is not uncommon in Chinese-speaking countries.
There were three groups: L2ers of intermediate proficiency (n = 13), L2ers of advanced proficiency (n = 26), and adult native speaker controls (n = 16). The intermediate group ranged in age from 19 to 56 years (median age 23 years). Their age of first exposure to English ranged from 5 to 14 years (median 10 years). Their age of immersion in English ranged from 17 to 46 years (median 19 years). The advanced group ranged in age from 18 to 29 years (median age 21 years). Their age of first exposure to English ranged from 3 to 13 years (median 6.5 years). Their age of immersion in English ranged from 17 to 26 years (median 18 years).
2 Pretest
A production task was included in order to determine whether plural marking was present in the interlanguage grammar, in other words, whether the L2ers would be sensitive to a number distinction between singular and plural in L2 English. As is well known, L2ers often omit inflectional morphology; nevertheless, many researchers have suggested that underlying features are present (e.g. Haznedar and Schwartz, 1997; Lardiere, 1998; Prévost and White, 2000). Although absence of overt plural marking does not necessarily indicate absence of number in the underlying grammar, we assume that appropriate suppliance of overt morphology is an indication that the relevant feature is represented.
Participants were given pictures to describe containing multiple items, designed to elicit plural count nouns. Accurate production of plural marking on count nouns in plural contexts was generally high, with a mean of 90% for the advanced group, and a mean of 78% for the intermediate group. We take this to indicate that number is represented in the grammar, and hence that it is reasonable to explore the role of featural mismatches in their interpretation of relative clauses. 7
3 Sentence types
The two main tasks (picture selection and self-paced reading) involved 24 test sentences and 12 fillers, presented in an order randomized for each participant, with the same wording used in the complex NPs (head nouns and relative clauses) in each task. Test items manipulated relative clause type (subject/object) and intervenor (number match/mismatch). In the picture selection task, the words Show me preceded the complex NP. In the self-paced reading task, the words I know preceded the complex NP.
All sentences were semantically reversible and contained only human NPs as heads and within the relative clause. There were 24 test sentences, which were divided into 6 sets, involving male and female characters. Within any particular sentence, the characters were either both male or both female. Note that, unlike the test items used in the L1 studies described above, there was only one cue to number, namely morphology on the noun. The verbs were in the past tense in order to avoid giving additional information on match/mismatch via verbal inflection. A full list of the test items is provided in the Appendices 1 and 2.
Each set of test items contained 4 sentences: an SRC match, an SRC mismatch, an ORC match and an ORC mismatch. Within each set, the 4 sentences used identical vocabulary. Every sentence was 8 words long (Show me / I know followed by the 6-word complex NP). All sets used two singular NPs (sg sg) for the matched sentences (see 11a, 11c, 12a and 12c). For the mismatched sentences, 3 of the sets used a singular plural NP (sg pl) order (see 11b and 11d) while 3 used a plural singular NP (pl sg) order (see 12b and 12d). 8 This made for a total of 6 SRC match sentences (sg sg), 6 SRC mismatch (3 sg pl and 3 pl sg), 6 ORC match (sg sg) and 6 ORC mismatch (3 sg pl and 3 pl sg).
(11) a. SRC match: . . . the king who pushed the boy. (sg sg) b. SRC mismatch: . . . the king who pushed the boys. (sg pl) c. ORC match: . . . the king who the boy pushed. (sg sg) d. ORC mismatch: . . . the king who the boys pushed. (sg pl) (12) a. SRC match: . . . the girl who hugged the queen. (sg sg) b. SRC mismatch: . . . the girls who hugged the queen. (pl sg) c. ORC match: . . . the girl who the queen hugged. (sg sg) d. ORC mismatch: . . . the girls who the queen hugged. (pl sg)
The 12 filler sentences (6 with singular subjects, 6 with plural) involved intransitive verbs in the relative clause, as shown in (13).
(13) a. Filler: . . . the queen(s) who slept. b. Filler: . . . the girl(s) who walked to school.
4 Tasks
The two main tasks (picture identification and self-paced reading) were administered via E-Prime 2.0 (Schneider et al., 2002). For the picture identification task, participants were presented with sentences, displayed on a laptop computer. Each sentence was accompanied by four pictures; participants had to press a key to indicate which picture illustrated the meaning of the sentence. This task was untimed. An example is shown in Figure 1.

Sample item from the picture identification task.
For the moving window self-paced reading task, each sentence was initially presented as a series of dashes with no words or punctuation visible. Participants pressed a key to uncover the first region of a sentence while the other regions remained covered. Pressing the key a second time uncovered the next region of the sentence, with the previous region replaced by dashes. Thus, participants read the entire sentence region by region, with only one region visible at any time. The regions in the self-paced reading task are shown in (14). Each sentence was followed by a comprehension question, requiring an answer of True or False (see 15). Reading times were recorded for each region and for the entire sentence.
(14) (15) a. I know the king who pushed the boy.
The boy pushed the king. (True/False)
b. I know the girls who hugged the queen. The girls hugged the queen. (True/False) c. I know the musician who the waiter touched.
The waiter touched the musician. (True/False)
d. I know the athlete who the doctors kicked.
The athlete kicked the doctors. (True/False)
SRCs and ORCs contained 5 regions. Region 4 is the point at which it becomes clear whether an SRC or an ORC is involved (see (14)). In the case of ORCs, region 4 is the critical region, namely the point at which it can be seen whether the features match or mismatch. For SRCs, the match/mismatch issue is only resolved at region 5. For the fillers, those with PPs contained 5 regions, while those without PPs contained 4. All test items ended with a period, which was a separate region from the rest of the text. 9
To prevent effects of task order or familiarization with test items, order of presentation of the picture identification and self-paced reading tasks was alternated, with the production pretest always taken between them. 10 In total, the three tasks took around 40 minutes to complete.
5 Predictions
As discussed above, we assume that adult L2 learners are constrained by RM as far as their grammatical knowledge is concerned, and that they permit disjunction, intersection and inclusion. In other words, we predict accuracy in both types of relative clauses (SRCs and ORCs) on both tasks (picture identification and self-paced reading). We also hypothesize that adult L2ers may experience processing difficulties in the case of ORCs and that such effects will be reduced if the intervenor differs in number features from the relative head, in accordance with featural RM. Thus, as far as the self-paced reading task is concerned, response times to ORCs are expected to be slower than to SRCs; faster response times are expected for ORCs with mismatched intervenors than with matched intervenors. In general, we expect L2ers to behave similarly to adult native speakers, revealing possible processing effects. Our predictions are summarized in (16).
(16) a. All groups will show accuracy on SRCs and ORCs in both tasks, regardless of match or mismatch. b. Response times will be faster to SRCs than ORCs for all groups. c. Response times will be faster to mismatched items in ORCs than to matched items for all groups, whereas no such difference is expected for SRCs.
III Results
1 Accuracy
We first consider the results relating to accuracy in interpreting SRCs and ORCs on the picture identification task and the self-paced reading task. 11 In the picture identification task, accuracy was determined by the choice of pictures. In the self-paced reading task, accuracy was determined by means of responses to the comprehension questions that followed the stimuli. On each task, participants received a score of 1 for each accurate response and 0 for inaccurate. Accuracy on the two tasks was modeled with logistic regressions (one per group) in R (R Core Team, 2019), using the glmer() function in the lme4 package (Bates et al., 2015), which implements contrast coding by default.
The statistical models contained the maximal number of random effects possible. For the picture identification task, the maximal converging model for all groups includes Type of RC and Match as main effects, as well as a by-participant random intercept. For the self-paced reading task, the maximal converging models include Type of RC and Match as main effects, and by-participant and by-item random intercepts. All models were compared with equivalent models including an interaction between Type of RC and Match via ANOVAs (likelihood ratio tests). As there is no significant statistical difference between models with and without interaction, 12 we report the results for the models without interaction.
Results from the picture identification task show that all groups interpret relative clauses appropriately, with a high level of accuracy, as shown in Figure 2. Across both the L1 and L2 groups, inaccurate responses were largely for ORC sentences, with about half the errors resulting from misinterpreting NP number, and the other half resulting from misinterpreting the ORC as an SRC. These results suggest that all groups are overall more accurate with SRCs than ORCs, and that the L2 learners are less accurate with mismatched sentences. The models confirm that the intermediate learners are significantly more accurate with SRCs than ORCs (

Accuracy on the picture identification task.
On the self-paced reading task, overall accuracy was also high, as shown in Figure 3. Inaccurate responses for both native speakers and L2ers again largely consisted of interpreting ORCs as if they were SRCs, rather than interpreting SRCs as ORCs. Examination of the proportions in Figure 3 suggests that both intermediates and native speakers are more accurate with SRCs than ORCs, while advanced speakers perform similarly with both. The models confirm that intermediates are significantly more accurate on SRCs (

Accuracy on the self-paced reading task.
2 Reading times
We turn now to reading times, tested only on the self-paced reading task. Items are retained for the analysis only if the answers to the comprehension questions were accurate; outliers (more than 2 SDs above/below the mean response times for any individual participant) have also been removed. Out of a total of 1,980 responses, 191 have been removed (86 outliers and 105 wrong answers), which corresponds to 9.6% of the data. Response times (RTs) were modeled via linear regressions (one per group) in R using the lmer() function in the lme4 package; p-values were obtained with the lmerTest package (Kuznetsova et al., 2014). Response time data were log-transformed in the statistical models.
Total RTs (as measured at the end of the sentence, excluding the period) and RTs at specific regions in the target sentences were modeled separately. As in the accuracy models, the RT models contained the maximal number of random effects possible. The maximal converging models for Total RTs included Type of RC and Match as main effects, as well as by-participant and by-item random intercepts. Like the accuracy results, the Total RT models were also run with an interaction between Type of RC and Match. ANOVAs (likelihood ratio tests) comparing models with and without interaction indicate no statistical difference between them. 14 We thus report the results for the models without interaction in this section.
Regarding RTs at specific regions in the sentence, we examined the effect of Match for ORC and SRC in regions where mismatch is observed: region 4 for ORC and region 5 for SRC (see (14)). We also examined the effect of Match for ORC at region 5, the spillover region. The maximal converging models for regions 4 and 5 included Match as main effect and a by-participant random intercept. (Since the material in regions 4 and 5 differs for SRCs versus ORCs (see (14), it is not possible to compare these RC types at the same region.)
Figure 4 shows the results for Total RTs, which suggest that responses overall are faster with SRCs than ORCs, with no effect of Match. The models for Total RTs indicate that neither Type of RC (

Total response times in the self-paced reading task.
Figure 5 shows participants’ RTs for Match at region 4, for ORC sentences only. Region 4 is the critical region to examine intervention effects in ORCs, as this is the region where NP2 is encountered. Examination of the bars in the figure suggests that learners, particularly those in the intermediate group, are faster with sentences where the two NPs match in number. This is contrary to expectation, as an advantage for the mismatched conditions was predicted; see (16c). The statistical models for Match with ORCs at region 4 confirm this: intermediates are significantly slower with mismatched sentences (

Response times for object relative clauses (ORCs) at region 4.
Participants’ RTs at region 5 for ORCs were also examined, to check whether the effect found for Match in the critical region is also observed in the spillover region. The statistical models indicate that Match is not significant at region 5 for any of the groups (intermediates:
We also probed whether the same effect of Match is found for SRCs at the region where NP2 appears, namely, region 5. Match is not statistically significant for any of the groups (intermediates:
The possibility arises that the (unexpected) results found for Match in ORCs at region 4 might not be the result of intervention effects, but, rather, a more general effect of number. Previous studies have obtained different responses for singular and plural NPs, regardless of the match/mismatch issue. For example, Contemori and Marinis (2014) found that the head noun of the relative clause (NP1) was processed faster if it was plural than if it was singular.
To determine whether this might be the case for our participants, we looked at RTs at the point where NP1 is encountered (region 2). We examined all sentence types together (ORCs, SRCs and fillers), as all of the sentences have the same profile at this point (I know the N who . . .). Figure 6 shows RTs for singular and plural NPs at region 2. Each group’s responses were modeled with a linear regression including Number as main effect as well as a by-participant random intercept. Number at region 2 is not statistically significant for any of the groups (intermediates:

Response times for singular vs. plural NPs at region 2.
In summary, the results concerning SRCs and ORCs largely confirm the hypotheses, in that participants were generally accurate on both relative clause types, with the intermediate group and the native speakers being significantly more accurate on SRCs in both tasks. Similarly, SRCs were processed faster than ORCs by the advanced group and the native speakers. The results relating to performance on matched versus mismatched ORCs, on the other hand, are contrary to what was predicted, with matched items responded to faster than mismatched by both learner groups, at least at region 4, an effect which disappeared at region 5. The implications of the results are discussed in the next section.
IV Discussion
To summarize the results, as far as differences between SRCs and ORCs are concerned, we have found evidence for intervention effects in both tasks, as indicated by accuracy results in the picture identification and self-paced reading tasks, as well as by response times on the self-paced reading task, all of which show an advantage for SRCs over ORCs. Consistent with prediction (16a), all groups were very accurate in interpreting relative clauses of both types, suggesting that SRCs and ORCs are appropriately represented and that adult L2ers observe all aspects of the distinctness hierarchy in (8) in the same way as adult native speakers, in other words permitting both types of relative clauses. Recall that in many of the studies on child L1 acquirers, discussed above, children performed at only around 50% accuracy on ORCs involving inclusion. It is noteworthy that the adult L2ers did not show any such difficulty. Nevertheless, accuracy on SRCs was significantly higher than ORCs in almost all cases, indicating a processing effect: parsing of ORCs is computationally more complex than SRCs, in that the relative head has to be retained in working memory longer before resolution is possible (see, amongst many others, Belletti and Rizzi, 2013; Gordon et al., 2001). This additional complexity leads to occasional inaccuracies in interpreting ORCs, for native speakers and L2ers alike. 15 Consistent with prediction (16b), response times to ORCs were significantly slower than to SRCs for the advanced L2ers and the native speakers, again suggesting a processing effect when intervention is involved. However, the effect was not significant for the intermediate group.
The status of featural differences, on the other hand, is less clear. The feature we have manipulated is number. Recall the claim that ORCs where the intervenor and head mismatch in features, as in (4b), will be easier to acquire/process than cases where there is a match, as in (4a). However, we obtained the opposite result, with match showing an advantage over mismatch, or no difference between them. As far as accuracy is concerned, on the picture identification task, L2 participants were significantly less accurate on stimuli involving mismatched features, although there was no effect on accuracy in the self-paced reading task. As far as processing is concerned, prediction (16c) was that mismatched features would give a processing advantage; in other words, that participants would respond faster to ORCs where the intervenor mismatched the head in number (sg pl or pl sg versus sg sg). In fact, RTs at the critical region (region 4) showed that mismatched items were responded to more slowly by both L2 groups. It is well known that L2 participants are often delayed in giving expected responses. At region 5, the spillover region, the mismatched cases no longer resulted in longer RTs. At best this provides weak support for the idea that the mismatch was in fact having a beneficial effect. (Gordon et al. (2004) make a similar argument involving the disappearance of an effect to motivate advantages for mismatched NPs.) However, we acknowledge a problem with our stimuli in that the sentences involving SRCs did not include a spillover region. In future research, it would be desirable to add a phrase (such as last week) to both SRC and ORC stimuli, in order to control more precisely for potential spillover effects.
As for the advantage for matched items in ORCs (accuracy in the picture identification task, RTs at region 4 in the self-paced reading task), it is possible that the L2ers pay more attention to – and take more time over – a number mismatch, as their L1 does not have obligatory number marking on nouns. In other words, the L1 is indirectly implicated after all, not in the sense that L2ers fail to represent number, but rather in the sense that overt number stands out for them, leading to minor inaccuracies or delays. Consequently, we might expect to see greater inaccuracies or delays in pl pl sentences due to overt number on both the NPs. However, such sentences were not included in our stimuli. In future research, it would be prudent to use all four possible combinations of sg sg, sg pl, pl sg and pl pl.
As far as RTs are concerned, since no effect of Number on NP1 was found, the significant difference in learners’ RTs for match and mismatch at region 4 does not seem to be a consequence of taking longer to process singular or plural as such. It is possible that, once presented with a mismatch, L2ers scan the sentences again in their minds (since the earlier parts of the sentence are not visible on the screen), which results in longer RTs at the region where the mismatch is observed. In order to determine whether absence of inflection in the L1 is playing a role here, it would be advisable to add L2ers whose L1 has richer inflection; such L2ers might be expected to show the predicted benefit for mismatched cases. 16
The absence of a mismatch effect is not in principle problematic. According to the hierarchy in (8), disjunction, intersection and inclusion are all permitted in adult grammars. In other words, it is not the case that effects of featural differences will inevitably be found. Rather, if featural differences are found, they are expected to favour the mismatched cases. No difference between matched and mismatched items was in fact observed for the native speakers on all measures, for the advanced L2ers on most measures, and for the intermediates on some measures. An issue, then, is why we obtained no effect for certain groups on certain measures whereas others have reported mismatch effects. We note, however, that there are other studies which have failed to find a mismatch effect for number. Using an online listening task, Contemori and Marinis (2014) found an effect for number (in that plural marking speeded up processing) but not for match vs. mismatch in the response times of English-speaking children and adults. As far as L2 learners are concerned, Suzuki and Hirakawa (2019) found no effects for number match versus mismatch in elicited production of relative clauses, which were in fact replaced by passives regardless of match/mismatch, thus circumventing the intervention problem.
Several studies on L1 acquisition, on the other hand, do find an effect for number (e.g. Adani et al., 2010, 2014). As noted above, in these studies, there were two overt cues to the match or mismatch, namely number marking on the noun as well as verbal agreement, as shown in (6). It is therefore not possible to determine whether a mismatch on the intervenor alone would have had an effect or whether what was observed was not solely an RM effect. The only study to address this issue experimentally is Bentea and Durrleman (2017), who manipulated presence or absence of inflection on the verb in French. They found that children aged 6–8 years showed an advantage in mismatched ORCs in French, regardless of presence or absence of overtly pronounced verbal inflection, whereas 5 year olds showed no effect for mismatch. Number marking on NPs in French is pronounced only on the article; it could be that the difference between le and les is more salient (at least for the older children) than the difference between singular and plural inflection on a noun, as is the case for English. 17
In order to determine the extent to which overt marking of inflection on both noun and verb explains the results appearing to support featural RM in L1 acquisition, it is important to replicate these studies using verb forms that do not indicate number, such as past tense verbs in English, as used in our study. Additionally, our experiment could be rerun with stimuli in the present tense, so that number distinctions are realized on the verb as well as the NPs involved. It remains, then, an open question as to whether the effects of featural RM have in fact been truly tested in many previous studies, since the verb is not a potential filler for the gap but nevertheless provides an extra indication of differences between matched and mismatched items. We suspect that double marking of inflection is in fact necessary (but not sufficient; see Contemori and Marinis (2014)) for the mismatch effect to show up as far as language learners (L1 or L2) are concerned but this remains to be investigated.
In conclusion, we have found differences between SRCs and ORCs in native speaker and L2 learner performance, and we have suggested that these are consistent with featural RM and are attributable to processing rather than reflecting a representational problem. Differences between SRCs and ORCs would also be consistent with other theories, such as the Noun Phrase Accessibility Hierarchy (Keenan and Comrie, 1977) or the similarity-based interference approach of Gordon and colleagues (Gordon et al., 2001, 2004), amongst others. Many L2 researchers have proposed that working memory capacity impinges on L2 performance (e.g. Cunnings, 2017; Hopp, 2010). An account in terms of working memory would also expect better performance on SRCs than ORCs.
Belletti and Rizzi (2013) point out that the featural RM account makes different predictions from more general accounts attributing problems to working memory. In particular, the RM account expects difficulties under very specific circumstances, involving certain kinds of morphosyntactic features. An account in terms of working memory would not, presumably, expect differences between disjunction, intersection and inclusion; that is, it would not predict performance differences depending on the properties of certain morphosyntactic features.
In other words, the more crucial test case for featural RM involves match versus mismatch of features and here our results are contrary (match better than mismatch) or neutral (no effect of mismatch). Even though we did not get the expected mismatch effect, we did get an effect (match preferred) in certain cases, which is also presumably not consistent with a general working memory account. We reiterate the point that featural RM does not in fact predict that featural differences must result in performance differences; rather it anticipates that if there are differences they will be in one direction rather than the other. We have suggested that some of the earlier research that has reported effects in the predicted direction for number in L1 acquirers may be attributable to the kinds of sentences that were used as stimuli, where number marking was not restricted to the intervenor. This is an issue that clearly requires additional research on Italian and English, along the lines initiated by Bentea and Durrleman (2017) for French.
Footnotes
Appendix
Stimuli: Fillers – Intransitives.
| NP1 | Sentence |
|---|---|
| sg | Show me / I know the chef who laughed. |
| sg | Show me / I know the queen who slept. |
| sg | Show me / I know the soldier who cried. |
| sg | Show me / I know the girl who walked to school. |
| sg | Show me / I know the boy who ran from the hospital. |
| sg | Show me / I know the athlete who climbed up the stairs. |
| pl | Show me / I know the chefs who laughed. |
| pl | Show me / I know the queens who slept. |
| pl | Show me / I know the soldiers who cried. |
| pl | Show me / I know the girls who walked to school. |
| pl | Show me / I know the boys who ran from the hospital. |
| pl | Show me / I know the athletes who climbed up the stairs. |
Acknowledgements
We would like to thank the Second Language Research reviewers for their helpful comments, as well as the audiences at EUROSLA 2018, GALANA 2018 and SLRF 2018. Thanks to Erica Brandao for assistance with recruitment and testing and to Guilherme Garcia for advice on the statistical analyses.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by research grant #2016-SE-188196 from Le Fonds de Recherche du Québec – Société et Culture (to Lydia White, PI) and by the ARIA program of the Faculty of Arts, McGill University (to Vera Xia).
