Abstract
Considerable reason exists to view the mind, and language within it, as modular, and this view has an important place in research and theory in second language acquisition (SLA) and beyond. But it has had very little impact on the study of working memory and its role in SLA. This article considers the need for modular study of working memory, looking at the state of common approaches to the subject and the evidence for modularity, and then considering what working memory should look like in a modular mind. It then sketches a research program to explore working memory within a modular mind and particularly its role in SLA. This is followed by a brief look at the way that the Modular Online Growth and Use of Language (MOGUL) approach can serve as a framework for such a program.
I Introduction
Working memory (WM) is the means of making information temporarily available to processes that require it. Serious development began with Baddeley and Hitch (1974), followed by the very extensive work of Baddeley (e.g. 1986, 2007, 2012) and countless others, mostly within the Baddeley and Hitch approach. This approach is probably being eclipsed now, at least among specialists, by ‘state-based’ theories (see especially D’Esposito and Postle, 2015). A common feature of both these approaches is the assumption that the mind is largely homogeneous, i.e. non-modular. This assumption has carried over into work in second language acquisition (SLA), where the role of WM has become a popular research topic (e.g. Juffs and Harrington, 2011; Linck et al., 2014; Szmalec et al., 2013; Wen et al., 2015; Williams, 2012). As this assumption is contrary to a great deal of thinking in SLA and beyond, a need exists for an alternative approach to studying WM in SLA, one that takes modularity seriously.
I will first briefly review current WM theories and modularity, emphasizing modularity of language, and consider the nature of WM in a modular mind. This is followed by discussion of what a modularity-based research program would look like and a brief look at the Modular Online Growth and Use of Language (MOGUL) framework as a promising framework for such a program.
II Theories of working memory
1 The Baddeley and Hitch model
The Baddeley and Hitch model initially hypothesized three components: a visuospatial sketchpad, a phonological loop, and a central executive directing attention and providing storage of a nebulous sort. Baddeley subsequently reduced the executive to its attention function and added an episodic buffer as a place where everything comes together to form a coherent conscious episode. My concern is with the sketchpad and the loop.
The sketchpad is not a genuine entity. At one point, Baddeley and colleagues described ‘visuospatial’ as a ‘blanket term’ covering any short-term memory that is not verbal (Della Sala et al., 1999). He concluded long ago that visual and spatial WM are distinct in terms of both storage and the processes that manipulate the stored representations (see Baddeley, 2007); in other words, two distinct modules are involved. More recently, he tentatively incorporated haptic (including kinesthetic and tactile) information in the ‘visuospatial sketchpad’, but research indicates that this is in fact a distinct WM domain (e.g. Katus et al., 2015; Pasternak and Greenlee, 2005).
The notion of a unified ‘phonological loop’ is also problematic. Considerable evidence exists that we must posit both a phonological store and an auditory store to explain the phenomena for which the loop was hypothesized (see especially Cowan, 2005). Penney’s (1989) review emphasized the observed importance of modality effects in short-term recall of verbal stimuli: Visually and auditorily presented information are both encoded phonologically, but recall is strongly influenced by the original modality.
Many domains are left out of this picture. Recently, Baddeley (2007, 2012; Baddeley et al., 2011) tentatively incorporated music, lip-reading, sign language, and ‘sound’ in the loop (in addition to haptic in the sketchpad), but he was very tentative and offered little or no evidence that a shared WM exists.
These considerations all point to a more fractionated WM, including at least visual, auditory, haptic, spatial, and phonological, and possibly a great many more. In other words, they point in the direction of a genuinely modular conception of WM.
2 State-based theories of WM
State-based theories reject the idea of WM as a location or a store, in favor of an activation approach: Working memory is the set of items in long-term memory (LTM) that are currently active. The central theme for research is thus the nature of the activation, with attention taken as its primary source. These theories abandon the distinctions among the various components of the Baddeley and Hitch model and are thus in a sense even less modular. But they are actually very amenable to a modular interpretation. Cowan et al. (2014) concluded from their experiments that, contrary to their previous view, most of working memory capacity (WMC) is peripheral; i.e. domain-specific, available only for processing within a given domain. This is a large step in the direction of a modular view of WM.
III Modularity
1 In general
Modularity can mean different things, but the central idea is functional specificity (Barrett and Kurzban, 2006; Coltheart, 1999; Pinker 1997). If distinct mental functions are carried out by distinct units, the mind is modular. The logic of a modular design is that a system composed of specialist sub-systems can operate more accurately and efficiently than a general-purpose system, and that damage or change to one function has limited effect on others. Carruthers (2006) argued, for these and other reasons, that a modular design makes more sense from an evolutionary perspective than does a more homogeneous system.
Some clarifications are needed. First, modularity does not imply absence of interaction, just constrained interaction. Second, neural plasticity is not problematic. The innate portion of a module includes only the basic processor and the primitives it works with. The overwhelming majority of a module’s neural range is likely to be connections among these primitives, constituting new representations formed through experience, guided by the processor. Finally, modularity does not assume a one-to-one mapping between genes and features of the module; the brain’s development is far more complex and interesting (Marcus, 2004). One question that I will leave open for the general case (see below for language) is how many modules exist and how fine-grained the modularity is.
The modular approach is closely related to the popular dual-processes idea (Evans, 2008; Kahneman, 2011). On this view, the mind includes two kinds of processes: specialist (S1) and non-specialist (S2), the latter able to perform much the same functions as the former but typically less efficiently and successfully. Workings of the non-specialists, but not the specialists, are strongly associated with consciousness. The specialists can be identified with modules while the non-specialists can be understood either as a general, non-modular system or as interactions among modules.
Much of the motivation for a modular approach comes from cognitive neuroscience, where modularity represents standard thinking (e.g. Bertolero et al., 2015; Dehaene, 2011; Gazzaniga, 2011; but see also Barrett and Satpute, 2013). Bertolero et al. produced extensive evidence for a close correspondence between function and location in the brain and cited genetic evidence consistent with this conclusion. Neural research has found that LTM is functionally fragmented, its visual aspects in visual areas, motor aspects in motor areas, etc. (e.g. Fuster, 2009; Jonides et al., 2008; Rissman and Wagner, 2012). If WM is activation of LTM representations, it should show a parallel fragmentation: it should be modular. And neural research suggests that it is. WM tasks involving a given function specifically activate areas associated with that function (e.g. D’Esposito and Postle, 2015; Druzgal and D’Esposito, 2001; Zaksas et al., 2001).
2 Modularity and language
The modularity of language is highly controversial (critics include, amongst others, Fedorenko et al., 2006; Goldberg, 2009). The case for modularity starts with the poverty of the stimulus: Given the extreme complexity of natural language, the limited and sometimes misleading input, and the generally limited learning ability of small children, the universal success of acquisition is mysterious – unless we hypothesize a substantial innate foundation, universal grammar (UG). Children’s success is more striking in that it occurs in widely differing circumstances, involving both cultural differences and various conditions that should be expected to get in the way (see Bishop and Mogford, 1988), and even in children whose general learning ability (intelligence) is extremely weak (e.g. Flavell et al., 1993; Smith and Tsimpli, 1995). Further evidence comes from deaf children’s acquisition of sign languages, which shows striking parallels with acquisition of spoken language (Goldin-Meadow, 2003; Lust, 2006). In extreme cases children receive input from an impoverished version of the language, due to limitations of the speakers or signers around them, and go beyond this input, developing a full grammatical system. The success enjoyed by nearly all children under nearly all conditions contrasts with the problems experienced by a small population of otherwise normal children, even under the most favorable conditions: problems that have a clear genetic basis (Bishop, 2006). All these considerations point to an innate, domain-specific basis for language, UG.
This leaves the question of whether UG is involved in SLA. The standard argument for non-involvement is that learners typically have limited success. But this observation can be explained in any number of ways, including such factors as motivation, context, and cognitive maturity, and interactions among them. Perhaps more importantly, the existence of an established language inevitably influences processing and therefore learning of the L2. The high level of success achieved by many learners argues against any fundamental change from first to second language acquisition. Continuity is also the more parsimonious hypothesis.
IV Working memory in the modular mind
Modularity of mind has important implications for how information becomes available to cognitive processes, i.e. for WM. Each set of processes (module) is largely independent of the others, using information in a format largely useless for the others. The natural conclusion is that each module has a distinct WM, making relevant information available to the processor of that module, in the encoding of that module. For the case of language, I will follow the conclusions of Jackendoff (1997, 2002; also Sharwood Smith and Truscott, 2014) that several modules are involved, including phonological and morphosyntactic for specifically linguistic processing and conceptual, motor, affective, and perceptual for associated functions.
But information availability – i.e. WM – must be seen more broadly as well, because the information can consist of the combined products of multiple modules. Perceptual information in particular constitutes the general input to the system, including linguistic modules, and so must be very widely available. This availability should be in the form of a unified picture of the world, the kind of picture that makes up our conscious experience; compare the ‘images’ of Damasio (2010), and ‘scenes’ of Edelman (2004). As this is a matter of making information available to processors, it is a form of WM, very close in fact to the sense of ‘working memory’ that guides most research, in that it is domain general, conscious, and effortful. Neurally, it can be considered a synchronization of activity in the various perceptual systems and beyond, as consciousness is commonly understood (see Truscott, 2015a, 2015b). This global form of WM is crucial for SLA because it represents the input to learning (I will consider an additional function below).
V A research program
In this section I sketch a program for investigating modular WM and its role in SLA. A research program of this sort requires constant interaction of theoretical and empirical work. Partly for this reason, it cannot provide simple prescriptions for doing experiments, only preliminary ideas. It includes study of modularity in general and study of both local and global WM, as described above, particularly regarding their role in SLA.
1 Studying modularity
Studying modularity means first identifying the modules that make up the mind. Little disagreement exists regarding the modular character of the various sensory modalities. The modularity of spatial functions also faces little controversy, at least in the context of working memory. Other functions, including language, are more controversial. For understanding modularity specifically in relation to language, Jackendoff (1997, 2002) is probably the best starting point.
One general criterion for judging modularity is evolutionary justification: A proposed module should serve a function that is significant in natural selection and could plausibly have evolved. The extensive literature in evolutionary psychology (e.g. Cosmides and Tooby, 2013; Pinker, 1997) has much to say on these matters. Behavioral evidence plays a key role as well, particularly dual-task research looking at degree of interference between tasks carried out by two putative modules. No less important is neural work, identifying brain locations that are especially active in particular types of tasks.
2 Studying modular WMs
Within a modular conception of the mind, the function of making information available to cognitive processes implies that a distinct WM exists for each module, and this must be taken into consideration in empirical research and its interpretation. For the case of language, this includes modules that are specifically linguistic and others that process language-related information. An important aspect of a research program is thus study of the individual WMs for each of these modules, along with the interactions among them.
Probably the most important tool for this job is dual-task research, more or less as typically carried out but with a broader scope, recognizing all the modules that are involved. The published literature offers some highly sophisticated methodology (e.g. Cowan et al., 2014), which is fully applicable to modular SLA research. The main adjustment is that task design and selection should be based on hypothesized modules, with interpretation of results following. A ‘pure’ task relying on just one WM is unlikely, so comparison between results of related tasks is essential. The load on each putative WM can also be varied and resulting effects on performance observed, again as in familiar WM research. A challenge for task selection is that a single task can often be carried out in more than one way, as described above.
Neural research is also essential, looking for activation patterns occurring during WM tasks in regions that have been identified with particular modular functions. A crucial cautionary note for neural research and its interpretation is that there can be no simple equation of cognitive activation (WM) with neural activation. The former is simply the availability of a representation to its processor. Neurally, this availability can be in the form of sustained activation but can also be a transient change in relevant synapses, establishing a state of increased readiness for activation: a ‘quiescent working memory’ (Fusi, 2008; Mongillo et al., 2008). Thus, the absence of sustained (neural) activation does not imply that a representation is not cognitively active or is not ‘in working memory’. Here again task selection is crucial, and potentially challenging.
3 Studying global WM
Global WM is, again, essentially what is normally treated as WM. Distinguishing it from local WMs is essentially the same issue as distinguishing central and peripheral capacities, a prominent theme in state-based WM research (e.g. Cowan et al., 2014). The sophisticated methodology developed there is thus available for study of global working memory (GWM). But it should be applied with awareness of the distinct roles played by the various modules, including perceptual, phonological, syntactic, and conceptual. One important caution comes from the observation, based on Carroll (1999), that the input to learning (the content of GWM) is not a sentence or an utterance but rather a perceptual representation.
Neurally, GWM is realized as broad patterns of synchronized activity over wide areas of the brain. Study of these patterns is again a common part of current WM research and so the methodology and interpretation tools are readily available. In the modular framework these broad patterns should include local patterns representing activity of the individual modules. This then is another way to approach the issues of local and global WM.
Study of individual differences in working memory capacity, as they are commonly studied, is primarily about global WM, as measurement of WMC involves deliberate, explicit tasks. But the role of individual modules and their WMs should be considered. Whether individual differences exist in linguistic WMs is unclear and constitutes another worthwhile research topic.
In state-based theories, attention is usually taken as the source of the domain-general activation that constitutes WM. So the vast literature on the topic is useful here. The concept of attention has its shortcomings, though (see Sharwood Smith and Truscott, 2014), so it may be better to focus directly on elements that determine what is attended: value, goals, context (as internally represented), and sensory input (i.e. active representations in the perceptual modules).
4 Studying the two types of linguistic knowledge and processing
The proposed WM research program connects to one of the most important and difficult topics in SLA, and beyond: the long-recognized distinction between two types of linguistic knowledge and use (e.g. R Ellis, 1988; Krashen, 1981; Lightbown, 1985; Long, 1977; Schwartz, 1986), its most popular current form being the implicit–explicit distinction (e.g. N Ellis, 2005). In each case the primary type is unconscious and automatic, the other largely conscious and playing a supporting role (compare the specialist–nonspecialist contrast).
The modular framework offers a useful way to understand and study this crucial distinction. The linguistic modules constitute the primary seat of (unconscious) linguistic knowledge, while the more conscious variety is conceptual, i.e. found in the conceptual module (see Truscott, 2015a, 2015b). We should expect the work of specialist modules performing their own function to be automatic; if knowledge outside these modules is to influence language-related activity, this use is likely to require GWM involvement. The role of local and global here can be studied, again, with familiar WM methods, notably dual-task studies and neural research. The main issue is task design.
Addressing this issue means recognizing the wealth of relevant findings in the SLA literature. The most important but perhaps least recognized finding of Norris and Ortega’s (2000) seminal meta-analysis on the effects of formal instruction was that very different results come from measurements of declarative (conceptual) linguistic knowledge and measurements that discourage the use of such knowledge, measuring instead the unconscious workings of the linguistic modules. This point can also be seen in meta-analytic reviews on the effects of error correction (e.g. Kang and Han, 2015; Li, 2010; Truscott, 2007; for discussion, see Truscott, 2016). These observations provide useful guidelines for task design. Their limits must be recognized, though. Many types of metalinguistic tasks can be carried out by intuitions, possibly based on knowledge in the linguistic modules. Conceptual knowledge can be used to some extent in spontaneous production, in the form of either deliberate focus on explicit knowledge or use of automatized conceptual knowledge.
This last point raises another significant research topic within the modular approach. While the distinction between modular and conceptual linguistic knowledge is closely associated with the standard implicit–explicit distinction, the two cannot be equated. Truscott (2015a, 2015b) offered an account of the distinction between implicit and explicit (and subliminal) in a modular perspective, providing a basis for study of this issue. The hypothesis is that knowledge/processing in the specialized linguistic modules is entirely implicit while its conceptual counterpart is typically explicit, consistent with Williams’ (2012) observation that SLA research has typically found explicit but not implicit knowledge and learning to be associated with WMC (which, again, is largely about GWM). This hypothesis also fits well with findings that the strongest relation between WMC and success occurs in the early stages of learning, especially in formal classroom learning (Gathercole, 2006; Hummel, 2009; Masoura and Gathercole, 2005; Winke, 2005). This association is readily explained by the use of conceptual knowledge, and therefore GWM, as a substitute for poorly developed modular knowledge in the early stages.
Probably the most important implication of this discussion is that we have to recognize, again, that two distinct kinds of processes are at work in language processing. In practical terms, this means that research requires a more refined and more theoretically based notion of task type than is often assumed and, no less important, greater concern with the implications that are to be drawn from experimental findings.
VI MOGUL as a useful framework for a modular research program
A research program has the greatest potential if carried out within an appropriate, established framework. The MOGUL framework (Sharwood Smith and Truscott, 2014) is a good candidate for this role.
First, the framework offers a modular architecture, based on Jackendoff (1997, 2002) but with an eye to SLA. It hypothesizes two specifically linguistic modules, syntactic structures and phonological structures, and additional modules that are involved in language but are not specifically linguistic, including conceptual, affective, motor, and the various perceptual modules. As such, it provides a natural way to study the distinction between modular and conceptual linguistic knowledge. Processing consists of innately specified processors within each module constructing representations, according to their in-built rules, from the representations currently active in their stores.
Activation is thus central in the framework. I suggested above that the heart of WM is the mechanisms that activate representations and maintain that activation, and that these mechanisms are about value, goals, and context. Each of these has an established place in the MOGUL framework. Global WM is also strongly associated with consciousness, an association that is straightforwardly accommodated in MOGUL. The same means (synchronization of activity of perceptual WMs) explains the availability of unified perceptual experience (global WM) and its association with consciousness. Finally, the framework has received extensive application to SLA, facilitating establishment of connections between WM research and other areas of SLA.
VII Conclusion
A need exists for serious exploration of working memory from a modular perspective and of the implications for SLA. I have offered some discussion of what a research program of this sort should look like and suggested that the MOGUL approach provides a useful framework for it. This discussion is of course preliminary and in some respects sketchy, requiring a great deal of further development.
Footnotes
Declaration of conflicting interest
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research benefited from a grant (
