Abstract
The experiment reported here tests the Lexical Orthography Hypothesis, that is, the notion that the output of the lexical phonology is the most promising phonological depth for an exhaustive representation of tone by means of diacritics in the orthography of a tone language. We conducted a controlled classroom experiment with 97 secondary school pupils learning written Kabiye, a Gur language of northern Togo. After testing their baseline skills in writing the standard orthography, the pupils participated in an eleven-hour transition course spread over three weeks in four parallel groups:
1 Introduction
Throughout the developing world, there are thousands of languages that do not yet have a written form. In many of these, tone plays a contrastive role in the lexicon and/or the grammar. The World Atlas of Language Structures plots not only a dense concentration of tone languages in sub-Saharan Africa and South-East Asia, but also a broad scatter worldwide, including North, Central, and South America, the Indian sub-continent, China, Papua New Guinea, and Oceania (Maddieson, 2013). By some estimates as many as 60–70% of the world’s languages are tonal (Yip, 2002, p. 1). Moreover, their geographical distribution loosely coincides with the very areas of the world in which mother-tongue illiteracy is most endemic. So orthography developers in these locations are frequently faced with the challenge of how to represent tone in writing.
In the course of the twentieth century, numerous strategies were developed for marking tone in newly developed orthographies, including word-final silent letters in Romanizations of Asian languages (e.g., Heimbach, 1969), word-initial and -final punctuation in Ivory Coast (e.g., Bolli, 1978, 1991), and superscript numbers in Mexico (e.g., Rupp & Rupp, 1996). But the classic, widespread convention for marking tone orthographically, particularly in sub-Saharan Africa, is the use of diacritics, and that method is the focus of this study.
The choice of symbolization entails an equally important but perhaps less obvious choice about which orthographic depth is most suitable: shallow or deep? The relative depth of different orthographies has been a key concept in the linguistics of writing ever since it was first introduced by Katz and Feldman (1981, 1983) and it remains the focus of ongoing research (e.g., Benuck & Peverly, 2004; Caravolas, Lervåg, Defior, Málková, & Hulme, 2013; Snider, 2014; Ziegler et al., 2010). Some orthographies, such as Spanish, are relatively shallow, mostly maintaining a one to one phonographic correspondence; others, such as French, are much deeper, the link between speech and spelling being less obvious.
Numerous researchers have debated what the most promising depth for a tone orthography should be (Bernard, Mbeh, & Handwerker, 1995, 2002; Bird, 1999a, 1999b; Koffi, 1994; Kutsch Lojenga, 2008, 2014; Mfonyam, 1989, 1990) but as yet there is no consensus. This article seeks to shed new light on the subject by evoking the theory of lexical phonology. One of the most significant and lasting contributions this theory has made to our understanding of phonology is that there is a fundamental distinction between lexical and post-lexical linguistic processes and that native speakers are more psychologically aware of the former than they are of the latter. Although the theory itself has largely been superseded, some more recent theories, such as stratal optimality theory (Bermúdez-Otero, in preparation; Kiparsky, 1998, 2000), continue to formally recognize the fundamental lexical/post-lexical distinction, so its applicability to orthography development remains intact. The current study reports on a classroom experiment that tested the Lexical Orthography Hypothesis, that is, the notion that the output of the lexical phonology is the most promising depth for an exhaustive representation of tone by means of diacritics. The experiment focuses on the orthography of Kabiye (ISO 639-3 kbp), an Eastern Gurunsi Gur language spoken in northern Togo.
We begin with some definitions. We will use the term “exhaustive” in this paper to describe any tone orthography that marks one less tone than the number of contrastive level tones in the language, that is, one tone in a language with two level tones and two tones in a language with three level tones. We consider such a representation to be exhaustive, or maximal, because no contemporary tone orthography literature advocates the writing of all tones everywhere. According to this definition then, all three experimental orthographies presented in this paper are exhaustive, because in each case H tone is marked by the presence of a diacritic and L tone by its absence. And from a methodological point of view, having all the experimental orthographies marked exhaustively kept the outcomes more comparable.
The term “downstep” also needs defining. In this paper, we use the term to describe a drop in tonal register. In Kabiye, as in many African languages, any H tone following a L tone is pronounced at a lower register than the preceding H, and this pattern reiterates until the end of the phonological phrase. Following Stewart’s (1965, 1983, 1993) terminology, there are two manifestations of downstep: automatic (where the lowering of register is triggered by a pronounced L tone) and non-automatic (where the lowering of register occurs after a floating L tone, that is, one that is not pronounced, but is present in the underlying structure). While some researchers, especially in the past, have used the terms downstep and downdrift to refer to these processes, we prefer Stewart’s terminology, which has gained wide acceptance, because it draws attention to a single process with two possible causes (for further terminological discussion, see Connell, 2001).
The article is organized as follows. Section 2 provides a brief overview of the theory of lexical phonology and its implications for orthography development. Section 3 presents the Kabiye orthography: first some information about the sociolinguistic background, then a summary of the phonographic correspondences. Section 4 provides a sketch of the Kabiye tone system, and describes in detail the two tonal processes that are the focus of the experiment: lexical L tone spreading, and post-lexical HLH plateauing. It also summarizes the experimental predictions. Section 5 reports the experiment itself: the preparatory phase, the pre-test, the intervention, and the post-test. Section 6 presents the test results, treating the pre-test, an analysis of group equivalence, and each of the six error types in the post-test. Section 7 offers a summary and interpretation of the results, followed by some discussion of the implications of the experimental results for the development of tone orthographies for previously unwritten tone languages. Section 8 offers some concluding remarks.
H tone is written with an acute accent and L tone is written with absence of an accent in all three experimental orthographies and in phonetic transcriptions. In the latter, non-automatic downstep is written according to the IPA convention with a downward-pointing superscript arrow [↓] placed before the affected tone bearing unit (TBU). In the experimental
2 The Lexical Orthography Hypothesis
To understand the Lexical Orthography Hypothesis, a brief overview of lexical phonology and its implications for orthography development is necessary.
It is generally accepted that phonetic transcriptions of surface allophonic variations are not suitable for orthographic representation because such transcriptions represent sounds of which native speakers are almost totally unaware. For example, if the English orthography were to write both the aspirated and unaspirated allophones of /p/ (e.g., <spɪt> ‘spit’ vs. <phɪt> ‘pit’), it would unnecessarily over-differentiate, since native speakers are more or less unaware that the two variants of /p/ are actually different. The time-honored way of avoiding this extreme has been to employ phonemic representations, which avoid representing low-level allophonic differences like those in ‘spit’ and `pit.’ But while orthographies based on phonemic representations have often enjoyed relatively good success, consistently writing phonemic representations is also not without controversy. 1
For example, native speakers are very aware of the outputs that result from lexical morphophonemic processes, and they have no problem representing the phonemic outputs of those processes orthographically. Take, for example, the alternations of the English negative prefix /iN-/. While phonemic representations like <im-probable>, <ir-reverent>, and <il-logical> are not problematic for the orthography, morphophonemic representations like <in-probable>, <in-reverent>, and <in-logical> no doubt would be because native speakers are very aware that they pronounce these allomorphs differently. In fact, they are so aware of the differences that many do not even realize they are the same prefix.
On the other hand, native speakers are much less aware of phonemic outputs that result from post-lexical morphophonemic processes, and they have no problem representing the outputs of those processes morphophonemically. Take, for example, the alternations between [s] and [z] of the English plural suffix in words like cat-s and dog-z. Even though /s/ and /z/ are phonemes in English (cf. sip vs. zip), native speakers are not very aware that they pronounce these phonemes differently in words like cat-s and dog-z, and they would probably not be very favorable to writing them differently in the orthography. Again, see Snider (2014, pp. 36–40) for a discussion of problems associated with consistently representing the morphemic level orthographically.
The theory of lexical phonology (Goldsmith, 1976/1990; Kiparsky, 1982a, 1982b, 1985; Mohanan, 1982, 1986), together with its more recent evolution into stratal optimality theory (Bermúdez-Otero, in preparation; Kiparsky, 1998, 2000), makes a formal distinction between lexical and post-lexical phonological processes, with the former applying prior to the latter, in serial fashion. Following the lexical phonology model, Figure 1 illustrates the interaction between morphosyntax and phonology, on the one hand, and the serial nature of the lexical and post-lexical phonologies, on the other hand. We invert the classic diagram since, in terms of the linguistics of writing, it is metaphorically more intuitive for surface forms to appear at the top and deep forms at the bottom.

Lexical phonology model.
Starting from the bottom of Figure 1, underived lexical entries are first subject to lexical phonological processes. (The root of the example word “divinities” is subject to stress assignment in the phonology side of the figure.) Subsequently, lexical entries may undergo any required lexical morphological processes. (The example word undergoes derivational /–iti/ suffixation.) If lexical entries do undergo any morphological processes, the new, now morphologically complex form is again subject to any applicable lexical phonological processes. (The example word now undergoes trisyllabic laxing, a process that shortens and laxes a long vowel when it is followed by two syllables, the first of which is unaccented.) This reciprocal interaction between morphology and phonology takes place repeatedly until all required lexical morphology has been affixed to the word and all applicable lexical phonological processes have applied to each new form. (The example word, morphologically, undergoes further suffixation of the plural suffix. Since this new form is not subject to any more phonology rules in the lexical component, nothing further happens to it phonologically in the lexical component.) The resultant output of all morphological and phonological processes in the lexical component is a “word.” This (now derived) word is then subject to whatever phrasal syntax is appropriate to the communication intended by the speaker. Post-lexical phrases may be as simple as single-word utterances, or as complex as the syntax allows. (The example word is a single word pronounced in isolation, so no syntactic processes apply.) However simple or complex, once the phrase is assembled syntactically, it is subsequently subject to all applicable post-lexical phonological processes. (The example word is now subject to the post-lexical voicing assimilation processes). For further discussion of how data is processed in the lexical phonology model, the interested reader is referred to Kenstowicz (1994, p. 196ff) and Kiparsky (1982a).
According to Mohanan (1982), native speakers are aware of lexical phonological processes and relatively unaware of post-lexical ones (Snider, 2014, pp. 30-36). If this is true, the output of the lexical phonology could be a promising level of phonological depth upon which to base an orthography.
Regarding post-lexical processes: (a) they can result in non-contrastive sounds (e.g., aspirated consonants in English), (b) native speakers can demonstrate a complete lack of awareness of phonological alternations that result from their application, and (c) they can apply across word boundaries, with the result that some words may sound one way in one environment but another way in a different environment. Lexical processes, on the other hand: (a) can have lexical or categorical exceptions (e.g., single words or whole classes of words that do not undergo the processes), (b) can lack phonetic motivation, and (c) can apply only word internally, including across morpheme boundaries.
Again, to the extent that native speakers are truly aware of phonological alternations that result from the application of lexical processes, and they are truly unaware of alternations that result from the application of post-lexical ones, the output of the lexical phonology promises to be an ideal beginning point for developing an orthography.
The theory of lexical phonology is usually applied to utterance-based linguistics, but we will apply it to the linguistics of writing by making the following comparisons, from the shallowest (least abstract) to the deepest (most abstract) representations. In Table 1, the first column lists the terminology used in lexical phonology for each stage of the derivation, starting with the input at level 4 and ending with the pronunciation at level 1. The second column refers to the corresponding orthographic level, three of which (
Relationship between phonological and orthographic levels.
Level (1) is the output of the post-lexical phonology, and corresponds to a phonetic transcription of the surface form. It includes non-contrastive phenomena such as automatic downstep and the falling pitch of L tones pre-pausally. This level is excluded from our study, because there is general agreement that an IPA transcription is not a valid candidate for an orthography. The bottom level (4) is the input of the lexical phonology. It corresponds to what is often called a
Between these two extremes—surface and deep—are two other levels that are included in the experiment. They are often confused, because they are both relatively shallow, intermediate representations. First, there is the
The other intermediate possibility is the
Lexical Orthography Hypothesis
The lexical level (i.e., the output of the lexical phonology) offers the most promising level of phonological depth upon which to base a phonographic tone orthography that marks tone exhaustively.
This hypothesis is well known to some field linguists, but of course the only way to confirm or refute it is to formally test it. That is what motivated us to set up a quantitative classroom experiment, examining the exhaustive representation of tone by means of diacritics in the Kabiye orthography. We pitched three levels of orthographic representation against each other:
3 The Kabiye orthography
3.1 Sociolinguistic background
Kabiye is spoken by over one million people in Togo. 2 The Kabiye homeland consists of two mountain ranges to the north of the town of Kara, but the twentieth century saw considerable emigration to the center and south of the country, so by now about two thirds of the population live outside the homeland.
The origins of the written form of Kabiye can be traced to the pioneering efforts of Rev. Antoine Brungard of the Société des Missions Africaines (1932). But it was not until the 1980s that the Comité de Langue Nationale Kabiyè (CLNK), 3 mandated by the Togolese government, devised the standard orthography that is still in use today. Standardization led to the development of a bilingual dictionary (CLNK & SIL-Togo 1999) and various government sponsored pedagogical materials (e.g., MAS, 1984, 1995). The catalogue of published literature now exceeds 200 titles. The national newspaper Togo-Presse contains a daily half page in Kabiye, and the new Kabiye Wikipedia—born in June 2014—already has 150 articles in it. 4
Various literacy initiatives exist in both the formal and non-formal sectors, including the option for pupils in state schools to take written Kabiye as an exam subject in grades nine and ten (Roberts, 2011b). However, very few of these pupils ever actually encounter a Kabiye text, even in class. Togo is an overwhelmingly oral culture, and any literacy tends to be in French, the official language. In such a context, most readers can only be considered semi-literate at best, and this is certainly true of the pupils involved in our experiment.
Since there was early resistance among members of the CLNK and other literacy stakeholders to marking exhaustive tone with diacritics, the committee made only a single concession to tone marking: the disambiguation of a subject pronoun minimal pair (CLNK, 1988, p. 9). However, as the years went by, the CLNK grew increasingly aware of the problems of zero tone marking, and has addressed this question in its meetings and its biannual journal (CLNK, 1995, pp. 4–5, 16–17). The principal author has participated in the debate over several years, contributing short, non-technical articles to the CLNK’s journal, running a series of regional seminars with the aim of encouraging dialogue between orthography stakeholders, and leading a five-day extraordinary session of the CLNK during which various proposals were presented and discussed.
3.2 Phonographic correspondences
Kabiye is an SVO language with ten noun classes (Lébikaza, 1999, pp. 366–370) and vowel harmony (Lébikaza, 1999, pp. 65–69). It has 18 consonant phonemes /p, f, t, ʈ, s, t͡ʃ, k, k͡p, d, z, l, y, w, h, m, n, ɲ, ŋ/. The obstruent phonemes /p, f, t͡ʃ, k, k͡p/ are represented in the orthography either by the voiceless series <p, f, c, k, kp>, or by the voiced series <b, v, j, g, gb>, depending on various conditioning factors that are beyond the scope of this article but which have been treated elsewhere (Lébikaza, 1999, pp. 135–140). As for the phonemes /j, ɲ/ they are represented by the graphemes <y, ñ> respectively.
There are nine basic vowel phonemes: /i, e, ɪ, ɛ, a, u, o, ʊ, ɔ/. The phonemes /ɪ, ʊ/ are represented respectively by the graphemes <ɩ, ʋ>. Long vowels are written as a sequence of two letters (<aa, ee> etc.), and extra-long vowels as a sequence of three (<aaa, eee> etc.). There is also a series of five long back unrounded vowels /ɯ̘ɯ̘, ɯ̙ɯ̙, ɤɤ, ʌʌ, ɑɑ/ that are contrastive with their back rounded and their front unrounded counterparts, but only occur at morpheme boundaries as the result of morphophonemic conditioning. They are written respectively as <iɣ, ɩɣ, eɣ, ɛɣ, aɣ>.
Finally, the hyphen <-> is used in the orthography to avoid homographs. It is placed between the possessive pronoun and the noun in the associative noun phrase (example 1), and between the verb and the object pronoun (example 2), to distinguish them from each other and from verb phrases (example 3):
4 The Kabiye tone system
4.1 Overview
Among Gur languages, Kabiye is relatively well described, with published research including not only two reference grammars (Delord, 1976; Lébikaza, 1999) but also dozens of other titles (see the thematic bibliography in Roberts, 2013, pp. 309–319). The tone system, too, is relatively well understood. All researchers agree that Kabiye has two contrastive level tones, high (H) and low (L), and automatic and non-automatic downstep (Lébikaza, 1999, p. 183). The TBU is the mora (Lébikaza, 1999, pp. 170, 267; Padayodi, 2010, p. 249), which means that all vowels and all pre-consonantal and word final nasals are capable of bearing tone. Contour tones are not licensed on single TBUs. However, there are differing views about what constitute underlying forms. This paper takes the position that there are six tone patterns on noun roots—H, L, HL, LH, LHL and HLH (Roberts, submitted)—and three on verb roots—H, L, HL (Roberts, 2002). We will discuss the implications of differing tone analyses for orthography development in Section 7.2.
Kabiye has tonal contrasts in both the grammar (examples 4–6) and the lexicon (7 and 8).
This experiment focuses on two tonal processes: lexical L tone spreading (Section 4.2) and post-lexical HLH plateauing (Section 4.3). There are numerous others, but these were chosen because they are frequent in natural contexts, and because the distinction between their status as lexical and post-lexical processes is clear and unambiguous.
4.2 Lexical L tone spreading
The first tonal process that will be targeted for testing is lexical L tone spreading. In the Kabiye verb phrase, the L tone of a prefix spreads rightwards onto a H verb root until it is blocked by a singly linked H tone.
5
In example 9, the imperative (i.e., the base form of the verb) carries a H melody which associates to all three TBUs. In example 10, the L tone of the subject pronoun causes the H tone on the first two tone bearing units of the verb root [wele-] to be pronounced as L, but has no effect on the third tone bearing unit [-sí]. In example 11, the L tone of the negative prefix has exactly this same effect on its environment. Being a lexical process, the
L tone spreading in Kabiye is considered to be a lexical process for three reasons. First, it applies internally, within the phonological word, never across a word boundary. Second, it is limited to a particular morphological context: not all L tones in the language spread to adjacent H tones, only those associated to verbal prefixes. 6 Third, classroom observations reveal that L1 speakers are often more aware of it than they are of post-lexical tonal processes – they hear it straightaway and learn how to write it more easily.
4.3 Post-lexical HLH plateauing
The second tonal process that will be targeted for testing in this experiment is post-lexical HLH plateauing. In Kabiye, a singly linked L between two H tones delinks, and the second H—with its downstepped register following a L—spreads left (Lébikaza, 1999, pp. 57–58). In example 12, the melody of the word in isolation is [HL]. In example 13, the melody is [HH]. When these two words are juxtaposed in example 14, this creates a HLH sequence, so the phrase is not pronounced [HLHH], but [H↓HHH]. Being a post-lexical process, the
An anonymous reviewer has suggested that the output of HLH plateauing is not phonemic but rather allophonic (with [HLH → H↓HH] and [HLLH → HLL↓H] as allophonic variants of under-lying /HLH/), and that we are therefore not testing a phonemic orthography at this point. This, however, is not the case. From a structuralist point of view, the result of /HLH/ plateauing is [H↓HH], which contrasts phonemically with [HHH] and with [HLL]. Whether the floating L responsible for non-automatic downstep is underlying or is generated lexically or post-lexically is not a distinction structuralist theories are able to make, because they pre-date the theory of lexical phonology. Neither are they able to distinguish between singly and doubly linked L tones, because they pre-date the theory of autosegmental phonology.
HLH plateauing, and the non-automatic downstep that it generates, is considered to be a post-lexical process because it occurs in all environments, 7 within words and across word boundaries. It often occurs sequentially too, with the second H tone of one HLH sequence becoming the first H tone of the next HLH sequence. In the following four-word sentence (15), the HLH melody occurs five times, each time overlapping with the neighboring HLH pattern on either side.
HLH plateauing is a generalized phenomenon throughout the language wherever the tonal context permits it, but classroom observations reveal that L1 speakers are often unaware of it – they do not necessarily hear it straight away, and learning to write it is a challenge. Another hallmark of HLH plateauing that identifies it as a post-lexical process is that non-fluent readers, whose reading is characterized by long hesitations between words, do not pronounce it across word boundaries, whereas fluent readers do.
4.4 Experimental predictions
Our experiment seeks to obtain empirical data providing evidence for or against the kinds of predictions made by the Lexical Orthography Hypothesis. For this purpose, we targeted the two tonal processes described in Sections 4.2 and 4.3 and developed three experimental orthographies to write them. Figure 2 summarizes the relationships between the two tonal processes and the three experimental orthographies, and our predictions about the relative ease of writing such forms.

Experimental predictions: two tonal processes, three experimental orthographies.
Our predictions are that:
those who learn the
those who learn the
The
5 The experiment
5.1 Overview
The experiment took place on non-consecutive days from 26 January to 12 March 2011 at Lama-Kolidè secondary school in northern Togo. 8 Ninety-seven tenth-grade pupils participated in the experiment, all of whom had chosen written Kabiye as an optional exam subject. As one reviewer noted, written Kabiye may attract pupils who are naturally stronger in the language arts, so the sample was not necessarily a representative cross-section of Kabiye pupils.
First, the pupils participated in two pre-intervention dictation tasks to establish their base-line skills in written Kabiye and French, after which we randomly assigned them to three parallel groups and a control group. Each group followed an eleven-hour course spread over three weeks. Finally, the pupils participated in a post-intervention Kabiye dictation test designed to assess their newly acquired skills. We also included a French dictation in the post-test, but it did not reveal any results relevant to the experiment.
5.2 Teacher training
We recruited eight mother-tongue teachers to teach the four courses in pairs. These were picked from a larger group of candidates for their demonstrated teaching ability and their facility with written Kabiye. The pedagogical materials were written, revised and introduced to the teachers by means of several rounds of pilot tests conducted with a small group of volunteers for several weeks in the run-up to the experiment. We closely observed the teachers’ performance during this phase to ensure that they were well paired, and that the pairs would teach equally effectively across the three experimental groups. All the teaching was in Kabiye. The principal author was discretely present throughout the experiment, but only to ensure that it ran smoothly.
5.3 Preparatory phase
At the time of the experiment, the pupils had already completed approximately 40 hours of instruction in written Kabiye with a volunteer teacher who knows the standard orthography. We calculated this figure on the basis of 4 terms, 12 weeks per term, 1 hour a week, plus an estimated adjustment for the frequently interrupted school timetable. The pedagogy in such classes is characterized by rote learning and copying lessons directly from the blackboard into exercise books. There is a high pupil-teacher ratio (48 to 1). We considered this to be insufficient exposure to the written language to enable the pupils to participate in the experiment. So we supplemented their normal timetable with an additional two-hour lesson once a week for the five weeks preceding the experiment. During these lessons, the pupils benefitted from a much lower pupil-teacher ratio (11 to 1). Specially developed pedagogical materials gave them plenty of opportunity to practice writing and oral reading skills in the standard orthography. In addition, part of each lesson focused on applying the four basic conjugations (imperative, imperfective, unbound perfective, and bound perfective) to a list of 20 verbs that would later be used in the dictation tasks.
During these preparatory weeks, each pupil completed a questionnaire in French enabling us to glean sociolinguistic data. In the frequent cases where the written responses proved to be insufficiently clear, a trained research assistant interviewed individual pupils in Kabiye, greatly increasing the accuracy of the data. This information was classified according to six independent variables (Table 2). 14
Independent variables gathered from the sociolinguistic questionnaire.
5.4 Pre-test
On the first day of the experiment, in a plenary session, we tested pupils’ competence in the Kabiye standard orthography and the French orthography using dictation tasks composed of 20 short, semantically unrelated sentences. The inclusion of the French pre-test served primarily to identify any differences in pre-intervention skills that the pupils brought to the experimental setting. It also allowed us to control for any inequalities between the groups in terms of general writing ability. The variable of measurement was the number of individual spelling mistakes, quantified by comparing each pupil’s written work letter by letter against two master texts (Kabiye and French) and counting insertions, substitutions and omissions. We recorded their error rates by means of two independent variables (Table 3).
Independent variables gathered from the pre-test.
5.5 Intervention
After the pre-test, the pupils were randomly divided into four parallel groups. Three of them learned the experimental orthographies, while a control group continued learning the standard orthography. These parameters were recorded by means of the grouping variable associated with the intervention, which we labeled
Process variable
For an analysis and discussion of group equivalence see Section 6.2. Each group followed an eleven-hour training course spanning three weeks. The course consisted of eleven, one-hour lessons interspersed with practice dictation exercises.
5.6 Post-test
On the final day, we tested acquired skills in a Kabiye dictation task, with each group putting into practice whichever experimental orthography they had learned. The dictation contained 20 SVO sentences, similar in difficulty to those used in the pre-intervention task and the lessons, but different in content. The twenty sentences contained seven cases of lexical L tone spreading (see Section 4.2) and nine cases of post-lexical HLH plateauing (see Section 4.3). These sentences alternated with others that contained neither of these processes.
We recorded the results by means of six dependent variables. Table 5 explains each error type, describes how it was quantified and gives a concrete example of each type compared against a sentence written in the
Dependent variables—individual error types on the Kabiye post-test.
The counting of error rates related to vowel length (
6 Analysis
6.1 Pre-test
Using the Minitab software, we ran ANOVAs for statistical variance to test whether any of the sociolinguistic independent variables (Tables
2
and
3
) were predictive of pre-test performance in Kabiye or French.
10
The only one that was statistically significant was
Pupils write the Kabiye standard orthography with fewer errors if both parents are Kabiye. The high mean and low standard deviation for the six pupils who had one non-Kabiye parent suggests consistent non-usage of Kabiye in these households. The much larger standard deviation but lower error rate for the pupils who have two Kabiye parents reflect a high level of variation in use of Kabiye, but a lot of general familiarity with the language. None of the other independent variables—
6.2 Group equivalence
Random assignment does not guarantee equivalence among groups especially when the number of participants in an experimental design is relatively small. Accordingly, further analysis was carried out to establish the level of group equivalence and to investigate the possible effects of any distributional anomaly that might have occurred. The results are captured in Tables 6 and 7.
Distributional characteristics of the four groups resulting from the random assignment of pupils.
Investigation of the effects of independent variables on performance.
Table 6 defines the makeup of each group with respect to the distribution of the independent variables for which data was gathered. The four groups in the design are listed with demographic and performance characteristics by group shown by rows. P-values appear in the final column indicate whether the grouping characteristic under consideration was statistically significant.
The only characteristic that had a statistically non-random distribution with respect to
This was carried out by (a) examining the impact of each sociolinguistic variable on pre-test performance, and (b) looking for evidence that the demographic variable in question impacted performance on the post-test in a manner independent of its impact on the pre-test. We included each variable in turn in a multivariate model (using GLM) with
Table 7 shows the results of this investigation, with the key data in the final column. (For an explanation of each of the variables, see Table 2). The columns to the left, which have specific labels on each line according to the values for each variable, give raw scores. For example, pupils who reported speaking only one language had a mean score on the Kabiye pre-test of 63.8 while those reporting speaking two languages had a mean score of 52.5. The penultimate column contains p-values indicating whether variations in the variable in question can be considered statistically significant with respect to the Kabiye pre-test. The final column shows p-values for each of the variables when included in a multivariate model with the Kabiye pre-test as a covariate. A significant p-value in this case would indicate that the variable in focus impacted performance on the post-test independent of any impact on the Kabiye pre-test.
The penultimate column reminds us that only one variable—
In Table 6, it was noted that the
6.3 Post-test
The aim of the post-intervention dictation task was to compare proficiency in writing the three experimental tone orthographies. We did this by means of an analysis of each of the six individual error types against
6.3.1 Error type 1: failing to mark an acute accent on a TBU that should have one
An ANOVA analysis testing
6.3.2 Error type 2: marking an acute accent on a TBU that shouldn’t have one
An ANOVA analysis testing
But here we hit a measurement problem. If a pupil writes a TBU with no accent, is it a failure to write a H tone, a deliberate choice to mark L tone, or is it simply the result of fatigue, boredom, confusion or indecision? For this reason, it is misleading to measure
So we examined the data from another angle, measuring how many sentences were written with at least one accent (Table 8). We reasoned that a sentence written with no accents is most unlikely to represent a deliberate choice to write the whole sentence as L tone, since such sentences are almost non-existent in Kabiye. We also isolated the results of the first half of the dictation from those of the second half, to see if there was any decline in performance. The first line in Table 8 indicates, for each experimental orthography, the percentage of sentences written without any accents. The second and third lines contrast the average number of accentless sentences written in sentences 1–10 and sentences 11–20. The fourth line records the statistical probability of the results.
Performance degradation measured by counting sentences written with no accents.
We observe that the
Of course, viewing the data from this angle sheds no light at all on accuracy. Ironically, those in the
6.3.3 Error type 3: Writing a vowel where there shouldn’t be one
Running ANOVA to test
6.3.4 Error type 4: Failing to write a vowel where there should be one
Running ANOVA to test
We should also note that both pre-tests are strong indicators of performance on Error4 (Kabiye Pre-test: correlation = .7; p < 0.001. French Pre-test: correlation = .226; p = 0.024), indicating that literacy skills in either language are a good basis for mastering the spelling of vowel length in Kabiye. This is the only case where we found a statistically significant correlation between the two pre-tests and one of the error types. 12
6.3.5 Error type 5: failing to write lexical L tone spreading correctly
The correct rendering of L tone spreading by the
6.3.6 Error type 6: failing to write non-automatic downstep in the context of post-lexical HLH plateauing correctly
The correct rendering of non-automatic downstep in the context of post-lexical HLH plateauing by the
Until now, examining error types 1–5, the results of the
7 Discussion
7.1 Summary and interpretation of results
Let us summarize the results of the analysis from the point of view of the pupils writing the
Data supporting the Lexical Orthography Hypothesis.
In addition, we should add the observation—not mentioned until now—that, when all error types are collated in the Kabiye post-test, the
7.2 The profile of the Deep orthography
One of the major difficulties in developing an adequate
Nouns present a similar scenario. Lébikaza (1999, pp. 205–215), applying eight morphotonological rules, obtains seven underlying melodies: H, L, LL, LLL, HL, LH, LHL. Padayodi (2010, pp. 259–274) identifies five underlying melodies: H, L, HL, LH, and toneless. Roberts (submitted) on the other hand, obtains six underlying melodies: H, L, HL, LH, HLH, LHL. According to his analysis, the noun root melodies spread across the entire word and this is how the noun class suffix receives its tone. In other words, the input and output of the lexical phonology are identical for noun roots. Since the experiment followed Roberts’ analysis, the
Such conflicting analyses—by no means uncommon in tone research—are a reminder to us of one of the major hazards of developing
7.3 Depth and graphic density
All three tone orthographies developed for this experiment were “exhaustive” or “maximal” representations because they mark one less than the number of contrastive tones in the language. But we should also note that the choice of depth in a tone orthography may lead to representations of greater or lesser graphic density, even though all of them are exhaustive. Diacritic density is quantifiable by calculating the number of tone diacritics as a percentage of the number of tone bearing units in a natural text of about 400 words (Bird (1999b, p. 89) uses 164 words, but our own recent research on several languages indicates that this is insufficient). In our experiment, the diacritic density of the
7.4 Accurately measuring “absence of an accent”
Continuing on the topic of exhaustive tone marking, the binary strategy of marking H tone with an acute accent and L tone with absence of an accent created an unforeseen measurement challenge (Section 6.3.2). Every time a pupil wrote a TBU without an accent, it was unclear whether this represented failure to write a H tone or a deliberate choice to mark L tone. Two of the experimental groups (
What is clear is that only countable choices can be measured in a quantitative experiment. That is why we suggest that it may sometimes (depending on the nature of the research question, of course) be preferable to develop experimental orthographies that mark all tones everywhere. The purpose of such experiments, after all, is often to test a general orthographic principle, not a specific experimental orthography for its own sake. The disadvantage of such an experimental strategy of course—as any literacy specialist will immediately point out—is that it is never advocated in the literature on developing orthographies for unwritten languages. It would be unfortunate if practitioners dismissed the lessons to be learned from an otherwise soundly designed experiment because the orthography used was deemed too far from field realities. There is a potential tension, then, between the quest to develop accurate and precise measurements, and the need for experiment designs to be ecological, capturing performance of an ability in way that is as true to real-life as possible.
7.5 Functional load and learner motivation
Languages differ enormously in the relative functional load carried by tone. Some Mande, Kru, and Kwa languages in Côte d’Ivoire and some Omotic languages in Ethiopia are at the high end of the continuum (Wedekind, 1985). These languages typically have four or five level tones, contour tones, and many monosyllabic words, all of which contribute to a high level of written ambiguity if tone is not marked. Previous research in Kabiye indicates that although the functional load of tone is not so low that it can be ignored, it is clearly not a language that warrants an exhaustive representation of tone.
Readers might legitimately ask, then, why we targeted Kabiye for investigation at all. The reason for the choice of language was purely pragmatic. Two of the authors had recently conducted a previous tone orthography experiment on the same language (Roberts & Walter, 2012), so the professional contacts had already been nurtured in the language community, the background linguistic research had already been done, and we could recycle many of the existing pedagogical materials.
But pragmatism apart, the relative functional load of tone in any given language is a crucial consideration for experimental research because it has implications for learner motivation. The tone orthography of a language with an extremely high functional load of tone may be more difficult to learn, but the learner has a correspondingly high motivation to overcome the obstacles because tone markings leads to comprehension. In a language with an extremely low functional load of tone, on the other hand, learner motivation is correspondingly low, because most of the diacritics are unnecessary and mastering the system pays no real dividends.
7.6 Methodological limitations
Our primary concern about the experimental methodology is that pupils’ base-level skills in the standard orthography are weak. Since most of them have inadequate mastery of the segmental phonographic correspondence rules, they cannot be expected to add an extra graphic layer reliably. So the results show what performance is like when elementary pupils learn the simplest possible structures after minimal training, and should be understood with this in mind.
The experiment did not seek to assess oral reading skills. Yes, the results demonstrate that the
8 Conclusion
Fieldworkers engaged in developing orthographies for unwritten languages have many pressures on their time, and tone is often perceived as being of secondary importance, especially if the linguist in question is not an L1 speaker of a tone language. Many embark on tone orthography development with minimal training and support, unaware that the output of the lexical phonology offers a third option between the two extremes of
But at a more fundamental level, this article carries no assumption that tone must necessarily be written exhaustively in all tone languages. On the contrary, we advocate that this strategy should only ever be used as a last resort, in languages where it has been proven that the functional load of tone is so high that it is unavoidable. Such proof comes not through autosegmental theory—which is indispensable for analyzing the tone system itself—but from other less exploited angles such as analysis of written ambiguities in natural contexts and analysis of oral reading errors.
Who stands to benefit from the results of this experiment? Since we believe that the functional load of tone in Kabiye does not warrant an exhaustive representation of tone by means of diacritics, the results are almost certainly not going to be of much use to the CLNK as they debate how to mark tone beyond, perhaps, sending a clear signal to them about the complexities of orthographic depth inherent in exhaustive tone marking. In any case, the only members who advocate this strategy are one or two university trained linguists; most other members are intuitively opposed to it.
So is the experiment of purely scholarly interest to linguists versed in the intricacies of lexical phonology? No, because from the outset, it was conceived as responding to a global rather than a local concern. It looks beyond the borders of Togo and contributes to a wider debate about how tone should be written in orthographies throughout the world. With this in mind, we should bear in mind how vastly different tone systems can be from one another. It is clearly unwise to generalize about whether, how much, and by what means tone should be marked. This experiment provides clear evidence from one language that the output of the lexical phonology is the most promising level for an exhaustive diacritic tone orthography. This finding now needs to be further tested with multiple reading and writing experiments, characterized by a variety of imaginative aims and designs, in diverse languages from around the world.
Footnotes
Acknowledgements
We are deeply indebted to the principal author’s research assistant, Emmanuel Pidassa, without whose efforts this research would never have been completed. We would like to thank Mike Cahill, Steve Parker and four anonymous reviewers for their stimulating and helpful comments on a draft of this article. We are indebted to the CLNK for the interest they have taken in this research since it began. The article also benefitted from the discussion following its presentation in English at the 41st Colloquium on African Languages and Linguistics (University of Leiden, Netherlands, 29-31 August 2011) and in French at the LLACAN-CNRS Journée Scientifique, (Villejuif, France, 18 November 2011).
Abbreviations
BP bound perfective
CLNK Comité de Langue Nationale Kabiyè (now the Académie Kabiyè)
CND conditional
COM comitative
DIS distant
H high toneI
MP imperativeI
PF imperfective
L low tone
NC noun class
NEG negative
PP possessive pronoun
OP object pronoun
sg singular
SP subject pronoun
SVO subject, verb, object
TBU tone bearing unitAcknowledgements
Funding
This research was made possible thanks in part to a twelve month postdoctoral research fellowship awarded to the principal author by the City of Paris to work at the LLACAN-CNRS research laboratory.
