Abstract
The purpose of this study was to investigate effects of a preparatory contextual singing pattern on melodic dictation test scores. Forty-nine undergraduate music education majors took melodic dictations under three conditions. After hearing an orienting chord sequence, they (1) sang a preparatory solfége pattern in the key, meter, and tempo of the target dictations in the first condition; (2) prepared themselves silently during an equivalent time interval in the second condition; and (3) took the dictations immediately in the third condition. A repeated measures ANOVA and post hoc analysis revealed that participants scored significantly higher when they heard the dictation immediately following the chord sequence than when they sang the preparatory pattern first. Participants may have been distracted by the additional task of singing, interfering with their focus on the ensuing dictation. They reported a variety of preparatory strategies during the silent interval condition, suggesting that dictation students may benefit from learning multiple strategies and choosing what works best for them. Future research might investigate the relationship between strategies used during dictation and strategies used just prior to dictation. Implications for music educators include the need for careful decisions regarding when and how to combine musical tasks for student learning.
Melodic dictation is a widely accepted method of assessing the aural skills of music students in high schools and colleges (College Board, 2012a; National Association of Schools of Music, 2012). Dictation tests typically focus on accurate perception of pitches and rhythms, demonstrated through standard music notation on the staff. Successful dictation requires coordination of multiple skills, including inner hearing, melodic memory, and transfer from aural comprehension to accurate notation (Foulkes-Levy, 1997). Limited repetitions and limited time in which to process and notate target melodies require dictation students to work efficiently (Karpinski, 2000).
Students may employ any of a broad range of strategies when taking dictation (Karpinski, 2000), including “chunking” of information for efficient memory storage and retrieval (Madsen & Staum, 1983; Oura, 1991; Potter, 1990; Rogers, 2004), applying contextual solfége syllables or numbers to percepts for identification (McClung, 2001), engaging kinesthetic or visual imagery to decode pitch information (Mikumo, 1994; Thompson, 2004), attending to broad melodic characteristics to reinforce context for the target melody (Buonviri, 2014), and employing flexible working memory to analyze the melody both forward and backward in chronological sequence (Berz, 1995; Margulis, 2005). Some instructors teach these strategies to boost students’ learning and to assist them in dictation tests (Karpinski, 2000; Paney & Buonviri, 2013).
Research has demonstrated important similarities among underlying skills required for dictation, sight-singing, and error detection (Byo, 1997; Killian, 1991; Larson, 1977; Norris, 2003; Sheldon, 1998). Greater understanding of these skills and their relationships may have implications for both music students and teachers in a variety of applications, including performance (Duke & Simmons, 2006; Morrison, Treviño, & Sielert, 2008), critical listening (Cassidy, 2001; Gromko & Russell, 2002), composition (Younker & Smith, 1996), and conducting (Sheldon, 1998).
Paney and Buonviri (2013) found that many aural skills teachers strive to maintain parallel progress between dictation and singing materials in curriculum development and practice. One of the primary objectives of singing in aural skills course work is to help students develop internal context for musical elements such as pitch and rhythm, typically through syllable, number, or letter systems (Karpinski, 2000). Aural skills students practice singing aloud what they hear internally or what they read from notation. This skill may reciprocally reinforce dictation ability as students reverse the process: They attempt to turn what they have physically heard in dictation into internal understanding and written notation.
Some instructors encourage students to use their singing voices as an intermediate step during dictation practice. Pembrook (1987) warned, however, that this approach may be counterproductive because most students cannot sing back a full melody correctly. In his study, participants’ singing actually hindered their accuracy. Attempting to sing the entire melody before notating it may have confused their memory of it.
Although caution should be exercised in teaching singing as a melodic memory tool, it may be useful to orient dictation students to important parameters of the task at hand, such as key and meter. Research suggests that pitch and rhythm processing are dissociated neurological functions (Hodges & Nolker, 2011). However, students must integrate these two elements to succeed at dictation. Their final product demonstrates an ability to recognize how pitch and rhythm contexts coincide through a specific example, the dictation melody. The singing voice may be useful for preparing and reinforcing this integration of contexts (Schellenberg & Moore, 1985).
During dictation, chunking strategies for melodic memory may be improved when understood in context, as Deutsch (1977) described:
The differences between attempting to recall a musical sequence in a familiar tonal system as compared with a set of notes chosen at random, is equivalent to the difference between trying to recall a sentence as compared with a set of nonsense syllables. (p. 114)
Chunking may increase students’ melodic memory during dictation and can improve their musical comprehension if they clearly attend to the contexts in which the melody sounds (Klonoski, 2006; Rogers, 2004).
The moments just prior to, or intervening between, singing tasks appear to influence auditory processing. Demorest and Clements (2007), for example, found that providing a tonal context may improve performance on pitch-matching tasks. Yarbrough, Orman, and Neill (2007) found that a tonal context, established through movable-do fluency, improved choral sight-singing. Students’ readiness to perceive a melody accurately and convert it to notation may benefit from establishing a similar tonal context through singing.
In practice, some aural skills instructors teach orienting patterns and warm-ups to establish key, meter, pitch, and rhythm contexts before singing or dictation tasks begin (Henry, 2008), while others insist that students maintain silence just prior to dictation. Some instructors may also try to eliminate any distractions, internal or external, by setting up metric and tonal contexts through, for example, the piano and moving immediately to the dictation melody as one musical gesture. Systematic investigation of effects of these approaches on students’ dictation performance is missing in the literature.
Formal investigation of these approaches could reveal important information not only about performing well on dictation tasks but also about how the mind processes aural percepts in light of prior preparation in general. Accurate perception of aural stimuli depends at least partially on the relationship between mental focus and distractions, both internal and external (Abril & Flowers, 2007; Flowers, 2001; Flowers & O’Neill, 2005; Madsen & Geringer, 2000). Mental engagement just prior to aural skills tasks must be taken into account when considering relative levels of success on those tasks. The purpose of this study was to investigate effects of a preparatory contextual singing pattern on melodic dictation scores of undergraduate music majors. The primary research question was this: Does singing a solfége pattern in the key, meter, and tempo of a dictation melody, just prior to hearing that melody, improve dictation scores?
Methodology
Participants
Participants (N = 49) were sophomore, junior, and senior music education majors, ages 18 to 21, at a large university. Participants represented a broad range of concentrations, vocal and instrumental, within the major. All participants had passed at least the first two semesters of music theory at the collegiate level, ensuring that they were familiar with melodic dictation and possessed the notational skills to complete the experimental tasks. Potential participants who confirmed they had absolute pitch were excluded from the study.
Design
This study was based on a within-subjects design. The independent variable was experimental condition, with three levels: preparatory singing pattern prior to dictation, equivalent silent interval prior to dictation, and neither singing pattern nor silent interval prior to dictation. The dependent variable was melodic dictation test scores. Each participant took the test under all three conditions in one sitting. Three sets of dictation melodies were employed—one for each condition—to avoid melody memorization and potential testing effects. Further threats to internal validity were addressed by counterbalancing melody sets and order of test conditions.
Testing Instrument
The researcher-designed testing instrument consisted of one audio file and one answer sheet for each of the three experimental conditions (three files and three sheets total). Each audio file and answer sheet contained a practice item and three target items, yielding 12 test items total across the three conditions. Each test item consisted of an orienting I–V7–I chord progression and a two-measure melody played by the grand piano patch in GarageBand.
Two experienced aural skills instructors reviewed all target melodies to establish external content validity by ensuring they were typical of tonal melodic dictations in content and difficulty level (see Figure 1). Face validity was also a particularly important characteristic of this instrument; the answer sheet staff prompted students to notate just as they would in standard dictations in a high school or college theory course. Reliability of the instrument was measured through internal consistency analysis. Results were well within acceptable limits: Melody Set 1, α = .93; Melody Set 2, α = .78; and Melody Set 3, α = .82.

Test melodies.
For two of the conditions—preparatory singing and equivalent silent interval—the audio files contained a two-measure silent interval between the orienting chord progression and the target melody of each item. In the preparatory singing condition, participants sang the preparatory pattern during this interval (see Figure 2), and in the silent interval condition, participants prepared themselves silently. In the third condition (no singing or silent interval), the audio file items continued directly from the orienting chord sequence to the target melody. In all conditions, the first item was for practice, enabling participants to familiarize themselves with the procedures for that condition.

Preparatory singing pattern.
Procedures
Each participant completed the experimental tasks under all three conditions in one sitting in a small, quiet office. Audio files were played directly from GarageBand on a MacBook Pro laptop computer, through Sony MDR-NC7 Noise Canceling Headphones.
Participants heard the instructions for each condition at the beginning of the audio file for that condition and were able to read the instructions for themselves simultaneously. For the preparatory singing condition, participants were coached through rote learning of the pattern (see Figure 2) by a brief audio tutorial prior to beginning that section of the test. The pattern was modeled by a recorded male voice in two keys, C major and F major, each of which the participant echoed while referring to the printed pattern. Because the pattern was generically notated with appropriate contour and movable-do syllables, participants could easily transpose it to any key; movable-do solfége had been the primary syllable system employed in their prior theory coursework. The tutorial continued by establishing only the chord sequence in two more keys, D major and G major, after each of which the participant practiced the pattern without any model.
In all conditions, after the instructions the audio file announced the practice melody, played the orienting sequence, silent interval (in Conditions 1 and 2), and melody, and remained silent for 45 seconds while participants notated the melody. The decision to provide 45 seconds of work time was determined through review of dictation procedures on the AP Music Theory Exam (College Board, 2012b), averaging the work time provided on that exam (30 seconds or 60 seconds, depending on repetitions of the target melody), and pilot testing. Under all conditions the participants’ objective was to write the target melody in standard notation on their answer sheet. The audio then proceeded to announce the next melody. Participants heard each melody only once. A single playing of these short melodies enabled completion of all experimental tasks under all conditions in one sitting, without potential for testing fatigue, a possible threat to internal validity.
In the preparatory singing condition, participants heard the orienting chord sequence and then sang aloud the preparatory pattern during the two-measure interval. They were permitted to use the Preparatory Singing Pattern sheet for reference while singing. Then they heard the melody and attempted to write it in standard notation. In the equivalent silent interval condition participants were instructed to “prepare yourself silently for two measures” during the two-measure interval following the chord sequence, and then they heard the melody and attempted to notate it. In the final condition, participants heard the chord sequence, followed immediately by the melody, and attempted to notate it. The researcher was in the room at all times to ensure functionality of the testing instrument and participants’ successful compliance with procedures.
Analysis
One of the original 50 participants was unable to sing the preparatory pattern consistently correctly; his answer sheets were not scored, leaving 49 tests for data analysis. All melodies consisted of two measures of 4/4 meter and were scored per quarter-note beat for correct rhythm and pitch notation. Each beat contained a total possible score of two points (one for correct pitch and one for correct rhythm), each melody contained a total possible score of 16, and each condition (three melodies) contained a total possible score of 48. This scoring system is an adapted version of the AP Music Theory scoring system (College Board, 2012b) in which melodic dictations are scored by two-beat increments. Scoring each individual beat was important for precision in the current study because dictations were only two measures in length and heard only one time each.
Results
This study employed a within-subjects design in which all participants completed the experimental tasks under all three conditions. Mauchly’s test indicated that the assumption of sphericity was not violated, χ2(2) = .175, p > .05, and a one-way repeated measures analysis of variance was used to analyze participants’ test scores by condition.
The repeated-measures ANOVA revealed a significant difference in test scores, F(2, 96) = 3.70, p = .03, partial η2 = .07, based on conditions of preparatory singing (M = 34.29, SD = 9.76), equivalent silent interval (M = 34.78, SD =11.24 ), and no interval (M = 36.37, SD = 10.14). Post hoc pairwise comparisons with Bonferroni correction showed that participants scored significantly higher in the no interval condition than in the preparatory singing condition. There were no other significant pairwise comparisons. Participants scored significantly higher when they heard the dictation immediately following the opening chord sequence than when they sang the preparatory pattern in the intervening measures. Differences between other condition pairings were not significant.
Discussion and Implications
Participants scored significantly higher when they heard the dictation immediately compared to when they sang. However, there was no difference between when they heard the dictation immediately and when they experienced the equivalent silent interval. The difference between means in the significant pairwise comparison was only slightly more than two points, but statistical results suggest that the preparatory singing pattern did detract from students’ success with the task. This finding has important implications for aural skills teachers considering whether to pair activities for mutual reinforcement of skills.
Participants’ singing may have introduced some form of distraction into the process. Although production tasks related to the target task may be aimed at sparking cognitive processing, mental attention to the “complementary” task may actually usurp potential processing. Waggoner (2011), for example, found that college music majors were less accurate with error detection when conducting than when listening to recordings.
Participants in the current study were enrolled at a large university with multiple music theory faculty members. None of the three conditions in the current study is utilized more than any other on a regular basis in class, though singing with movable-do solfége is the norm. All participants sang the preparatory pattern correctly on all four items of the preparatory singing condition. They were able to complete the task correctly, which seemingly would have mentally “paved the way” for success on the dictation task in that condition. In other words, they appeared to be correctly reinforcing for themselves the basic key and meter information the preparatory pattern was designed to represent.
Pembrook (1987) found that attempting to sing an entire dictation melody from memory before notating it was generally counterproductive for students because they couldn’t sing the melody correctly. In that study, the incorrect singing seems to have interfered with mental processing and subsequent notation. In the current study, however, the singing task was short and manageable, it occurred before participants heard the dictation, and it was sung with consistent success. Although participants were able to execute the pattern correctly, it appears that the preparatory singing still had some contrary effect on participants’ performance.
Although singing and dictation seem generally to be mutually reinforcing aural skills long term (Paney & Buonviri, 2013), they may not be complementary within specific tasks such as those employed in this study. Instructors who routinely lead students through a preparatory singing pattern on, for example, movable-do solfége may be trying to trigger internal aural experiences of common patterns of pitch and rhythm relationships just prior to dictation. They may believe that rehearsing these patterns aloud just prior to dictation will prepare students for efficient and accurate perception of dictation materials and successful transfer to contextualized understanding and notation. Based on the results of this study, however, such an approach may actually hinder students’ work; the additional task may be distracting them (Henry, 2008).
Instructors who insist on silence prior to dictation may be convinced that each student needs individual time to focus mentally without aural distractions. They may be trying to allow students the aural space to perceive the target melody clearly and fully, without any residual external or internal noise. In this study, participants were instructed to “prepare yourself silently for two measures” in the equivalent silent interval condition. After testing, they were asked what they did internally during that silent interval. Responses were quite varied, including singing do, singing the notes of the tonic triad, singing the first five notes of the major scale, maintaining internal silence, replaying the chord sequence, trying to predict which note the melody might start on, planning their strategy, counting beats carefully, and “nothing.” This variation in strategies suggests that prior dictation experiences likely guided them to focus on whatever would be most helpful to them for the ensuing task, a finding consistent with Killian and Henry’s (2005) study of preparation for sight-singing.
Research has shown that dictation students employ a number of different strategies to decode pitch and rhythm efficiently and accurately (Mikumo, 1994; Thompson, 2004). It may be helpful to students to explore and define their own strategies (Buonviri, 2014) and to draw connections between what they do during dictation and how they prepare themselves just prior to dictation. For example, for someone who listens for implied harmony while dictating a melody (Povel & Jansen, 2001), rehearsing typical tonal chords, externally or internally, might be helpful. For someone who focuses on metric placement of important landmarks in a dictation (Buonviri, 2014), counting carefully and internalizing the meter and tempo may be important preparations. In other words, actively bringing personally chosen strategies to the forefront of one’s mind just prior to dictation may help to engage those strategies during the task.
Finally, those instructors who prefer to play a dictation immediately following an orientating chord or meter prompt may intend to help students avoid distractions and remain optimally focused. The results of this study suggest that participants performed equally well, statistically, whether the dictation was played directly following the orientating chord sequence or after two measures of intervening silence. It may be that participants were able to effectively eliminate distractions and maintain focus in both cases: in the no interval condition, through lack of time for distractions to arise, and in the silent interval condition, through customized mental focus. Since the only significant difference occurred between the preparatory singing and no interval conditions, it appears that the interval combined with the required singing task was what distracted participants.
Employing a particular routine just prior to taking dictation should aim to prepare students to take dictation purposefully, instead of just getting to the right notes and rhythms one by one. Students need to comprehend what they are hearing, but the successful notation of a dictation melody does not necessarily prove that (Klonoski, 2006). They need to arrive at the correct answer through understanding of the musical “big picture” (Buonviri, 2014), not just by patching together unrelated bits of information. Each student may understand the musical big picture in different ways.
Transfer between visual and aural domains is crucial for teaching and learning any notated music. Critical listening, music reading, sight-reading, error detection, performance, improvisation, composition, and conducting all depend to some extent on fluency across the two domains. Greater understanding of factors affecting performance on aural skills tasks will help teachers make wise decisions about how and when to help students create transfer across tasks and skills.
Recommendations for Further Research
This study compared effects of three preparatory conditions on melodic dictation success. Future research might investigate effects of the same or similar conditions on dictation scores of other populations. For example, high school students enrolled in theory courses may have less experience overall with dictation and singing than college students, providing additional insights into the efficacy of preparatory singing as a singular approach encouraged by their teacher, and the influence of singing ability on the effectiveness of this approach. Since high school theory experience appears to play an important role in college music majors’ theory readiness (Livingston & Ackman, 2003), research comparing dictation skills and strategies across the college transition is crucial. Researchers might also further examine effects of juxtaposing aural skills activities (e.g., singing and error detection) in light of the discussion in this study about possible interference among seemingly “complementary” tasks. Finally, future research might explore the relationship between students’ chosen strategies while taking, and just prior to taking, melodic dictation.
Footnotes
Declaration of Conflicting Interests
The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author received no financial support for the research, authorship, and/or publication of this article.
