Abstract
Describing the qualities of a singer’s voice is a challenging task, even for experts. Voice pedagogues usually evaluate vocal qualities through auditions, after which they make subjective judgments. As a means of communication, language has limitations in expressing concrete sound sensations, and sometimes such a description can be abstract for the singer. Our work aims to design and validate a new educational approach for vocal assessment and training, using the Mental Images for Singing Education (MISE Method). In addition, we analyze whether this pedagogical methodology favors the improvement of vocal technique and the learning of concepts according to the level of experience of the students and their executive performance. Our results show that the introduction of mental imagery produces beneficial effects in the teaching and learning of vocal techniques, regardless of the initial performance level. This makes the MISE Method an effective assessment and training tool, allowing the formalization of a communication language hitherto used intuitively in the singing classroom. However, our results were inconclusive at the cognitive level, so future research is needed to expand the study of executive processes according to the type of mental images used.
The voice is the main human means of communication. Nonetheless, its description is conditioned by the auditory perception capacity of each individual and their knowledge of the vocal field. The difficulty is greater in the case of the singing voice, where the fusion of oral and musical language increases the variety and breadth of parameters, which must be considered. Singing is a complex phenomenon, in which auditory control over the sound material is essential (Cámara Izaguirre, 2004). Being able to explain in words the quality, characteristics, and technical skills can be abstract for the student, mainly due to the limited capacity of language to express concrete sound sensations. In this sense, Mitchell (2014) states that “to describe a sound, listeners focus on the most easily perceivable technical and visual aspects of the performance, which are simpler to state and generally avoid describing the overall sound of the singer” (p. 187).
Externally, the vocal and technical characteristics of singers can be evaluated through two methods:
Subjective judgments using audition to assess the vocal quality and interpretative capacity of the singer. In the process of learning to sing, the teacher’s clear perception of any changes in the quality of the voice is essential, offering the student technical solutions through enriched feedback that allows them to adapt a teaching program that is suited to their needs (Mitchell, 2014).
Acoustic analyses, which appear to be more reliable (Larrouy-Maestri et al., 2013; Stadler Elmer & Elmer, 2000).
The singer can also self-evaluate through two techniques: self-observation (with mirror) and self-listening (recording). Southcott and Mitchell (2013) confirmed the considerable pedagogical value of high-fidelity digital recordings as a means of feedback for the student singer and as a tool for voice perception.
Various didactic resources can be employed in the teaching–learning process of singing, such as associating sounds with body gestures, usually centered on the hands (Bustos Sánchez, 2003), along with metaphors (Sousa et al., 2010), and visual images. Recently, it has been shown that the use of the latter improves teacher–student communication, student performance, and increases concentration for the interpretation of the musical work (Merzero et al., 2017; Merzero-Moreno et al., 2018). An example of this is shown in the research conducted by Paney and Tharp (2021), in which they tested the positive effect of visual feedback on vocal practice and vocal perception of the potential as a singer through the computer game SingingCoach. The authors explain that visual feedback may improve singing, but this progress stops in the short term if the feedback is removed. Nonetheless, participants’ ratings of their own singing increased significantly over the 10 weeks, probably because the computer game SingingCoach encouraged them to practice more frequently.
Another didactic resource traditionally used in teaching singing is mental imagery (MI), which has been scarcely studied in the specialized literature, but it has a wide corpus of evidence, for example, in sport setting (Lindsay et al., 2021). Since singing is a highly cognitive task, it is valuable for singers to implement imagery in their training, as this improves vocal technique, character development and reduces performance anxiety (DeSantis et al., 2021). Recent research in the field of choral music has also revealed the effectiveness of the use of verbalized imagery by choral directors to communicate singers how to create and change vocal responses (Black, 2022). There is no predetermined program, as each pedagogue employs those MI that they consider useful for the learning process.
Another noteworthy aspect is that musical imagery is multimodal, that is, MI in musicians are not necessarily specific to the motor, somatosensory, auditory or visual aspects of the images, but musicians integrate them all (Lotze, 2013). In this same line, Morales-Villar and Sáez-Zea (2019) have suggested that these images have an added value, since they can generate a more complex sensory experience, both of a visual and proprioceptive or kinesthetic nature. The latter, in addition, have proven to be an effective complement that helps improving instrumental practice (Lotze, 2013).
At the same time, after verbal instruction, MIs are generated by the singer themselves from previous experiences, setting in motion complex cognitive processes that are fundamentally of an executive nature (Tirapu-Ustárroz et al., 2011). This imagery is intended to transmit to the student the physical-acoustic sensations they will perceive when working on certain vocal aspects. The singer’s self-perception of the sound sensations linked to breathing, phonation, emission, articulation, and resonance of the voice are very important aspects for the development and effective progress of vocal technique.
In recent years, there has been considerable interest in studying the effect of music training on cognitive function, that is, improving cognitive ability, especially at the executive level (Meyer et al., 2020). Specifically, Roden and colleagues (2014) have shown a transfer effect of training music to specific executive domains, such as the phonological loop and central executive components of working memory. However, there is no evidence whether the level of executive performance of the singers favors the use of MI in teaching singing.
The main objective of this work is to develop and validate the Mental Images for Singing Education (MISE) method as a pedagogical tool that allows:
The external evaluator to assess five parameters of the singer’s vocal technique (breathing in singing, voice support, sound concentration in phonation, voice flexibility, and resonance).
The singer to have strategies and technical resources that favor their learning process through MI, thus allowing them to self-evaluate and carry out their learning in an autonomous way.
The teacher and student to establish a protocolized communication strategy that favors understanding between them.
In particular, the following specific objectives are addressed:
O1. To quantify improvement in the five aspects of vocal technique considered by applying a scale specifically designed for this method (MISE instrument).
O2. To investigate the effectiveness of the MISE method for improving the five aspects of vocal technique as a function of the singer’s level of experience.
O3. To analyze whether executive functions favor the use of MI in teaching singing and whether there are significant differences according to general cognitive level.
Method
Participants
The sample was composed of 55 singers with three educational levels:
Students of the Degree in Primary Education-University of Jaén, subject: Vocal Interpretation and Creation.
Students of the Master’s in Research and Aesthetic Education: Arts, Music, and Design- University of Jaén, itinerary: Research and teaching musical language in conservatories.
Expert students from the international program of high performance in opera singing “Jaén Ópera Joven” (VI and VII editions).
The undergraduate and master’s degree students were selected using non-random sampling, while the expert students were selected from among 99 applicants using an artistic/training curriculum and audition (audiovisual recording of an opera aria of their choice) by three expert judges. These three judges are recognized professionals with more than 20 years of experience in the artistic and academic field. Expert 1 is a pianist specializing in the accompaniment of lyrical singers, a regular in theaters and festivals of the highest international level, a repertoire and vocal technique coach, and a university doctor. Expert 2 is a speech therapist, singing teacher and a professional operatic singer on an international level. Expert 3 is a singing teacher and lyrical singer, a university professor with a doctoral thesis in singing teaching.
Procedure/instruments
The training process was individual, with a duration of 5 days in each edition. The training consisted of five lyrical interpretation masterclasses and three workshops delivered by experts.
Daily masterclasses (40 min)
These were delivered by an internationally renowned opera repertoire pianist in which opera arias were performed. The teacher made corrections in vocal interpretation, pronunciation, voice projection, and used other stylistic resources. On the last day of training, the teacher assigned each student one of the works that had been used during the course, to be performed in a final public concert.
Workshop on neuropsychological aspects of singing (90–120 min)
In this workshop, an expert neuropsychologist administered a cognitive battery that included the following three widely studied and validated executive tests:
Raven’s Progressive Matrices Test (Raven et al., 1994, 1996). This visual test measures intellectual capacity and general mental ability. We applied the advanced version, the APM (Advanced Progressive Matrices) scale designed to assess people with above-average ability.
Similarities of the WAIS-IV (Wechsler Adult Intelligence Scale-IV) (Wechsler, 2008/2012a, 2008/2012b). This test assesses verbal comprehension, specifically the ability to express relationships between two concepts, associative thinking, and verbal abstraction ability.
Wisconsin Card Sorting Test (WSCT) (Heaton et al., 1993). This test evaluates the capacity for abstract reasoning, concept formation, and the switching of cognitive strategies in response to changes in environmental contingencies.
Workshop on body awareness in lyrical singing (60–80 min)
This was conducted by two physiotherapists specialized in artistic posturology.
Group workshop on career management for opera singers (120 min)
This was delivered by a specialist critic from an international opera magazine.
All study participants were individually administered with the MISE method in the presence of experts. The objectives of the method are as follows:
To allow the expert to evaluate (through specified criteria) the characteristics and technical resources of the singers.
To help generate feedback between the pedagogue and the singer and awareness and self-perception of their voice.
To provide, through MI, strategies that the singing student or singer can use to favor the improvement of their learning process.
To provide the singing teacher with didactic tools and resources that they can use while training the voice of their students.
This method works on five specific vocal aspects that belong to the following basic areas of vocal technique (Table 1).
Aspects of Vocal Technique that are Trained and Evaluated with the MISE Method.
Note. MISE = Mental Images for Singing Education.
Administration of the method consisted of the following seven phases:
Obtaining prior information about the participant concerning their official or private musical training, specifically their singing studies, level of training, and professional experience as a soloist or in vocal ensembles (choir).
Execution of the vocal exercise by the singer, related to each technical aspect proposed by the expert judges (Pre-Test).
Double evaluation of the level of execution of the vocal exercise using the MISE instrument, designed using a Likert-type scale from 0 to 5:
(a) On behalf of the singer. Copy for the participant (Appendix A in Supplementary Materials online)
(b) On the part of the expert judge. Expert evaluator’s copy (Appendix B in Supplementary Materials online)
Previously, the researchers who designed MISE trained the three expert judges on the Likert-type scale, who independently and simultaneously evaluated all the participants.
MI elicitation associated with the vocal exercise: an expert judge gives a verbal command to the learner.
Repetition of the vocal exercise by the singer (post-test).
Second assessment of the performance level of the vocal exercise using the MISE instrument.
(a) On behalf of the singer. Copy for the participant (Appendix A in Supplementary Materials online)
(b) On the part of the expert. Expert evaluator’s copy (Appendix B in Supplementary Materials online)
Singer’s self-assessment of the changes experienced while using MI through a brief written description.
This process was repeated for each vocal exercise with an estimated administration time of 20–30 min.
Figure 1 describes the vocal exercises and the MIs associated with each aspect, which fall into different sensory (visual and tactile) and motor categories.

MI and Vocal Exercises Included in the MISE Method.
Ethical issues
The purpose of the research was explained to all participants, after which they were asked to give their verbal informed consent to take part in the study and for the treatment of the data, following current legislation on personal data protection. The data were stored in an anonymous database, with the assignment of a registration number so that it was not possible to identify the participants. All data, both in physical and digital format, were kept by the research team. Furthermore, the research has followed the Declaration of Helsinki (World Medical Association, 2013) in its design and implementation.
Results
Evaluation rubric
The MISE methodology was validated in terms of content validity (American Educational Research Association et al., 1999; Eignor, 2013). An evaluation rubric was designed to ascertain the content validity, representativeness, and appropriateness of the MIs.
The quality and consistency of the questionnaire were rated very positively by seven professionals/experts, with mean scores exceeding 3.57 points and standard deviations of less than one point. Thus, the overall quality reached 4.62 points, and the adequacy of the MIs for the aspects studied was 4.45 points (Table 2).
Evaluation Rubric used by Professionals/Experts.
Sample profile
The sample consisted of 55 singers who were Bachelor’s (20%), Master’s (36%), or Expert (44%) students with an average age of 27.3 years (age range, 21–40 years).
In the sample, all the participants have studied music, 19 (34.5%) have studied singing (official or not); 13 (23.6%) have completed official singing studies and 40 (72.7%) have obtained an official music degree in another instrument (other than singing) and 53 (96.3%) reported playing an instrument.
Sixty percent were women (28 sopranos and 5 mezzo-sopranos) and 40% were men (10 tenors, 9 baritones, and 3 basses) of various nationalities, and all were members of choirs/vocal ensembles. In addition, the group of experts had extensive professional experience as soloists (Table 3).
Professional Experience.
MISE instrument
We studied the validity and reliability of singing assessment. As we expect that MISE method provides a significant improvement, we have tested whether self-improvement and improvement scores match between evaluators. We calculated the interclass correlation coefficients (ICC-absolute agreement) to determine the inter-rater reliability. This also allowed us to explore the credibility and validity of the self-evaluation by comparing the agreement with the expert judges’ evaluations.
ICC output values above .7 suggested an acceptable agreement among participants and expert judges, indicating than evaluators provided close ratings, except for breathing category with ICC reaching .695, near to this threshold (Table 4).
Interclass Correlation Coefficients of Improvement by the MISE.
Note. MISE = Mental Images for Singing Education; ICC = intraclass correlational coefficient.
Next, we analyzed the scores achieved by the participants and the evaluations made by the expert judges.
Concerning subjective self-perception/evaluation, the scores obtained by the sample of singers participating in the study are shown in Tables 6 and 7. The singers showed initial mean scores below 2 points in all aspects except for breathing. However, after the use/introduction of the MISE, the mean scores were around 3 points and reached 3.70 points for Resonance (Table 5).
Singer’s Perception Before and After Introducing the MISE.
Note. MISE = Mental Images for Singing Education.
Concerning the objective scores given by the expert judges proposing and evaluating the task, the mean scores were slightly lower (Table 6).
Expert Judges’ Assessment Before and After Introducing the Mental Images.
Perceptions given in the Singer’s self-assessment of the changes experienced while using MI through a brief written description, were analyzed. The responses were subjected to a categorization process (Zhang et al., 2014). To unify and standardize the information, similarity measures were used, according to criteria of lexical proximity and/or synonymy. Perceptions were re-stated in one of the following forms: Improved physical sensation; Improved sound; Overall improvement; No improvement; Other. Results mainly showed a notable improvement in physical sensation and sound, although in support and flexibility 21.2% declared that they did not feel improvement after the introduction of the MI (Table 7).
Singer’s Perceptions.
Improvement
To determine the influence of an individual training session with MISE on improving the five proposed vocal aspects (O1), we considered subjective and objective dimensions, that is, the levels of self-perceived improvement made by the individual and the levels of improvement considered by the expert judges.
When the MIs were introduced, the levels of self-perceived improvement reached mean values between 1.11 and 1.95 points. The expert judges assessed these levels of improvement, awarding mean values ranging between 1.16 and 2.02, showing similar variability in their criteria among categories (Table 8).
Overall Descriptive Results of the Improvement Obtained: Subjective and Objective Dimensions.
The distribution of “improvement” was considered normal (Kolmogorov–Smirnov test, p > .05), and we used parametric methods for comparisons. In all cases, there were significant differences in the mean values of the pre- and post-test in favor of the scores after the MISE (t-Student, p < .05).
We studied these differences in detail and found that self-perceived improvement and improvement perceived by the expert judges were significantly higher than one point (t-Student Ho: µ ⩽ 1, p < .05), except in 1- Breathing given by expert judges, where no significant evidence of improvement above than one point was found (p = .14 > .05).
There were no significant sex differences in the self-perceived improvement reported by the participants (Mann–Whitney U test, p > .05).
Concerning levels of expertise (O2), there were significant differences in self-perceived improvement in Breathing and Resonance (Kruskal–Wallis test, p < .05), with the source of this difference being the group of expert participants (Figure 2).

Self-Perceived Improvement: Mean Levels.
Finally, to give response to the O3 objective, first we analyzed whether the level of executive functions favored the use of MI in the teaching of singing, and second, we assessed whether there were significant differences according to the level of cognitive capacity, for the group of expert students.
The results obtained through the MISE instrument revealed self-perceived improvement levels above 1.5 points (Table 9). It should be noted that, after the use/introduction of the MI, the students reported mean levels of self-perceived improvement in Concentration and Resonance, with scores of 1.92 and 2.50 points, respectively. Thus, the expert judges perceived the improvement with greater homogeneity.
Self-Perceived and Objective Improvement Scores in the Expert Group of Students.
After the analysis and correction of the three neuropsychological instruments/tests, the main descriptive measures obtained by the participants were calculated (Table 1A—Appendix C in Supplementary Materials online), along with the correlation between the neuropsychological variables of the Wisconsin test that were of interest for our study (Table 2A—Appendix C in Supplementary Materials online). Again, no statistically significant inter-questionnaire correlations were found, confirming that they evaluate executive processes of a different nature.
According to Raven’s test scores, capacity was enhanced in 50% of the participants, although no significant differences in improvement as a function of capacity were found (Mann–Whitney U test, p > .05).
Next, the correlation was analyzed between various executive aspects, measured through neuropsychological tests, and self-perceived improvement in “singing,” quantified through the MISE instrument (Table 10).
Correlations (with p-Values in Parentheses Below) Between the Neuropsychological Variables Considered and Self-Perceived Improvement in the Five Aspects of the Singing Voice.
Note. DS = direct score; WAIS-III = Wechsler Adult Intelligence Scale-III; WCST = Wisconsin Card Sorting Test.
p < .05 level.
A moderate negative correlation was found between WAIS-IV Similarities and improvement in Breathing, Concentration, and Flexibility. The presence of negative correlations indicates that, the lower the score on the WAIS Similarities test, the greater the perceived improvement with the use of MI. Overall, we found no significant correlations between the various parameters considered in the Wisconsin Test (Table 10).
Discussion
After analyzing the results, we observed that the MISE instrument has high content validity. The evaluation rubric used by professional experts indicates that our instrument has outstanding quality and reliability. We have confirmed that the instrument provides an adequate evaluation of the five vocal aspects considered, being especially useful for the items related to concentration, flexibility, and resonance. In addition, the experts rated very positively the adequacy of the MIs for the aspects studied.
The sample, although limited in number, is highly representative. It is characterized by a high level of education, musical training, and extensive vocal experience in choral ensembles. The sample represents the various maturing processes of the singer’s voice (wide age range), the different technical levels of vocal performance (from people with no formal education and little experience in singing to experts with high levels of training and experience as professional soloists), and the main vocal types for both sexes (although this was not homogeneous, reflecting real-life circumstances with a greater frequency of sopranos and tenors compared with bass voices).
The MISE method, using MI, provides a reliable and consistent measure of overall vocal quality by analyzing, through student self-perception, the main aspects of the singing voice: breathing, support, phonation, flexibility, and resonance. This makes it an innovative strategy for communication between the teacher and the singer. This is also the first didactic methodology with a dual utility. On one hand, it provides the teacher with an objective tool to evaluate and train the singing voice, and, on the other hand, it allows the students to carry out self-evaluation and self-management of their learning.
Our results showed that, following administration of MI, there was a significant improvement of more than 1 point in the self-perceived ratings of the singers in the five aspects evaluated by the MISE method, particularly resonance. Moreover, a reasonable level of improvement was also observed in terms of the objective ratings given by the expert judges.
Currently, there is little research on the application of MI in singers. Keller (2012) analyzed the role of mental images in musical performance, although not specifically for singing, an area studied in depth by DeSantis et al. (2021). Also, it is revealing the recent research by DeSantis et al. (2022), that determined that professional singers do not use imagery more than singing students. However, the authors conclude that women singers can use images for characterization to a greater extent than men.
Merzero-Moreno et al. (2018) proposed the use of visual imagery as a didactic resource in the singing classroom and demonstrated that these can improve the quality of the teaching–learning process. We consider that MIs have an added value. Visual images require only the perception and recognition of the image. At the same time, MIs, after verbal instruction, are generated by the students themselves from previous experiences, setting in motion complex cognitive processes, mainly of the executive type. In addition, generating an MI may evoke a visual experience along with a proprioceptive or kinesthetic one, thus creating a more complex sensory experience.
Concerning the neuropsychological results, after analyzing the Raven’s Progressive Matrices Test-APM scale, we found, as expected, that cognitive capacity was outstanding in 50% of the participants. However, we did not find significant differences in the improvement produced in singing when using MI according to this capacity.
We have observed a moderate negative correlation between the scores obtained on the WAIS-IV battery Similarities subtask and self-perceived improvement in the Concentration and Flexibility (MISE) aspects. Those students who, after the administration of MI, improved their vocal performance were those who obtained lower scores on this test of verbal reasoning and abstraction. We postulate, therefore, that these results could be because both tasks involve the implementation of cognitive processes of a substantially different nature; the creation of MIs is a visual process, while the performance of concept abstraction in the WAIS-IV subtask is verbal.
Globally, we also found no significant correlations between the various neuropsychological variables analyzed using the WSCT and self-perceived improvement (MISE).
Conclusion
It is evident that the introduction of MI through the MISE method produces beneficial effects in teaching and learning vocal techniques, regardless of the starting level of performance. In addition, this method provides a great deal of relevant information on aspects of vocal technique, is easy to administer, and requires relatively little time to complete (20–30 min). All this makes it an effective educational resource, formalizing a language of communication that until now has been used intuitively in the singing classroom, facilitating both the teaching process, and enabling the student’s self-learning.
One of the main limitations of our study is the small sample size. This is due, mainly, to the fact that the expert students had to have a high qualification at the vocal level. In addition, in this group the selection of participants was not done randomly but using an artistic/training curriculum and audition (audiovisual recording of an opera aria of their choice) by three expert judges. We are aware that the Likert-type rating scale has a strong subjective component. To solve this possible problem, we analyzed the degree of concordance between the scores awarded by expert judges and participants, since our study aimed to quantify the degree of improvement in the five technical parameters proposed, rather than assess the level achieved.
In future research, it is intended to apply the MISE method and continue evaluating its effectiveness by expanding the study sample, as well as replicating this research with professionals of the non-singing voice and vocal rehabilitation to study the improvement that the MISE method produces in these cases. In addition, given that from a neuropsychological standpoint, the results were inconclusive and we recommend increasing the battery of neuropsychological tests administered to assess other cognitive processes of an executive nature. In addition, it would be of special interest to extend the study to analyze the perceptions declared by the participants, applying qualitative methods (content analysis) and network visualization to display the relationships among them.
Supplemental Material
sj-docx-1-pom-10.1177_03057356221146813 – Supplemental material for The MISE method: A new communicative approach for evaluating and training singing through mental imagery—Executive implications
Supplemental material, sj-docx-1-pom-10.1177_03057356221146813 for The MISE method: A new communicative approach for evaluating and training singing through mental imagery—Executive implications by Mª del Coral Morales-Villar, Carmen Sáez-Zea, Rosaura Fernández-Pascual, Francisco José Comino-Crespo and Mercedes Vélez-Toral in Psychology of Music
Footnotes
Acknowledgements
The authors thank all the students and experts for voluntarily participating in this study.
Author contributions
Conceptualization: M.d.C.M.-V., C.S.-Z., and F.J.C.-C.; Formal Analysis: R.F.-P.; Methodology: R.F.-P.; Supervision: M.d.C.M.-V., C.S.-Z., and M.V.-T.; Writing—original draft: M.d.C.M.-V., C.S.-Z., R.F.-P., and F.J.C.-C.; Writing—review & editing: M.d.C.M.-V., C.S.-Z., R.F.-P., and M.V.-T. All authors have read and agreed to the published version of the manuscript.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
