Abstract
As the field of evaluation continues to expand, so do evaluation course offerings. However, little is known about the backgrounds of evaluation educators, how they go about planning their courses, challenges experienced, or how they attain competency as teachers. This study offers an initial understanding of novice evaluation educators’ experience in developing and teaching introduction to evaluation courses within a university setting. Factors that influence decisions around course content and pedagogical approaches are also examined. Data collection methods include dialogic conversation notes, reflective journals, course syllabi, and student evaluative feedback. Results indicate that perspectives on the purpose of evaluation and how evaluation should be taught are influenced by evaluation educator professional affiliation, experience with evaluation, and student audience. Implications for teaching and training in evaluation are discussed.
Overall accountability demands have spurred the significant expansion of the field of evaluation (Chouinard, 2013). In recent years, the need for evaluation as well as qualified program evaluators has increased markedly given global commitments to accountability in education, health, and science fields (National Academies Press, 2005, 2010). The need for evaluation and qualified program evaluators has also led to growth in regional, national, and international evaluation societies around the world (Rugh & Segone, 2013). An overwhelming majority of evaluators “stumble” into the field of evaluation (Christie, 2003), having begun their professional career journey in fields such as psychology, public health, sociology, and education. As such, a considerable portion of evaluation practitioners do not hold advanced degrees in evaluation and are motivated to seek formal training in order to expand their knowledge and competencies (Christie, Quiñones, & Fierro, 2014). While evaluation is becoming a distinct area of study, with over 48 university-based training programs currently offered (LaVelle & Donaldson, 2010), professional development sessions offered by universities, evaluation societies, and private entities have become increasingly popular as well.
The beauty and challenge of the field’s exponential growth lies in the notion that essentially anyone can be an evaluator. While many evaluation societies, including the American Evaluation Association (AEA; 2017), are adopting statements on necessary evaluator competencies, there are no standard requirements of competency attainment through training or practice to enter the field (Imas, 2010; Stevahn, King, Ghere, & Minnema, 2005). Further, scholarship on the teaching and training of evaluation continues to garner significant traction. Existing research on training in evaluation has focused on the number and content of training programs (Davies & MacKay, 2014; LaVelle & Donaldson, 2010), evaluator course taking and professional practice (Christie et al., 2014), and novice evaluators’ fieldwork experiences (Chouinard et al., 2017; Gredler & Johnson, 2001; Hurley, Renger, & Brunk, 2005). Prior work on teaching in evaluation studies has examined general teaching techniques (Conner, 2001; Oliver, Casiraghi, Henderson, Brooks, & Mulsow, 2008), the use of role playing (Alkin & Christie, 2002; Lee, Wallace, & Alkin, 2007), the need for long-term student evaluation projects (Trevisan, 2002), and techniques for designing evaluation courses and practicums (Levin-Rozalis & Rosenstein, 2003; Morris, 1994).
While there has been a surge in empirical effort to understand how, where, and what is being taught within evaluation courses, less is known about who is teaching, for what purposes, and how. To that end, this article answers the call for more empirical research based on systematic reflection for the purpose of disseminating knowledge gained (Becker & Renger, 2017; Mark, 2008). The purpose of this article is to explore the varied experiences of two novice evaluation educators developing and implementing two separate introduction to evaluation courses. Specifically, this comparative case study includes dialogic conversation notes, course syllabi, reflective journals, and course evaluation reviews as data sources. The following research questions guide this inquiry: (a) How and in what ways did novice evaluation educators make meaning of the development and implementation of introductory evaluation courses? (b) what factors influenced the decisions and pedagogical approaches associated with course design and implementation? and (c) to what extent did educational background, perspectives on evaluation, and research interests have an impact on the novice evaluation educator experiences? In the following sections, we offer a review of the literature on teaching and training in evaluation, thereby underscoring the importance of understanding the factors that influence the decisions and pedagogical approaches associated with course design. Next, we describe the study’s research methodology. The remainder of this article discusses the five themes that emerged during data analysis, a discussion about evaluation teaching and training, and implications for novice and expert teachers of evaluation alike.
Training and Teaching in Evaluation Literature
The most recent literature on evaluation training has focused on categorization and classification of evaluation course offerings, courses taken by evaluators, and training needs of evaluators. For instance, Lavelle and Donaldson (2010) provided evidence for what they purport is a dramatic increase in the number of evaluation trainings within university settings. They also found that while most universities only offer two to three specific evaluation courses, novice evaluators also take a range of quantitative, qualitative, and mixed methodological courses. Christie, Quiñones, and Fierro (2014) have categorized evaluator course-taking patterns into four distinct types of classes including quantitative, evaluation/qualitative, general research/quantitative, and general research. Within this investigation, the researchers revealed that very few evaluators have taken evaluation theory courses or advanced qualitative or advanced quantitative methods courses. An examination of topics covered in introductory evaluation courses affirmed that instructors spent the most time on evaluation approaches, practical issues of conducting evaluations, and evaluation planning (Davies & MacKay, 2014). Less than an hour or no time during courses were spent on the history of evaluation, metaevaluation, and contracts and budgeting. When evaluators were asked to identify competency areas that warranted their additional training, Galport and Azzam (2016) noted that 75% listed metaevaluation and 50% listed quantitative methods. These results are not surprising considering that so little time is spent on metaevaluation in introductory courses and that few evaluators take advanced quantitative methods courses. As a field, we should continue to explore the implications of the lack of variety of courses evaluators take prior to entering the profession.
Since 2004, evaluation scholars have reflected on and made suggestions about evaluation course pedagogy, values, and content. Oliver, Casiraghi, Henderson, Brooks, and Mulsow (2008) argued for the need to discuss pedagogy within program evaluation courses. They concluded that pedagogical choices such as cooperative learning, active learning, and self-directed learning were useful strategies to guide course activities to ensure learning outcomes are met. Thomas and Madison (2010) asserted that social justice should be included as a fundamental value within evaluation courses and field experiences. We agree with their argument that “evaluation students also must be inspired to challenge the status quo, to care about the interests of the disadvantaged, and to uncover weaknesses within the system that contribute to inequities within society” (Thomas & Madison, 2010, p. 571). Morris (1994) called for the need for evaluation educators to focus on teaching future evaluation practitioners both ethical knowledge and ethical practice. He urged instructors to employ multiple strategies that include providing tangible examples, having students keep reflective journals, and attempting to instill moral courage when discussing these issues. While courses are important ways for students to begin their training in evaluation, scholars have long recommended that evaluation students receive hands-on, practical training experiences (Hargreaves, Attkisson, Horowitz, & Sorenson, 1978).
Fieldwork experiences are often incorporated into graduate-level training in evaluation (Darabi, 2002; Trevisan, 2004). While adapting classroom knowledge or theory to the worksites/practice can be challenging for novice evaluators (Boyce & Chouinard, 2017; Chouinard & Boyce, 2017; Hurley et al., 2005), practical experiences for evaluation students including simulations, single course projects, practicums, and role playing (Trevisan, 2004) can have the potential to positively impact evaluation student learning. Long-term evaluation project participation is likely to have the most impact on the development of evaluation abilities (Gredler & Johnson, 2001; Trevisan, 2002). Three key components to successful fieldwork experiences are close communication between students and instructors regarding fieldwork projects (Gredler & Johnson, 2001; Levin-Rozalis & Rosenstein, 2003); the establishment of clear time lines and course expectations; and the development of trust, credibility, and project expectations with clients/agencies (Hurley et al., 2005).
While there have been some studies of the experience of university instructors (Endrizzi, 2011), there are no studies within the literature that examine the experiences of evaluation educators or factors that influence the decisions and pedagogical approaches associated with course design. Scant attention has been paid to backgrounds of evaluation educators, how they go about planning their courses, or challenges experienced. To our knowledge, nothing is known about the experiences of novice educators of evaluation. The current study is pertinent as the field continues to expand and professionalize because more evaluation educators, with varying backgrounds and perspectives on evaluation, will be called upon to teach evaluation courses and training seminars.
Method
Study Design
A comparative case study research design was employed to assess similarities, differences, and patterns across two or more cases (Campbell, 2010; Stake, 2006). Merriam (1998) defines case study as “an intensive, holistic description and analysis of a bounded phenomenon such as a program, an institution, a person, a process, or a social unit” (p. xiii). The two cases in this study were of our experiences in developing and implementing two evaluation courses, each by a different faculty member. We employed a constructivist epistemological approach, thus framing the construction of reality through meanings created both socially and experientially (Merriam, 1998). We developed our own understandings and assigned meanings across multiple sources of evidence. This research design was appropriate as it allowed data to be collected about each course; the design also allowed us to use our own experiences and reflections to assemble in-depth personal views. Doing so helped us better understand the larger issue of novice evaluation educators’ experiences with introduction to evaluation courses and factors that influence course design, implementation, and evaluation. Our ultimate goal for this research was to present case study findings in a reflective narrative format as they are useful when describing lessons learned for the improvement of practice (Becker & Renger, 2017).
Cousins and Chouinard (2012) described the emergence of reflective case narratives as an increasingly prevalent mode of empirical research on evaluation. Unfortunately, lack of attention to narrative structure, such as unclear prioritization, excessive length, and loose organization, often undermines the potentially rich insights a case study can offer (Brandon, 2014); this is especially true of reflective case narratives (Becker & Renger, 2017). As such, scholars have sought to establish clearer guidelines for the organization of reflective case narratives within the field of evaluation (Becker & Renger, 2017; Brandon, 2014; Cousins & Chouinard, 2012). To that end, we used tenets from Becker and Renger’s (2017) framework for structuring the remaining components of our methods section including (a) background and context; (b) motivation, scope, and review of subjectivity; and (c) study strategy, data collection methods, and analysis. Their framework also guided the Lessons Learned section of this article.
Background and Context
This study began as we both accepted and commenced with our tenure-track faculty positions in fall 2015, at the University of North Carolina at Greensboro (UNCG), a midsize public doctoral university with an R2 Carnegie Classification (Carnegie Classification of Institutions of Higher Education, 2016). During an informal conversation at the UNCG new faculty orientation program, we each acknowledged that we were teaching introductory evaluation courses in separate departments to different graduate student audiences within the School of Education (SOE). This was the first time either of us had taught an introduction to evaluation course. Upon this realization, we started sharing ideas about how we planned to approach our respective courses. Furthermore, we constructed a scholarly space where we openly shared research interests, teaching ideas, and professional aspirations. We then reviewed literature on teaching and training in evaluation and did not see our experiences reflected in these scholarly conversations. Consequently, as newer scholars in the academy, we viewed this discovery as a unique opportunity to fill this gap and collaboratively designed a research study to explore how we made meaning and factors that influenced the design and implementation of our respective introductory evaluation courses.
UNCG
The UNCG was originally established in 1891 as the first public college for women in North Carolina. The first African-American students were admitted in 1956 and the university became coeducational in 1962. Women comprise 67% of the student body with ethnic minority students making up 40% of the undergraduate and 36.5% of the graduate student body (UNCG, 2017). The UNCG is the most racially diverse of the UNC system’s historically White campuses and is designated as a minority serving institution. The UNCG SOE’s six departments, Counseling and Educational Development, Educational Leadership and Cultural Foundations, Educational Research Methodology (ERM), Library and Information Studies, Specialized Education Services, and Teacher Education and Higher Education (TEHE), are committed to advancing access to life’s opportunities by providing transformative learning, leading innovation and discovery, engaging communities, and promoting equity and diversity (UNCG SOE, 2017).
ERM 642: Evaluation of educational programs
Ayesha Boyce is an assistant professor in ERM. Over the course of 5 years, program evaluation has gained significant traction within UNCG’s SOE, particularly within the ERM department. Boyce is the second tenure-track assistant professor hired within the department whose research agenda has a focus on evaluation methodology. She was hired to work with ERM colleagues to build a program evaluation concentration for the department’s MS and PhD degree programs.
In fall 2015, Boyce taught Evaluation of Educational Programs which is housed in the ERM department. Graduate students within the SOE and from departments across the university enroll in this service course. For students enrolled in the Educational Research, Measurement, and Evaluation MS and PhD programs, this course is also the first in a sequence of evaluation and research methodology courses. The purpose of the course was to introduce students to program evaluation, evaluation theories, and overview the main commonplaces (Greene, Boyce, & Ahn, 2011) or components of an evaluation. A total of 25 students were enrolled in the course. For all but one student, this course was truly their introduction to program evaluation. Of the 25 students, there were 13 White women, 3 Black women, 4 White men, 2 Black men, 2 international women, and 1 international man. Thirteen students were from the ERM department, three students were from TEHE, three students were from Specialized Education Services, and two students were from Counseling and Educational Development. Four students were not formally enrolled in a UNCG department. Countries of origin represented were Nigeria, Saudi Arabia, Guyana, Bosnia, Turkey, and the United States. Both master’s and doctoral-level students were represented. In addition to articles, the required books for the course were: Alkin’s (2011) Evaluation essentials: From A to Z and Alkin’s (2013) Evaluation roots: A Wider Perspective of Theorists’ Views and Influences, Second edition. Course assignments included a reflective paper, a position paper about a controversial topic within the field of evaluation, an interview with a current evaluator about his or her theoretical and practical leanings, and a group presentation on an assigned data collection methodology. The final project was to complete an evaluation plan and presentation based on a fictional request for proposal (REP) and was completed in groups of four.
Higher education 663: Planning and evaluating programs in higher education
Brian McGowan is an assistant professor in TEHE. In fall 2015, he taught a course titled, Planning and Evaluating Programs in Higher Education, which is housed in the TEHE department. This course fulfills a core requirement for masters’ students in the student affairs administration and higher education (SAAHE) program. The purpose of the course was to provide students with choices available for and processes involved in conducting program evaluation by providing them with hands-on experience conducting a small-scale evaluation study relevant to their interests. In 2015, two professional associations, American College Personnel Association (ACPA)-College Student Educators International and NASPA-Student Affairs Administrators in Higher Education revised their professional competency areas for student affairs educators (ACPA/NASPA, 2015). One of the competency areas focuses on assessment, evaluation, and research, and this course was designed based on the foundational outcomes within this competency area. Twenty-eight students in the SAAHE program were enrolled in the course. With the exception of three students, this course was their first experience with program evaluation in any capacity. Of the 28 students, there were 5 White men, 14 White women, and 9 Black women. In addition to articles, the required book for the course was: Boulmetis and Dutwin’s (2011) The ABCs of Evaluation: Timeless Techniques for Program and Project Managers. Course assignments included a reflective paper, an ongoing reflective evaluation journal, and a small-scale evaluation study that included a prospectus, proposal, class presentation, and a final report.
Motivation, Scope, and Review of Subjectivity
Within the context of reflective case narratives, evaluators are encouraged to be attentive to the underlying impetus for the study, to account for their own subjectivities, and be attentive to how organizations and memberships can shape professional ideologies (Becker & Renger, 2017). We both employ similar philosophies to teaching and motivations for entering the academy. We are both passionate about issues related to diversity, equity, and access, as well as developing a climate of inclusion of historically marginalized populations within contexts of evaluation and higher education. Collectively, being in positions to assist the creation of change at an individual, institutional, and systemic level factored into our decisions to pursue the professoriat. Furthermore, given both of our field’s continued expansion into racially, ethnically, linguistically, economically, politically, and culturally diverse settings, we both believe that it is essential that educators and methodologists alike have a foundation in program evaluation techniques. We both have a social justice orientation that is reflected in the courses we teach and our research interests. While we share similarities and overarching professional beliefs, our experiences, education, and scholarship are markedly unique. We offer a brief overview below.
Boyce earned her PhD in educational psychology with an evaluation specialization from the University of Illinois at Urbana–Champaign. She is committed to developing the technical skills students need for evaluation practice, while also encouraging the adoption of a critical lens (Everitt, 1996; Fay, 1987) and attention to social justice (Thomas & Madison, 2010). She decided to pursue an academic career because she is committed to research on evaluation and to the teaching of the next generation of evaluators. Her hope is that students who complete her courses will have a strong methodological foundation and conduct evaluations that are based in democratic principles; engage with values; and promote equity, fairness, and diversity. She has over 10 years of professional evaluation experience and identifies herself as an evaluator who also conducts research and teaches. She is an active member of AEA.
McGowan earned his PhD in higher education administration from Indiana University and completed some coursework in evaluation during his doctoral training. His motivation to pursue an academic career stemmed from his prior administrative experiences in higher education coupled with research and teaching opportunities acquired in his doctoral program. His interest in teaching an evaluation course in his higher education program stemmed from a doctoral course on program evaluation where he conducted an evaluation of a mentoring program serving students of color. He identifies as a higher education researcher who has conducted evaluations and is not a member of AEA.
Given our understanding of our respective fields, we believe that our portfolios are likely similar to others who have taught introduction to evaluation for the first time. Some will be located with educational psychology, educational research, or other departments within an SOE and will have considerable experience practically and theoretically with program evaluation, the field of evaluation, and their primary allegiance and professional identification will be rooted in an evaluation society. Others will be housed within higher education, economics, psychology, or public health departments and will have affiliations first to their field and corresponding organizations and second to program evaluation. As previously mentioned, little is known about who is teaching introduction to evaluation courses and for what purposes. Moreover, to our knowledge, virtually nothing is known about populations unaffiliated with AEA that teach these courses. Our motivation for this study was to gain an in-depth understanding of similarities and differences in our experiences, thought process, and pedagogies, as well as the content and implementation of our introduction to evaluation courses.
Study Strategy, Data Collection, and Analysis
Data collection methods
Course syllabi, dialogic conversation notes, reflective journals, and course evaluations were collected to improve interpretability and meaningfulness of our novice evaluation educator experience. This approach was used to capitalize on strengths and counteracting biases in each of these data sources (Greene, Caracelli, & Graham, 1989). Data collection took place over the course of 6 months, August 2015–March 2016. Authors were the sole data collectors and reviewers. A description of each data collection method is included below.
Course syllabi
In August 2015, we began the study by sharing syllabi with one another to gain a sense of the content we were planning to cover in our courses. Both syllabi included standard content such as class format; course objectives and expectations; grading criteria; assignments; and statements about academic integrity, civility, and special needs.
Reflective journals
In September 2015, we began reflective journals. Skolits, Morrow, and Burr (2009) have suggested that “a meaningful strategy for self-reflection would be a systematic review of each activity of the evaluation, the demands that resulted, and the effectiveness of the associated role responses” (p. 291). To make the reflections as systematic and useful as possible, we wrote in our individual reflective journal once a week throughout the duration of the fall 2015 semester and specifically reflected upon each weekly class. Prompts were used with the following purposes to (1) gather information throughout the semester relevant to how the course was planned, designed, implemented, and evaluated with particular attention to logistics, pedagogy, and strategies; (2) capture information related to challenges and triumphs during the course; and (3) capture the experience of implementing an introduction to evaluation course for the first time from the instructor’s perspective. We also wrote final reflections in March 2016 that included us making suggestions for the next iterations of our courses.
Dialogic conversation notes
Also beginning in September 2015, we began our weekly dialogic conversations. During these conversations, we were able to voice our experiences, share resources, provide sources of support, express concerns, and offer constructive feedback as we prepared for the following week. For instance, there were a few topics where McGowan would share his lesson plans with Boyce for feedback prior to teaching that week. There were also moments where we cocreated and shared potential classroom engagement activities and would modify them to fit our respective audiences.
Course evaluations
In February 2016, we shared our feedback from student course evaluations with one another. These data allowed us to compare student perceptions of the course, course activities, and learning outcomes with our own and to develop an understanding of the quality of the courses.
Data analysis
We analyzed our respective reflective journals, dialogic conversation notes, and course evaluation feedback throughout the data analysis process. In case study research, data collection and analysis often happen simultaneously (Merriam, 1998). Therefore, we were purposeful in beginning data analysis while data collection was still underway. For instance, throughout the semester, we independently coded notes from the dialogic conversations before completion of data collection. We then came together to discuss our themes for the notes. Furthermore, we planned future dialogic conversations according to what we found in previous conversations. For example, our weekly dialogic sessions served as an opportunity for us to write memos, connect our experiences back to the literature, and begin noticing patterns across our data. However, we did not begin data analysis in earnest until summer 2016.
Initial independent analysis
Throughout the month of June 2016, we reviewed all individual data sets (course syllabi, reflective journals, and course evaluations) separately. We utilized Ryan and Bernard’s (2003) article as a guide as we searched for themes. We began by independently checking the dialogic conversation notes for accuracy and met briefly to discuss and summarize two dialogic conversation meetings where the notes were lacking. We then individually checked our own structured reflections and syllabi for accuracy in dates, events, referenced time frames, and other markers of origins. After accuracy checks, we manually analyzed each set of data individually.
This study also used analysis techniques that are consistent with case analysis research, specifically, searching for patterns, constructing themes, and identifying frequencies within the data (Stake, 1995). Stake (1995) notes “two strategic ways that researchers reach new meanings about cases are through direct interpretation of the individual instance and through aggregation of instances until something can be said about them as a class” (p. 74). Data were examined and searched for certain words, phrases, events, and themes that stood out. This was a three-step process to (a) search through data for regularities and patterns, as well as topics represented in the data, (b) write down topics and phrases to categorize these patterns/themes, and then (c) search for disconfirming data. Additional information about the coding and analysis process can be found within the respective sections below.
Reflective journal themes
Following the individual data set analyses for accuracy, all themes across all data sets were reviewed for our cases individually. We each began with the weekly reflective narratives because they served as our primary data source. We individually reread them line by line to generate initial codes. For Boyce, codes such as “running out of time,” “over planning,” “too much time on activities,” and “need to find other ways to fill class time” were themed as time constraints and course content. For McGowan, codes such as “following up with a peer about potential activities,” “contacting peers to guest lecture,” and “appreciating unsolicited advice” were themed as usefulness of informal/peer support. Codes from our reflective narratives were compared and recurring patterns within the data were sought out, which led to initial themes. Boyce’s themes from her reflective narrative were: need for formal support, anxiety about course implementation, time constraints and course content, and concerns about course management and peer review. McGowan’s themes from his reflective narrative were: usefulness of informal/peer support, concerns about lack of experience with evaluation, evolution of course content throughout the semester, formal mentoring, and semester time line and higher education student needs.
Dialogic conversation notes
We met to analyze and discuss the conversation notes together. After the discussion, we came to consensus on the following themes for the dialogic conversation notes: decisions around the amount of reading and reading content, reflections on syllabi changes during future iterations of the course, appropriateness of course texts and assignments, working through teaching and classroom management challenges, and aligning content to our respective field’s competencies and standards.
Course evaluation themes
We individually came up with themes for our own course evaluations. Boyce’s course evaluation themes included instructor enthusiasm, in-class storytelling, and changes for next iteration. McGowan’s course evaluation themes were: interaction with student, successful activities, and changes for next course iteration. We then each wrote up a brief summary of our individual cases based on the aforementioned individual themes.
Final themes
We then met in July 2016 to compare syllabi, course evaluations, and individual case study summaries including themes from the course evaluations and reflective narratives. Initial interrater reliability was high and consensus was achieved through dialogue. Course syllabi were compared and notes about similarities and differences in activities, content, and objectives were made together. We also reviewed and compared course evaluation feedback and scores and noted similarities and differences in students’ recommendations and comments. Including our previously constructed dialogic notes themes, we then constructed, refined, and combined themes across both cases and all data. Criteria for final themes included repetition, or topics that occur and reoccur (Bogdan & Taylor, 1975), searching for similarities and differences by making systematic comparisons across units of data (Glaser & Strauss, 1967), and linguistic connections or searching for words and phrases (i.e., “since” and “as a result”) which often indicate causal relations (Ryan & Bernard, 2003). The final themes were refined by combining themes with related meanings and deleting themes that had little data to support them or that had multiple counterexamples or disconfirming data. We finally reordered them into one cohesive final set of five themes. The findings are outlined in the Findings section.
Findings
Our analysis yielded five broad themes related to our experience as novice evaluation educators with developing and implementing introduction to evaluation courses: While we both were apprehensive about teaching an introduction to evaluation course, the source of our anxiety was dependent upon our educational backgrounds and professional experiences. There was a great deal of variation between objectives and content across courses. This variation stemmed from differences in the fields of higher education and program evaluation, student needs, and professor expertise and interest. Pedagogical decisions and course content were restricted by semester time constraints. It was decided that both courses would undergo significant changes in their second iteration. Informal and formal structures were key in supporting both professors throughout the semester.
Each of these themes is presented in greater detail below.
Apprehension to Teaching Evaluation From Divergent Sources
We both had different apprehensions as we prepared our respective courses for the first time. The source of our anxiety was dependent upon our educational background and professional experiences. Having never served as the sole instructor of a course, Boyce was concerned about her limited experiences in the classroom. With over 9 years of practical experience, including 25 evaluations conducted and formal training in evaluation theory, she considered herself a competent evaluator. Additionally, she had never developed or assessed student knowledge for program evaluation. She commented in her reflective journal: I am incredibly nervous to begin teaching Evaluation of Educational Programs next week. This is pretty much the fulfillment of a dream I envisioned for myself seven years ago. I have never taught a course from start to finish before. I endeavor to bring the field of evaluation to life in a way that engages and excites my students. I hope I can cultivate understanding with a small bit of passion. While I consider myself an experienced evaluator, ardor and content knowledge can’t replace strong pedagogy. (Boyce, reflective journal entry, one week before the start of the semester) I have to openly acknowledge that this course [Planning and Evaluating Programs in Higher Education] is a growth area for me personally. I view myself as a higher education researcher who has conducted a few evaluations. It is one thing to conduct evaluations; it is another thing to teach students how to do the same. After being hired, I was asked to revamp this course to focus more on evaluation. I am entering my third year as a tenure-track university professor and have no qualms about teaching; however, I know this specific course is going to really keep me on my toes. (McGowan, reflective journal entry, one week before the start of the semester)
Objective and Content Variation Between Courses
There was a great deal of variation between course content and objectives across our courses. While we both sought to ensure that our students understood the foundational process of how to plan and implement an evaluation, we went about doing so differently. Our variation stemmed from differences in our fields and department, student needs, and our own expertise and interests. For example, the ERM MS and PhD program offers (a) introduction to evaluation, (b) practicum in evaluation, (c) advanced evaluation theory, (d) policy in evaluation, (e) collaborative approaches, and (f) culturally responsive approaches evaluation courses. The “introduction to evaluation” course is one of the six evaluation courses offered within the department. It is meant to serve as a foundational course with the expectation that over three fourths of the enrolled students, no matter the department, will likely also at least take the practicum in evaluation course where they spend the entire semester conducting an evaluation—from meeting their stakeholder, to developing a plan, collection of data, and reporting findings via a written and oral report. Within the SAAHE program, “Planning and Evaluation Programs in Higher Education” is the only evaluation course and must fill a number of objectives within one semester including program evaluation, program planning, and assessment. As such, we each had slightly different objectives and expectations for our courses. During our fourth dialogic conversation, we each shared our overarching goals for our courses. Our review of course syllabi and corresponding dialogic conversation notes led us to the following reflections: I would like my students to be able to do the following by the end of my course: (1) compare and contrast major theories/approaches of program evaluation, (2) begin to determine appropriate approaches and questions for an evaluation based on purpose, context, and stakeholders, (3) be able to construct logic models and evaluation framework matrices, and link evaluation questions to appropriate data collection methods and (4) become sophisticated consumers of program evaluations reported by others. (Boyce, reconstruction of dialogic conversation based on notes and course syllabus) I want my students to do the following: (1) identify and understand principles, models, and applications of program planning and program evaluation, (2) develop an understanding of program evaluation practice by designing and conducting an evaluation for a higher education and student affairs functional area, (3) develop an understanding of ethical, political, and cultural issues facing program evaluators, and (4) write an evaluation report to meet stakeholder information and decision making needs. (McGowan, reconstruction of dialogic conversation based on notes and course syllabus)
Semester Time Constraints
Pedagogical decision-making based on course content and student feedback was a recurring theme in both of our narratives throughout the semester. In reflecting on the nexus between time and content, we oftentimes compromised content more than we would have liked given time constraints. Embedded within our narratives was a constant reflection of having to make pedagogical decisions based on time constraints. Each of us reflected on how we felt restricted given the vast amount of content that needed to be covered in the following ways: I am unsure that there is enough time in the semester to cover all of the content in this course in a meaningful and thorough way. The first half of the course is spent defining terms and concepts, and framing evaluation. Each of Alkin’s (2013) branches (methods, use, and, values) only get one class period with an additional week for the Mertens and Wilson’s (2012) social justice branch. I could really spend a whole semester examining, comparing, and contrasting major theories. The second half of the class is more about the technique of evaluation planning including stakeholder-evaluator relationships, question development, data collecting, and logic models. It’s really a great deal to cover in a short period of time. (Boyce, reflective journal entry, four weeks into the semester) I just finished doing a mid-semester check in with students and it seems like they are really enjoying the evaluation project. While gathering informal feedback, they mentioned wanting more time in class to work with their group members. As a result, I am now going to build in time to work on their evaluation project groups. I struggled with this decision as I am now compromising other course content in a few places. I want to fit everything in and give them a chance to have some group time in class. Definitely a fine line for sure. (McGowan, reflective journal entry, nine weeks into the semester)
Significant Course Changes for the Second Iteration
At various points throughout the semester, we documented ways in which we could improve our respective courses. Various mentors had encouraged us to jot down notes about specific activities and how they went throughout the semester. For example, in Boyce’s course, it was evident that her students struggled with the concepts of ontology, epistemology, axiology, and methodology and their relationship to evaluation practice. After her initial lesson on these “ologies,” she felt that students needed additional time and discussion around this topic so ultimately she came back to those concepts in subsequent weeks. Her class on logic modeling went very well. Students were enthusiastic and engaged. It was evident that the technical skills needed to construct logic models had resonated with them. During a dialogic conversation, Boyce decided to adapt her logic model exercise and implement it in McGowan’s course. End of semester dialogic notes echoed Boyce’s concerns about the balance between evaluation theory and technical practice. Further, her final reflective journal entries captured the decision to address this issue in subsequent iterations of the course: Having had some time to reflect on the course, I believe it makes sense to change it for the upcoming year. Six weeks is not enough time to do the evaluation theory and approaches content justice. We could also probably spend two whole class periods on logic models. So while I believe that evaluation is much more than a technocratic endeavor, I believe it is my responsibility to ensure that students are able to think well and become consumers of evaluation. Having 14 weeks dedicated to the technical aspects of evaluation should prove to be a valuable foundation for incoming evaluation MS and PhD students and enough about the technique of the practice of evaluation for students who are taking this course as an elective. There is a piece of me that is hesitant to let theory go, but I will spend some time thinking about how to talk about values, cultures, bias in knowledge production and different approaches throughout the course. I will also suggest the development of an advanced evaluation theory course for evaluation students in our PhD program. (Boyce, reflective journal, final entry) I plan to move this course in a different direction to align with the field of higher education’s professional competencies. I spent a significant amount of time on program evaluation and not enough on program planning this semester. Program planning and program evaluation could be two separate courses so I need to figure out a way to balance the content a little more. The competency area focuses on the ability to design, conduct, critique, and use various methodologies and the results obtained from them, to utilize processes and their results to inform practice. Although this current course accomplishes some of these tasks, I want to be more intentional about making these connections moving forward. (McGowan, reflective journal, final entry) I am not too interested in theory, but I understand the class needed to cover it. (ERM 642 Student) It may be helpful to spend more time on practical components of evaluation. Theory is important but it felt like the majority of class. (ERM 642 Student) At the beginning it was a challenge to put pieces together. I didn’t know how to “use” theory or how it fit with evaluation practice. (ERM 642 Student) I would have liked to learn more about program planning. (HED 663 Student) I do not feel like I learned the “program planning” piece of the title of the course. (HED 663 Student)
Necessity of Informal and Formal Support Structures
Given our varied experiences and background with conducting evaluations and teaching graduate-level courses, it was important to have both informal and formal support structures in place. Fortunately, we both were assigned formal mentors within our respective departments. These mentors offered advice on all aspects of our professional journey including teaching, research, and service. Further, we both were observed halfway into the semester and were given formal formative feedback about our teaching style, pedagogy, course activities, and student engagement. In addition to formal support, we also informally developed the professional relationship that lead to this article. Both types of support were important for our growth as novice evaluation educators. We offer examples of these support mechanisms in the reflections below: Only two weeks left in the semester and I am ready to hesitantly cheer and sigh a breath of relief. The weekly meetings with McGowan have proved to be incredibly useful. He provided me with tips on classroom management and student engagement that have proved to be invaluable. I was also able to meet regularly with the course’s previous instructor and she provided me with suggestions for activities and potential discussion points. I am not sure what I would have done without them. I also had support from my department chair who met with me monthly to talk about teaching, scholarship, and service. Finally, the associate dean for Academic Affairs and Student Services observed my class, conducted a focus group with my students, and wrote a mid-term semester report on my teaching strengths and weaknesses. All of the previous have allowed me to develop and improve throughout the semester. (Boyce, reflective journal entry, 12 weeks into the semester) This is my third year teaching and I have never had to ask for this much help. I do realize this is the first course that I have ever taught outside of my primary focus area. Boyce’s knowledge of evaluation proved beneficial particularly when brainstorming ways to deliver some of the course content. This research process served as a unique opportunity to improve my teaching, expand my knowledge of evaluation, and to bond with a new faculty colleague [Boyce] as we support each other through the promotion and tenure process. Going through this reflective process was extremely valuable and its impact cannot be underestimated. (McGowan, reflective journal entry, final entry)
Lessons Learned and Implications for Practice
Becker and Renger (2017) state “a significant contribution to the field are the lessons learned as they help move evaluation theory and practice forward” (p. 147). We reflected on two key lessons learned from engaging in this process: (1) balancing course content and (2) support structures.
Balancing Course Content
Our experiences with course development and implementation were directly impacted by the extent to which we were familiar with evaluation theory and practice and needs of our intended audiences. Initially, we both had conversations about what content and literature should be included within each of our courses. At the outset, Boyce was concerned that well-known evaluation theorists were not included within McGowan’s course. McGowan countered that it was necessary for him to teach the course in such a way that it was most useful to future higher education leaders (not professional evaluators), who may one day be called to conduct internal evaluations of their programs or who will have to read external evaluation reports. McGowan and many of Boyce’s students felt that less time within her course should have been spent on evaluation theory. Further, she ultimately realized that half a semester is not enough time to include meaningful engagement with evaluation theory while trying to teach foundational evaluation methodology. Additionally, McGowan’s students would have liked a little bit more information about different approaches to evaluation and program-planning techniques. Thus, for both courses, it ended up being a matter of balancing content with time constraints, student needs, and instructor interest and expertise.
Evaluation is conducted as a service to something else and is born of the need to rate quality, systematically analyze outcomes, and provide formative feedback (Shadish, Cook, & Leviton, 1991). As such, evaluation in and of itself is an interdisciplinary endeavor. While it is important that as a field we continue to professionalize and distinguish ourselves, there is much that can be learned from other fields about the training of practitioners. While evaluation is not solely a technocratic endeavor (Schwandt, 2009), we believe it is useful for evaluation educators to first begin with focusing on the development of sound practice, that is meaningfully informed by evaluation purpose, stakeholder voice, underlying values, and method choice instead of trying to squeeze in evaluation theory. Then, as students of evaluation continue to grow in expertise, they can more deeply engage with a variety of evaluation theories and approaches.
Support Structures
Research in other professional fields has found that most new faculty members are not adequately prepared or mentored (Austin, 2003; Siler & Kleiner, 2001). We found this to be true, however, our lack of expertise differed. Similar to findings from other fields, we primarily learned how to teach our introduction to evaluation courses by doing it (Knight, Tate, & Yorke, 2006). Our experience was unique in that we both had a number of formal and informal support structures available to us that we utilized as necessary. It would be ideal if all novice evaluation educators had practical support for pedagogy as well as content. Those who pursue a PhD in an evaluation-related field will likely have little teaching experience as those programs are primarily for graduate students. As such, there are few formal teaching opportunities. While Boyce was comfortable with evaluation theory and practice, she found support for her pedagogical skills through formal observation of her course and meetings with her department chair. Meetings with McGowan also assisted in the development of her class management skills, as he was able to give her feedback during their informal dialogic conversations.
Those instructors who are similar to McGowan, and we suspect that many faculty (e.g., tenure-seeking, adjunct, and clinical) within higher education, economics, psychology, or public health departments similar to him, have foundational pedagogical skills but may lack deep practical experience in program evaluation. McGowan’s pedagogical skills were stronger than his evaluation content knowledge. As such, Boyce offered support as he developed his syllabus and class activities throughout the semester. His course was also observed midway through the semester, and his assigned department mentor provided him with feedback. By partaking in this study, we were able to not only become better teachers of evaluation but also formed a colleagueship that will assist us in the navigation of the promotion and tenure process together. As faculty and instructors with diverse backgrounds are tasked with developing and teaching introduction to evaluation courses in a variety of programs and departments, adequate support structures that attend to their unique needs are essential. Given that not all introduction to evaluation instructors are members of AEA, the field will need to be innovative in strategies for how such professionals can be encouraged, understood, and supported.
Study Limitations
Becker and Renger (2017) have argued that reflection is key to articulating experiences that are often difficult to express. Further, Stake (1975) has shown that case study methodology, which often utilizes reflection about experiences, can be useful to extract tacit knowledge or “unexpressible associations which give rise to new meanings” (Stake, 1975, p. 6, as cited in Becker & Renger, 2017). We have employed reflection and narrative in this study to provide a first-person view of two novice evaluation educators as we developed, implemented, and evaluated our introduction to evaluation courses. This study is not generalizable in the traditional sense nor is it meant to be. We recognize that our two experiences are not all-encompassing or representative of all novice evaluation educators. Nevertheless, we have provided an initial synopsis of how educational background, perspectives on evaluation, and research interests have had an impact on our experiences. We hope that this article will be a catalyst for further research on evaluation educators, especially novices.
Future Research and Concluding Thoughts
As the field of program evaluation continues to grow, it is imperative to understand who and what perspectives are shaping the next generation of evaluators. Previous research has demonstrated that there is a breadth of diversity in the training new evaluators receive. Little is known about training that new evaluation educators receive, their perspectives, or opinions. Recent studies have begun to broach these topics with AEA members (Davies & MacKay, 2014). However, studies are needed that include participants similar to McGowan, who do not consider AEA their “professional-home” organization. Finding instructors of evaluation courses from university websites and contacting them directly could be an initial first step. Future studies should also examine the difference and similarities in perspectives on the purpose of evaluation and how evaluation should be taught from the viewpoint of evaluation instructors. While parallel research examines what should be taught, employing future research as outlined above will allow the field to better understand those who are developing and implementing such curricula and how to reach them so that we can continue to examine best practices for evaluator education and professional development.
Reflection is an honored tradition in evaluation and evaluators are encouraged to reflect upon our practice (Kallemeyn, Hall, Friche, & McReynolds, 2015). Novice faculty have found that reflection, included within larger narrative inquiry research, can assist with the venture of grooming scholars into faculty (Cole, McGowan, & Zerquera, 2017). We encourage novice and seasoned evaluation educators alike, to not only familiarize themselves with teaching evaluation literature but also to systematically reflect upon their experiences as they develop and implement evaluation courses.
Footnotes
Authors’ Note
Second Resubmission to the American Journal of Evaluation, Teaching and Learning in Evaluation Section on December 9, 2017.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
