Abstract
Much of the recent scholarship about ethnicity in comparative politics has focused on why ethnicity becomes a salient cleavage. Yet opinions still diverge as to how ethnicity matters. This article tests three hypotheses relevant to this question. Building on recent arguments, it first hypothesizes that voters take the ethnicity of parties and candidates into consideration. Second, it hypothesizes that wherever ethnicity is politically salient, it matters beyond coethnicity—that is, voters’ decisions are guided by ethnicity even when they are choosing among non-coethnics. Third, it argues that the advantage conferred by ethnicity is mediated by non-ethnic factors. A large vignette experiment carried in 2013 in the Indian state of Uttar Pradesh provides strong support for each of these hypotheses. These results imply that the influence of ethnicity on voting behavior may be subtler and more complex than what the main theories of ethnic politics usually assume.
Introduction
Over the past decade, an important literature has sought to explain why ethnicity becomes a salient cleavage in diverse democracies. Authors have explored why certain ethnic cleavages become salient when others do not (Posner, 2005), the conditions under which voters are likely to choose ethnic parties (Chandra, 2004; Pepinsky, Liddle, & Mujani, 2012), and the motivations of ethnic parties and ethnic voters (Chandra, 2004; Ferree, 2011). This literature has changed the way we interpret ethnic voting. According to these authors, voters’ preference for parties perceived to be favorable to their coethnics has less to do with “expressive voting” (Horowitz, 1985) than with the conditions under which elections take place. Ethnic parties win not because voters feel strongly about their identities but rather because the ethnic labels attached to parties constitute useful informational shortcuts—or heuristic devices—for patronage-seeking voters. The convergence toward this conclusion has been remarkable.
Yet no such convergence exists as to how ethnicity affects voters’ choices. As a result, key works in this literature rely on assumptions that remain disputed and/or untested. This article empirically evaluates these assumptions. Namely, it tests three interrelated hypotheses. The first one has to do with the level at which the influence of ethnicity takes place. The most cited arguments about ethnic voting focus on voters’ comparative evaluations of different parties (Chandra, 2004; Horowitz, 1985). Yet, in single-member districts, the idea that voters solely focus on party-level “ethnic labels”—that is, on whether a party is overall perceived to be “inclusive” or representative of a given ethnic group—may not be entirely credible. If, as already articulated by several authors (Ferree, 2011; Madrid, 2012), “ethnic labels” evolve over time as a result of parties’ strategic diversification of their leadership, we should not expect voters to be indifferent to the identity of the specific candidates fielded by parties in their constituency. Building on these arguments, I accordingly hypothesize that voters’ ethnic considerations are not entirely located at the party level and that voters also take the ethnicity of the specific candidates on the ballot into account when evaluating candidates. My second hypothesis has to do with the aspect of ethnicity that drives ethnic preferences. The comparative literature has so far mostly thought of ethnicity-based preferences in terms of a preference for coethnics and in terms of a binary coethnic/non-coethnic alternative, potentially overlooking more sophisticated reasoning about ethnic categories by voters. This may be limiting, insofar as voters may take ethnicity into account including where none of their coethnics are competing. Accordingly, I hypothesize that voters are likely to sort out between different types of non-coethnics. Finally, my third hypothesis relates to the malleability of ethnicity-based preferences. Most studies in this subfield imply that coethnicity confers an absolute rather than a conditional advantage in elections. By contrast, I hypothesize that the advantage conferred by ethnicity is merely conditional and that voters’ ethnicity-based preferences are mitigated by other factors.
To test these three hypotheses—and better understand how ethnicity matters—this article relies on a large vignette experiment that was implemented in two separate locations across one of the most famous cases in the recent literature on ethnic parties: the Northern Indian state of Uttar Pradesh (UP). In this experiment, likely voters rated fictional candidates whose key characteristics (ethnicity, party label, perceived probity, and perceived record) had been randomly manipulated. Insofar as real-world elections imply a degree of strategy that was not present in the experiment, these ratings are not equivalent to voting decisions. They nonetheless provide us with important and causally credible insights about voters’ evaluation of candidates and, more generally speaking, about the psychology of the median voter in North India.
These ratings provide support for all three aforementioned hypotheses. Because this design isolates the relative weight of party and candidate ethnicity in voters’ evaluations, it allows me to show that the ethnicity of candidates matters in addition to their party affiliations. Second, as the experiment exposed respondents to different types of non-coethnic candidates, it provides evidence that ethnicity matters beyond coethnicity (i.e., whether voters make a difference among different types of non-coethnics). Finally, because factors other than ethnicity (perceived probity and perceived record of the candidates) are also manipulated, I am able to show that ethnicity-based preferences are not absolute.
This research contributes to the burgeoning literature about the effect of various candidate characteristics in India (Aidt, Golden, & Tiwari, n.d.; Berenschot, 2011; Michelutti, 2008; D. Tiwari, n.d.; Vaishnav, 2012; Ziegfeld, in press). Beyond India, it more generally contributes to a growing experimental comparative literature evaluating the relative weight placed on candidates’ (ethnic and non-ethnic) characteristics by voters (Banerjee, Green, McManus, & Pande, 2014; Beaman, Chattopadhyay, Duflo, Pande, & Topalova, 2009; Conroy-Kurtz, 2013; Ziegfeld, in press). Its main contribution, however, is to the large and expending comparative literature on ethnic politics and ethnic parties. In line with the evolution of theoretical models of political selection—from the simple Downsian model of competition to more multi-dimensional models, where the identity of specific candidates also matters (Akerlof & Kranton, 2005; Besley, 2005; Chattopadhyay & Duflo, 2004)—these results imply that ethnicity affects the preferences of voters in ways that are both more complex and subtler than has so far been suggested in the comparative literature on ethnic politics. They imply that models focused on the effect of ethnicity at the party level (Chandra, 2004; Horowitz, 1985) in part misrepresent how ethnicity matters. They also support the idea that ethnic parties can attract new voters by fielding new types of candidates (Chandra, 2004; Ferree, 2011; Madrid, 2012), a finding which more generally supports the idea that the “ethnic labels” of parties evolve over time (Birnir, 2007). Third, these results show that the impact of ethnicity—in electoral politics and beyond—cannot be reduced to a simple preference for coethnics, insofar as individuals sort out among non-coethnics at least as much as they do between coethnics and non-coethnics. Fourth and finally, these results suggest that the effect of ethnicity is often mediated by other factors, contrary to what much of the comparative literature on voting behavior in diverse societies (Chandra, 2004; Ferree, 2011; Horowitz, 1985; Pepinsky et al., 2012) has so far implied.
Theorizing Ethnic Preferences in Comparative Politics
Over the past decade, leading arguments in comparative politics (Birnir, 2007; Chandra, 2004; Ferree, 2011; Pepinsky et al., 2012; Posner, 2005) have explored the motivations of ethnic voters. Why do voters prefer to vote for ethnic parties? Although these explanations have relied on a variety of data and cases, they have converged toward a remarkably consistent explanation. In low-information contexts, coethnicity allows rational, self-interested voters to make crucial inferences about patronage networks.
The most important models in this literature have made strong assumptions about the way in which ethnicity enters voters’ preferences. While a handful of observational studies have hinted that the influence of ethnicity may be less strong or less automatic (Bratton & Kimenyi, 2008; Lindberg & Morrison, 2008), much of the recent literature has implied that voters are not affected by the identity of the specific candidate who competes under these party labels, or by the non-ethnic characteristics of this candidate. Furthermore, these models have also implied that voters do not differentiate among non-coethnics. Yet each of these assumptions appears problematic.
Beyond Party Labels
There are, first, good reasons to believe that ethnicity matters both at the party level and at the candidate/constituency level. In countries in which ethnic cleavages are salient, it is not surprising that voters’ evaluations of candidates would derive, at least in part, from the ethnic labels attached to their parties (Chandra, 2004; Horowitz, 1985). But it seems unrealistic that voters would not simultaneously pay attention to the ethnicity of the specific candidates fielded by these parties in their constituency. Prominent works based on a variety of contexts (Chandra, 2004; Ferree, 2011; Madrid, 2012) note that ethnic parties strategically distribute party tickets to leaders from other ethnic groups when they hope to change their image or broaden their potential bases. As a result, these works suggest that the ethnic labels attached to parties derive, at least in part, from the ethnicity of the various candidates fielded by these parties over time. If this is the case, it is especially likely that voters’ individual-level evaluations are also, in part, influenced by the ethnicity of the specific candidate they are choosing from. If voters perceive different ethnic makeups at the party level to be more or less conducive to patronage, it is likely that they also perceive variations in the ethnicity of each specific candidate they are evaluating to have similar consequences, regardless of the ethnic label they perceive to be attached to that candidate’s party. Voters’ evaluations should thus be a function of the existing label attached to the party as well as of the ethnicity of the specific candidate fielded by these parties.
Insofar as the strength of partisanship tends to increase around elections (Clarke & Stewart, 1998; Lupu, 2013; Michelitch & Utych, 2015), we should expect the relative weights voters place on party labels and on candidate ethnicity to fluctuate over the course of the electoral cycle. It is, however, difficult to think of contexts or timings in which voters would be expected not to pay any attention to the ethnicity of the specific candidates they are evaluating. This might be the case wherever voters believe that their representative plays no role in the distribution of state-related benefits, or where representatives’ decisions do not directly affect the livelihood of voters. In most ethnically diverse democracies—including India, South Africa, Zambia, and other important cases featured in this literature—this is, however, not the case. Individual representatives are usually in charge of disbursing some funds to their districts. More importantly, voters need the mediation of their representative to access the state and its many benefits. Given the life-or-death character that this form of constituency service can take in “patronage-democracies” (Chandra, 2004), it is unlikely that voters who otherwise perceive ethnic favoritism to be a structuring force would overlook the ethnicity of candidates. 1
Beyond Coethnicity
The second problematic assumption relates to the way in which voters evaluate different types of non-coethnics. To the best of my knowledge, all aforementioned works in this literature imply that voters perceive ethnic differences in strictly binary terms: Coethnic parties or candidates are preferred to all others (hereafter referred to as “non-coethnics”), and voters do not distinguish among those non-coethnics.
Yet, in most contexts in which ethnicity is politically salient and in which more than two salient groups exist, there is reason to believe that voters do sort out among non-coethnics. In fractionalized settings (i.e., settings wherever there are more than two salient groups, none of which accounts for a majority of the population), a model based on a form of valence for coethnics should prove a limited tool to explain the role of ethnicity in the formation of political preferences. In such contexts, a better—if slightly more complex—model might consider the additional existence of a type of negative valence directed at some non-coethnics, which implies that different types of non-coethnics may be evaluated differently.
This broader, non-binary type of ethnicity-based preference may result from several dynamics. Coalition politics could first prompt voters to differentiate among non-coethnics that are seen as being allied to their coethnics within a political organization and non-coethnics that are seen as being opposed to coethnics in the current party system. More generally speaking, the history of intergroup relations may be expected to be different with different groups of non-coethnics. Antagonisms may exist between Group A and Group B, but not between Group A and Group C. In the United States, it is, for instance, likely that Whites would not evaluate Latino politicians and Black politicians similarly; in Kenya, a Luo voter is unlikely to have the same baseline attitude toward a Kalenjin politician and a Kikuyu politician, even though both are perceived as non-coethnics. In India, it seems even more unlikely that voters would evaluate politicians from the several dozens of locally salient caste-based and religious groups that do not qualify as coethnics.
This non-binary model of ethnicity-based preferences should be especially important if voters react to candidates more than they react to parties, as suggested in the previous subsection. Where a large number of salient groups coexist, for instance in India, it is in fact likely that many voters do not have a coethnic candidate on the ballot; in this configuration, large groups of voters effectively have to rank non-coethnics to choose a candidate.
An Absolute Advantage of Ethnicity?
A third problematic assumption common in the ethnic politics literature relates to the relative importance of ethnicity in voting preferences. Because they mostly focus on why ethnicity matters, or because they strive to present parsimonious models, arguments about ethnic voting often appear to suggest that ethnicity is the only or the most significant factor of voters’ preferences, even if evidence suggesting the contrary (Bratton & Kimenyi, 2008; Lindberg & Morrison, 2008) has emerged. According to many oft-cited models, the ideologies, probity, and record of candidates are either unimportant (Ferree, 2011; Horowitz, 1985) or irrelevant (Chandra, 2004). 2
Assuming that voters’ preferences are established at the party level, Chandra (2004) develops a reasonable rationale for this overwhelming influence of ethnicity. In many settings in which party-voters linkages are ethnic in nature, ideology and programs are unlikely to play an important role. As such polities are likely to be low-information polities, voters likely struggle to learn something useful about competing parties (see also Birnir, 2007 on this point). Information about the overall quality of candidates fielded by said party, about their probity, or about the overall record of the party may be difficult to access, compile, or comprehend. By contrast, the ethnic makeup of parties sends a simple, visible, and unambiguous message to the electorate. Following this reasoning, ethnicity should play a central role wherever information is scarce and voting decisions are party centric.
When voters also take candidate characteristics into account, this is, however, unlikely to be true. If voters pay attention to candidates, it is not clear why they would merely focus on their ethnicity and disregard their non-ethnic characteristics. Although it is difficult for voters to obtain aggregate information about a party’s record, program, or probity, it is easier for them to obtain some information about the specific candidate competing in their constituencies. Patronage-seeking voters are, for one, expected to have a sense of the record of their incumbent. Besides, information concerning the reputation or the probity of that candidate should be available at the local level (such information is not readily available about parties only because it is difficult for voters to aggregate the information across many constituencies). If voters pay attention to candidate-level characteristics when forming their political preferences, and if they have all this additional information when evaluating candidates, then it is unlikely that they would only take ethnicity into account. If that was the case, we should expect the effects of ethnicity and coethnicity to be mitigated by other, non-ethnic candidate-level factors, such as perceived record or perceived probity.
Summary of Hypotheses
This brief review of the recent comparative literature on ethnic voting in sum suggests that three interrelated hypotheses should be tested to clarify how ethnicity matters:
Research Design
Although two of these hypotheses are not entirely novel—H1 and H3 build on existing arguments—we lack causally credible tests for all three of these hypotheses. To generate such tests, I implemented a vignette experiment in which likely voters rated randomly manipulated profiles of candidates in the Northern Indian state of UP.
The Case: Ethnic Politics in UP
Given the diversity of cases described in the comparative literature on ethnic politics, several locations would have been appropriate to test these hypotheses. Focusing on the Northern Indian state of UP, however, made sense for a combination of reasons. For one, UP—with a population of more than 200 millions—is the largest Indian state. Second, it is the case on which much of the argument of Chandra (2004), a key reference in this literature, relies. Third, even if UP is often presented among Indian states as a state in which “ethnic politics” has triumphed, it is in no case the only state in India in which ethnicity factors into political preferences. Recent accounts of politics in settings as diverse as Mumbai (Hansen, 2001) and rural Rajasthan (Chauchard, in press) suggest that the politics of most Indian states—including many Southern Indian states—is structured around a “caste-based mosaic” that sees caste groups forge and break alliances with various political parties (Jaffrelot, 2010). Besides, as I show below and in the Online Appendix E, recent survey data suggest that the preferences of UP voters are altogether not that different from the preferences of other Indian voters. In that sense, trends among UP voters should, at the very least, not be entirely irrelevant to trends among other Indian voters. Insofar as one agrees with Chandra that UP and India shares key characteristics with a number of other “patronage-democracies”—relatively uninformed voters, political control over the distribution of state benefits, the persistence of a large state—the UP case should in addition be relevant to a number of other cases of ethnic politics across the post-colonial world, such as Nigeria, Zambia, or Senegal. Finally, voters in UP—as elsewhere in India and in many representative democracies—choose among several candidates in single-member constituencies, making the state an especially appropriate location to test the aforementioned hypotheses. 3
Ethnicity plays a major role in UP politics (Chandra, 2004; Pai, 2002), as religion and caste-based divisions have over the past decades become the most electorally relevant cleavages in the state (Jaffrelot, 2010; Pai, 2002). The two locally dominant parties—the Samajwadi Party (SP), currently in power at the state level, and the Bahujan Samaj Party (BSP), in power before that, until 2012—openly appeal to various and fast-changing combinations of groups within the state. 4 As a result of these partisan dynamics, all dominant political parties in the state have in recent times considered the ethnicity of candidates vying for their tickets. The two locally dominant parties—the SP and the BSP—have engaged in “social engineering” in an effort to build around their original caste base (Yadavs for the SP, Chamars for the BSP). But other players—including two national parties, the Indian National Congress and the Bharatya Janta Party—the latter of which won 71 out of 80 constituencies in the 2014 general elections—have also strategically recruited leaders and pandered to specific groups during electoral campaigns. News reports from recent electoral campaigns in UP suggest that the ethnicity of candidates is expected to play a central role in voters’ choices (“UP Assembly Elections,” 2012; Khan, 2012; Srivastava, 2012; M. Tiwari & Pandey, 2013), independently of these candidates’ party labels. 5 Besides, some candidates are described in these accounts as “polarizing,” and voters from different groups often appear to be coordinating against a candidate from a particularly disliked group, hence suggesting the existence of a form of negative valence.
Importantly, the operative ethnic categories on which these parties have based their strategies in recent times do not correspond to the larger categories to which some recent works on ethnic voting in India (Banerjee et al., 2014) refer, but rather to smaller, locally salient groups, organized on the basis of subcastes (jatis) among Hindus, and on the basis of religion among non-Hindu minorities (Suryanarayan & Huber, 2013; M. Tiwari & Pandey, 2013). This nuance is important: As parties appeal to these more specific subunits instead of appealing to larger caste group categories, they frequently target specific communities within one of these larger categories while ignoring, or failing to target, others. 6
Given the complex setup of the experiment—described at length below—and given the large size of UP, data collection could only realistically take place in a handful of locations across the state. Because I wanted rural voters to constitute the majority of the sample—the population of UP remains 76.79% rural—I first randomly selected Gonda district among the list of districts whose headquarter city counted fewer than 150,000 inhabitants. I subsequently selected the district of Faizabad as an additional case to evaluate across-district variation. 7 While limiting myself to two locations allowed for better monitoring of the experimental procedure of the study—hence enhancing the internal validity of the study—this by definition limited the representativeness of my results. In light of this, it seems important to highlight that Gonda and Faizabad districts are not obvious outliers within UP. As can be seen from the broad range of aggregate statistics and public opinion data available on the two districts (Online Appendix A), the two districts are not particularly dissimilar to the rest of the state.
The Vignette Experiment
To evaluate voters’ preferences, I exposed a sample of likely voters to profiles of fictional—but realistic—politicians presented as likely candidates in the (then upcoming) general elections. Relying on fictional politicians provided me with an important advantage: It allowed me to randomly vary several of their characteristics, which would have been impossible and unethical with real politicians. Because many voters know the identity of likely candidates in their district, these fictional politicians were presented as likely contenders in other districts of UP. 8 Because some voters may have known the identity of candidates from other districts immediately before the election, the experiment was run in between elections, that is, in the summer of 2013, 10 months before the general elections of May 2014. 9
Although the analyses presented below focus on a subset of these, no fewer than eight manipulations were inserted in each vignette. As noted by Hainmueller, Hopkins, and Yamamoto (2014), complex vignettes allow for richer profiles and for a more realistic experiment. This appeared especially important in the context of this study, insofar as the amount of information revealed can change respondents’ propensity to develop ethnicity-based preferences (Conroy-Kurtz, 2013). 10
Four of these pieces of information were not of any direct theoretical interest to this study: the face of the politician (i.e., one of the pictures presented in Figure 1), his previous experience in politics, his family status, and the district in which he planned to run. Although these manipulations were randomly varied across the three profiles viewed by each respondent, I did not expect them to have an effect on the outcomes of interest—the results presented in the Online Appendix B confirm that they did not. 11 Accordingly, I do not comment on the effect of these manipulations in the rest of this article. I instead focus on the effect of the four remaining manipulations, which—by contrast—were directly relevant to the three hypotheses outlined above. These manipulations concern the following:
The politician’s ethnicity: Vignettes exposed respondents to a fictitious candidate that belong to one of three possible groups: (a) the respondent self-identified community, (b) a group that the respondent had previously identified as a “friendly outgroup,” or (c) a group that the respondent had previously identified as a “rival outgroup.” 12
His party affiliation: Each fictitious candidate presented in these vignettes also belonged to only one of two possible types of political parties: either their favorite party or their third favorite party (from a ranking of the four main parties active in UP: the SP, the BSP, the India National Congress [INC], and the Bharatiya Janata Party [BJP]). 13
His record in terms of development: The politician was either presented as having significantly contributed to the development of his constituency during his term in office or as having done little. 14
The number of serious crimes mentioned on his affidavit: 15 The criminal reputation of politicians took one of two possible values: The politician either had a “clean” reputation (which was conveyed by the fact that “no serious crimes were reported on his affidavit”) or a “criminal” one (“four serious crimes” were reported on his affidavit).

The three fictitious politicians.
Restrictions were implemented to avoid or limit repetitions on each of these manipulations across the three vignettes consecutively evaluated by each respondent. When the choice set was binary, each choice could only appear on two out of the three vignettes. When there were three possible options, each of the three choices appeared only once across the three vignettes. In keeping with attempts by UP’s main parties to co-opt elites from different groups, all possible combinations of caste and party were allowed, to the exception of one (Muslim and BJP). 16
A Two-Stage Survey
Because this design required the research team to have information on the respondent’s ethnicity, on his or her attitudes toward a variety of non-coethnics, and on his or her partisan preferences prior to the vignette experiment, this research took place in two stages.
The first stage of this design implied interviewing a random sample of respondents at their residence. In this face-to-face “recruitment interview,” respondents were asked a number of background questions. As in most surveys in India, they were asked a series of demographic questions, including an open-ended question about their self-perceived ethnicity. This interview also included a number of questions about respondents’ attitudes toward non-coethnics. Namely, respondents were asked to cite two communities with whom their own community has “traditionally maintained good relations” as well as two communities with whom their community has “traditionally had tenser relations with.” These perceptions were later retrieved to determine ethnic groups that each respondent considered as coethnics, as “friendly outgroups,” or as “rival outgroups.” 17 Finally, the instrument also included a question that required respondents to rank the four main parties of the state in order of preference. As detailed below, this information was used to determine each respondent’s preferred and third-ranked party.
At the end of this interview, respondents were invited to participate, a few days later, to a “completely unrelated” study. They were given the address of a community center in which the research team was based and told about the financial incentives for participating to the study. 18 Upon visiting this location a few days later, they participated to a study during which they evaluated three randomly varied profiles.
To maximize the diversity of respondents that would visit our “lab,” “recruitment interviews” took place in 40 polling booths randomly sampled in a perimeter of 20 kilometers around each of the two “labs” in which the research team was established (there were two “labs,” one in Gonda and one in Faizabad). Using random walk techniques from the center of each polling booth area, an average of 98 likely voters were then sampled and interviewed within each polling booth. 19 A total of 7,767 interviews were completed in this fashion (4,030 in Gonda and 3,737 in Faizabad) over a period of 5 weeks. A total of 2,860 of these 7,767 persons recruited at their residence subsequently visited one of our “lab” and took part to the lab-based vignette experiment.
As labs were located on the outskirts of each city, and stood in each case very close to a well-known bus station, both rural and urban voters participated to the study (24% of participants to the lab study lived in an urban polling booth). More importantly, these locations allowed the research team to sample from three different assembly constituencies in each of the two districts, hence ensuring that the respondents in our sample were represented by a diverse set of politicians.
Both the initial random sample of respondents and the subset of these respondents that subsequently made it to the lab were diverse and roughly representative of their district. Table 1 presents descriptive information for both sets of respondents. As can be seen from the table, the subset of respondents that made it to the lab was very similar to the random sample of the initial “recruitment survey.” This unusual two-stage procedure thus does not appear to have led to major self-selection bias.
Comparing the Characteristics of Respondents in the Recruitment and Lab-Based Surveys.
Note. SP = Samajwadi Party; BJP = Bharatiya Janata Party; INC = India National Congress; BSP = Bahujan Samaj Party; MLA = member of legislative assembly; MP = member of parliament.
Lab Procedure
The procedure used to elicit respondents’ attitudes at our “lab” limited the potential for misreporting or for influence by both interviewers and bystanders (Chauchard, 2013). Eliciting respondents’ attitudes in a lab-like setting first allowed the research team to isolate respondents. Each “lab interview” took place in a separate room, which ensured that no bystanders were present during the interview.
20
Second, the procedure minimized respondents’ awareness that partisan and caste-related attitudes were being assessed. Respondents were told that they would participate to an experiment on “how much they know about politics in UP.” They were quickly shown a series of pictures of extremely prominent UP politicians from all major parties and asked whether they recognized them (they overwhelmingly did). They were then presented with “a more difficult task.” At this point, they were presented the photocopies of bogus news articles (Figure 2), each of which profiled “a likely candidate in the upcoming parliamentary elections in UP.” In keeping with the previous task, respondents were not invited to read the article but rather simply asked whether they knew whom the pictured politician was. As they could not recognize the man portrayed,
21
nor decipher the article,
22
the interviewer summarized the content of the article to the respondent. At this point, the interviewer provided respondents with the following description (random manipulations italicized): Here is what the article is saying: this politician is currently/previously was a Member of the State Legislative Assembly. He was elected in (district name). He is married and has (two/three) children. His name is (group-specific name) and he is the (caste/community name) face for the (party name). The article also made a thorough investigation of what this politician achieved while in office and it says that villages of his district received (a lot/very little) in terms of development when he was in office. Finally, it reports the information provided on the affidavit that candidates have to fill in order to run, and it says that (he is accused of no major crime/he has four cases pending for major criminal violations).
23

Example of a vignette.
Once he had read this description to the respondent, and summarized it a second time, the interviewer used two survey questions—none of which referred to ethnicity—to elicit the respondent’s evaluation of the profiled politician. The first item sought to measure respondents’ general perceptions of the quality of the politician represented in the vignette (a dimension I hereafter refer to as general expectations): Politicians’ job is to address their constituents’ problems. On a scale from 1 to 5, how good a representative do you think this politician would be in that respect?
The second question aimed to tape into respondents’ perceptions of the politician’s willingness to help them personally, over other groups and individuals: Politicians sometimes help some of their constituents more than they help others. On a scale from 1 to 5, how good a representative do you think this politician would be for you personally?
In both cases, responses were on a scale from 1 (worse) to 5 (best). Overall, the procedure described in this section was repeated three times—that is, each respondent was successively exposed and reacted to three different vignettes.
Tests
To provide intuitive and visual tests of my hypotheses, I take advantage of the randomized nature of these manipulations—which I check in the Online Appendix C 24 —and compare the average rating of different types of profiles. 25 In each case, I focus on responses to the first vignette rated by each respondent, which implies that I rely on the most conservative standard errors possible. 26 In addition, as responses to these two questions were highly correlated (.82), 27 my analyses in the body of this article simply focus on responses to the first item (general expectations). 28
Testing H1: Does the Ethnicity of Candidates Matter?
To test whether respondents took the ethnicity of candidates into account, Figure 3 compares the average rating of all profiles in which the candidates were coethnics to the average rating of profiles in which the candidates were not coethnics. Figure 4 subsequently breaks these results down by party. These analyses provide clear support for H1: As can be seen from Figure 3, coethnic candidates were rated much more favorably than non-coethnic candidates. This was true regardless of the party label attached to these candidates, as can be seen in Figure 4. In both figures, the 95% confidence intervals around the mean ratings for coethnics and non-coethnics do not overlap. This implies that respondents took the ethnicity of candidates into account and that they did so even when the candidate in the vignette belonged to their favorite party.

Average general expectations by ethnicity (binary).

Average general expectations by ethnicity (binary) and party.
These results are remarkable insofar as the design of this experiment did not particularly prompt respondents to do so: The amount of information they were presented with did not obviously drive them to react to the ethnicity of candidates. Yet, when asked to rate a candidate about whom they were given no fewer than eight pieces of information, respondents strongly reacted to this specific manipulation about ethnicity. Besides, the experiment was not presented as an experiment about ethnicity-related attitudes, and respondents could not have guessed that the research team was interested in testing this specific hypothesis. The two questions to which they responded after they were exposed to this first vignette were the first questions in the entire lab-based interview that related to political preferences (most of the interview until this point had been about “political knowledge,” that is, about how much people knew about politicians in UP).
Testing H2: Do Respondents Differentiate Between Non-Coethnics?
To test whether voters differentiate between non-coethnics (H2), Figure 5 breaks down the non-coethnic category into two subcategories and compares the mean rating of all profiles in which the candidates were coethnics with the mean rating of profiles in which the candidates were from “friendly outgroups” or from “rival outgroups.” These analyses provide clear support for H2: While both types of non-coethnics were rated lower than coethnics, non-coethnic candidates from “friendly outgroups” were rated much higher than non-coethnic candidates from “rival outgroups,” and none of the confidence intervals around my estimates for these three categories overlap. This confirms that the ethnicity of candidates matters, but it also suggests non-coethnics were not all equivalent in the eyes of respondents. While evaluating non-coethnic candidates, ethnicity-related considerations played a big role in the deliberations of respondents. This was true regardless of the party label under which these candidates competed, as shown in Figure D1 (Online Appendix D).

Average general expectations by ethnicity.
Altogether, Figures 3 to 5—as well as the corresponding average marginal component effects reported in the Online Appendix B1—suggest that the ethnicity of candidates strongly affected evaluations. The size of these candidate-ethnicity effects was large in absolute terms. As can be seen from the estimates presented in Table B1 (Online Appendix B), candidates from “rival outgroups” were, for instance, rated about .85 point lower than coethnics. This is a large and significant difference, suggesting that participants paid a great deal of attention to the ethnicity of candidates. While both types of non-coethnics were rated significantly lower than coethnics, the large difference between them in turn implies that the category “non-coethnic” may be substantively meaningless, insofar as the difference between “friendly outgroups” and “rival outgroups” (.54) was significantly larger than the difference between coethnics and “friendly outgroups” (.32). Finally, it is remarkable that the size of these effects was large in comparison with the size of the effects of the other manipulations included in the vignette experiment. This suggests that respondents sometimes weighed the ethnicity of candidates more than they weighted the party labels under which those candidates ran. While candidates from a “rival outgroup” were rated .85 point lower than coethnics (Table A1 in the Online Appendix A), candidates described as belonging to the respondents’ “third preferred party” (i.e., a disliked party) were only rated .56 point lower on average than candidates from the respondent’s favorite party (Table A1). This suggests that the ethnicity of some candidates mattered at least as much as the party labels under which they reportedly ran.
Testing H3: Do the Non-Ethnic Characteristics of Candidates Matter?
Finally, to test whether the non-ethnic characteristics of candidates matter, I break down the average rating of profiles associated with each type of ethnicity (coethnic, “friendly outgroup,” “rival outgroup”) by their perceived record and perceived criminality (both of which were randomly manipulated in the vignette experiment). Focusing on the interaction between candidate ethnicity and perceived record, Figure 6 shows that participants’ ratings evolved as a function of the perceived record of the candidate and that this was true regardless of the ethnicity of said candidate. For both coethnic and non-coethnic candidates, the ratings were significantly lower if the politician was presented as having a bad record (the confidence intervals do not overlap). Focusing on the interaction between ethnicity and criminality, Figure 7 displays a comparable pattern: Candidates presented as being currently charged in four criminal cases (not an uncommon situation in UP) were rated significantly lower than candidates presented as facing no charge at all (the confidence intervals do not overlap). 29 These graphs, as well as row 7 and 8 of Table B1 provide clear support for my third hypothesis: the non-ethnic characteristics of candidates strongly affected voters’ preferences. Altogether, these results imply that factors additional to ethnicity did drive respondents’ evaluations.

Average general expectations by ethnicity and perceived record.

Average general expectations by ethnicity and perceived criminality.
Discussion
These analyses challenge some of the key assumptions of the comparative literature on ethnic parties, as they show that ethnicity was—at least in the context of this study—factored into voters’ preferences in a way that is both more complex and less automatic than what frequently cited arguments in this literature (Chandra, 2004; Horowitz, 1985) so far suggest. To better explore the implications of these findings, one, however, needs to assess their validity beyond this context. Accordingly, the following subsections discuss the extent to which the protocol of the study might have affected its conclusions, before exploring the relevance of its findings for other contexts.
Do These Preferences Reflect UP Voters’ Preferences?
Insofar as they were expressed in the context of a lab-based survey-experiment, the preferences expressed here cannot be equated to the preferences on which voters rely in actual elections. Several elements, however, suggest that the preferences captured by this study should resemble real-world preferences.
It is, first, important to note that the experiment did not prompt respondents to react as strongly as they did to the ethnicity of candidates. Because respondents were faced with a wide range of information about politicians, it is difficult to argue that they strongly reacted to ethnicity because this was the only information about candidates available to them (Conroy-Kurtz, 2013). However, social desirability concerns suggest that the effects of ethnicity I detect should be seen as underestimates, especially when it comes to the effect of “rival outgroup” candidates. This is due to a practical aspect of my design: When respondents were asked as part of their recruitment interview to name groups with whom their own group has maintained tense relationships with—a sensitive question asked in a face-to-face interview—efforts to limit response editing were limited. As a result, it is likely that some respondents misrepresented their views and named groups other than those they might have named in private, or with a more confidentiality-enhancing administration mode. This implies that some of the “rival outgroup” politicians likely do not belong to groups that respondents truly consider to be “rival outgroups,” which in turn implies that the results above likely underestimate the effect of ethnicity. This, importantly, means that the lab-based survey constituted a hard test for H1 and H2.
These results are, in addition, unlikely to be the product of two other features of my research design. The fixed order in which manipulations appeared in the vignette is unlikely to have inflated my results. This is, first, because the order was in practice not as fixed as it might have been if respondents had self-administered the survey. After interviewers described the vignette to respondents a first time, interviewers checked that respondents had heard and understood the information included in the vignette. As they asked respondents to summarize what they had heard, the information about candidates was often repeated in a different order, as respondents often struggled to remember some of the seven randomly varied pieces of information contained in the vignette. Second, and more generally speaking, it is not clear that “row-order effects” always exist in complex experiments of this type: Hainmueller, Yamamoto, and Hangartner (2015), for instance, do not detect any. 30 Third, potential row-order effects would likely run against my main finding. In the vignette, respondents heard about the ethnicity of candidates first. This should lead us to expect that they were more strongly primed by the other treatments than by the candidate-ethnicity one. If that were the case, it would imply that my design potentially underestimates the effect of candidate ethnicity and that support for H1 and H2 might have been even larger than it already is in the results presented here.
These findings are similarly unlikely to owe to the structure of my dependent variable. Vignette experiments can be of two types: “rating tasks” involving one profile at a time (as in this experiment) and “choice tasks” requiring respondent to pick one of several randomly varied profiles presented simultaneously. While choice tasks are seen by some as being closer to real-world decision making, in which respondents must cast a single ballot between competing candidates who vary on multiple dimensions, they do not encompass any of the strategic aspects of voting in the real world. Besides, recent works (Hainmueller et al., 2014; Hainmueller et al., 2015) suggest that the estimates retrieved from either type of tasks are often comparable. This may be seen as evidence for the fact that preferences retrieved using either type of tasks do not differ and/or that they both significantly differ from preferences in real-world decision making.
It is, finally, unlikely that respondents reacted to the record and the criminal status of candidates because they felt compelled to reject “bad types” in front of an interviewer. Several elements point in that direction. The way respondents reacted to the ethnicity treatments suggests, first, that respondents may not have been exceedingly worried by social desirability as they rated these candidates. Caste is a sensitive topic in India and one that is prone to editing by respondents (Chauchard, 2013). Yet my results show that respondents had no problem endorsing coethnics and rejecting non-coethnics in front of their interviewer, who usually had a sense of their own caste, based on their name and appearance. Second, the large number of treatments included in the experiment would have made it easy for respondents to endorse criminals without necessarily appearing to do so, as interviewers would not necessarily have been able to deduce which characteristics (among a total of eight) drove the respondent’s evaluation. Because interviewers specifically did not ask why respondents were rating these candidates the way they were rating them, it is not clear that respondents should have felt any pressure at all to reject candidates with a criminal background. Third, it would have been relatively unproblematic for respondents to disregard information about criminality and record, as neither the veracity of this information nor the impartiality of these charges was clear. Yet they overwhelmingly reacted to these prompts. These estimates may be seen as conservative, insofar as the effect of a positive development record might have been larger had respondents had a personal experience with these politicians—which they did not—or had respondents been able to verify that the information in the newspaper article was accurate—which they were not able to do. 31
Implications
While I see no obvious reason to believe that the protocol of the study would have exacerbated respondents’ reactions to candidate ethnicity, or prompted them to penalize politicians with unattractive pedigrees, two important caveats are in order. The timing of the study may first have helped increase respondents’ emphasis on candidate-level characteristics. Because the study took place in between elections, party preferences likely were in motion at the time the study took place. It is thus possible that the study underestimated the effect of party labels and/or that the voters interviewed in the study weighed candidates more heavily than they might have closer to the actual election. Further studies would accordingly need to measure the evolution of the weights voters place on candidate-level characteristics and party preferences over the course of the electoral cycle. As party preferences crystallize, and as the “labels” of parties become clear to voters, it is possible that voters weigh candidates less. Given the size of the effects of candidate ethnicity detected here, it is, however, unlikely that these effects would entirely disappear as elections get closer. Second, because I chose to focus on two mostly rural districts within the state of UP (a logical choice given that the state’s population is 76% rural), the preferences retrieved in this study may not be entirely representative of the preferences of urban voters, who tend to be more educated, more informed, and overall less influenced by ethnicity in elections.
In spite of these limitations, these results are relevant to a range of contexts. The voters who participated to this study do not constitute outliers in India. The state of UP—the largest Indian state, with a population of more than 200 millions—has over the last decades acquired the reputation of a state riddled with high levels of “ethnic politics” and with particularly polarized intergroup relations. Yet this does not necessarily imply that ethnicity is not relevant to elections taking place in other Indian states (Chandra, 2004; Jaffrelot, 2010), nor that other Indian voters would place more emphasis on party-level factors than the voters of UP. As can be seen from the Online Appendix E, data from the 2009 National Election Study (the most recent available data based on a national sample of voters 32 ) show that the attitudes of UP voters were altogether not that different from the attitudes of other Indian voters in these respects: They similarly declared weighing parties more than candidates (Table E1 in the Online Appendix E); they similarly did not feel particularly close to any political parties (Table E2); besides, as can be seen from Tables E3 to E6 (and even more clearly from Figures E1 to E4), UP voters were not the only Indian voters to endorse openly anti-minority or “casteist” statements; 33 finally, as shown in Tables E7 and E8, UP voters were as likely as others to declare that they would pick a corrupt or a criminal representative. Altogether, this additional data suggest that UP voters are unlikely to weigh the various characteristics of candidates in a way that radically differs from the way the median Indian voter weighs them. While this is limited data, and while further studies will need to confirm this, this suggests that the aforementioned results might not have been radically overturned had I sampled across North India—rather than in just two locations.
These results also likely inform the behavior of voters from a variety of ethnically diverse democracies beyond India. Ethnicity often becomes a heuristic device when relatively uninformed voters hope to access state benefits controlled by elected politicians (Chandra, 2004; Posner, 2005). In such contexts, voters similarly select representatives on the basis of single-member constituencies—this is the case in most commonwealth nations. The logic articulated above is likely to hold in these contexts: “ethnic voters” are likely to weigh both party-level and candidate-level factors as they evaluate candidates. Moreover, because the fragmented ethnic context of North India is not unique—many African nations are for instance as ethnically fragmented—these voters are as likely to differentiate between different types of non-coethnics as Indian voters.
In light of these elements, these results have implications for the comparative literature on ethnic voting and ethnic parties. While popular theories of ethnic voting (Chandra, 2004; Horowitz, 1985) focus on the influence of ethnicity at the party level, voters may react to the ethnicity of candidates as much as they reacted to the party labels under which these candidates run. This implies that ethnicity-related considerations enter voting decisions in several ways and that current models focused on the effect of ethnicity at the party level misrepresent how ethnicity matters. This further suggests that a change of candidate at the constituency level can change the electoral fortunes of a party, a finding coherent with arguments about African and Latin-American politics suggesting that ethnically identified parties can alter their image and attract new voters by fielding new types of candidates (Ferree, 2011; Madrid, 2012).
Second, these results show that the impact of ethnicity is not limited to a simple preference for coethnics. Voters do prefer coethnics to other types of candidates, but they also clearly differentiate between non-coethnics, rating some of them nearly as positively as their own coethnics and strongly penalizing others—suggesting that ethnicity matters positively as well as negatively. This has far-reaching implications—beyond electoral politics—for how we should think of intergroup relations in ethnically fragmented societies: In the absence of a simple, binary dichotomy between coethnics and non-coethnics, we should expect voters’ decisions and more generally speaking citizens’ decisions, to rely on ethnic considerations in a much larger set of situations, including those in which none of their coethnics is involved.
Finally, contrary to what both “primordialist” (Horowitz, 1985; Rabushka & Shepsle, 1972) and constructivist models (Chandra, 2004) have often implied, these results suggest that ethnicity is unlikely to provide candidates with an absolute advantage: When coethnic candidates have problematic personal characteristics, voters—ceteris paribus—rate them more negatively, hence suggesting that non-ethnicity-related candidates-level factors matter, as for their American counterparts (Campbell, Green, & Layman, 2011; McDermott, 1997, 1998; Popkin, 1991). Altogether, these findings thus imply that the voters of new and post-colonial democracies engage in complex, multi-dimensional calculations about their political representatives. While models of ethnic voting should remain parsimonious, they should better acknowledge this degree of sophistication.
Footnotes
Acknowledgements
Major thanks to Thibaut Marcesse and to MORSEL for research assistance on this project. Thanks to Neelan Sircar, Devesh Kapur, Brendan Nyhan, Yusaku Horiuchi, Josh Kertzer, Marko Kasnja, Harish S.P., Francesca Jensenius, Adam Ziegfeld, Scott Abramson, Carlos Velasco Rivera, Evan Lieberman, Sunita Parikh, Yael Zeira, and Kanchan Chandra for insightful comments.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
