Abstract
Counting repressive events is difficult because state leaders have an incentive to conceal actions of their subordinates and destroy evidence of abuse. In this article, we extend existing latent variable modeling techniques in the study of repression to account for the uncertainty inherent in count data generated for this type of difficult-to-observe event. We demonstrate the utility of the model by focusing on a dataset that defines ‘one-sided-killing’ as government-caused deaths of non-combatants. In addition to generating more precise estimates of latent repression levels, the model also estimates the probability that a state engaged in one-sided-killing and the predictive distribution of deaths for each country-year in the dataset. These new event-based, count estimates will be useful for researchers interested in this type of data but skeptical of the comparability of such events across countries and over time. Our modeling framework also provides a principled method for inferring unobserved count variables based on conceptually related categorical information.
Introduction
Recording repressive events is integral to the scientific study of peace and conflict. Doing so accurately, however, is complicated by the fact that state leaders often have strong incentives to conceal these events from the international community and destroy evidence associated with abuse. Even when monitors, activists, and journalists have complete access, resource constraints may limit their ability to observe or record state violence. The lack of access, and constraints on monitoring resources, combine to potentially bias counts of repressive events (Brysk, 1994; Davenport & Ball, 2002).
Researchers recognize that differences in information sources may lead to divergent inferences and have spent considerable time seeking to resolve these problems by integrating data derived from multiple sources (e.g. Hendrix & Salehyan, 2015; Krüger et al., 2013). These approaches provide a promising means of validating inferences from the study of repressive events, but they are seldom applied to the time-series cross-sectional analyses that are central to much of the empirical human rights literature. Remaining concerns over the validity of repressive events count data contributed to a movement away from these data in human rights research (Poe, 2004). Yet standards-based indicators are subject to other forms of bias (Fariss, 2014, 2019) and are less well suited to precisely track the patterns of repressive events. Much could be learned with counts of these events but only if the political and operational processes that make obtaining such data problematic are conceptually identified and empirically addressed.
We take up this challenge by developing and validating a solution to the problem of bias in count data within the context of estimating one-sided government killings (Eck & Hultman, 2007; Pettersson, Högbladh & Öberg, 2019). We demonstrate how latent variable modeling techniques can be used to triangulate information from a variety of data sources to improve and expand upon existing estimates of whether and how many individuals were killed by their governments. Our approach builds on existing latent variable models of human rights, which assume that a government’s underlying level of repression is not observed directly, but can be estimated based on observable pieces of information captured through a wide variety of human rights monitoring sources. Specifically, we leverage a useful property of latent variable models: the ability to generate predictive distributions for all input variables, regardless of whether they are observed or missing in a particular country-year. The result is the creation of new estimates of one-sided-killing that account for and reduce bias stemming from resource constraints and the incentive to conceal repression.
We use the Eck & Hultman (2007) data from the Uppsala Conflict Data Program (UCDP) as our benchmark for our model. These data are central to conflict research and have been deployed in hundreds of published articles. 1 We nevertheless endeavor to make several improvements. Eck & Hultman (2007) code one-sided-killings as absent in country-years where there is no direct and reliable evidence of at least 25 individual deaths. The risk is that these data undercount repressive events. We empirically assess this possibility by using information conveyed in other human rights indicators to identify instances where killings were likely to have occurred, but for which insufficient documentary evidence exists. Determining which regimes are within this category is itself of substantive interest to scholars interested in concealed or otherwise unobserved abuse. In addition, while existing data begin in 1989, our approach is able to generate predicted counts extending back as far as 1946, allowing researchers to expand the temporal range of empirical tests. 2 This is particularly useful for testing theories of slow-changing and/or system-level changes of repression. Finally, we improve estimates of uncertainty around these events by moving beyond the provision of ‘low’, ‘best’, and ‘high’ estimates and instead generate full probability distributions for the number of individuals killed in any particular country-year.
Our latent variable model allows us to expand the empirical testing ground for the scientific study of political violence by providing a principled means of combing biased, incomplete, or otherwise imperfect pieces of information. This approach is critical for scholars because of the increasing volume and breadth of information about repression. The transformation of information from analog to digital is allowing scholars of peace and conflict studies to generate counts of important events like killings but also other forms of political events and patterns of communication in and around conflict (e.g. Steinert-Threlkeld, 2017).
Beyond these substantive contributions, we also innovate methodologically by extending latent variable modeling techniques to account for zero-inflated count processes. Zero-inflation occurs in count processes when zeros are observed (perhaps excessively) for one of two reasons: (1) the event being counted did not, and could not have occurred; or (2) the event being counted was not observed but could have happened. Within our context, this corresponds to instances where: (a) no killings are observed because none occurred; and (b) killings occurred, but were not recorded due to reporting biases driven by a regime’s attempt to conceal these events or because of capacity limitations on the monitoring organizations that collect information about repression. Though used within the context of human rights, we expect these techniques will be useful for a wider body of research linking political event-counts to unobserved concepts of interest.
Below, we introduce our modeling innovations in further detail and validate our estimates of one-sided killing. In so doing, we highlight several instances where we believe one-sided-killings occurred, despite their absence from existing data sources. Leveraging the expanded temporal domain of our data, we track a reduction in one-sided-killing over time, corroborating evidence of more widespread reductions in other forms of political violence (Goldstein, 2011; Lacina, Gleditsch & Russett, 2006). We close with a discussion of the promise of latent variable models generally, with suggestions for addressing the limitations of our model and improving the measurement of repressive events.
Existing measurement models of repression
All latent variable models share the assumption that an underlying concept of interest cannot be observed directly but can be approximated through observable manifestations. For latent variable models of human rights, the underlying concept is broadly defined as respect for physical integrity rights, which comprises a unidimensional spectrum ranging from complete and widespread abuse of these rights at one extreme, to complete respect of the rights at the other (Eck & Fariss, 2018; Schnakenberg & Fariss, 2014; Fariss, 2014, 2019). Where a state sits along this spectrum is determined by data collected from a variety of sources on different types of abuse and repression: the observed manifestations, which are broadly classified as either ‘events-based’ indicators or ‘standards-based’ indicators.
Events-based indicators capture extreme rights violations, identifying the scope and scale of repressive events. Typically, these indicators are collected and periodically updated through the continual evaluation of primary and secondary sources. By contrast, standards-based indicators capture both less extreme abuse and widespread abuses. These categorical variables rely on contemporaneous human rights reports. Though we focus on reducing bias in event-counts, both types of data sources are subject to a variety of biases. A latent variable approach lessens these biases by combining and aggregating information through a principled and transparent measurement model.
The Online appendix contains descriptions of the specific datasets and the sources for the standards-based indicators and event-based indicators that we use for our updated model. The standards-based indicators are almost all derived from Amnesty International and US State Department reports (Hathaway, 2002). The event-based indicators are: massive repressive events (Harff & Gurr, 1988); genocide and politicide (Harff, 2003); genocide and democide (Rummel, 1994; Wayman & Tago, 2010); one-sided government killing (Eck & Hultman, 2007); and political executions (Taylor & Jodice, 1983). Fariss (2014, 2019) treats all of these variables as dichotomous indicators that identify whether each type of event occurred. The definitions of genocide, politicide, and massive repression variables capture human rights violations at the extreme end of the repression spectrum. The measurement of one-sided government killing captures instances in which more than 25 individuals (non-combatants) are killed, but excludes extra-judicial killings that occur inside a prison and combatant deaths that occur during civil conflicts (Eck & Hultman, 2007). Extra-judicial killing more generally is captured by both the political execution data (Taylor & Jodice, 1983) and several of the variables derived from the human rights reports described above.
The models constructed by Schnakenberg & Fariss (2014) and Fariss (2014) are outlined in Table I. These models assume there is an unobserved latent trait,
For each physical integrity item, the model estimates an ‘item discrimination’ parameter
Fariss (2014) extends this model by allowing the difficulty cut-points for some of the items to vary over time, changing
Existing latent variable models of repression
A latent variable model for binary, ordered, and zero-inflated count processes
We can extend this latent variable model to take advantage of the event-counts from some of the event-based data sources that have been coarsened to binary indicators in existing models. This extension allows for the incorporation of more information into latent variable estimates.
The new model requires us to assume a parametric form for event-counts that fits the underlying data generating process. Specifically, we need to account for the fact that killings may be unobserved either because they did not occur, or because they did, but evidence of the event was unobserved. This second category likely features cases where killings were concealed due to information scarcity, potentially driven by deliberate concealment. We therefore use a zero-inflated, negative binomial probability distribution to link the latent repression variable with the event-count data.
As with the above models, we again are assuming that the observed indicators (here the amount of one-sided-killing) is a function of the underlying latent trait. We parameterize this with α,
where
The negative binomial likelihood also incorporates a rate parameter, r. This accounts for the degree of ‘over-dispersion’ in the count data by allowing the variance to increase. The variance is equal to
Extending the latent variable model of repression
Next we must integrate this negative binomial framework into the broader model of human rights so that UCDP Eck & Hultman (2007) data can be incorporated. Though our framework could be used to accommodate all count data, we chose to use the UCDP data as our primary data source for two reasons. First, these data have been widely used, widely scrutinized, and have relatively expansive spatial and temporal coverage. Second, as we discuss in more detail below, this is one of the only data sources that has categorical estimates of uncertainty, providing researchers with ‘low’, ‘best’, and ‘high’ fatality estimates. 5
To integrate count data, we construct a model with the following likelihood function:
where vj
and cj
are indicator variables that determine which portion of the likelihood function a particular manifest variable should be passed through. For standards-based indicators
When constructing the model, one important choice was how to treat the ‘low’, ‘best’, and ‘high’ variables. One option would have been to treat these as three independent indicators, and assign each their own difficulty and discrimination parameters. That is, we would assume that they are conditionally independent and only a function of the latent variable
We therefore used an alternative, and potentially more realistic model parameterization that considers these values as the result of one coder or set of coders deliberately attempting to generate estimates of an unknown, true count of one-sided-killing,
This assumption is reflected in the notation for count-indicators, where single
Along with generating improved estimates of the latent trait, this model specification generates two additional substantive quantities of interest. First, as detailed above, the model can be used to estimate
Often, these predictions are used as a form of model-checking (Gelman & Hill, 2007). Yet, relaxing the assumption that the manifest indicators are measured without error, these posterior predictions are also a useful means of approximating uncertainty around the indicators themselves. A desirable feature of this modeling framework is that predictions for
Uncertainty around the number of killings is also quantifiable because the prior distribution of each of the model parameters and the latent variable allows for the approximation of the posterior distribution of each country-year distribution of one-sided government killing counts. Country-year heterogeneity is driven by either increased uncertainty in
Priors and estimation
The parameters for the binary and ordered data are given the same distributions as in Fariss (2014) (Table 1) with one exception. Recent work has applied robust-modeling techniques as a means of improving latent variable model estimates (Reuning, Kenwick & Fariss, 2019). Specifically, the conventional assignment of a standard normal prior to the latent trait is substituted with a Student’s T distribution using the following prior specification on the latent trait and innovation parameter:
The wider tails of the Student’s t-distribution allow estimates of the latent trait to experience sudden changes within a given time series. This is a desirable modeling innovation as repression levels may change rapidly because of regime change, military coups, or the onset of rebellion.
We now need to assign priors to the parameters for our count data.
Results and validation
Validating estimates of respect for human rights and one-sided-killings is difficult because we cannot observe the ‘true’ values with complete certainty (Adcock & Collier, 2001). In our application, we are particularly concerned with whether our model fits the data reasonably well and whether the estimates of one-sided-killing produced are valid. 6
We assess the validity of our model using several types of criterion-related validity checks, which validate a measure based on its relationship with existing measures and determining whether each behave in theoretically plausible ways (Trochim & Donnelly, 2008: 59). First, we compare the model’s estimates to previous latent estimates of repression, which is one type of convergent validation check. For convergent validity, the latent variable estimates should closely relate to other measures that are known to be valid measures of the concept of interest. Second, we conduct a posterior predictive check of how well our model’s estimates correspond to the data used to generate the model (Gelman & Hill, 2007). Specifically, we examine the correlation between model predictions of one-sided-killing and the original UCDP variables. A strong correlation would be evidence that our model fits the data relatively well, which itself functions as a form of predicted validity check (Trochim & Donnelly, 2008: 57). Third, we take advantage of the expanded temporal Comparing estimates of latent respect for physical integrity rights
Latent variable estimates of the respect for physical integrity rights
Figure 1 displays the mean estimates of the latent trait ( Predictions counts of one-sided-killing along values of the latent trait
Model predictions of one-sided-killing
Figure 2 displays predictions across values of
One-sided-killings reduce to zero at approximately the mean value of latent repression estimates. The UCDP dataset begins recording frequent observed instances of one-sided-killing at approximately one standard deviation (–1.0) below the mean value of the ‘true’ level of repression. The magnitude of the predictions increases as the latent variable decreases. Though only Rwanda (1994) nears the maximum observed value, the model makes predictions that accord with earlier episodes of domestic Predictions of one-sided-killing for country-years omitted from the UCDP data
There are several observations where the UCDP data do not identify one-sided-killings – reflected as zeros for the low count in these country-years – but our model generates non-zero predictions. Figure 3 reports predictions for this subset of observations. For most country-years, the model produces predictions tightly clustered around 0, which is consistent with the decision to exclude them from the UCDP data. For observations that are otherwise low on the latent trait, however, the model predicts non-zero values for one-sided-killings.

Predicted one-sided-killing for worst country-years with no reported killings in UCDP data
Next, we focus in on the cases where UCDP reports no one-sided-killing while our model predicts high one-sided-killings. In Figure 4 we show the top ten country-year units for unobserved one-sided-killings. Afghanistan in 1989 is the most extreme case. We estimate a median of 1,098 killings taking place, with a 33% to 66% range of 649 to 1,778. In 1989 the USSR withdrew from Afghanistan starting a civil war that would last until 1992. Iraq in 1989 is the next with a median estimate of 705 and a range of 470 to 1,024. This was the last year of the Anfal genocide where the Iraqi government systematically massacred Kurdish Iraqis. Although the bulk of the killings occurred just prior to this date, documentary evidence suggests active repression campaigns persisted, and our estimates suggest that one-sided-killings remained probable. Eight of ten of the cases we identify are between 1989 and 1991, and almost all of these cases were countries caught between the United States and the Soviet Union during the end of the Cold War. The only cases that are not from this period are Sudan in 2009 and 2010, where we have median one-sided-killing estimates of 190 and 179 respectively. Reports from Human Rights Watch mention civilian bombing in Sudan during these years in addition to reports of 200 individuals that were ‘disappeared’ between 2008 and 2009.
Figure 5 displays the Spearman correlations between the model’s predictions of one-sided-killing and the original UCDP ‘low’, ‘best’, and ‘high’ counts. The median Spearman rank correlation for the low estimates is 0.69, while for the best estimates it is 0.80 and for the high it is 0.82. The correlations along with the other examinations of the predictions of one-sided-killing tell us that the model does a good job of fitting the observed and unobserved data.
Changes in government killing over time
Because of the temporal coverage of the other human rights variables, our model produces estimated counts of one-sided-killings beginning in 1946, allowing us to conduct a convergent validity test by examining whether our measure can corroborate existing findings that the level of violence has declined over time. Figure 6 displays the total number of one-sided-killings each year. Each annual count is created by taking draws from the posterior of each country’s predicted one-sided-killing for a given year and then summing across all countries. Readers should keep in mind that our predictions of one-sided-killing are based primarily on our country-year estimates of the latent respect for human rights ( Correlations between model predictions of one-sided-killings with UCDP low, best, and high estimates
The model suggests that the total number of one-sided-killings was relatively low starting in 1946 before increasing in the mid 1950s. This increase is driven in part by the independence of states like Sudan, who had violent entries into the international system. Estimates remain high throughout the Cold War; more than a million one-sided-killings occurred each year. The number dropped into the high thousands during the 1990s (other than during the Rwandan genocide in 1994) and most recently to just below 1,000 (these deaths do not include extra-judicial killings that occur in custody). These estimates corroborate the results from other studies that find a decline in fatalities during war (Goldstein, 2011; Lacina, Gleditsch & Russett, 2006), a decline in the level of violence more generally (Pinker, 2011), and improvements in respect for human rights (Fariss, 2014, 2019). All of these authors point out that the decline in violence has not been steady – a fact thrown into stark relief as recent conflicts in Ukraine, Venezuela, and Syria presage heightened violence.
Country example: The democratic Republic of Congo (1993–96)
Documenting repressive events in any country is difficult because of limited resources and limited access to areas in which repressive acts take place (e.g. Brysk, 1994). This is especially true in places such as the Democratic Republic of Congo, which, over the last Model based estimates of the yearly number of one-sided government killings beginning in 1946 and ending in 2017
Though the best open-source information – much of which is provided by journalists and monitors on the ground – is used by the UCDP coders, the estimates given are still just that: estimates. Figure 7 displays distributions of the number of one-sided-killings for the Democratic Republic of Congo (1993-1996). Each plot contains the simulated distribution of potential values, the median prediction from our model, and the original UCDP ‘low’, ‘best’, and ‘high’ counts. 1994 is notable because it is the only year for which UCDP was unable to uncover direct evidence of 25 killings or more. Nevertheless, an absence of killing is unlikely, given qualitative evidence, the values on the other observed repression variables, and the UCDP estimates from 1993 and 1995.
According to the US State Department Human Rights reports, in the Democratic Republic of Congo, Country-year distribution of the number of one-sided government killings
Though the frequency of the events was less than prior years, security forces engaged in the extrajudicial killing of civilians in 1994. Because the information environment for this case was poor, sufficient reliable information was not available for this case to enter the UCDP database. Our latent variable model produces a distribution of potential estimates for this case, with a median estimate of 61 deaths.
Conclusion
The measurement model and validation tests presented in this article contribute to a growing research area on using measurement models like this to improve the validity of the variables used to study peace and conflict (e.g. Anders, 2020; Barnum & Lo, 2020; Clay et al., 2020; Cordell et al., 2020; Huddleston, 2020; Krüger & Nordås, 2020; Marquardt, 2020; Meserve & Pemstein, 2020; Montal, Potz-Nielsen & Sumner, 2020; Terechshenko, 2020). Our research makes important improvements in the measurement and understanding of repressive events by linking together count data and categorical variables of repression. In particular, our modeling strategy leverages disagreements in event-counts within data sources, which allows us to generate country-year distributions of estimates of one-sided-killings as part of the latent variable model of repression. Disagreements between event-count estimates exist because of the reporting incentives of monitoring organizations and a lack of transparency or resources with which to completely observe all repressive events. This framework allows us to bring together different sources of information about repression and assess how well each piece of information works together and then, based on assumptions about the way the information was produced, modify, validate, and update the model. We conclude by noting three remaining threats to measurement validity. For each point, the expanding set of human rights data and increasing adoption of new measurement modeling techniques promise to yield additional insights about counting repression.
First, additional data collection for cases where UCDP does not report reliable counts above 25 deaths would allow us to further refine the latent variable model estimates. There is a general challenge in the study of repressive practices because scholars and activists tend to focus monitoring capacity and attention on the most violent cases (Brysk, 1994; Eck & Fariss, 2018). Additional knowledge of the cases not likely to have involved one-sided-killing would improve the performance of the model. Relatedly, our model assumes that the standard of accountability for UCDP one-sided-killing data does not change. The UCDP data, like other event-based variables, are created using many sources and updated over time. These features of the coding process help to account for bias from particular sources, which makes the event-based variables suitable to act as a baseline for comparison with the standards-based variables that do not share this feature.
Second, and building on the first point, scholars are beginning to acknowledge and quantify disagreements between different sources of information. Such efforts should assuage concerns about models that use event-counts from disparate sources of information. Recent research has exploited multiple systems evaluation and capture-recapture models as a promising means of leveraging disagreements between sources to produce more accurate accounts of repressive events (e.g. Hendrix & Salehyan, 2015; Krüger et al., 2013). These analyses are limited to a smaller number of spatial and temporal units. As source-specific information becomes increasingly available, our latent variable measurement strategy provides a principled, model-based approach for incorporating information from new count-based estimation procedures. Linking information from multiple-systems estimation and latent variable models is an important area of new methodological research that directly confronts this challenge.
Third, researchers and activists want to make inferences about more than just country-year units. New data collection efforts are beginning to acknowledge and understand the roles of different state actors who commit human rights violations and the different groups that are targeted. To date, the ITT data project (Conrad, Haglund & Moore, 2013) and the UCDP data project (Eck & Hultman, 2007; Pettersson, Högbladh & Öberg, 2019) are the only data efforts that systematically collect repression data about targets, agents, or non-state actors for all states. Other event-based data collection efforts exist and are also beginning to provide some of this information for specific regions (e.g. Saleyhan et al., 2012). The models presented in this article are capable of systematically linking diverse sources of information and multiple levels of information in one model (e.g. country-year, country-year-actors, country-year-victims, country-year-regions). Each of these measurement challenges represents opportunities for new theorizing, new data collection, and new measurement modeling.
Footnotes
Replication data
The estimates from this article, along with the code necessary to implement the models in STAN and R, are publicly available at a dataverse archive: https://doi.org/10.7910/DVN/7C7KPU. The Online appendix can be found at
.
Acknowledgements
We thank James Lo, Zhanna Terechshenko, and three anonymous reviewers for many helpful comments and suggestions.
Funding
Fariss acknowledges research support from the SSK (Social Science Korea) Human Rights Forum, the Ministry of Education of the Republic of Korea, and the National Research Foundation of Korea (NRF- 2016S1A3A2925085).
