Abstract
While U.S. homicide rates remain at near historical lows, large percentage increases in homicide in 2015 have generated much speculation about its causes among policy makers, academics, criminal justice practitioners, media representatives, and other social commentators. In this article, we show how two data visualization tools, funnel charts, and time series fan charts can show the typical volatility in homicide rates in different cities over time. Many of the recent increases are not out of the norm given historical patterns, and so one need not rely on various ex ante hypotheses to explain homicide spikes occurring in many U.S. cities.
Introduction
In the aftermath of several incidents involving police officer shootings of minority citizens, the media has focused on increases in homicide rates in several cities including St. Louis, Chicago, and Baltimore. These increases have been linked to particular events occurring within those cities such as civil unrest and a reduction of police activity (Dewan, 2017; Fagan & Richman, 2017; Morgan & Pally, 2016; Rosenfeld, 2015; Towers & White, 2017; University of Chicago Crime Lab, 2017). To explore the validity of such claims, this article introduces two graphical methods, funnel charts, and time series fan charts to evaluate whether homicide rates are atypical between cities in any particular year or within cities over time. The application of these techniques allows us to advance the understanding of homicide trends and to offer researchers better tools for analyzing crime data. Both of which may serve to better inform the media, public, and policy makers.
Here, we show that homicide rates are a volatile measure, and prior attempts to identify extreme changes over time, in particular percent changes in homicides, produce exaggerated evidence of change. Homicides are rare events, thus rates in cities with smaller populations will tend to have more volatile fluctuations over time. Using funnel charts, we will show that several cities, such as Chicago, Baltimore, and St. Louis, have had historically high homicide rates compared with the cities of similar population sizes consistently since at least the 1980s. Generating prediction intervals using a random effects binomial regression model will also show that recent increases in those cities are not atypical given historical homicide rates over time.
Identifying Changes in Homicide Rates Over Time
One of the most common ways to identify recent increases in homicides over time is to calculate a percent change. This approach is common both in the popular press, 1 as well as in academic articles (Baumer & Wolff, 2014; Parker, Mancik, & Stansfield, 2016; Rosenfeld, 2016; Stamatel, 2014). For example, given a homicide rate of 10 (per 100,000) in the year 2000 and a homicide rate of 12 in the year 2010, the percent change over the 10-year period would be (12 − 10) / 10 = 0.2, or 20%. Although percent change is simple to calculate and interpret, it has particularly poor properties when examining low-rate homicide trends. First, the variance of the metric is not defined (Wheeler, 2016), so an actual outlying value is difficult to identify. Second, the variance of the metric is influenced by the denominator, so places with lower homicide rates will have more volatile changes from year to year. Third, the variance of the metric is asymmetric; a change of a homicide rate from 4 to 5 is a 25% increase, whereas a change from 5 to 4 is only a 20% decrease. This subsequently will result in larger percentage increases than percentage decreases.
Highly varying percent changes can be seen in reported samples of homicide rates. Stamatel (2014) shows decreases of 406% to increases of 100% for homicide rates in various countries from 1956 through 1998. Baumer and Wolff (2014) show percent decreases in large U.S. cities of up to 80% from 1993 to 2001. Parker et al. (2016) show percent changes of −90% to +200% in a sample of 199 U.S. cities for more recent crime trends from 2007 through 2011. Rosenfeld (2016) shows percent changes in a selected sample of large U.S. cities of +120% to −40% for changes in homicides from 2014 to 2015. Such ranges suggest that even percent changes of over 100% for large polities are not uncommon and should not per se be considered unexpected simply by chance. Because of this year-to-year variability researchers often aggregate homicide rates over multiple year spans and analyze those (see Beaulieu & Messner, 2010, for one example).
A more formal, statistical approach researchers have used to identify changes in homicide trends is to estimate structural breaks in trends over time (Cook & Cook, 2011; Messner, Deane, Anselin, & Pearson-Nelson, 2005; Piehl, Cooper, Braga, & Kennedy, 2003; Pyrooz, Decker, Wolfe, & Shjarback, 2016). For example, Pyrooz et al. (2016) in trying to identify a Ferguson effect, 2 use random effect growth models and test for linear change in the pre to post crime rates in 81 cities exceeding a population of 200,000. Using 12 months of pre and post data, they identify several cities with large increases in homicide rates.
The approach in Pyrooz et al. (2016) relies on specifying the exact intervention date where the break would be associated and also uses a restrictive functional form of homicide rates over time (a linear trend in the 12 months before and after). Messner et al. (2005) use spline regression techniques, where the knots of the splines are not specified but estimated (using a model selection technique, stepwise regression) to identify the location of change points in annual homicide rates in cities of over 250,000 between 1979 and 2001. They related the different timing of structural breaks to epidemics and suggested that these breaks were largely identifying booms and busts for homicides related to the crack epidemic.
These techniques are limited in their ability to identify whether recent increases in homicides are atypical. First, to test for a structural break, one needs multiple data points before and after the break (Piehl et al., 2003). One cannot identify whether one observation represents a structural break. Second, none of the prior research listed used models that take into account how homicide rates for smaller cities are more variable. Although the majority of cited research uses selection criteria of only examining cities of over 100,000 or more population, we will show that there is still a large amount of expected variation between cities of less than a million population as compared with higher population cities. That is, one would expect much larger year-to-year changes in the homicide rate for a city with a population of say 100,000 versus a city with a population of 1 million.
Data Sources
For this analysis, we use the Federal Bureau of Investigation’s (FBI) Uniform Crime Reporting (UCR) reported homicides and population estimates between 1960 and 2015 at the agency level. The UCR data from 1960 through 2015 were obtained from Interuniversity Consortium for Political and Social Research (ICPSR) UCR data series. As of the analysis, the full 2016 UCR data have not been published by the FBI, only the half-year 2016 estimates.
Although errors in UCR crime reporting data are well documented (Loftin & McDowall, 2010; Maltz & Targonski, 2002), they are sufficient for the present purposes. Nonetheless, we take several steps to alleviate potential errors. First, we only include cities and years that have reported for the full 12 months. We do not rely on imputed information, and the models presented in the article do not rely on contiguous years of reporting. Second, the use of the graphs in the article easily identifies erroneous high data entries or outliers (Maltz, 2010). Although we have undoubtedly missed some errors, we have provided an appendix that describes all the data anomalies spotted and corrected in the historical agency-level UCR data provided by ICPSR.
To identify outliers, two different data analysis and visualization techniques are shown. The first will use funnel charts to identify outliers between cities in any one particular year, and the second will use forecasts and fan charts to identify outliers in changes over time.
Funnel Charts for Between-City Comparisons
It is inappropriate to compare homicide counts between different cities that have very different population sizes; therefore, we calculate homicide rates per capita, typically in terms of per 100,000 population. Less appreciated by researchers is that there are different underlying variabilities in homicide rates between cities. For example, if the national homicide rate is 5 per 100,000, and the observed rates in different cities are simply due sampling error (where the sample is the number of individuals within a city), cities of a population size of 100,000 would be expected to have a homicide rate within 1 and 10 per 100,000 95% of the time, 3 whereas cities with a population size of 1 million would be expected to have a homicide rate within 3.7 to 6.4 per 100,000 95% of the time. Thus, in this example, a homicide rate of 9 for the city of 1 million is a larger outlier than a homicide rate of 9 for a city of 100,000 relative to the national average.
Wainer (2007) describes several examples where parties made erroneous conclusions based on identifying outliers in data with varying errors around the measurements. For example, the push to create smaller schools was based on the observation that the majority of schools with high average-standardized test scores had smaller enrollments. However, this is simply an artifact because smaller schools had more sampling error about the mean, and so many small schools also had very low standardized test scores.
Another example with similarities to homicide rates would be if one mapped areas with high or low rare cancer rates, one would mostly identify counties in rural areas for both (Gelman & Price, 1999). This again is because the rate is more volatile for places with fewer individuals, and even in large populations, the standard error around those rates is quite large because cancer is a rare event.
A chart to display this information in a more succinct fashion is a funnel chart (Spiegelhalter, 2005). Academics may be most familiar with funnel charts from meta-analyses, in which one plots the estimated effect size of the study against the size of the sample (or some other measure of standard error). When considering homicide rates though, the standard error is directly a function of the population size for a city. In particular, if one has an estimate of the centralized value (such as the national homicide rate), one then estimates a measure of standard error around that proportion. Here, we use the exact binomial Clopper–Pearson confidence interval to denote standard errors on the chart. One way to write the interval is (Agresti, 2002):
where B is the beta distribution at quantiles
Cities with smaller populations are expected to have greater differences in homicide rates compared with cities with larger populations, and one draws these confidence limits as lines on a chart to denote when cities are outside of the expected bounds. Figure 1 displays a funnel chart for homicide rates for 3,279 agencies of over 10,000 population in 2015. The gray error bands signify a 99.9% confidence interval around the overall homicide rate within the sample of 4.9 per 100,000.

Funnel chart for homicide rates.
Figure 1 displays several cities outside of the funnel chart ranges, but the majority of locations follow the spray pattern according to their population. It also demonstrates that many, but not all, larger cities are outside of the error bar ranges. The largest city in the sample, New York City, with nearly 10 million residents, falls within the narrow band for the overall homicide rate for the entire nation. In general, larger cities have homicide rates significantly higher than the national average.
The graph also highlights small agencies that have exceptionally high reported homicide rates, although the expanding intervals do a good job of capturing the majority of small populations in the graph. East St. Louis (Illinois), Chester (Pennsylvania), Gary (Indiana), and Prichard (Alabama) stand out among the smaller cities.
Finally, the graph also highlights a grouped set of outlier cities that have very high homicide rates given their populations of more than 100,000 but less than 1 million. This set of cities includes St. Louis and Baltimore at the top, with Flint, Detroit, New Orleans, Birmingham, Kansas City, and Jackson slightly below.
Evidence against idiosyncratic hypotheses about recent homicide increases in those particular cities can be assessed by viewing such funnel charts over the past six decades. Figure 2 displays the same 99.9% confidence intervals for each decade from 1960 through 2010 and highlights many of the same cities as being exceptionally high over the historical time period for which we have measurements. For example, St. Louis has had high homicide rates compared with cities of similar sizes since the 1970s. The same is true for Baltimore since the 1980s. Any reasonable explanation for the current homicide trends in any of these particular cities must take into account the historically very high homicide rates in those places, dating back to at least the 1980s.

Funnel charts from 2010 to 1960 by decade.
In addition, the funnel charts display the homicide increase and decrease over the five decades, with very few cities being clearly above the funnel in 1960, whereas more and more cities accumulate into the high homicide ranges from 1970 through 1990. The year 2000 then displays fewer outlying cities in the 10,000 to 100,000 population range. In 2010, one will recognize that several of the smaller cities, East St. Louis and Chester, have consistently had high homicide rates relative to other cities of similar size going back over 5 years. In the case of Gary, Indiana, one can track the decline of population over time as well, as it was a high outlier with more than 100,000 population in 1980 but has declined to less than 100,000 population (as of 2006) while maintaining a relatively high homicide rate.
Although examining funnel charts over time provides one way to assess change, it is difficult to see the natural progression of cities over time, and it is not easy to assess whether recent increases should be seen as outliers or signify a meaningful change in homicide rates. We address this in the next section by formulating a model to predict future homicide rates and displaying the models prediction intervals using fan charts.
Time Series Fan Charts for Within-City Comparisons
The prior funnel charts are only sufficient to examine variation between cities. Such that one can identify if a particular city has a high homicide rate while taking into account the variability between the different population sizes between two cities. But one is often interested if changes within one city over time are sufficient to identify whether homicide rates are substantively increasing. To identify those outliers, one needs a model of what would be expected in the future given historical homicide rates in a particular city.
To accomplish this, we fit a binomial random effects model 4 :
This model estimates the homicide rate in City i and Year t as a probability
This model is equivalent to a logistic regression, where each row is an individual in the population and each is assigned 1 if they were murdered and 0 otherwise (Britt, Rocque, & Zimmerman, 2017). But the model can be fit with aggregated data, so there is no need to expand the data set to many millions of individual cases. Although interpreting homicide rates as a probability may seem strange, in terms of providing future predictions, it is not logically any different than modeling the logarithm of the homicide rate per 100,000 population (McDowall, 2002). The homicide rate per 100,000 is simply one way to express a percent. In our experiments, this model provided the best coverage of future predictions, compared with negative binomial regression models and/or using fixed effects instead of random effects.
Although this on its face appears substantially different than the more typical Poisson regression of homicide counts and using the population count as an offset (Osgood, 2000), it is not. This is because the Poisson distribution is a special case of the binomial distribution when the number of trials is very large (Kingman, 2005), which is the case for cities with populations over 100,000. But the binomial regression model has the advantage that the intrinsic variance due to differing population counts is taken into account for future predictions.
Although the model can undoubtedly be improved, we take this as a simplified baseline model, not unlike fixed effects models used in prior research (McDowall & Curtis, 2015; McDowall & Loftin, 2009). The random effects model has the added benefit over the fixed effects model of shrinking future predictions. The idea behind shrinkage is synonymous with regression to the mean. If a location has a homicide rate of 20 one year, and the national level homicide rate is 5, the best prediction going forward into the future is somewhere between 5 and 20 (Efron & Morris, 1977). These shrunk estimates tend to provide better forecasts than fixed effect estimates (Savitz & Raudenbush, 2009).
Next, we generate future prediction intervals by simulating 1,000 posterior draws from the random effects model (Gelman & Hill, 2007). Prediction intervals are substantially easier to interpret than confidence intervals. For example, given a 90% prediction interval, if correct, that interval should cover future observations 90% of the time. 5 Here, we fit the model with data from 1960 through 2014, 2015 homicide rates are not used at all in the fitting process. Then, we check coverage for the model by comparing posterior simulations for the prior year with the future year, for example, posterior simulations for Chicago in 1970 are used to determine whether they cover the homicide rate in Chicago in 1971. We generate 50% prediction intervals based on the 25th and 75th quantiles of the posterior simulations, as well as 80% prediction intervals based on the 10th and 90th percentiles. These are displayed as varying shades in the graphs, using darker gray for the 50% interval and lighter gray for the 80% interval (Maltz & Zawitz, 1998). These are sometimes called fan charts (Spiegelhalter, Pearson, & Short, 2011).
Figure 3 displays an example fan chart for the city of New Orleans based on the random effects binomial model. The forecasted intervals are in gray, and the actual observed homicide rates are displayed as black circles. In the historical data for New Orleans, year-to-year changes of 10 to 20 in the homicide rate (up or down) are not uncommon. There is also a large spike in 2007 (the year in which Hurricane Katrina occurred), up to a homicide rate of over 90 and subsequent regression to more usual homicide rates hovering around 50. To be outlying based on this model, New Orleans would have needed a homicide rate of over 70 per 100,000 in 2015. 6

Fan chart for New Orleans homicide rate predictions.
To evaluate the overall effectiveness of the models’ prediction intervals, we calculate how well the 50% and 80% prediction intervals cover the observed data. Table 1 displays those coverages for the historical (in sample data) and the out-of-sample 2015 data. For both, we can see that the model is not perfect but has close to the expected coverage.
Coverage Rates for the 50% and 80% Prediction Intervals.
To read Table 1, the lower 25th percentile from the posterior simulations should have around 25% of the observed values lower than predicted. 7 We see that in the historical sample, lower values than predicted occur 30% of the time. So the lower bound appears to be slightly too high across the samples. We also see that the high interval has slightly too few outliers, at 20%. Examining the wider 80% interval, the same pattern occurs, and the lower interval has 19% below the predictions, but the upper interval is very close to nominal coverage with only 9% above the 90th percentile.
In the out-of-sample 2015 data, we see that the coverage is very near to what is expected. Twenty-five percent of the cases are above the 75th percentile, and 10% of the cases are above the 90th predicted percentile. Again, the lower bounds are not low enough, but as most interest is in identifying cities with rising homicide rates the model appears to be effective.
Tables 2 and 3 display the agencies that have higher (n = 53) than expected homicide rates, and Tables 4, 5, and 6 display those cities that have lower (n = 104) than expected homicide rates in 2015. Each is based on the 80% prediction interval. The majority of the cities with high homicide rates are smaller and have around 100,000 population, with the exception of Milwaukee 8 (population 600,400) and Nashville (population 658,029). There are several instances of state and county police agencies as well, such as the Connecticut State Police and St. Louis County Police. The tables are arranged so that the cities with the largest difference from predicted are sorted to the top. In Table 2, Portsmouth, Virginia, was the largest high outlier, with an observed homicide rate of 28 per 100,000, which is far outside the 80% prediction interval of 7.5 to 13.7. For the low outlier cities, the majority are cities that experience zero homicides. The largest low outlier is Lowell, Massachusetts (the first row of Table 4), which experienced zero homicides but had a lower prediction interval of 2 homicides per 100,000.
Fifty-Three Cities That Have Higher Homicide Rates Than Predicted in 2015 (Based on 80% Prediction Intervals; 1-26).
Fifty-Three Cities That Have Higher Homicide Rates Than Predicted in 2015 (Based on 80% Prediction Intervals; 27-53).
One Hundred Four Cities With Homicide Rates Below Predictions (Based on 80% Prediction Intervals; 1-35).
One Hundred Four Cities With Homicide Rates Below Predictions (Based on 80% Prediction Intervals; 36-70).
Note. Sp = State police
One Hundred Four Cities With Homicide Rates Below Predictions (Based on 80% Prediction Intervals; 71-104).
Note. Sp = State police
Figure 4 displays a set of panel charts showing several additional cities and their forecasted intervals. Dallas and New York City 9 show the homicide decline, and given very little volatility over the prior years have subsequently smaller prediction intervals. In contrast to this, Chicago has shown larger fluctuations since 1960 and subsequently has larger prediction intervals. Based simply on the prediction interval for 2015, Chicago’s homicide rate in 2016, 28 per 100,000 (University of Chicago Crime Lab, 2017), would not be outside but near the upper limit of the 2015 80% prediction interval. Baltimore shows the widest prediction intervals compared with the other three cities, partly because it is a smaller city but also because it bucks the trend of the majority of other cities (it has not shown similar homicide declines as compared with the rest of the data set during the 1990s). The sharp rise in 2015 to a homicide rate of over 50 is outside the 75th percentile, but easily within the 90th percentile forecast.

Example fan charts for Dallas, New York City, Chicago, and Baltimore.
Figure 5 displays fan charts for four cities that showed outlying values in 2015. Portsmouth, Virginia, is quite high, but it appears in this case that the model does a poor job of estimating the historical volatility in that city, as similar jumps were observed in the early 1990s. Milwaukee’s homicide rate of 24 is only just above the predicted 90th percentile of 23.

Fan charts for example cities that have more homicides than predicted (Portsmouth and Milwaukee) and fewer homicides than predicted (Lincoln and Boise).
Lincoln and Boise demonstrate example cities that have lower homicide rates than predicted. This is due to examples of zero homicides when at least one would be expected. The upper intervals though appear to be reasonable. In 2016, Lincoln reported a total of 11 homicides, which results in a homicide rate of 4 per 100,000. This is just at the upper end of the predicted homicide rate in 2015.
Discussion
In this article, we introduce two ways to evaluate a city’s homicide rate. First, we introduce funnel charts to make more accurate comparisons between cities that have differing sizes of the population, and we introduce fan charts to show when cities have outlying values based on prior homicide rates in a particular city. With the funnel charts, we show that several cities with less than 1 million population have had very high homicide rates relative to the national average for 30 years or more. Second, we illustrate the use of fan charts with prediction intervals based on a random effects binomial regression models. The prediction intervals for our models have appropriate coverage both with historical in-sample data and out-of-sample 2015 data. Fan charts can be effectively used to identify outliers for a particular city based on past homicide rates.
Although homicide rates remain substantially lower than they were at the peak of the crack epidemic of the late 1980s and early 1990s, the 10.8% increase in the national homicide rate between 2014 and 2015 (and the expected 8% increase for 2016) represents some of the largest year-to-year percentage increases seen in homicide rates in decades. In response, a number of hypotheses have been advanced to explain these seemingly sudden increases. Two of the most popular are civil unrest as a result of prominent cases of police shootings (Desmond, Papachristos, & Kirk, 2016; Morgan & Pally, 2016) and depolicing in reaction to criminal and civil litigation after incidents of police brutality, also known as the Ferguson effect (Fagan & Richman, 2017; Morgan & Pally, 2016; Pyrooz et al., 2016; Towers & White, 2017; Wolfe & Nix, 2016). But, as we show in this article, many of the cities exhibiting increases in homicide rates are often within future predicted intervals and have had historically high crime rates compared with national averages. One need no special interpretation for many of the recent increases in homicide rates in cities besides simply knowing that such year-to-year changes are not unprecedented.
Many of the potential explanations popular in the media appear to be cherry-picked to fit a popular narrative. For example, while decreases in arrests and stops are frequently cited to explain the rises in homicides in Chicago and Baltimore, New York City has seen similar decreases in stops due to civil litigation since 2011 (Sweeten, 2015) but has not seen subsequent increases in homicides in that time frame. Disseminating information about the expected variance of homicide rates over time in fan charts is a simple way to prevent the media, policy makers, and the public from over interpreting by-chance fluctuations.
One can draw comparisons with the crime decline in the 1990s as well. There has been a plethora of theories about the mechanisms that resulted in the crime decline (Blumstein & Wallman, 2000; Levitt, 2004; Rosenfeld, Fornango, & Baumer, 2005). Despite accumulating evidence over time, there is relatively little consensus over what caused this decline (Farrell, Tilley, & Tseloni, 2014). Large scale trends are based on observational data, so such evaluations tend to have low internal validity (Berk, 2005; Bushway & McDowall, 2006). Much of the narrative about the homicide increase follows this same rhetoric, albeit with a much smaller temporal sample. The graphs presented here should effectively illustrate the typical volatility in homicide rates and prevent premature conclusions about whether some external shock is causing homicides to increase in any particular city. Future researchers should always consider both the size of the city and the historical levels of homicide when conducting analysis on homicide rates for any particular sample of cities.
What implications do our findings have for other researchers conducting analysis of homicide trends over time? Our research highlights two important insights. One, by identifying cities with very high levels of homicides relative to other cities of similar population size going back to the 1980s, the analysis demonstrates that whatever factors are responsible for cross-sectional differences in homicide rates between cities were established in the distant past, have remained relatively stable, and have continued to manifest themselves over the study period examined here. This suggests that long-term structural factors, such as residential segregation or concentrated disadvantage, may better explain homicide rates between cities over long periods of time (Pyrooz et al., 2016). Researchers focusing only on very recent homicide trends are likely to overestimate the effect of recent shocks compared with these factors. Second, we have shown that many cities have followed similar homicide trends over a long period of time. This suggests that national level explanations are important for understanding changes in homicide trends over time (McDowall & Loftin, 2009). More importantly, scholars attempting to explain idiosyncratic local factors for homicide changes, such as depolicing or civil unrest, need to establish that the city under study is truly unique in its increase or decrease in homicide trends. Many of the recent explanations for increases in homicide in specific cities do not stand up to scrutiny when examining trends in cities that have experienced similar exogenous shocks but have not witnessed the same increase in homicides.
As with all social science research, there are limitations to our study. First, funnel charts and fan graphs as presented do not incorporate any other demographic characteristics into the models. For funnel charts, these might include adjusting the homicide rate estimates per well established covariates of crime, such as poverty rates. However these covariates and their effects tend to be fairly temporally stable over time (Griffiths, 2013; Tcherni, 2011). As such, we would likely see similar patterns in the funnel charts over time. These covariates can also be incorporated into the time series fan charts; however, it is an open question as to whether they would reduce the size of the prediction intervals. That is a question for future researchers to examine.
Second, it is possible that the model could be improved by incorporating temporal aspects, such as an autoregressive term for the prior year’s crime or a nonlinear temporal trend component. But, if homicide rates are a random walk (McDowall, 2002), the best point estimate one can use to predict future rates is last year’s homicide rate. Although the model entails some simplification, we provide good evidence that the model is adequate for identifying outlying high values. We show this via the models having appropriate coverage both in sample and out of sample. In regard to the lower bound, examining Tables 4 to 6 show many cities with zero homicides (Loftin & McDowall, 2003), and zero-inflated models may improve the lower bound estimate.
Finally, the reliance on the UCR data, which has a long history of problems with data reporting, is also a limitation of our study (Loftin & McDowall, 2010; Maltz, 1977; Maltz & Targonski, 2002). Although the use of funnel and fan charts has the ability to spot erroneous high data entries, smaller cities regularly have zero homicides. If an agency had not reported and was mistakenly recorded as having zero homicides, such graphs are not as likely to uncover those data entry errors. Still, the same idea of funnel or fan charts can be applied to different sources of homicide data, such as at the county-level via the Vital Statistics reporting program, and can be applied in the same way.
Finally, what is an appropriate policy response to these models? If Baltimore and Chicago show large homicide increases but are within future prediction intervals, should there be a response by these cities? And if a city is outside of their prediction bands, should policy makers or criminal justice professionals “do something” in response? The fact that homicides are rare and volatile makes it challenging for researchers to identity whether the ups and downs are directly attributable to any particular exogenous factor. One should be cognizant of this limitation when attributing increases or decreases in homicide rates to any recent phenomena, as that logic is particularly susceptible to post hoc ergo propter hoc. To identify whether a recent homicide increase is tied to a recent factor, say perceptions of police legitimacy, one would need to determine whether historical measures of legitimacy explain prior homicide rates both within a city and between cities over time. In short, one needs good social science research to ascertain causes of any homicide increase.
This is true whether a city’s homicide rate is within the predicted interval or not. Prior exogenous shocks, as well as actual structural changes in a city’s homicide rate, will result in larger year-to-year changes and make prediction intervals going into the future wider. Subsequently, this does not mean that a city’s homicide rate being within the reported prediction interval falsifies any particular claim that homicide rates are increasing in that city or that some external force is causing homicides to go up. Such fan charts are however a reasonable tool to gauge the noteworthiness of any particular increase compared with historical patterns. This is especially true in relation to citing percent changes, which can be hyperbole and manipulated to ignore historical trends. Some recent increases in homicides are clearly noteworthy, such as Chicago (in 2016) and Baltimore (in 2015), even though they are within the future prediction intervals. But one needs to always keep in mind the inherent volatility in homicide rates as one spike may be indicative of a structural break in the a city’s homicide rate but does not necessarily guarantee a trend going into the future. See the fan chart for New Orleans for a recent example, which although observed a high spike in homicides the year hurricane Katrina occurred, returned to more typical levels in the years to follow.
Understanding causality is important before any reasonable policy response can take place. For example, increases and decreases in homicide rates in Chicago have been largely attributed to public interpersonal disputes. Domestic homicides in Chicago have steadily decreased since the 1960s (Block & Christakos, 1995) and more recent examination of these trends show that this continues to be the case (Wheeler & Block, 2017). Such patterns may not hold for all cities. If recent increases are due to domestic homicides, a much different policy response is required than if gun or gang violence are major contributors. Speculatively attributing any recent change in homicides to one factor and formulating a policy response is not likely to reduce future homicides.
Once a particular factor is identified, one can often draw on a wider array of evidence than homicide rates. For example, if gun violence is a major contributor to a recent increase in homicide, the researcher would examine whether other crimes involving guns, such as robbery and assault, also increased in the same time period. Given these crimes are more prevalent than homicides, the data may provide clearer evidence as to whether local factors related to firearms also contributed to a recent increase in homicides.
Evidence arguing against this approach though is the fact that assaults and homicides may differ in general trends (McDowall & Curtis, 2015). It is also the case that homicides are often examined because they are viewed as more reliable measures of violent crime due to differential reporting (Loftin & McDowall, 2010). Some speculated factors that influence crime trends, such as perceptions of police legitimacy, may also influence reporting of victimization to the police (Desmond et al., 2016). Thus making it necessary to rely on homicide statistics that are not potentially biased due to changes in reporting patterns or discovery by the police.
Subsequently, if one does want to make specific statements about increasing or decreasing homicide rates, it is essential to understand the typical uncertainty in those rates for cities of different sizes and the volatility of homicide rates over time. The graphs displayed in this article effectively accomplish both those goals.
Footnotes
Appendix
This appendix is provided to describe the data manipulations for the Uniform Crime Reporting (UCR) data series before modeling. When compiling the full UCR reported data from 1960 through 2015, this results in a total of 913,335 jurisdiction-years. For data analysis, first, all jurisdiction-years that did not report a full 12 months were eliminated (n = 261,515). Of those cases, the majority were listed as reporting 0 months (n = 214,790). Second, jurisdiction-years reporting zero for population served were also eliminated (n = 165,515 in the original database), as well as those with missing values for population served (n = 3,185). Zero-population jurisdictions occur regularly for agencies serving larger populations, such as state police agencies (Maltz & Targonski, 2002). Third, cases in which negative homicides were reported were eliminated (n = 35). After these operations, the original database of 913,335 jurisdiction-years was reduced to 552,555 cases.
The second stage of correcting data values occurred for records that were clearly anomalous. In 1993, three cities reported homicide totals of over 1,000 which were clearly in error—Newport News, Virginia; Atlantic City, New Jersey; and Athens, Georgia. These homicide totals were adjusted to 22, 10, and 13, respectively, which were taken from the online UCR data tool, https://www.ucrdatatool.gov/. The Interuniversity Consortium for Political and Social Research (ICPSR) UCR data series for New York City included the homicides occurring during the terrorist attack of 9/11. Subsequently, the homicide counts for New York City during 2001 were changed from 3,452 to 649, which is what the NYPD (New York City Police Department) lists currently at http://www.nyc.gov/html/nypd/downloads/pdf/analysis_and_planning/seven_major_felony_offenses_2000_2015.pdf.
The last data check concerned false originating agency (ORI) codes (the code used to designate each reporting jurisdiction) or changing ORI codes over time. The first two letters of the ORI correspond to the state reporting, and for the years of 1967 and 1970, Fairfax, Virginia, reported their ORI as “SC02901” instead of “VA02901,” although the agency name and the state were correct in the original data files (so could subsequently be distinguished from “SC02901” Lancaster, South Carolina).
For changing ORIs over time, initially the authors noticed that the ORI for the Alaska State Troopers changed from “AKASP00” from 1960 through 2013 to “AKAST01” in 2014. After seeing this error, the authors sorted the final data set by ORI and year and did a manual check to see whether any other agencies also had obvious changes in their ORI. The only other case found was Baltimore County, which was reported as “MD00300” for the years of 1967 through 1970 but was otherwise listed as “MD00301” from 1960 through 2015.
Inevitably, we will have missed some anomalous reporting. In particular, while the funnel charts and fan charts do an excellent job of spotting outlying high data values for homicides, because homicide is rare, they will not often highlight zero homicides which are actually missing data (Loftin & McDowall, 2010; Maltz, 2010). So if such jurisdiction-years were mistakenly reported as having the full 12 months, such cases would not be highlighted by our charts, unless the cities were very large (such as over 500,000 population).
Authors’ Note
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
