Abstract
Lyme disease has become the most prevalent vector-borne disease in the United States and results in morbidity in humans, especially children. We used historical case distributions to explain vector-borne disease introductions and subsequent geographic expansion in the absence of disease vector data. We used geographic information system analysis of publicly available Connecticut Department of Public Health case data from 1984, 1985, and 1991 to 2012 for the 169 towns in Connecticut to identify the yearly clusters of Lyme disease cases. Our analysis identified the spatial and temporal origins of two separate introductions of Lyme disease into Connecticut and identified the subsequent direction and rate of spread. We defined both epidemic clusters of cases using significant long-term spatial autocorrelation. The incidence-weighted geographic mean analysis indicates a northern trend of geographic expansion for both epidemic clusters. In eastern Connecticut, as the epidemic progressed, the yearly shift in the geographic mean (rate of epidemic expansion) decreased each year until spatial equilibrium was reached in 2007. The equilibrium indicates a transition from epidemic Lyme disease spread to stable endemic transmission, and we associate this with a reduction in incidence. In western Connecticut, the parabolic distribution of the yearly geographic mean indicates that following the establishment of Lyme disease (1988) the epidemic quickly expanded northward and established equilibrium in 2009.
Introduction
L
Typically the use of human case data to track disease progression is inconsistent because of problems associated with sensitivity (misreporting/misdiagnosis of cases), reporting fatigue, changes in case definition, and changes in surveillance techniques. These problems create variation around the true data and may obscure data trends. However, previous studies have shown the value of using human surveillance data to model spatial trends in vector-borne and zoonotic diseases (White et al. 1991, Kitron and Kazmierczak 1997, Glavanakov et al. 2001, Childs et al. 2007). Kitron and Kazmierczak generated a map delineating counties with the highest risk for Lyme disease according to the correlation between tick distribution and human case distribution by county of exposure (Kitron and Kazmierczak 1997). Glavanakov et al. (2001) estimated a regional correlation distance from Lyme disease human incidence reported by New York State counties from 1988 to 1996 using spatial autocorrelation methods. Connecticut has long-term datasets (Petersen et al. 1989) describing Lyme disease case distribution in 1984 and 1985, and the Connecticut Department of Public Health has historical Lyme disease records throughout the state from 1991 to 2012. However, Ertel et al. (2012) detailed potential problems with using the long-term datasets due to changes of surveillance methods.
Eisen et al. (2012) assessed potential reasons for the continued lack of success and control of Lyme disease in the northeastern United States. This study describes the spatial-temporal patterns of Lyme disease from 1984 to 2012 using human case reports in the state of Connecticut with a specific focus on how both eastern and western Connecticut have experienced Lyme disease epidemics (Fig. 1). Two spatially and temporally separate disease introductions are identified. The two epidemics have similar characteristics such as a northern direction of spread, rate of expansion, and trends in normalized incidence after the epidemic wave passes.

Yearly Lyme disease cases in eastern and western Connecticut. The total number of yearly Lyme disease cases in 91 eastern and 78 western towns in Connecticut for 1984, 1985, and 1991–2012. Yearly fluctuations in total Lyme disease cases are influenced by the reporting method, but the trends in yearly cases remain consistent with high case numbers during the epidemic phase and a transition to lower numbers during the endemic phase.
Materials and Methods
Case report data
Connecticut has conducted active Lyme disease surveillance since 1984; federally mandated reporting started in 1987, when a physician-based surveillance system was established. The surveillance system has undergone changes to improve the reporting quality, especially during 1998–2007 when Lyme disease was added to the list of significant laboratory findings (Ertel et al. 2007). All Lyme disease cases used in this study were selected according to the 1987 case report definition (Centers for Disease Control and Prevention [CDC] 1997). Due to reporting inconsistencies in earlier data, this research only used cases from 1984 and 1985 collected by Peterson et al. (1989) and 1991–2012 from the Connecticut Department of Public Health website (
Normalized incidence was calculated as the number of Lyme disease cases reported each year divided by the town population and normalized to 10,000 people. Rates were computed each year for all 169 towns in Connecticut, and geographic mean centers were identified according to incidence rates and locations.
Spatial analysis
After visualizing the yearly incidence for all the towns in Connecticut using ArcGIS 10 (the 24 yearly maps are not shown), and examining the total cases per town from 1984, 1985, and 1991–2012 (Fig. 2A), two disease clusters or epidemic clusters were apparent. After defining the clustered cases in eastern and western Connecticut, Global Moran's I values (p<0.05, Z>2.90 with Bonferroni correction) were used to test for significant spatial autocorrelation of the cases. Geographic structuring of the yearly cases indicates that townships with similar incidence of Lyme disease are more closely related than townships more distant. Global Moran's I values (Boots and Tiefelasdorf 2000) calculated by MATLAB (2009a) using inverse of distance between towns as weights and incidence rate in each town as data, are shown in Table 1.

(
Significant (p<0.05, Z<2.90 with Bonferonni correction) Z-distribution values are in italic and boldface for clarity.
The formula for calculating Global Moran's I values is:
where dij
is the distance between town i and j, ωij
=1/dij
, representing the weight between town i and j, Wi
(Wj
) is the incidence rate in town i (j),
where Xj is the longitude for town j and Yj was the latitude for town j. The yearly geographic mean represents the cluster of the case distribution in each year. Geographic means were calculated using MATLAB (2009a).
To estimate the rate and direction of spread of Lyme disease, the geographic origin was defined as the first spatially significant geographic mean identified by the earliest annual report of Lyme disease cases collected at the township level (Table 1). The latitude and longitude of each geographic mean were then plotted versus time from 1984 to 2012. The model parameters were determined using polynomial fitting for both latitude and longitude of the geographic mean centers. Second-order polynomials resulted in highest correlation values (R 2).
Results
Mapping the total number of cases per town for 1984, 1985, and 1991–2012 showed a high number of cases in eastern and western Connecticut and fewer total cases in the area between the two introductions (Fig. 2A). The map for yearly geographic mean centers showed only one significant cluster of normalized Lyme incidence in 1984 and 1985 centered in East Haddam, in eastern Connecticut. Global Moran's I values confirmed spatial autocorrelation of these clusters but no significant spatial structure in western Connecticut until 1991, when another cluster formed centered in Newtown (Table 1). After establishment, the yearly incidence remained significantly spatially autocorrelated in both eastern and western Connecticut until 2012 and 2009, respectively, indicating two significant clusters of cases. We analyzed the weighted geographic mean of only the significant yearly clusters using normalized Lyme disease incidence to account for differences between town population sizes, and indicated a northern direction and a parabolic geographic rate of range expansion for the two epidemics (Figs. 2B and 3A).

(
Despite two separate introductions, the yearly fluctuations in number of Lyme disease cases are similar in both parts of the state (Fig. 1). Yearly Lyme disease cases generally increased until 2002, at which point they began to decline. However, in 2004 cases started to increase again in both areas until 2009. The fluctuating number of yearly cases did not change the rate of spread as measured by the shifts in yearly weighted geographic means of Lyme disease incidence (Fig. 2B). After establishment, increased Lyme disease incidence at the spreading edge of the epidemic pulled both geographic means northward, although the majority of cases remained in the south where the higher population density resides (Fig. 3A). With few cases in central Connecticut there was little east or west spread in the yearly geographic means (Fig. 3B).
In the eastern Connecticut region, as the epidemic progressed, the yearly shift in geographic mean (rate of epidemic expansion) decreased each year until 2007, at which point the epidemic had encompassed the entire eastern portion of the state and reached equilibrium. The equilibrium correlates to a transition from spread of Lyme disease epidemic to stable endemic transmission and correlates to a reduction in human cases compared to the epidemic phase. The rate of geographic expansion fits a second order polynomial with latitude=−0.0004t 2+0.0189t+41.4147, R 2=0.8414 and longitude=−0.0003t 2+0.0151t −72.3871 (t=1, 2, 8,…., 29), R 2=0.5998 (Fig. 3A, B).
Similarly, in western Connecticut, the parabolic shape of the yearly geographic means indicates that the epidemic quickly spread northward after establishment in the south (Fig. 3A). However, the steep curve indicates a more rapid spread than in the east, and cases had spread throughout western Connecticut by 2003. After the epidemic spread to all of western Connecticut, the total number of yearly cases declined, the geographic mean shifted back to the south, and the case distribution was no longer significantly spatially correlated. Therefore, the equilibrium shifted back toward the center of the western region as the epidemic transitioned to endemic transmission. The rate of geographic expansion fits second order polynomials with latitude=−0.0021t 2+0.0539t+41.3316, R 2=0.8510 and longitude=0.0007t 2 − 0.0077t − 73.3209 (t=1,2,3,…19), R 2=0.8998 (Fig. 3A, B).
Discussion
This analysis of Lyme disease case data in Connecticut revealed strong spatial structuring of two disease clusters, indicating two spatially and temporally independent foci of origin and a recent shift from rapid epidemic Lyme disease spread throughout the state to lower and more stable endemic transmission. Lyme disease has spread uncontrolled throughout the state, and understanding the spatial and temporal spread may aid in predicting current and future risks to guide interventions and mitigation strategies.
In 1977, Lyme disease was first identified in Lyme, Connecticut (Steere et al. 1978) and firmly established before the 1984 survey determined significant spatial clustering around East Haddam. In contrast, the western Connecticut cluster did not become spatially autocorrelated until 1991, and the cluster centered in Newtown. The original sources of I. scapularis for both Connecticut epidemic clusters are likely white-tailed deer introduced from a refuge on Long Island after their extermination from the mainland during the 19th century (Halls 1984). Prior to 1984, I. scapularis was abundant only in the region of Lyme, Connecticut (Wallis et al. 1978, Petersen et al. 1989), and these likely arrived from deer swimming in the Long Island Sound (Halls 1984). In contrast, the western Connecticut epidemic likely originated during the 1980s from the expanding Lyme disease epidemic in Westchester County, New York, which is immediately adjacent to Newtown, Connecticut (Williams et al. 1986, White et al. 1991). Furthermore, white-tailed deer examined in western Connecticut for the presence of I. scapularis or antibodies to B. burgdorferi yielded negative results prior to 1988 (Anderson and Magnarelli 1980, White et al. 1991).
Two separate introductions of Lyme disease explain the case distribution and rapid spread throughout the state better than a single introduction in the eastern half of the state because the Connecticut River running north to south through central Connecticut separates the eastern and western clusters. Peterson et al. (1989) hypothesized the river stopped the spread of deer and the disease east and west. Furthermore, this physical barrier was previously shown to impede the easterly spread of diseases such as raccoon rabies to townships lying west of the river (Lucey et al. 2002, Smith et al. 2002, 2005). For Lyme disease to appear in an area, three elements are necessary: Lyme disease bacteria (Borrelia), ticks that transmit the disease (I. scapularis), and mammalian hosts for the pathogen and ticks (such as mice and deer) (Turney et al. 2014). The river may impede the spread of mice and small mammals, which are alternative hosts.
The yearly case distributions also indicate a northward expansion during the epidemic phase, which is characterized by high incidence and geographic expansion. The large population in southeastern Connecticut obscured the expansion of the disease north in previous studies because the majority of cases remain in the more populated areas. However, by using the weighted geographic mean analysis with incidence data, various population sizes of each town are accounted. The end of the epidemic phase is characterized by a decreasing rate of spread indicated by the movement of the weighted geographic mean and a reversion of the geographic mean toward the middle of the state. During the endemic phase, the number of Lyme disease cases is more proportional to the population. Therefore, more cases are reported in the more heavily populated southern areas of the state. In western Connecticut where the epidemic was more recent, the geographic rate of expansion was greater as indicated by the steeper slope. In contrast, in eastern Connecticut, the epidemic was slower and more stable due to establishment of Lyme disease from an earlier introduction in Connecticut. The epidemic to endemic transition was observed in both the eastern and western epidemics.
All surveillance activity is subject to human variation and reporting bias, as well as other factors such as changing case definitions and fluctuations in reporting of a novel condition or during an epidemic, rather than after the transition to endemic disease transmission. Throughout this study, the data were gathered using passive surveillance, active surveillance, enhanced laboratory, and mandatory laboratory surveillance by the Connecticut Department of Public Health. The reporting requirements and methods greatly influenced the number of cases reported each year (Connecticut Health Department 2005). However, the geographic distribution of the cases and the trend in weighted geographic means remained consistent in both eastern and western Connecticut during the epidemics and transitions to endemic transmission. Therefore, spatial analysis of epidemiological data for vector-borne zoonosis, such as Lyme disease, provides valuable insights into underlying processes of disease spread such as disease vector distributions and normalized incidence of epidemic versus endemic transmission. Last, spatial and temporal risk analysis based on epidemic expansion rates can help predict current and future risk. In at risk areas, prompt recognition and treatment of the disease and prevention measures, such as personal protection, habitat manipulation, and vector control can be implemented to reduce the hazard.
Conclusions
Historical case records were used to identify two geographically and temporally distinct introductions of Lyme disease into Connecticut. The two introductions have high statistical clustering and a uniform direction and rate of spread until equilibriums were established in both eastern and western Connecticut. This study confirms the importance of the Connecitcut River dividing the state and the introductions from Long Island. Furthermore, the results demonstrate Lyme disease spread rapidly throughout the state during an epidemic phase of high incidence and then transitioned to an endemic phase in recent years, which is characterized by a lower incidence.
Footnotes
Acknowledgments
Funding and support had no influence on the findings or the investigators. This research was funded by US Department of Agriculture (USDA), the Agricultural Research Service with Specific Cooperative Agreement 58-5430-1-0356. The funding agency had no input in study design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; or decision to submit the manuscript for publication. The Lyme disease case data are publically available either in the publication of (Petersen et al.
) or on the Connecticut Department of Public Health (CDPH) website. As such, Lee W. Cohnstaedt, PhD, at the USDA Agricultural Research Service had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. The data were analyzed by Ling Xue and Caterina Scoligo, PhD, at Kansas State University and Lee W. Cohnstaedt. The manuscript was written and reviewed by Ling Xue, Caterina Scoligo, David S. McVey, PhD, Rebecca Boone, and Lee W. Cohnstaedt.
Author Disclosure Statement
No competing financial interests exist.
