Abstract
Regular physical activity is strongly linked to prevention of costly chronic health conditions. However, there has been limited examination of the impact that level of participation in physical activity promotion programs has on health care costs. This study examined a fitness reimbursement program (FRP) offered to small employers. FRP participants received $20 reimbursement every month they visited their fitness center ≥12 days. Visits were recorded electronically. Participants were assigned to 4 mutually exclusive cohorts by mean monthly fitness center visits: low (<4 visits); low-moderate (≥4 and <8 visits), high-moderate (≥8 and <12 visits), and high (≥12 visits, which qualified for reimbursement). Cohorts were matched by inverse propensity score weighting on demographic, health status, health care supply, and socioeconomic characteristics. Between-cohort differences in propensity score-weighted health care costs, starting from FRP program sign-up, were examined with a generalized linear model. Analyses were conducted with and without high-cost outliers during the pre- and post-FRP period. A total of 8723 participants (mean follow-up: 11.1 months) were identified during October 2010-June 2013. With high-cost outliers removed (n = 226), a pattern of lower per-member-per-month health care costs was observed with increasing participation: compared with the low cohort, monthly savings were: $6.14 (2.6%) for low-moderate (P = 0.60), $16.40 (6.9%) for moderate-high (P = 0.16), and $20.01 (8.4%) for high (P = 0.08). With high-cost outliers included, significant monthly cost savings were observed for the moderate-high ($43.52, P < 0.01) and high ($52.66, P < 0.001) cohorts. These results indicate directionally positive cost outcomes associated with increasing level of fitness center participation.
Introduction
T
Employers have a vested interest in their employees' health to improve productivity and control rising health care costs. In 2014, 74% of US employers who provided health insurance to their employees also offered at least 1 health and wellness program. 11 Programs aimed at encouraging physical activity, such as gym membership discounts or on-site exercise facilities, are a common component of health and wellness programs. 11 Health care savings have been reported for several employer health and wellness programs 12 –15 and a rigorous meta-analysis found savings of $3.27 in medical costs per program dollar spent. 16 However, these studies are based largely on multifaceted programs with a comprehensive range of offerings (eg, disease management, health risk screening, healthy lifestyle promotion) and thus lack the ability to assess which specific components of the program are the most cost-effective. Some analyses of comprehensive health and wellness programs indicate that return on investment is driven primarily by a single component such as disease management. 17,18
There has been limited study of the impact of employer-sponsored physical activity programs on health care costs with a specific focus on the association between level of physical activity participation and health care savings. 19,20 In a study of a single large employer program that provided an incentive-based initiative for frequent fitness center visits, program participants who visited fitness centers more often had lower health care costs compared with nonparticipants. 20 The present study sought to extend these findings by examining a similar program among members of a large national health plan. The objective was to examine the relationship between the number of monthly fitness center visits and health care costs among participants in a program that offered an economic incentive for more frequent fitness center visits. The return on investment for the incentive feature also is reported. This study expands on prior research by focusing on program participants only. To minimize the likelihood of selection bias, the study population was limited to health plan members who agreed to participate in the program, with the lowest participation cohort serving as the comparator group, rather than nonparticipants. Fixed and random effects modeling procedures were used, which are designed to help avoid selection bias related to unmeasurable factors that might influence program participation and outcomes.
Methods
Intervention
In October 2010, a large national health plan began offering the Fitness Reimbursement Program (FRP) to small business employers (2–99 employees). Health plan members who enrolled in FRP selected a fitness center from participating facilities eligible for reimbursement within their respective area. At each fitness center visit, FRP participants swiped a card or key fob that recorded each visit; each visit earned 1 credit up to a maximum of 1 credit per day. FRP participants who visited a fitness center 12 or more times in any given month were reimbursed $20 for each month when utilization was that high. Reimbursements were credited at the end of each month in which reimbursement was earned, or at the beginning of the following month.
Design and data source
This was retrospective cohort study with data collected during April 1, 2010, through December 31, 2013. Member characteristics and health care claims were captured from an administrative claims database affiliated with Optum. Claims data included medical (facility and professional) and pharmacy claims with both health plan and patient paid amounts.
Member selection and cohort definition
An index date for each sample member was defined as the date the member enrolled in FRP during the participant identification period (October 1, 2010, through June 30, 2013) (Table 1). Inclusion in the study required continuous medical coverage in the health plan for at least 6 months pre index (starting as early as April 1, 2010, for participants who enrolled in the FRP on October 1, 2010) and at least 6 months post index (ending as late as December 31, 2013, for those who enrolled on June 30, 2013). Medical cost differentials between study cohorts were assessed using data from 6 to 12 months after the FRP began, with 3 additional months of claims run out that were examined to account for any delay in claims processing (eg, if a member's FRP enrollment date was June 30, 2013, then claims reflecting his or her health care service dates from June 30, 2013, to December 31, 2013, were used in the analysis). Claims that were paid through March 31, 2014, were reviewed to allow for the 3 months of run out needed to make sure all services used in the few months of 2013 could be accounted for in the analysis, because sometimes it takes a few months for all claims to be paid after services are incurred.
Participation cohorts defined according to number of PMPM fitness center visits during post-index period: low, <4; low-moderate, ≥4 and <8; high-moderate, ≥8 and <12; high, ≥12.
Maternity-related claim or other pregnancy-related condition during the pre- or post-index period; complex medical condition, diagnostic or procedure claims for any of the following during the study period: cancer, end stage renal disease, HIV/AIDS, organ transplant, trauma, or mental disorder.
High-cost outlier defined as monthly total health care costs greater than the 97.5th percentile of the study sample.
FRP = fitness reimbursement program; PMPM = per member per month.
FRP participants were excluded from the study if they were not active employees aged 18–64 years as of the index date (ie, retirees younger than age 65, spouses and dependents of active employees), if they had a maternity-related claim or other pregnancy-related condition during the pre- or post-index period; or if they had any medical claim containing a diagnosis or procedure code for cancer, HIV/AIDS, organ transplant, trauma, or mental disorder during the pre- or post-index period.
FRP participants were assigned to 4 mutually exclusive cohorts according their average level of FRP participation, defined as the mean number of monthly visits to a fitness center during the post-index or follow-up period. FRP cohorts were labeled as describing: low participation (<4 visits per month on average); low-moderate participation (≥4 and <8 visits per month on average), high-moderate participation (≥8 and <12 visits per month on average), and high participation (≥12 visits per month on average). Members in any cohort received the $20 incentive for any month in which their utilization met or exceeded 12 visits, but cohort assignment was based on their average utilization.
Measures
Demographic, health status, health care supply, and socioeconomic characteristics of FRP participants were collected to account for factors other than the amount of FRP participation that also influence health care costs. Demographic characteristics were: age as of the index date, sex, US census region of residence, and a binary variable (yes/no) indicating residence in a metropolitan statistical area. Health status during the 6-month pre-index period was assessed by count of inpatient stays, count of emergency room visits, total health care costs, and Charlson comorbidity score. 21 Indices of health care supply that have been shown to influence health care expenditures 22 were determined according to the member's zip code of residence. These included the number of primary care providers per 100,000 residents, number of medical specialists per 100,000 residents, and number of hospital beds per 1000 residents. Socioeconomic characteristics were determined at the zip code level using the 2010 US Census as the reference: these included minority ratio, categorized as high, medium, or low according to the percentage of minority residents, and income level grouped as high (1st quartile), medium-high (2nd quartile), medium-low (3rd quartile), and low (4th quartile). Other characteristics included the season of the index month and a binary indicator of continuous pharmacy benefits during both 6-month pre-index and post-index periods. A retrospective risk score derived from 12-month pre-index health care costs, centered around 1.0, was used to predict whether allowed charges for the member were expected to be higher or lower than the population's average for the following year.
The outcome was total health care costs during follow-up, starting from the month of the index date and continuing for at least 6 and up to 12 months post index. Because of variable length of follow-up, costs were determined on a per-member-per-month (PMPM) basis. Total costs were calculated as the sum of all medical and pharmacy costs, including both health plan and member paid amounts.
Statistical analysis
Inverse propensity score weighting 23,24 was used to minimize differences in demographics, health status, health care supply, and socioeconomic characteristics among the 4 cohorts of FRP participants. To apply the weighting technique, one must first generate a propensity score for each sample member. The propensity score was derived by logistic regression modeling, using the FRP participant's demographic, health status (including pre-index health care costs), and other characteristics that were already defined as adjustment variables. The propensity score for each member was obtained from the logistic regression output as his or her predicted probability of being in the FRP participant cohort that he or she actually was in. The propensity score for each member was then transformed into a case weight for subsequent regression analysis; the weight was defined as 1.0 divided by the member's propensity score value.
To estimate the impact of FRP participation on health care expenditures in a way that would help avoid selection bias, propensity score weighted health care costs during follow-up were modeled with a fixed effects generalized linear model (GLM), with a gamma distribution and log-link. The GLM model included the FRP cohort as the primary independent variable and participant characteristics as adjustment variables. Multicolinearity of adjustment variables was assessed by reviewing the variance inflation factor and condition index values for each variable. Variables with a variance inflation factor ≥10 or condition index ≥30 were removed from the regression models. High-cost outliers, defined as FRP participants with follow-up monthly total health care costs greater than the 97.5th percentile of the study sample, were removed from the primary analysis as well. Two sets of sensitivity analyses also were conducted for total health care costs. The first sensitivity analysis was the same as the primary analysis except that high-cost outliers were retained. The second set of sensitivity analyses employed a random effects, rather than fixed effects, GLM, with and without outliers excluded.
Return on investment (ROI) analyses
PMPM health care cost results were used to estimate the financial ROI associated with the FPR incentive feature. Differences in health care expenditures between higher users and the low user group were estimated via the regression modeling described. These differences were then contrasted with the cost of the incentive feature to estimate savings PMPM, and the associated ROI ratio (ie, savings divided by incentive costs).
Results
Member sample and characteristics
The initial sample members included 23,103 health plan members who enrolled in the FRP during October 1, 2010, through December 31, 2013, (Table 1). After applying exclusion criteria and removing high-cost outliers, 8723 participants (37.8% of initial study sample) were retained. The percentage of participants retained was similar across FRP cohorts. The primary reasons for exclusion were lack of continuous health plan enrollment with medical benefits during both the 6-month pre-index and 6-month post-index period (26.5% excluded) and FRP enrollment date occurring after June 30, 2013 (21.7% excluded).
High-cost outliers were determined as the highest cost 2.5% of the final study population. The percent of outliers was similar in each of the 4 groups with 2.6% in the low user group, 2.2% in the low-moderate group, 2.7% in high-moderate group, and 2.6% among high users. Outliers averaged 370 annual emergency room visits per 1000 and 337 annual inpatient admissions per 1000 while the non-outliers had 102 annual emergency room visits per 1000 and 15 annual inpatient visits per 1000 (data not shown). All differences between outliers and non-outliers were statistically significant (P < 0.0001.)
The propensity score variables and FRP participant characteristics before and after propensity score weighting are shown in Table 2. Prior to propensity score weighting, large differences among cohorts were evident for age group, season of index month, and socioeconomic markers; this is evidenced by standardized difference values for these variables that were greater than 0.10. After propensity score weighting, participant characteristics were rendered similar across FRP cohorts (ie, all standardized difference values were less than 0.10).
Participation cohorts defined according to number of PMPM fitness center visits during post-index period: low, <4; low-moderate, ≥4 and <8; high-moderate, ≥8 and, <12; high, ≥12.
High participation cohort is the reference group for standardized differences.
Standardized difference value ≥0.1 indicates significant difference (bolded values).
Based on member zip code of residence; income categories are quartiles based on median income in the member's zip code according to 2010 US Census.
Derived from 12-month pre-index health care costs to predict whether allowed charges would be higher than an average for the following year.
ER = emergency room; FRP = fitness reimbursement program; PMPM = per member per month; STD = standardized.
Length of follow-up and fitness center visits
On average, participants were followed for 11 months after FRP enrollment (Table 3). The mean PMPM fitness center visits for each cohort ranged from 1.7 for the low participant cohort to 15.3 for the high participant cohort. In the low participation cohort, 2.29% of participants had 0 PMPM visits.
Participation cohorts defined according to number of PMPM fitness center visits during post-index period: low, <4; low-moderate, ≥4 and <8; high-moderate, ≥8 and, <12; high, ≥12.
2.29% had 0 PMPM visits.
PMPM = per member per month; SD = standard deviation.
Total health care costs
Propensity score-weighted total health care costs during follow-up are shown in Table 4. In the primary analysis using a fixed effects model with high-cost outliers removed, a trend for lower health care costs with increasing level of FRP participation was evident. Compared with the low participation cohort, PMPM total health care costs were lower by 2.6% in the low-moderate cohort (P = 0.60), 6.9% in the high moderate cohort (P = 0.16), and 8.4% in the high participation cohort (P = 0.08). In the sensitivity analysis with high-cost outliers included, the same trend for declining PMPM health care costs was apparent at the moderate-high and high participation levels, and the difference compared with the low participation cohort was statistically significant for the high participation cohort (P < 0.001). In the sensitivity analysis using a random effects model, lower PMPM costs also were observed at higher levels of participation: with outliers excluded, total PMPM costs were lower compared with the low participation cohort for low-moderate (P = 0.752), moderate-high (P = 0.791), and high (P = 0.132) participation cohorts. When outliers were included in the random effects model, total PMPM costs were higher for low-moderate (P = 0.919), lower for high-moderate (P = 0.129) and lower for high (P = 0.011) participation cohorts.
Participation cohorts defined according to number of PMPM fitness center visits during post-index period: low, <4; low-moderate, ≥4 and <8; high-moderate, ≥8 and, <12; high, ≥12.
General linear model adjusted for participant characteristics.
High-cost outlier defined as monthly total health care costs greater than the 97.5th percentile of the study sample.
PMPM = per member per month.
ROI analyses
Table 5 shows the results of the return on investment (ROI) analyses on the sample that excluded the high-cost outliers (n = 8723). This is the sample that yielded lower savings, so ROI results reported here may be conservative. Group membership in this study was based on the average number of fitness facility visits per month, but there were a few members who had individual months with 12 or more visits in each group, and they received the $20 incentive for each of those months. Thus, some incentive dollars were paid to members of each user group (see row A in Table 5). The average monthly incentive payments varied from $0.35 for the low user group to $17.28 for the high user group (row B). Savings in health care for members of each group are shown in comparison to the low user group (row C). The overall average health care savings was $9.50 PMPM.
Participation cohorts defined according to number of PMPM fitness center visits during post-index period: low, <4; low-moderate, ≥4 and <8; high-moderate, ≥8 and, <12; high, ≥12; outliers removed.
Row C reports the mean difference in health care expenditures PMPM for the group of interest vs. the low group.
Row E reports the average dollars in health care expenditures that were saved or lost, per dollar spent on the incentive. Only the cost of the incentive is included in the return on investment analysis. See Discussion section regarding the perspectives for and limitations of the analysis.
PMPM, per member per month.
Net savings per group also are reported in Table 5. Net savings are defined as the difference in health care savings vs. the cost of the incentive. For the entire study sample (excluding outliers) the average net savings was $2.76 PMPM.
Finally, ROI ratios for each group are reported in Table 5. The ROI for the entire study sample was estimated to be a savings of $1.41 in health care expenditures per dollar spent on the incentive.
Discussion
The results of this study suggest a trend for lower health care costs with increasing level of FRP participation. Compared with lowest level of participation, when a small number of high-cost outliers were removed from the sample, monthly health care savings ranged from $6.14 (2.6%) per member at the low-moderate level of participation to $20.01 (8.4%) per member at the highest level of participation. Although differences were not statistically significant at the conventional alpha ≤0.05 threshold, the difference in health care costs between the lowest and highest level of participation approached significance (P = 0.08) for the high users.
From a statistical perspective, P values higher than 0.05 do not imply that $0 savings would be a better estimate for any group of members. The best estimates are still the values reported herein, but the higher the P values are, the more uncertainty there is around these estimates. When outliers were included in the analysis, the difference between the lowest and highest participant groups was larger ($52.66) and significantly different from zero (P < 0.001). As shown above, results were similar in sensitivity analyses that employed a random effects model as another option to avoid selection bias.
With reference to other research, results from the present study are consistent with research by Abraham and colleagues, who examined a program offered by the University of Minnesota that provided the same incentive ($20/month) for at least 8 (vs 12 in the present study) monthly fitness center visits. 20 In that study, annual health care spending for participants with the highest level of participation—9 to 12 fitness center visits per month—was 18% lower than for than nonparticipants. However, removal of high-cost outliers (approximately 1% of the sample) impacted results: although a similar trend for lower health care costs was evident, the estimate of health care savings at the highest level of participation was appreciably smaller and nonsignificant after excluding high-cost outliers. 20
Two notable differences in methodology between the present study and the work by Abraham et al. 20 warrant consideration. First, the present study compared health care costs only among FRP participants, with the low participation cohort serving as the comparator group, while the previous study compared costs of program participants to nonparticipants. The present study selected the lowest participation cohort, rather than nonparticipants, as the comparator to minimize differential motivation for modifying behavior among participants vs. nonparticipants. The lowest participation cohort did include a small number of health plan members (2.3%) who signed up for FRP but had zero fitness center visits. These members were included in the low participation cohort because, unlike nonparticipants, they were motivated to enroll in FRP although they did not follow through. Second, in the present study the period for identifying high-cost outliers was also shorter and more contemporaneous: high-cost outliers were identified in the 6 months prior to FRP participation and during 6–12 months follow-up vs 3 years (including 2 years before the fitness program started and the 1-year follow-up period) in the previous study. 20
Studies in elderly populations also have found evidence of an inverse dose response relationship between health care costs and increasing level of group physical activity participation. 25,26 Among managed care Medicare recipients who participated in a health plan-sponsored Silver Sneakers program, participants who averaged 2 or more visits per week incurred approximately $1000 lower health care costs in the second year of the program than participants with less than 2 mean weekly visits. 25 Cost savings of approximately $3700 per year also were observed among older patients with diabetes who more regularly participated in a community exercise program sponsored by their health plan. 26
With regard to the financial ROI in the incentive program, there are multiple perspectives to consider and some limitations worth noting. Perspectives include those of the vendor (Optum), the health plan client who purchased the program from Optum, and members of the health plan who used the fitness facilities. Each of these entities may bear some financial risk, so each may regard the ROI methods and results differently. From Optum's perspective the risk involved is related to whether it can offer the program at a price that exceeds the cost of the incentive and other production costs, and whether the program would yield enough savings for the health plan client to exceed these costs. That was indeed the case.
From the program sponsor's (ie, the health plan client's) perspective, the intervention was the $20 payment it gave to individuals who visited a fitness center at least 12 times in any given month, plus other costs that influence the total price it paid Optum for the program. Prices are negotiated on a client-by-client basis, and vary depending on client size, locale, and whether the client chooses to pay for or spend their own resources on communications to promote the program (because such efforts are associated with higher use of the fitness facilities, which in turn yields more savings). Although the full price paid by the client for this study is not specified, the study does provide insight about whether ROI values would still be above 1.0 if those costs were included in the analysis.
Specifically, in the analysis that excluded outliers from the sample, the savings in health care expenditures were equal to $1.41 per dollar spent on the incentive. The savings were much higher when analyses included the full sample. In that case the ROI ratio reflects a savings of $3.13 per dollar spent on the incentives (ROI calculations for the full sample are available upon request). Depending on pricing variations for individual clients, ROI values based on analyses that exclude outliers may not exceed 1.0 for some clients, while for others ROI values may still exceed 1.0. However, for all of the clients Optum has done business with so far, ROI values would exceed 1.0 when the full price is accounted for, if savings estimates are based on the full sample, including outliers.
From the fitness facility users' perspective, participating in the fitness facility program was free, but the cost of the gym membership also might need to be accounted for in an ROI analysis. Gym membership costs would need to be included only if the incentive either motivated the user to buy a facility membership that she or he did not already have, or to extend a membership she or he was thinking of cancelling. The study team doubts either would occur very often. It seems more likely that the incentive would motivate those who already have a membership to exercise more without affecting gym membership costs. Unfortunately, data were not available to test this hypothesis.
Data also are lacking about fitness center utilization prior to the incentive program, and there are only limited data about whether the incentive program may have increased that utilization. The only data available about the potential impact of the incentive on fitness center utilization came from a subsequent satisfaction survey that was conducted among program participants. Among participants, 61.3% of the survey respondents agreed that their health improved since enrolling in the program and 95.6% of those respondents said an increase in exercise contributed to this improvement. Although these results may support the notion that the program influenced how much users exercised, they are inconclusive.
Finally, there may be additional savings beyond reduced health care expenditures to consider, such as those related to reduced absenteeism or improved productivity at work that are associated with exercising more. Data were not available to estimate productivity savings in this study, but it should be considered for future studies. Thus, ROI from the incentive feature cannot be reported with much certainty and the results should be considered in this context.
The results of this study add to the relatively small body of evidence suggesting lower health care costs with increasing level of participation in physical activity programs. 19,20,25,26 Physical activity is a potentially modifiable risk factor for preventing many common chronic health conditions. Conservative estimates indicate that 24% of working age adults have 1 common chronic condition, and 18% have 2 or more common chronic conditions. 27 Even modest reductions in chronic disease burden among employees may lead to large reductions in health care spending. 28 Although this study's results coupled with previous research indicate that health care savings may be correlated with increasing participation in fitness activities, additional research is needed to extend these findings. Future studies should examine both clinical and economic outcomes. Also, variability in the intensity and duration of physical exertion corresponding to observed fitness activities should be accounted for and the sustainability of benefits over longer periods should be evaluated.
The results of this study are based on a retrospective analysis of administrative data and there are inherent limitations associated with this approach. Number of fitness center visits was used as a surrogate for level of physical activity; however, the frequency of fitness center visits does not provide information regarding the overall intensity or duration of physical activity, which has been linked to improved health outcomes. In the future, it would be useful to ask fitness center users to also use wearables such as FitBit or other technologies 29,30 to gather hard data on the intensity of exercise.
Propensity score weighting and fixed and random effects models were used to mitigate socioeconomic and other differences in participant characteristics among cohorts, including pre-FRP enrollment health care costs and health risk scores. Although this approach was useful, data regarding frequency of pre-FRP fitness center visits was not available, nor is it known how many visits to the FRP would have been used by any sample member in the absence of the $20 incentive, so it is possible that the health benefits achieved through regular pre-FRP fitness center visits (or other pre-FRP physical activity programs) carried over into study cohorts. For example, high frequency FRP users already may have been participating at a high level in a fitness center or other physical activity program prior to signing up for the FRP and, thus, could have entered the program as healthier individuals.
Next, the mean length of follow-up was just under 1 year. Longer term evaluations are needed to determine if the apparent health savings observed change over time. Finally, this study focused solely on the relationship between fitness facility use and medical expenditures. Data were lacking on associated decreases in absenteeism, increases in productivity at work, and reduced disability expenditures, if any. Others may wish to investigate these outcomes. The fitness center literature is mixed with respect to the relationship between physical activity and work productivity, and results often vary with study design, type of intervention, or population characteristics. 31 –33
Conclusions
Among participants in an employer-sponsored fitness program that included a financial incentive for regular fitness center visits, a trend for lower total health care costs was observed with more frequent fitness center visits. Compared with the lowest level of participation (<4 visits/month) monthly savings in health care costs ranged from $6.14 (2.6%) per participant at the low-moderate level of participation (≥4 and <8 visits per month) to $20.01 (8.4%) per participant at the highest level of participation (≥12 visits per month). When outliers (those with very high health care costs) were retained in the sample, significant differences in costs were found when comparing those in the high participation group to those in the low participation group. These results suggest a dose-response pattern to FPR program use that may help save health care expenditures. Because of data limitations, one should view the ROI analyses with caution, noting, however, that they seem to indicate savings in health care expenditures that exceed the cost of the incentive program, especially if all sample members are included in the analyses.
Footnotes
Author Disclosure Statement
Ms. Navratil-Stawn, Mr. Hartley, and Dr. Ozminkowski declared the following conflicts of interest with respect to the research, authorship, and/or publication of this article: the authors are employed by and own stock in UnitedHealth Group. Stock or other compensation did not influence the content of this paper.
The authors received the following financial support for this article: this research was supported by UnitedHealth Group, Minnetonka, Minnesota.
Acknowledgments
The views expressed in this manuscript are the authors' and do not necessarily represent the views of Optum or UnitedHealth Group.
Sarah Peirce-Sandner, MS, and Virginia Rosen (Optum, Eden Prairie, Minnesota) provided medical writing support for this manuscript.
Prior Presentations
This article was presented at the Population Health Alliance Forum, December 2014, Scottsdale, Arizona.
