Assessing the Added Value of Blood Pressure Information Derived from Electronic Health Records in Predicting Health Care Cost and Utilization

Abstract

Health care providers are increasingly using clinical measures derived from electronic health records (EHRs) for risk stratification and predictive modeling. EHR-specific data elements such as prescriptions, laboratory results, and vital signs have been shown to improve risk prediction models. In this study, the value of EHR-based blood pressure (BP) values was assessed in predicting health care costs (ie, total, medical, and pharmacy) and key utilization end points (ie, hospitalization, emergency department use, and being among the highest utilizers). The study population included 37,451 patients of a large integrated delivery system in the mid-western United States with complete EHR data files, who were 18–64 years old, had continuous insurance at an affiliated health plan, and had eligible BP records. Both EHRs and insurance claims of the study population were used to extract the predictors (ie, demographics, diagnosis, and BP values) and outcomes (ie, costs and utilizations). Predictors were extracted from 2012 data, whereas concurrent and prospective outcomes were extracted from 2012 to 2013 data. Three base models (BMs) were constructed to predict each of the outcomes. The first BM no. 1 used demographics. The second BM no. 2 added the Charlson comorbidity index to BM no. 1, whereas the third BM no. 3 added the Adjusted Clinical Group Dx-PM case-mix score to BM no. 1. BP was specified as means, ranges, and classes. Adding BP ranges to BM no. 1 and BM no. 2 showed the greatest improvements when predicting costs and utilization. More specifically, adjusted R ² and area under the curve of BM no. 2 improved by 32.9% and 14.1% when BP ranges were added to predict concurrent total cost and hospitalization, respectively. The effect of BP measures on improving the risk stratification models was diminished when predicting prospective outcomes after adding the measures to BM no. 3 (ie, the more comprehensive diagnostic model), specifically when represented as BP means. Given the increasing availability of BP information, this research suggests that these data should be integrated into provider-based population health analytic activities. Future research should focus on subpopulations that benefit the most from incorporating vital signs such as BP measures in risk stratification models.

Introduction

Nearly a third of US adults are estimated to have hypertension.¹ Prevalence of hypertension increases with age, ranging from 7.5% among adults aged 18–39 years to 63.1% among those aged 60 years and over.¹ Hypertension continues to be a public health priority as it increases the risk for cardiovascular diseases.² Controlled blood pressure (BP) can decrease the incidence of stroke, myocardial infarction, and heart failure^3
–5; however, only 48.3% of adults with hypertension have controlled BP, which is defined as systolic (SYS) BP <140 mmHg and diastolic (DIA) BP <90 mmHg.^1,6

Hypertension is the primary reason for ∼43 million outpatient visits in the United States annually.⁷ In 2011, total costs associated with hypertension were estimated at 46 billion US$, which stemmed from health care services, medications, and missed days of work.⁸ The annual national medical expenditures associated with hypertension have been significantly increasing over time,⁹ where prescription medication costs have been the primary driver.¹⁰

In 2010, the National Academy of Sciences (NAS) recommended that health systems focus on population-based strategies to control hypertension.¹¹ NAS recommended strengthening hypertension surveillance and monitoring efforts, which would facilitate the determination of hypertension burden, assessment of BP changes over time, and evaluation of interventions.¹¹

Health systems are increasingly using both administrative claims and electronic health records (EHRs) to identify high-risk patients^{12

–19}; however, EHR-derived vital signs such as BP status are rarely incorporated in risk stratification models that guide population health management efforts.^19,20 Given the total medical expenditure associated with hypertension^9,10 and increased surveillance and availability of BP data on a population level,¹¹ research is needed to assess the value of BP measures in improving population-based forecasts of costs, inpatient hospitalization, and emergency department (ED) admissions.

Objective

The study objective was to determine whether adding BP markers derived from EHRs improves the performance of predictive models of utilization among individuals receiving ambulatory care. The research team hypothesized that BP risk markers would add value, relative to diagnosis-based risk markers derived from combined claims and structured EHR data, in improving predictive models of costs and utilization.

Methods

Data source

This study used data provided by HealthPartners, Inc., consisting of >1800 physicians practicing in 8 hospitals, 55 primary care clinics, and 22 urgent cares located in Minnesota. HealthPartners provides medical services and health plan financing and administration to 1.8+ million enrollees.²¹

The study included structured EHR data, medical claims, and pharmacy claims. EHR data captured complete outpatient information within the HealthPartners' network, but excluded information derived from inpatient encounters or those occurring outside of the network. BP records were derived from EHR data, whereas claims data contained demographics (eg, age and gender), diagnosis codes documented as the International Classification of Disease, Ninth Revision, Clinical Modification (ICD-9-CM codes), prescriptions as National Drug Code, date of services, paid amount by plan, and out-of-pocket amount by individuals.

Study design and subjects

This study is a 2-year retrospective cohort study (2012–2013). BP measures and control variables were derived from 2012 data, whereas outcomes of interest were constructed from 2012 and 2013 data.

The data preparation process started with 1,408,914 nonmissing BP records, but excluded (1) 872 records with SYS BP <70 or >250^22,23; (2) 16,596 records with DIA BP <45 or >150^22,23; (3) 5897 records with SYS/DIA BP <1.0625 + 0.00125*DIA or >3.0^22,23; (4) 3399 records with positions of measure not captured as “default” or “sitting”; (5) 589 records with site of measure not listed as “default,” “left arm” or “right arm”; and (6) 733,116 records not dated within 2012. The remaining 666,460 eligible BP measures were summarized by calculating the mean value of all records from the same date for each subject resulting in 329,964 daily BP measures (Fig. 1).

FIG. 1.

Denominator selection process (left: patients; right: BP records). *Totals are not equal to the sum of individual criteria as exclusions overlapped.

The study further excluded patients if they were not between 18 and 64 years old, had a pregnancy, or had incomplete data. Patients <18 years old were excluded as BP measurement and its effect on utilization in children are different than those in adults.^1,6,8 Patients >64 years old were excluded as Medicare claims data were not available in this study. Patients with pregnancies were excluded as health care utilization among those patients is often not due to comorbidities, and risk stratification studies recommend excluding deliveries in utilization predictions.^24,25 Hence, among 141,716 members with an EHR, the following members were excluded: (1) 629 members with data quality issues (eg, unmatched age and gender between EHRs and claims); (2) 33,285 subjects who were not 18 to 64 years old in 2012; (3) 26,826 subjects without continuous pharmacy enrollment in 2012 or 2013, and 0 subjects with noncontinuous medical enrollment; (4) 4065 subjects with pregnancy in 2012 or 2013; and (5) 39,460 patients lacking at least 2 valid BP records. The final study population included 37,451 eligible subjects associated with 162,575 BP measures (Fig. 1).

Independent variables

Four sets of yearly BP measures were generated: (1) mean SYS+mean DIA: the sum of SYS and DIA means for every individual was derived using 2012 records; (2) BP class: using the mean SYS and DIA values, individuals were classified into 1 of 4 classes of BP: normal, elevated, stage 1, and stage 2+ (ie, merged crisis level with stage 2)²⁶; (3) range of SYS and DIA: the range of SYS and DIA (maximum–minimum) was calculated for each patient across entire records in 2012; and (4) BP class range: calculated the range of changes for the BP class (ie, ranging from 0 to 3 classes; 0 representing normal with no changes).

The values of the 4 BP-derived measures were assessed for predicting utilization by adding them to multiple “base” risk stratification models. For each base model (BM), the performances of the following combinations were compared: (1) BM only, (2) BM plus the means of SYS and DIA, (3) BM plus the BP class, (4) BM plus the range of SYS and DIA, and (5) BM plus the range of BP class.

Information derived from 2012 health plan claims for each member of the study group was used to build 3 sets of base predictive models: (1) BM no. 1: BM consisting of gender and age groups (ie, 18–34, 35–44, 45–54 and 55–64 years); (2) BM no. 2: BM including the Charlson comorbidity index²⁷ in addition to the BM no. 1's variables; and (3) BM no. 3: BM including diagnosis-based predicted scores (Dx-PM score)²⁸ plus variables used in BM no. 1. The Dx-PM score makes use of the full range of available ICD codes to categorize all morbidities. It is a component of the widely used Johns Hopkins Adjusted Clinical Group (ACG) risk adjustment system.²⁸ The Dx-PM score has been shown to be a valid measure of morbidity in several studies.^24,25

Outcome variables

Three types of costs (ie, annual, medical, and pharmacy costs) and 3 binary utilization indicators (ie, any hospitalization [excluding deliveries], ED visit, and being in the subgroup with the highest 5% of total costs) were calculated separately in the concurrent (2012) and prospective (2013) years. Annual cost was the sum of paid and out-of-pocket amount derived from medical only, pharmacy only, or all claims. Costs were truncated at the top 0.5% as cost is highly skewed.²⁸ Outcome flags were also generated for being in the top 5% of cost, having any hospitalization, and having any ED visit in each period.

Statistical methods

Descriptive characteristics of study subjects, including age, gender, baseline morbidity, and medical utilization, were calculated in total and by 4 BP classes separately.

The impact of adding 4 sets of BP-derived variables was evaluated on the performance of 3 BMs (ie, BM no. 1, BM no. 2, and BM no. 3) in explaining costs and utilization. Linear regression was used to explain/predict costs as linear regression is considered the standard approach in risk adjustment studies.^28,29 Logistic regression was used to predict binary utilization outcomes such as any hospitalization, any ED admission, and being in the top 5% of total cost. The outcomes from the base year (2012) were used in the concurrent prediction and the subsequent year (2013) were used in the prospective analysis.

The performance measure for linear regression models included adjusted R ² and mean absolute prediction error (MAPE).³⁰ The adjusted-R ² measurement was used, instead of R ², as the adjusted-R ² measurement considers the number of variables included in the model in calculating the performance measurement, hence will not be inflated by adding additional variables to a model. MAPE is the average absolute difference between the predicted and the actual value across all subjects. MAPE of different types of costs is divided by their respective means so that results could be compared across various types of costs. In contrast to adjusted R ², MAPE is more resilient to outliers.³⁰ The performance of the logistic regressions was presented by the area under the curve (AUC) measure.

A bootstrap analysis of 300 runs was performed to provide point estimate and 95% confidence intervals (CIs) for all performance measures, including adjusted R ², MAPE, and AUC. Statistical significance at a 0.05 cutoff was determined when the BM's point performance measure was not contained in the 95% CI performance measure of the BM after adding a BP variable to the model.³¹

The institutional review board of Johns Hopkins Bloomberg School of Public Health approved this study.

Results

Population specifications

Out of the 37,451 patients, 41.7% had normal BP levels, whereas 17.0%, 27.7%, and 13.5% were in elevated, stage 1, and stage 2 BP classes (Table 1). Average age was higher in elevated BP classes, ranging from 42.2 years of age for the normal BP class to 50.2 years for stage 2 BP class. Overall, the percentage of female population was 60.9% in the study population, with higher rates in lower BP classes (eg, 73.7% in the normal BP class). Mean SYS and DIA BPs were 122.4 and 76.0 mmHg for the total population. The lowest SYS and DIA means were observed in the normal class (110.1 and 69.0), whereas the highest means were measured in the stage 2 class (144.9 and 89.1).

Table 1.

Study Population Specifications, Utilizations, and Costs in 2012

Specification	Normal	Elevated	Stage 1^a	Stage 2^a	All
Population, no. (%)	15,619 (41.71)	6381 (17.04)	10,382 (27.72)	5069 (13.54)	37,451 (100.0)
Demographics^b
Age in 2012, years (SD)	42.24 (13.16)	46.04 (13.48)	47.52 (11.68)	50.21 (10.42)	45.43 (12.82)
Age 18–34, % (SD)	29.86 (45.77)	23.65 (42.5)	16.5 (37.12)	10.22 (30.29)	22.44 (41.72)
Age 35–44, % (SD)	22.02 (41.44)	14.76 (35.48)	18.73 (39.02)	15.66 (36.35)	19.01 (39.24)
Age 45–54, % (SD)	26.6 (44.19)	26.41 (44.09)	30.09 (45.87)	32.06 (46.67)	28.27 (45.03)
Age 55–64, % (SD)	21.53 (41.1)	35.18 (47.76)	34.68 (47.6)	42.06 (49.37)	30.28 (45.95)
Female, % (SD)	73.71 (44.02)	52.7 (49.93)	52.41 (49.94)	49.08 (50)	60.89 (48.8)
Comorbidities^b
Chronic conditions, no. (SD)	2 (2)	2.44 (2.29)	2.56 (2.22)	2.88 (2.31)	2.35 (2.18)
Medication ingredients, no. (SD)	5.18 (4.38)	5.64 (4.71)	5.85 (4.69)	6.17 (4.78)	5.58 (4.59)
Dx-PM score^c, score (SD)	1.57 (1.86)	1.8 (2.19)	1.8 (2.05)	1.94 (2.37)	1.72 (2.05)
Chronic conditions^b
Type-2 diabetes, % (SD)	3.94 (19.45)	9.07 (28.73)	9.28 (29.01)	10.4 (30.52)	7.17 (25.79)
Hypertension, % (SD)	7.6 (26.5)	18.92 (39.17)	27.6 (44.7)	47.82 (49.96)	20.51 (40.38)
Congestive heart failure, % (SD)	0.4 (6.29)	0.45 (6.73)	0.29 (5.37)	0.55 (7.41)	0.4 (6.3)
Ischemic heart disease, % (SD)	1.72 (12.99)	3.13 (17.43)	2.42 (15.36)	3.14 (17.43)	2.34 (15.13)
Acute myocardial infarction, % (SD)	0.33 (5.71)	0.44 (6.61)	0.26 (5.09)	0.39 (6.27)	0.34 (5.79)
Cancer, % (SD)	4.56 (20.86)	5.22 (22.24)	5.39 (22.59)	5.74 (23.26)	5.06 (21.92)
BP measures^b
BP class = normal, % (SD)	100 (0)	0 (0)	0 (0)	0 (0)	41.71 (49.31)
BP class = elevated, % (SD)	0 (0)	100 (0)	0 (0)	0 (0)	17.04 (37.6)
BP class = stage 1, % (SD)	0 (0)	0 (0)	100 (0)	0 (0)	27.72 (44.76)
BP class = stage 2, % (SD)	0 (0)	0 (0)	0 (0)	100 (0)	13.54 (34.21)
SYS BP, mean (SD)	110.13 (6.56)	124.16 (2.81)	128.97 (6.8)	144.88 (10.06)	122.45 (13.77)
DIA BP, mean (SD)	69.01 (5.82)	73.58 (4.6)	81.72 (4.9)	89.11 (8.04)	76.03 (9.32)
Cost^b,d
Total cost, $ (SD)	7521 (14309)	8550 (15505)	8205 (14402)	8203 (15066)	7978 (14653)
Medical cost, $, (SD)	6211 (12913)	7008 (14124)	6693 (13030)	6728 (13716)	6551 (13272)
Pharmacy cost, $ (SD)	1217 (3293)	1470 (3603)	1440 (3566)	1412 (3573)	1348 (3464)
Utilization
Inpatient hospitalization, no. (SD)	0.07 (0.36)	0.09 (0.45)	0.09 (0.4)	0.09 (0.43)	0.08 (0.4)
Outpatient visits, no. (SD)	14.3 (16.5)	14.48 (16.56)	14.61 (16.36)	14.14 (15.66)	14.4 (16.36)
ED admission, no. (SD)	0.19 (0.59)	0.2 (0.66)	0.2 (0.56)	0.21 (0.57)	0.2 (0.59)
Any inpatient hospital, % (SD)	0.06 (0.23)	0.07 (0.25)	0.06 (0.24)	0.07 (0.25)	0.06 (0.24)
Any ED admissions, % (SD)	0.14 (0.35)	0.16 (0.36)	0.16 (0.36)	0.17 (0.37)	0.15 (0.36)
In top 5% of total cost, % (SD)	0.04 (0.21)	0.06 (0.23)	0.05 (0.22)	0.05 (0.22)	0.05 (0.22)

Stage 1: SYS BP 130–139 mmHg or DIA BP 80–89 mmHg; stage 2 (or higher): SYS or DIA BP higher than stage 1 thresholds.

Percentage, mean, and SD are calculated using the corresponding BP class denominators.

ACG Dx-PM case-mix score.

Truncated at top 0.5% of total cost.

ACG, Adjusted Clinical Group; BP, blood pressure; DIA, diastolic; ED, emergency department; SYS, systolic.

Patients in the higher BP classes had higher number of chronic conditions (eg, 2.88 conditions in stage 2 vs. 2.00 for the normal class) and were using higher number of medications (ie, count of medication ingredients was ∼20% higher in stage 2 than the normal class). In general, higher BP classes had a higher rate of select chronic conditions (eg, type 2 diabetes and congestive heart failure), except for acute myocardial infarction that was highest in the elevated BP class.

Inpatient hospitalization, ED admissions, and being in the top 5% of the cost had slightly additional per-person rates in higher BP classes (eg, 0.17 any ED admissions for stage 2 vs. 0.14 for the normal group); however, outpatient visits were most frequent in the elevated BP group. A statistical comparison using P-values generated P-values <0.001 for all utilization group comparisons; however, due to the considerably large size of the denominator in each BP group, P-values are deemed challenging to interpret for group comparison analysis.³²

Impact of adding BP measures on the adjusted R ² of costs

Overall, both concurrent and prospective adjusted R ² increased with statistical significance when the complexity of the BMs increased from BM no. 1 to BM no. 2, and to BM no. 3 (Table 2). For example, the adjusted R ² of concurrent total cost among BMs increased from 1.76 (95% CI 1.52–2.01) for BM no. 1 to 17.70 (15.98–19.37) for BM no. 2 and to 48.75 (46.83–50.71) for BM no. 3. The same trend was observed among BMs with comparable added BP variables. For example, among BMs that included the BP class range as a predictor, the adjusted R ² of concurrent total cost increased from 6.45 (6.01–6.92) for BM no. 1 to 20.21 (18.71–21.83) for BM no. 2 and to 49.36 (47.44–51.34) for BM no. 3 (Table 2).

Table 2.

Impact of Blood Pressure Measures in Predicting Medical, Pharmacy, and Total Costs (Adjusted R ²)

Period	Concurrent			Prospective
Type of cost	Total	Pharmacy	Medical	Total	Pharmacy	Medical
BM no. 1^a
BM no. 1: age, gender	1.76 (1.52–2.01)	0.99 (0.81–1.18)	1.40 (1.20–1.61)	1.88 (1.65–2.13)	0.88 (0.71–1.09)	1.54 (1.33–1.74)
BM no. 1 + mean SYS + mean DIA	1.92 (1.67–2.17)	1.10 (0.88–1.30)	1.52 (1.31–1.77)	2.00 (1.75–2.26)	1.00 (0.80–1.23)	1.62 (1.39–1.87)
BM no. 1 + SYS range + DIA range	12.15 (11.36–13.01)	2.60 (2.26–2.96)	11.69 (10.80–12.51)	5.38 (4.90–5.96)	2.28 (1.96–2.62)	4.58 (4.14–5.11)
BM no. 1 + BP class	1.79 (1.55–2.03)	1.03 (0.83–1.22)	1.42 (1.23–1.65)	1.91 (1.67–2.17)	0.93 (0.74–1.11)	1.57 (1.34–1.77)
BM no. 1 + BP class range	6.45 (6.01–6.92)	2.00 (1.71–2.30)	5.88 (5.43–6.31)	3.57 (3.24–3.94)	1.71 (1.46–1.99)	2.92 (2.63–3.25)
BM no. 2^a
BM no. 2: age, gender, Charlson index	17.70 (15.98–19.37)	11.55 (9.99–12.89)	13.57 (11.95–15.14)	11.20 (10.09–12.33)	10.71 (9.44–12.08)	7.35 (6.43–8.26)
BM no. 2 + mean SYS + mean DIA	17.76 (16.02–19.41)	11.57 (10.03–12.91)	13.62 (12.03–15.21)	11.21 (10.09–12.33)	10.72 (9.44–12.12)	7.37 (6.45–8.28)
BM no. 2 + SYS range + DIA range	23.53 (21.96–25.19)	11.88 (10.38–13.23)	19.92 (18.33–21.44)	12.71 (11.50–13.88)	10.95 (9.67–12.35)	8.92 (7.92–9.78)
BM no. 2 + BP class	17.75 (16.03–19.42)	11.57 (10.03–12.93)	13.63 (12.00–15.21)	11.20 (10.11–12.34)	10.72 (9.44–12.10)	7.37 (6.45–8.28)
BM no. 2 + BP class range	20.21 (18.71–21.83)	11.84 (10.28–13.18)	16.18 (14.64–17.65)	11.92 (10.81–13.03)	10.90 (9.65–12.29)	8.03 (7.02–8.89)
BM no. 3^a
BM no. 3: age, gender, ACG Dx-PM score	48.75 (46.83–50.71)	23.60 (21.92–25.32)	39.62 (37.51–41.92)	26.66 (24.79–28.49)	21.08 (19.41–22.85)	18.41 (16.56–20.10)
BM no. 3 + mean SYS + mean DIA	48.79 (46.85–50.75)	23.63 (21.91–25.33)	39.64 (37.54–41.94)	26.67 (24.78–28.49)	21.09 (19.44–22.86)	18.41 (16.57–20.12)
BM no. 3 + SYS range + DIA range	50.12 (48.31–52.02)	23.65 (21.93–25.39)	41.52 (39.53–43.71)	26.77 (24.91–28.61)	21.13 (19.47–22.95)	18.65 (16.80–20.39)
BM no. 3 + BP class	48.80 (46.87–50.76)	23.64 (21.92–25.34)	39.67 (37.54–41.94)	26.67 (24.79–28.49)	21.08 (19.42–22.85)	18.42 (16.56–20.11)
BM no. 3 + BP class range	49.36 (47.44–51.34)	23.61 (21.93–25.32)	40.37 (38.36–42.63)	26.75 (24.85–28.59)	21.08 (19.41–22.85)	18.52 (16.67–20.24)

Adjusted-R ² (95% CI) was generated using 300 bootstrapping runs, and cost was truncated at top 0.5% of total cost.

ACG, Adjusted Clinical Group; BM, base model; BP, blood pressure; CI, confidence interval; DIA, diastolic; SYS, systolic.

Adding the BP ranges, either the SYS and DIA range or the BP class range generally generated statistically significantly higher adjusted R ², especially when predicting total and medical costs (Table 2). For example, adding the SYS and DIA ranges to BM no. 2 statistically significantly increased the adjusted R ² of predicting concurrent total cost from 17.70 (15.98–19.37) to 23.53 (21.96–25.19), which is a 32.9% increase in performance. However, adding the SYS and DIA means to BM no. 2 only increased the adjusted R ² to 17.76 (16.02–19.41), which was an insignificant improvement for BM no. 2's predictive power (Table 2).

The effect of BP range variables in increasing the adjusted R ² declined when predicting prospective costs. For example, adding the SYS and DIA ranges to BM no. 2 improved the adjusted R ² of predicting prospective total cost from 11.20 (10.09–12.33) to 12.71 (11.50–13.88), which is a 13.4% improvement in predicting prospective cost, but markedly lower than the 32.9% improvement of the concurrent total cost prediction. Adding the SYS and DIA means to BM no. 2 showed no significant improvement in adjusted R ² when predicting prospective costs (Table 2).

The effect of BP range variables in increasing the adjusted R ² also declined when the complexity of the BMs increased. As an instance, adding the SYS and DIA ranges to BM no. 3 only increased the adjusted R ² of concurrent total cost by 2.8%, which is considerably <32.9% improvement of BM no. 2 when adding the same BP variables. The improvement of BM no. 3's adjusted R ² in predicting prospective costs was negligible when adding any types of the BP variables (ie, BP means, ranges, or classes; Table 2).

Impact of adding BP measures on the MAPE of costs

Similar to the trend of the adjusted R ² rates, but with an inverse direction, the concurrent and prospective MAPE values decreased as the complexity of the BMs increased from BM no. 1 to BM no. 3 (Table 3). For example, the concurrent MAPE of predicting total cost among BMs statistically significantly decreased from 98.33 (97.41–99.09) for BM no. 1 to 88.69 (87.73–89.58) for BM no. 2 and to 64.06 (63.19–64.98) for BM no. 3. The same pattern was identified within the same sets of BP variables across the BMs. For instance, among BMs that included the BP class range as a predictor, the MAPE of predicting concurrent total cost decreased from 94.60 (93.67–95.42) for BM no. 1 to 86.66 (85.87–87.53) for BM no. 2 and to 64.24 (63.37–65.05) for BM no. 3 (Table 3).

Table 3.

Impact of Blood Pressure Measures in Predicting Medical, Pharmacy, and Total Costs (Mean Absolute Prediction Error)

Period	Concurrent			Prospective
Type of cost	Total	Pharmacy	Medical	Total	Pharmacy	Medical
BM no. 1^a
BM no. 1: age, gender	98.33 (97.41–99.09)	122.42 (121.62–123.21)	103.31 (102.28–104.17)	105.25 (104.38–106.11)	126.38 (125.53–127.35)	110.31 (109.45–111.33)
BM no. 1 + mean SYS + mean DIA	98.23 (97.32–99.05)	122.36 (121.58–123.15)	103.21 (102.21–104.06)	105.15 (104.30–106.01)	126.28 (125.43–127.24)	110.26 (109.40–111.29)
BM no. 1 + SYS range + DIA range	91.10 (90.16–91.96)	120.13 (119.26–121.08)	95.94 (94.82–96.87)	102.29 (101.51–103.23)	124.17 (123.29–125.21)	107.56 (106.65–108.59)
BM no. 1 + BP class	98.31 (97.38–99.07)	122.35 (121.56–123.14)	103.30 (102.26–104.14)	105.19 (104.32–106.08)	126.33 (125.48–127.30)	110.28 (109.40–111.31)
BM no. 1 + BP class range	94.60 (93.67–95.42)	121.02 (120.22–121.88)	99.47 (98.49–100.34)	103.76 (102.96–104.67)	124.99 (124.14–126.02)	109.01 (108.13–110.00)
BM no. 2^a
BM no. 2: age, gender, Charlson index	88.69 (87.73–89.58)	110.36 (109.24–111.42)	96.30 (95.36–97.14)	98.12 (97.23–99.05)	114.18 (113.01–115.45)	106.25 (105.39–107.26)
BM no. 2 + mean SYS + mean DIA	88.72 (87.79–89.60)	110.37 (109.26–111.43)	96.32 (95.38–91.17)	98.11 (97.20–99.03)	114.19 (113.02–115.45)	106.23 (105.37–107.26)
BM no. 2 + SYS range + DIA range	84.84 (83.94–85.64)	109.75 (108.61–110.84)	92.41 (91.38–93.32)	96.73 (95.86–97.62)	113.61 (112.44–114.87)	104.78 (103.93–105.74)
BM no. 2 + BP class	88.69 (87.73–89.56)	110.33 (109.22–111.39)	96.31 (95.35–97.15)	98.10 (97.20–99.04)	114.17 (112.99–115.42)	106.23 (105.37–107.25)
BM no. 2 + BP class range	86.66 (85.87–87.53)	109.91 (108.74–110.96)	94.23 (93.31–95.10)	97.43 (96.52–98.29)	113.79 (112.55–115.00)	105.56 (104.69–106.56)
BM no. 3^a
BM no. 3: age, gender, ACG Dx-PM score	64.06 (63.19–64.98)	98.02 (96.62–99.62)	75.75 (74.85–76.68)	84.89 (83.70–85.91)	103.36 (102.01–104.58)	96.76 (95.63–97.76)
BM no. 3 + mean SYS + mean DIA	64.08 (63.22–64.95)	98.05 (96.63–99.63)	75.75 (74.87–76.68)	84.89 (83.70–85.90)	103.36 (102.01–104.54)	96.75 (95.64–97.76)
BM no. 3 + SYS range + DIA range	64.21 (63.43–65.06)	98.11 (96.71–99.66)	76.24 (75.38–77.10)	84.81 (83.65–85.81)	103.46 (102.16–104.74)	96.54 (95.42–97.56)
BM no. 3 + BP class	64.07 (63.23–64.96)	97.99 (96.65–99.60)	75.76 (74.89–76.68)	84.90 (83.71–85.91)	103.35 (102.01–104.55)	96.76 (95.63–97.75)
BM no. 3 + BP class range	64.24 (63.37–65.05)	98.01 (96.61–99.57)	76.06 (75.23–77.03)	84.87 (83.69–85.90)	103.36 (102.02–104.59)	96.68 (95.60–97.67)

MAPE (95% CI) was generated using 300 bootstrap runs, and cost was truncated at top 0.5% of total cost.

ACG, Adjusted Clinical Group; BM, base model; BP, blood pressure; CI, confidence interval; DIA, diastolic; MAPE, mean absolute prediction error; SYS, systolic.

Adding the BP ranges often generated lower MAPE in predicting concurrent costs, but such improvement in MAPE was reduced when predicting prospective costs (Table 3). For example, adding the SYS and DIA ranges statistically significantly decreased MAPE of predicting concurrent total cost from 88.69 (87.73–89.56) to 84.84 (83.94–85.64) in BM no. 2. The impact of the SYS and DIA ranges on predicting prospective total cost was much smaller with MAPE decreasing from 98.12 (97.23–99.05) to 96.73 (95.86–97.62) in BM no. 2 (Table 3).

Despite the effect of the SYS and DIA ranges in reducing MAPE in BM no. 2, including the SYS and DIA means to BM no. 2 did not statistically significantly decrease MAPE in predicting either concurrent or prospective costs. Indeed, adding SYS and DIA averages slightly increased MAPE in most predictions when added to BM no. 2 (Table 3).

None of the BP variables statistically significantly reduced MAPE for either concurrent or prospective costs when added to BM no. 3. Adding BP range variables (ie, the SYS and DIA ranges, or the BP class range) to BM no. 3 increased MAPE of concurrent total cost and decreased that of prospective total cost; however, none was statistically significant. Adding BP mean variables (ie, the SYS and DIA means, or the BP class) minimally changed MAPE with none being statistically significant (Table 3).

Impact of adding BP measures on the AUC of utilization

The AUC of BMs in predicting utilization outcomes (eg, hospitalization) increased after including the Charlson and ACG Dx-PM comorbidity scores in BM no. 2 and BM no. 3, respectively (Table 4). For example, the concurrent AUC of being in the top 5% of total cost statistically significantly increased from 0.613 (0.601–0.625) in BM no. 1 to 0.751 (0.738–0.763) in BM no. 2 and to 0.936 (0.931–0.941) in BM no. 3. The same trend was observed among BMs with similar added BP variables. For example, among BMs that included the BP class range as a predictor, the AUC of predicting the top 5% status increased from 0.723 (0.710–0.733) for BM no. 1 to 0.790 (0.779–0.800) for BM no. 2 and to 0.932 (0.927–0.937) for BM no. 3 (Table 4).

Table 4.

Impact of Blood Pressure Measures in Predicting Inpatient and Emergency Department Utilizations (Area Under the Curve)

Period	Concurrent			Prospective
Type of cost	Any IP	Any ED admission	Top 5%	Any IP	Any ED admission	Top 5%
BM no. 1^a
BM no. 1: age, gender	0.598 (0.587–0.608)	0.537 (0.528–0.546)	0.613 (0.601–0.625)	0.596 (0.583–0.610)	0.522 (0.515–0.532)	0.616 (0.604–0.628)
BM no. 1 + mean SYS + mean DIA	0.607 (0.595–0.619)	0.550 (0.542–0.559)	0.626 (0.612–0.639)	0.610 (0.598–0.625)	0.532 (0.523–0.540)	0.622 (0.608–0.634)
BM no. 1 + SYS range + DIA range	0.768 (0.759–0.778)	0.659 (0.651–0.667)	0.771 (0.760–0.781)	0.659 (0.645–0.672)	0.588 (0.579–0.597)	0.690 (0.679–0.703)
BM no. 1 + BP class	0.599 (0.588–0.610)	0.548 (0.540–0.557)	0.618 (0.606–0.631)	0.608 (0.595–0.620)	0.533 (0.525–0.542)	0.619 (0.607–0.631)
BM no. 1 + BP class range	0.712 (0.702–0.725)	0.614 (0.606–0.622)	0.723 (0.710–0.733)	0.632 (0.619–0.644)	0.567 (0.557–0.576)	0.666 (0.655–0.677)
BM no. 2^a
BM no. 2: age, gender, Charlson index	0.693 (0.682–0.703)	0.579 (0.570–0.586)	0.751 (0.738–0.763)	0.656 (0.644–0.671)	0.569 (0.561–0.578)	0.720 (0.708–0.732)
BM no. 2 + mean SYS + mean DIA	0.694 (0.683–0.704)	0.586 (0.579–0.594)	0.750 (0.738–0.762)	0.665 (0.653–0.677)	0.574 (0.565–0.582)	0.721 (0.709–0.733)
BM no. 2 + SYS range + DIA range	0.788 (0.778–0.798)	0.663 (0.654–0.670)	0.817 (0.806–0.826)	0.685 (0.672–0.698)	0.599 (0.591–0.608)	0.745 (0.734–0.755)
BM no. 2 + BP class	0.693 (0.682–0.704)	0.585 (0.577–0.592)	0.752 (0.739–0.763)	0.664 (0.653–0.678)	0.574 (0.564–0.582)	0.721 (0.709–0.734)
BM no. 2 + BP class range	0.749 (0.739–0.760)	0.625 (0.617–0.634)	0.790 (0.779–0.800)	0.670 (0.658–0.683)	0.587 (0.578–0.596)	0.737 (0.725–0.748)
BM no. 3^a
BM no. 3: age, gender, ACG Dx-PM score	0.844 (0.835–0.852)	0.650 (0.642–0.658)	0.936 (0.931–0.941)	0.720 (0.709–0.732)	0.616 (0.609–0.625)	0.843 (0.833–0.852)
BM no. 3 + mean SYS + mean DIA	0.844 (0.835–0.852)	0.651 (0.643–0.659)	0.936 (0.932–0.941)	0.724 (0.712–0.735)	0.617 (0.610–0.626)	0.843 (0.833–0.853)
BM no. 3 + SYS range + DIA range	0.851 (0.843–0.859)	0.681 (0.673–0.689)	0.934 (0.929–0.939)	0.721 (0.709–0.733)	0.620 (0.612–0.629)	0.843 (0.833–0.852)
BM no. 3 + BP class	0.844 (0.835–0.851)	0.651 (0.643–0.659)	0.936 (0.931–0.940)	0.724 (0.712–0.735)	0.617 (0.610–0.626)	0.843 (0.833–0.852)
BM no. 3 + BP class range	0.839 (0.830–0.847)	0.661 (0.652–0.669)	0.932 (0.927–0.937)	0.717 (0.706–0.730)	0.616 (0.609–0.624)	0.843 (0.833–0.852)

AUC (95% CI) was generated using 300 bootstrap runs, and cost was truncated at top 0.5% of total cost.

ACG, Adjusted Clinical Group; BM, base model; BP, blood pressure; CI, confidence interval; DIA, diastolic; ED, emergency department; SYS, systolic; IP, inpatient hospitalization; Top 5%, being in the top 5% of total use.

Adding the SYS and DIA ranges to BM no. 2 generated the highest increase in AUC of predicting any of the concurrent and prospective utilization markers (ie, any hospitalization, any ED admission, and being in the top 5% of total cost). For example, adding the SYS and DIA ranges increased AUC of predicting concurrent hospitalization from 0.693 (0.682–0.703) to 0.788 (0.778–0.798) in BM no. 2, which represents a 14.1% improvement in model performance. The impact of the SYS and DIA ranges on predicting prospective hospitalization was smaller with AUC increasing from 0.656 (0.644–0.671) to 0.685 (0.672–0.698) in BM no. 2 (Table 4).

Adding the BP ranges (ie, SYS and DIA ranges, and BP class range) to BM no. 3 statistically significantly improved the prediction of concurrent ED admissions. For example, the AUC of BM no. 3 improved from 0.650 (0.642–0.658) to 0.681 (0.673–0.689) after adding the SYS and DIA ranges to predict concurrent ED admissions. BP ranges, however, did not statistically significantly improve the AUC of the BM no. 3 to predict other utilization outcomes such as hospitalization or being in the top 5% of total cost. SYS and DIA averages and BP class did not improve the BM no. 3 AUC for any of the utilization outcomes, neither concurrent nor prospective (Table 4).

See the online (Supplementary Tables S1–S3) for a summary of ratio improvements of all performance measures (ie, adjusted R ², MAPE, and AUC) across all BMs (ie, BM no. 1, BM no. 2, and BM no. 3) and outcomes (ie, concurrent and prospective costs and utilizations).

Discussion

Health care providers are increasingly using EHR data, instead of or in conjunction with insurance claims data, to identify and manage high-risk patients.¹⁹ Prior research has shown the value of unique EHR data fields, not found in routine insurance claims data, that can improve the risk stratification and predictive models of health care utilization.¹⁶ For example, past studies have shown the value of EHR's prescription information,^14,18,33,34 EHR's laboratory results,¹³ EHR's unstructured data such as clinical notes,^15,35,36 and EHR's vital signs such as body mass index (BMI),¹⁷ in improving the overall prediction of health care costs and utilization. Despite the prevalence of hypertension, BP has not been assessed as a unique EHR data type in improving the risk stratification process. To address this gap, this study aimed to determine whether adding BP markers improves the performance of predictive models of cost or utilization among patients receiving ambulatory care.

The study results show that BP variables can improve common risk prediction models of health care utilization and cost (Tables 1–3). The added value of BP variables, however, was diminished with more sophisticated prediction models such as the ACG Dx-PM-based model (ie, BM no. 3). BM no. 1 showed the highest gain after adding BP variables, with some BP variables improving the prediction of concurrent total cost (ie, adjusted R ²) by ∼6-folds, and improving the prediction of concurrent inpatient admission (ie, AUC) by 29%. BM no. 2, which includes the Charlson comorbidity index, also showed benefiting from the BP variables; however, the gains were smaller than BM no. 1. Some of the BP variables improved the prediction of concurrent total costs (ie, adjusted R ²) by ∼33% in BM no. 2, while improving the prediction of concurrent hospitalization (ie, AUC) by 14%. These improvements were generally smaller in prediction prospective costs and utilizations. See the online supplemental material (Supplementary Tables S1–S3) for a ratio-based summary of added value of BP variables in improving the performance of the base risk prediction models.

In this study, the added value of BP variables in predicting health care utilization varied depending on the statistical approach used to summarize them in the specified timeframe. The results suggest that measurements that depict the variation of BP annually (eg, SYS and DIA ranges, or BP class ranges) can improve the common utilization prediction models considerably more than measurements that simply represent the overall status of BP (eg, SYS and DIA means, or BP class). The insignificance of BP means in improving the predictive powers of BM no. 2 or BM no. 3 may be due to the presence of the hypertension diagnosis in the common comorbidity scores, which highly correlates with BP means.^1,5,6 Conversely, the significance of BP ranges in improving the base predictive models may represent a different concept of BP measurement not already captured by the diagnosis of hypertension. Indeed, the BP ranges provide a measure to capture fluctuations in BP within a specific timeframe that may hint an uncontrolled BP status regardless of the diagnosis of hypertension, hence providing an added value in improving diagnosis-based predictive models of utilization and cost.

Past studies have mainly focused on assessing the added value of vital signs in improving the prediction of short-term clinical outcomes.³⁷ The study team found only 1 study that has explored the use of vital signs (ie, BMI) in predicting annual health care cost and utilization.¹⁷ The study found that adding BMI levels to a BM that includes the Charlson index improved the prediction (adjusted R ²) of concurrent and prospective total costs by 4.02% and 13.24%, with the latter being a statistically significant improvement. In contrast, this study found that BP ranges result in higher rates of improvement in adjusted R ² for predicting concurrent and prospective total costs (13.48% and 32.89%, respectively), with both being a statistically significant increase from the BM that includes the Charlson index (ie, BM no. 2). Nevertheless, when comparing BP ranges with BMI levels, assessing the higher rates of improvement in adjusted R ² is inconclusive as the underlying population of the BMI study differs from that of this study. Future studies should investigate the value of BP data versus or in combination with BMI data in improving risk stratification models using the same population of patients. Future research should also explore the use of ranges, instead of groups, levels, or means, for assessing the value of vital signs such as BMI in predicting health care utilization and cost.

The added value of BP ranges (ie, SYS and DIA ranges and BP class ranges) was considerable for BM no. 1, which only includes demographic information in the BM. This has significant implications for nontraditional settings to risk stratify newly enrolled patients and/or when clinical data such as diagnostic data are not readily available. For example, among newly enrolled patients of a telemonitoring program, clinical data may not be available at start, hence the vital signs (eg, BP) that are captured through the telemonitoring program can play a key role to improve the risk stratification of such patients. Another example is using the enhanced BM no. 1 (with BP ranges) in consumer health IT products that capture vital signs using mobile health solutions. Diagnostic data are often incomplete in such consumer health IT solutions, hence the addition of vital signs such as BP can greatly enhance risk stratification efforts for consumers.

Future studies should explore the value of BP measures in improving the predictive models of health care utilization in other population groups (eg, children and older adults) and different clinical settings (eg, inpatient and nursing home). Future research should also examine the simultaneous use of different vital signs, and their interactions, in improving the prediction of health care utilization and cost. Other potential research topics may include studying vital signs within different/shorter timeframes (eg, months instead of years), assessing the correlation of vital signs and missing rate of related diagnosis (eg, BP variables vs. completeness of hypertension diagnosis), and identifying subpopulations who will benefit the most from incorporating vital signs for risk stratification and case management purposes. In addition, as EHR data are becoming more standard for provider-based population health management, vital signs such as BP can be used for various population level management purposes beyond the predictive modeling. And, as providers are increasingly capturing data on social determinants of health in EHRs,^38,39 future research should also assess the potential disparities in using vital signs to improve risk stratification efforts among minority and vulnerable populations.

Limitations

This study has several limitations. First, the study population was limited to 18–64 years old patients. This was mainly due to the different effect of BP on utilization in the pediatric population and missing Medicare data in this study. Second, this study only focused on patients visiting ambulatory outpatient settings. The effect of BP on risk prediction may be different in other clinical settings, hence the generalizability of the study results should be assessed before adopting BP measures to improve risk stratification models in practice. Third, patients who missed the minimum data quality requirements (eg, missing BP data) were excluded in this study. Future studies should examine the underlying differences in utilization among patients without BP records and populations with BP data (eg, patients with no BP data may include more high utilizers). Fourth, the BP data used in this study were limited to data extracted from a single health provider EHR system. BP data captured and recorded in EHRs of providers practicing out of this health network may provide additional information. Future research should explore the value of EHR interoperability in improving the completeness of vital sign data for population health management. Finally, this study only assessed the value of BP data in BMs using the traditional linear and logistic regressions. The values of BP and other vital signs are yet to be measured when incorporated in nontraditional risk stratification approaches such as machine learning methods.⁴⁰

Conclusion

Adding BP measures to utilization prediction models improves their ability to predict health care costs and utilization; however, the added values of BP measures are attenuated when the base risk stratification models incorporate diagnosis-based comorbidity indexes. The values of BP measures are most prominent when represented as BP ranges in predicting concurrent costs and utilizations (eg, hospitalization and ED admission). Future research on risk stratification models should focus on subpopulations that benefit the most from incorporating vital signs such as BP measures in risk prediction and case management efforts.

Footnotes

Acknowledgments

The authors acknowledge the support of HealthPartners, Inc., (Bloomington, MN) in sharing the underlying data and providing the research team with technical support throughout the research. The authors also acknowledge the technical and management support provided by other Johns Hopkins team members (Tom Richards, Fardad Gharghabi [deceased], and Elyse Lasser).

Authors' Contributions

Dr. Kharrazi served as the principal investigator of the project. Dr. Kharrazi and Dr. Chang designed the study and selected the appropriate methodology for the evaluation. Dr. Chang and Dr. Kharrazi performed the analysis. All authors (ie, Dr. Kharrazi, Dr. Chang, Dr. Weiner, and Dr. Gudzune) reviewed the results and contributed to the interpretation of the results. Dr. Kharrazi and Dr. Chang drafted the article. All authors (ie, Dr. Kharrazi, Dr. Chang, Dr. Weiner, and Dr. Gudzune) reviewed the article before submission.

Author Disclosure Statement

No competing financial interests exist.

Funding Information

No external funding supported this study. All authors are Johns Hopkins employees. The Johns Hopkins University receives royalties for nonacademic use of software based on the Johns Hopkins Adjusted Clinical Group (ACG) methodology.

Supplementary Material

Supplementary Table S1

Supplementary Table S2

Supplementary Table S3

References

Fryar

, Ostchega

, Hales

, Zhang

, Kruszon-Moran

Hypertension prevalence and control among adults: United States, 2015–2016. NCHS Data Brief, 2017:1–8.

Office of Disease Prevention and Health Promotion. Healthy people 2020: heart disease and stroke. 2014. https://www.healthypeople.gov/2020/topics-objectives/topic/heart-disease-and-stroke Accessed July 10, 2021 .

Ambrosius

, Sink

, Foy

, et al. The design and rationale of a multicenter clinical trial comparing two strategies for control of systolic blood pressure: the Systolic Blood Pressure Intervention Trial (SPRINT). Clin Trials, 2014; 11:532–546.

Lawes

, Bennett

, Feigin

, Rodgers

. Blood pressure and stroke: an overview of published reviews. Stroke, 2004; 35:1024.

Zanchetti

, Thomopoulos

, Parati

. Randomized controlled trials of blood pressure lowering in hypertension: a critical reappraisal. Circ Res, 2015; 116:1058–1073.

Chobanian

, Bakris

, Black

, Cushman

, Green

, Izzo

. Seventh report of the Joint National Committee on prevention, detection, evaluation, and treatment of high blood pressure. Hypertension, 2003; 42:1206–1252.

Rui

, Okeyode

. National ambulatory medical care survey: 2015 state and national summary tables. 2015. https://www.cdc.gov/nchs/data/ahcd/namcs_summary/2015_namcs_web_tables.pdf Accessed July 9, 2021 .

Mozaffarian

, Benjamin

, Go

, et al. Heart disease and stroke statistics-2015 update: a report from the American Heart Association. Circulation, 2015; 131:e29–e322.

Zhang

, Wang

, Zhang

, Fang

, Ayala

. Medical expenditures associated with hypertension in the US, 2000–2013. Am J Prev Med, 2017; 53(6 Suppl 2):S164–S171.

10.

Wang

, Zhou

, Zhuo

, Zhang

. Annual total medical expenditures associated with hypertension by diabetes status in U.S. adults. Am J Prev Med, 2017; 53(6S2):S182–S189.

11.

IOM (Institute of Medicine). A population-based policy and systems change approach to prevent and control hypertension. Washington, DC: The National Academies Press, 2010.

12.

Kharrazi

, Chi

, Chang

, et al. Comparing population-based risk-stratification model performance using demographic, diagnosis and medication data extracted from outpatient electronic health records versus administrative claims. Med Care, 2017; 55:789–796.

13.

Lemke

, Gudzune

, Kharrazi

, Weiner

. Assessing markers from ambulatory laboratory tests for predicting high-risk patients. Am J Manag Care, 2018; 24:e190–e195.

14.

Chang

, Richards

, Shermock

, et al. Evaluating the impact of prescription fill rates on risk stratification model performance. Med Care, 2017; 55:1052–1060.

15.

Kan

, Kharrazi

, Leff

, et al. Defining and assessing geriatric risk factors and associated health care utilization among older adults using claims and electronic health records. Med Care, 2018; 56:233–239.

16.

Kharrazi

, Weiner

. A practical comparison between the predictive power of population-based risk stratification models using data from electronic health records versus. Med Care, 2018; 56:202–203.

17.

Kharrazi

, Chang

, Heins

, Weiner

, Gudzune

. Assessing the impact of body mass index information on the performance of risk adjustment models in predicting health care costs and utilization. Med Care, 2018; 56:1042–1050.

18.

Kharrazi

, Ma

, Chang

, Richards

, Jung

Comparing the predictive effects of patient medication adherence indices in electronic health record and claims-based risk stratification models. Popul Health Manag, 2021 [Epub ahead of print].

19.

Pandya

, Chang

, Kharrazi

Electronic health record-based risk stratification: a potential key ingredient to achieving value-based care. Popul Health Manag, 2021 [Epub ahead of print].

20.

Kharrazi

, Lasser

, Yasnoff

, et al. A proposed national research and development agenda for population health informatics: summary recommendations from a national expert workshop. J Am Med Inform Assoc, 2017; 24:2–12.

21.

Health Partners. About healthpartners: quick facts. 2019. https://www.healthpartners.com/about/facts/ Accessed July 3, 2021 .

22.

Marler

, Jacob

, Lehoczky

, Shapiro

. The statistical analysis of treatment effects recordings in 24-hour ambulatory blood pressure. Stat Med, 1988; 7:697–716.

23.

Patel

, Lagord

, Waterall

, Moth

, Knapton

, Deanfield

. Online self-assessment of cardiovascular risk using the Joint British Societies (JBS3)-derived heart age tool: a descriptive study. BMJ Open, 2016; 6:e011511.

24.

Starfield

, Weiner

, Mumford

, Steinwachs

. Ambulatory care groups: a categorization of diagnoses for research and management. Health Serv Res, 1991; 26:53–74.

25.

Weiner

, Starfield

, Steinwachs

, Mumford

. Development and application of a population-oriented measure of ambulatory care casemix. Med Care, 1991; 29:452–472.

26.

Afshar

, Li

, Chen

, et al. An exploratory data quality analysis of time series physiologic signals using a large-scale intensive care unit database. JAMIA Open, 2021; 4:ooab057.

27.

Charlson

, Charlson

, Peterson

, Marinopoulos

, Briggs

, Hollenberg

. The Charlson comorbidity index is adapted to predict costs of chronic disease in primary care patients. J Clin Epidemiol, 2008; 61:1234–1240.

28.

Health Services Research & Development Center at the Johns Hopkins University Bloomberg School of Public Health. The Johns Hopkins ACG case-mix system reference manual version 11.0. Baltimore: The Johns Hopkins University Bloomberg School of Public Health; 2014. Technical Reference Guide.

29.

Duncan

Healthcare risk adjustment and predictive modeling. Winsted, CT: ACTEX Publications, 2011.

30.

de Myttenaere

, Golden

, Le Grand

, Rossi

. Mean absolute percentage error for regression models. Neurocomputing, 2016; 192:38–48.

31.

Greenland

, Senn

, Rothman

, et al. Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations. Eur J Epidemiol, 2016; 31:337–350.

32.

Lin

, Lucas

, Shmueli

. Research commentary—too big to fail: large samples and the p-value problem. Inf Syst Res, 2013; 24:906–917.

33.

, Jung

, Chang

, et al. Assessing the population-level correlation of medication regimen complexity and adherence indices using electronic health records and insurance claims. J Manag Care Spec Pharm, 2020; 26:860–871.

34.

Chang

, Kan

, Shermock

, et al. Integrating e-prescribing and pharmacy claims data for predictive modeling: comparing costs and utilization of health plan members who fill their initial medications with those who do not. J Manag Care Spec Pharm, 2020; 26:1282–1290.

35.

Chen

, Dredze

, Weiner

, et al. Identifying vulnerable older adult populations by contextualizing geriatric syndrome information in clinical notes of electronic health records. J Am Med Inform Assoc, 2019; 26:787–795.

36.

Chen

, Dredze

, Weiner

, et al. Extraction of geriatric syndromes from electronic health record clinical notes: assessment of statistical natural language processing methods. JMIR Med Inform, 2019; 7:e13039.

37.

Brekke

, Puntervoll

, Pedersen

, Kellett

, Brabrand

. The value of vital sign trends in predicting and monitoring clinical deterioration: a systematic review. PLoS One, 2019; 14:e0210875.

38.

Lasser

, Kim

, Hatef

, et al. Social and behavioral variables in the electronic health record: a path forward to increase data quality and utility. Acad Med, 2021; 96:1050–1056.

39.

Hatef

, Weiner

, Kharrazi

. A public health perspective on using electronic health records to address social determinants of health: the potential for a national system of local community health records in the United States. Int J Med Inform, 2019; 124:86–89.

40.

Kan

, Kharrazi

, Chang

, et al. Exploring the use of machine learning for risk adjustment: a comparison of standard and penalized linear regression models in predicting health care costs in older adults. PLoS One, 2019; 14:e0213258.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.05 MB

Assessing the Added Value of Blood Pressure Information Derived from Electronic Health Records in Predicting Health Care Cost and Utilization

Abstract

Introduction

Objective

Methods

Data source

Study design and subjects

Independent variables

Outcome variables

Statistical methods

Results

Population specifications

Impact of adding BP measures on the adjusted R 2 of costs

Impact of adding BP measures on the MAPE of costs

Impact of adding BP measures on the AUC of utilization

Discussion

Limitations

Conclusion

Footnotes

Acknowledgments

Authors' Contributions

Author Disclosure Statement

Funding Information

Supplementary Material

References

Supplementary Material

Impact of adding BP measures on the adjusted R ² of costs