Abstract
In England, general practitioners are incentivized through a national pay-for-performance scheme to adopt evidence-based quality improvement initiatives using a portfolio of Quality and Outcomes Framework (QOF) indicators. We describe the development of the methods used to assess the cost-effectiveness of these pay-for-performance indicators and how they have contributed to the development of new indicators. Prior to analysis of new potential indicators, an economic subgroup of the National Institute for Health and Care Excellence (NICE) Indicator Advisory Committee is formed to assess evidence on the cost-effectiveness of potential indicators in terms of the health benefits gained, compared to the cost of the intervention and the cost of the incentive. The expert subgroup is convened to reach consensus on the amounts that could potentially be paid to general practices for achieving new indicators. Indicators are also piloted in selected general practices and evidence gathered about their practical implementation. The methods used to assess economic viability of new pilot indicators represent a pragmatic and effective way of providing information to inform recommendations. Current policy to reduce QOF funding could shift the focus from national (QOF) to local schemes, with economic appraisal remaining central.
Introduction
The Quality and Outcomes Framework (QOF) is a financial incentive scheme for English general practices to improve quality of care. Practices’ performance is measured against a set of quality indicators. The scheme was introduced in 2004/2005 and is a component of the revised General Medical Services contract between the Government and general practice. The funding is intended to reward quality of care, leading to benefits for patients and the National Health Service (NHS). Research has shown that financial incentive schemes have the potential to improve the quality of primary care, although longer-term effects are still unknown.1–3 In addition, there is some evidence to suggest that these incentives have the potential to improve delivery of clinical care in underserved populations.4,5
QOF domains, indicators and points, 2015/2016.
Clinical and Public Health QOF indicators with a high points allocation, 2015/2016.
The National Institute for Health and Care Excellence (NICE) convenes the Indicator Advisory Committee, quarterly to
Prioritize suggestions for new clinical or public health topics and make recommendations for indicator development. Consider the outcome of piloting, consultation and economic appraisal of potential indicators, and make final recommendations on new indicators. Review information on the uptake of current indicators in the QOF and recommend whether any should be retired, considered for changes to points and/or thresholds, or be subject to further assessment.
Once the Committee has agreed new topic areas for indicator development, work is carried out to develop the indicators and to pilot them with up to 40 practices across the UK using a combination of qualitative and quantitative methods. For example, this may include gathering data on levels of achievement for piloted indicators and interviewing practice staff about the implementation of indicators. 10 As part of this process evaluation, work is also undertaken to assess the cost-effectiveness of incentivizing the adoption of appropriate pilot indicators. This is done to provide the Committee with evidence on the potential costs and benefits of pilot indicators alongside the evidence from piloting. This is particularly important in the current economic climate, where the efficient use of general practice resources reinforces the need to adopt cost-effective decision-making.
Methods for assessing the cost-effectiveness of QOF indicators
The methods applied to determine the cost-effectiveness of QOF indicators adopt a net (monetized) benefit approach. In summary, the method applies the following calculation to an indicator:
Variables required in cost-effectiveness analysis of QOF indicators.
To estimate the net benefit of an indicator, it is necessary to have information on the benefits and costs associated with the indicator. To gather these costs and benefits, a rapid review of the available economic evidence is carried out. The opinions of those involved in delivering the indicators at pilot GP sites are also sought. The costs associated with an indicator include service delivery costs; for example, the costs of additional GP or nurse consultations to monitor a patient’s health status over a period of time, the initial cost of implementing the intervention, and if relevant, secondary care service usage. The unit costs for these activities are obtained from sources11,12 such as the Personal Social Services Research Unit (PSSRU) unit cost database and the NHS hospital tariff. Indicator costs also include indirect consequences such as an increase in referrals resulting from more intensive monitoring. This may lead to increased health service usage costs in the short-term, as in the case of chronic kidney disease (CKD), where the introduction of estimated glomerular filtration rate (eGFR) as a prognostic indicator led to a 61 per cent increase in new patient referrals in a NHS Trust. 13
The benefits refer to health benefits which might be gained by a patient as a result of the introduction of the indicator. The monetized health benefit of the indicator refers to the value of the health improvements associated with achieving the predicted benefits of the indicator. For example, in the case of using ambulatory blood pressure monitoring (ABPM) to confirm a diagnosis of hypertension, these include the avoidance of cardiovascular events in people correctly diagnosed as hypertensive. These benefits are presented in terms of the change in quality-adjusted life years (QALYs) as a result of introducing the new indicator compared to standard practice without the new indicator. QALYs measure the life years gained adjusted for their utility using a health-related quality of life measure, such as the EQ-5D which categorizes a person’s health status as usually ranging from a value of one for someone in perfect health to a value of zero representing death. 14 NICE recommends the use of QALYs as a measure of health benefit to enable a standardized approach to economic evaluations across all health areas. 15 In making recommendations on cost-effectiveness, interventions generating QALYs valued between £20,000 and £30,000 are considered potentially cost effective by NICE. For evaluating cost-effectiveness of new indicators, the lower QALY value of £20,000 is used. In addition to the health benefits for patients, the cost-effectiveness analysis also takes into account any cost savings that might be achieved, such as reduced adverse events or avoided hospital admissions.
The QOF payments that are made on reaching particular levels of achievement for the indicator are also considered in the net benefit analysis. The QOF payment is assumed to be an incentive payment that is additional to the delivery cost. Payments are triggered once performance for an indicator exceeds the minimum threshold, i.e., the minimum proportion of the eligible population within a GP practice who receive the intervention associated with the indicator. Payments increase linearly until performance reaches the maximum threshold, which is usually around 85–90% of the eligible population. For instance, blood pressure management in secondary prevention of coronary heart disease (CHD) is incentivized. The payments for this indicator are triggered once 53% of CHD patients in a general practice have a blood pressure reading of 150/90 mmHg or less in the preceding 12 months. The level of payment increases linearly up to a maximum of 93% of CHD patients with a measured blood pressure reading of 150/90 mmHg or less. 16
Having taken account of the costs and benefits, the output of this process is an estimate of net benefit. This analysis is conducted based on the entire population of England which is assumed to comprise 7962 practices with a mean practice size of 7034 patients. 17 The findings are presented in the form of a net benefits table. The table shows different combinations of QOF points and levels of achievement (percentages of the eligible population to whom the indicator has been applied) at which the indicator can be considered to be cost-effective. This allows the Committee to consider the number of QOF points (and thus the associated incentive payments) that could be offered before the indicator would stop being cost-effective. Where the net benefit is positive, then the indicator is considered to be cost-effective (the benefits to the National Health Services (NHS) outweigh the costs). For instance, cost-effectiveness analysis for nine indicators for cardiovascular disease and diabetes (BP5, CHD9, CHD10, CHD11, CS1, DM15, DM21, LVD3 and Stroke12) implemented in 2004/2005 with direct therapeutic impact were found to have positive net benefits, with mean payments per treated patient made to general practice, ranging from £0.63 to £40.61. 18 In some cases, there may be a lack of evidence to support the use of the indicator on economic grounds, but it may be warranted according to other criteria, for example, if it is considered by patients, the public and general practices to be valuable. In order to test the robustness of the results, sensitivity analysis is conducted to indicate the extent to which costs would have to rise, or benefits and eligible population would have to fall before the indicator ceased to be cost-effective for specific numbers of points.
Procedure for cost-effectiveness analysis of new indicators: role of the economic subgroup
The Indicator Advisory Committee establishes an economic subgroup to appraise the work of the health economists involved in carrying out the economic evaluation of the pilot indicators. The subgroup is made up of committee members with relevant expertise, including practice managers, general practitioners, and patient representatives. This ensures the analysis reflects clinical practice and patients’ preferences. Prior to analysis of any new pilot indicators, the indicators taken forward for economic evaluation are agreed with the economic subgroup. The subgroup scrutinizes the rationale for the evidence and costs to be used in the net-benefit analysis. Evidence of benefits of new indicators is derived from NICE evidence-based guidelines or other robust sources of evidence if NICE guidelines are not available. 19 Once the indicators to evaluate are agreed, the health economists perform the economic analysis and present their findings to the subgroup for consideration and approval.
Considering the process in more detail, each clinical indicator attracts incentive payments through achievement of specified QOF points. The subgroup advises the health economists about the range of QOF points that should be considered for the economic analysis of each new indicator, for example, between five and 15 points. To determine the appropriate range of points, the QOF point allocation is appraised by the health economists for similar indicators which have already been implemented. This helps to promote a reasonably consistent approach within the existing QOF menu. For example, indicators which include confirmation of diagnosis consistently have relatively low levels of QOF points allocated; e.g., COPD002 (The percentage of patients with COPD, diagnosed on or after 1 April 2011, in whom the diagnosis has been confirmed by post bronchodilator spirometry between 3 months before and 12 months after entering on to the register) has a maximum of five QOF points. New indicators that are amenable to cost-effectiveness analysis are those that:
Lead to a specific treatment or therapy, e.g., cardiac rehabilitation after myocardial infarction. Have clinically significant outcomes or are a surrogate measure of a clinically significant outcomes, e.g., lower cholesterol levels in diabetes. Are likely to have relevant and robust data on costs and benefits, e.g., reducing blood pressure in older people with hypertension.
Those indicators which are less likely to be considered for cost-effectiveness analysis are those that:
Are solely process measures, e.g., the creation of a register but with no other action implied. Are unclear or inconsistent in their impacts, e.g., indicators that refer to ‘support and advice’. Are likely to lack high quality level evidence of effectiveness (e.g., no randomized controlled trials).
These are not rules but provide a guide to the usual rationale adopted in deciding whether an indicator can be assessed for its cost-effectiveness. In some cases, the effectiveness of a clinical indicator may not be clear and, in these circumstances, economic evaluation can still be performed using an approach called ‘threshold analysis’. This helps to determine the point at which an indicator is likely to become cost-effective (i.e., the net benefit becomes positive). For example, if there is a lack of evidence of the benefits of a treatment that could be incentivized through clinical indicators, threshold analysis can be used to estimate how much clinical improvement would have to be achieved before financial incentivization could be cost-effective at different levels of QOF points. In 2011, an indicator was piloted on the percentage of patients with asthma who in the previous 15 months had a record of structured asthma educational discussion. There was insufficient evidence on the health benefits of the intervention, measured by QALYs, to allow for net benefit analysis. Instead, the delivery costs, eligible population and levels of reported points achieved were modelled against estimated cost savings generated through assumptions about reductions in hospital admissions and accident and emergency department (A&E) visits avoided with the intervention. Scenarios were developed to indicate how many points might be justified before the indicator would cease to be cost-effective.
Once economic evidence is collated for the relevant pilot indicators, the information is reported to the economic sub-group for discussion and agreement. More recently, the economic sub-group has recommended further analysis to assess the extent to which the cost-effectiveness of an indicator exceeds the upper range of agreed QOF points. This allows an assessment of those indicators that are analysed as being very cost-effective, as opposed to simply reporting that the indicator is cost-effective to the upper limit of points originally agreed by the economic subgroup. As there are only a fixed number of QOF points available to incentivize general practices, it is not feasible for these extra points to be recommended for adoption. However, the upper limit of points does provide an indication of the extent to which the indicator is cost-effective.
Cost-effectiveness of new proposed pilot indicators, 2012–2013.
Reasons why cost-effectiveness analysis of new proposed pilot indicators was not possible, 2012–2013.
Limitations to the cost-effectiveness assessment method
The net benefit approach lends itself to evaluating the cost-effectiveness of indicators that have a clearly established direct therapeutic effect, ideally from robust trial evidence; that is, indicators where achievement can be shown to clearly affect health status or life expectancy, allowing the benefits to be expressed in terms of QALYs. Many of the ‘Ongoing management’ indicators have these characteristics. This approach is less applicable to evaluating ‘Initial diagnosis’ indicators, although this does not mean that such indicators are not cost-effective. In the absence of robust trial evidence, these indicators are normally evaluated for their cost-effectiveness by using clearly defined but very conservative modelling assumptions; i.e., using lower range estimates of benefits and higher range estimates for costs. Due to these conservative assumptions, ‘Initial diagnosis’ indicators may not appear as cost-effective as ‘Ongoing management’ indicators.
An example of such an indicator is the (now retired) chronic kidney disease indicator, CKD2 (The percentage of patients on the CKD register whose notes have a record of blood pressure in the preceding 15 months), which was based on the assumption that monitoring can lead to differing levels of therapy resulting in improvement in renal function. However, it could equally be argued that monitoring per se, provides little if any health benefit and the benefits associated with improved control only occur as a result of subsequent treatment. Critically, a clear link needs to be made between process measures and their clinically relevant outcomes.
It is important to note that NICE does not consider the broader societal perspective in considering costs. Hence, analyses are limited to a health services perspective which, for practical purposes, simplifies the economic evaluation. External costs such as the productivity loss or lost household production are not included in the analyses.
Additionally, the current NICE recommendation for conducting cost-effectiveness analysis only assesses the incremental changes in quality of life for patients. Conditions which require long-term care, such as dementia, will affect the quality of life of both patients, and their care-givers and family members. The 2014/2015 NICE indicators included two indicators to improve dementia care (NM64, NM65). 20 However, cost-effectiveness analysis assessing these indicators did not involve outcomes associated with family members and care-givers. If these outcomes were to be included in the analysis, these dementia indicators might be extremely cost-effective as seen in studies which have assessed outcomes in care givers. 21
Discussion
This approach to assess the economic viability of new pay-for-performance indicators represents a pragmatic and effective way of providing the NICE Indicator Advisory Committee with information to inform its decision making and recommendations for new national QOF indicators. The clear presentation of the economic net benefit taking account of both delivery and reimbursement costs, supported by close scrutiny of the clinical evidence and regular expert input, allows the Committee to understand whether the new indicator is likely to be cost-effective in general practice and, more importantly, the extent to which it is likely to be cost-effective through application of sensitivity analysis.
Currently, the economic subgroup of the Committee is presented with cost-effectiveness data based on published trial evidence to determine the effectiveness of interventions to inform the development of new pay-for-performance indicators. Based on the evaluation of the impact of QOF over the past decade, there is a risk that this may lead to over-optimism about the population health benefits of indicators22,23: at inception, the general practitioners’ pay-for-performance scheme was estimated to reduce mortality by 11 lives per 100,000 people over the first year of implementation, below the 56 lives per 100,000 people that could have been saved if all eligible patients had been treated. 22 This may be partly related to general practices already exceeding the target performance for full payments (e.g., percentage of patients already treated) when the scheme was introduced. 22 The issue of setting realistic payment thresholds is partially ameliorated by pilot testing of new indicators before implementation. Further, while there has been modest observed improvement in quality of care in the short to medium-term in indicator areas, such as reduction in hospital admissions, 24 decrease in short-term mortality 25 and modestly improved quality of care for chronic diseases, 26 the long-term impacts on costs, practitioner behaviour and population health outcomes still need evaluating. In the future, direct extraction of data on health benefits and baseline target performance from general practices, rather than basing these on trial evidence, would allow the economic subgroup and NICE to make more robust and realistic decisions about the likely effectiveness and cost-effectiveness of specific indicators within the pay-for-performance scheme.
Finally, the recommendations for new QOF pay-for-performance indicators are being developed in a climate of financial restraint. Across England in 2011–2013, practices achieved the upper payment thresholds for 87% of all clinical indicators. 27 This level of achievement was greater than policy makers had anticipated. To reduce the financial cost, the English Department of Health increased payment thresholds and reduced the points allocation. 28 As a consequence, although economic analysis may show that an indicator may be cost-effective up to a very high number of QOF points, it is not feasible to recommend those points. However, there may be other opportunities to adopt such indicators: commissioners at local and regional level are already incentivizing certain clinical activities, in some cases, using QOF-like pay-for-performance schemes. 29 Commissioners may also consider clinical areas, identified as highly cost-effective, to be appropriate to incentivize locally over and above the national QOF scheme. This may be particularly relevant for indicators that have been shown to be cost-effective up to much higher levels of QOF points than would be feasible to be awarded within the finite and reducing budget of the national QOF scheme.
In conclusion, the reduced level of funding available through the national pay-for-performance scheme means economic appraisal is more important than ever to ensure that the most cost-effective indicators are incentivized among those which are retained in the scheme.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
