Abstract
Introduction
Even though the use of telemedicine and telecare interventions in healthcare has grown over the last 30 years, clinicians, healthcare providers, and decision makers still have to make choices on which interventions to implement or use in healthcare because of financial constraints. Therefore, decisions have to be made to see whether the additional benefit of this new telemedicine/telecare intervention compared with the current alternative(s) is worth the additional cost. 1 The question remains as to whether telemedicine services could provide a significant improvement, in terms of both clinical and cost-effectiveness.
Over the last two decades, growing numbers of studies have been published on the effectiveness and cost-effectiveness of telemedicine interventions; however, the economic analyses associated with these interventions have not always been correctly reported. Several guidelines exist for assessing the quality of health economic studies, such as the standard checklists for economic evaluations by Drummond et al., 1 Evers et al., 2 and Husereau et al. 3 Likewise, McIntosh and Cairns 4 and Sisk and Sanders 5 have provided frameworks to serve as a guide when conducting economic evaluations of telemedicine technologies. They also point out some of the challenges faced when evaluating telemedicine technologies, such as the constantly changing technology, small sample sizes, and the lack of methodological rigor.
Systematic reviews are becoming increasingly popular in evidence-based research. They summarize a vast body of evidence and can be used by policy makers to make informed decisions. Because of the growing number of studies reporting on the cost-effectiveness of telemedicine, there have also been several systematic reviews on the cost-effectiveness of telemedicine studies being published. However, there has been no general agreement in the methods and analytical results that these systematic reviews have raised, and therefore a critical appraisal of the quality of these review articles is warranted. There have been several reviews looking at systematic reviews of telemedicine: Bahaadinbeigy et al. 6 highlighted the need for an overview of the current situation regarding systematic reviews of telemedicine, Ekeland et al. 7 looked at the effectiveness of telemedicine with a brief section on economic analysis, and Ekeland et al. 8 looked at the methodologies for assessing telemedicine. As far as we are aware, this is the first systematic review looking at the systematic reviews of cost-effectiveness of telemedicine.
The aim of this study was to conduct a review that was aimed at identifying published systematic reviews on the cost-effectiveness of telemedicine studies, providing a critique of these reviews in order to determine their validity, and also providing recommendations to guide future research aimed at assessing the cost-effectiveness of telemedicine studies.
Materials and Methods
Search Strategy and Data Extraction
A systematic search was undertaken to identify all published systematic reviews looking at the cost-effectiveness of telemedicine studies. This article focuses on the general field of telemedicine (i.e., on all health areas) rather than the application of telemedicine in certain health areas, such as heart failure 9 or chronic diseases, 10 to make the results more generalizable. The following databases were searched from their inception date until the last week in July 2012: Medline, Embase, ISI Social Science Index, Cochrane Database of Systematic Reviews, NHS Economic Evaluation Database, and the Database of Abstracts of Reviews of Effects. Search terms included “review” terms combined with “telemedicine” terms (Fig. 1). We did not use specific terms such as cost-effectiveness or economics as we did not want to miss any potential reviews.

Example of the search strategy in Medline.
The citation indices including abstracts for each of the search strategies were exported into the citation manager software (Endnote Windows version 9). All duplicates within Endnote were deleted. For three of the databases—NHS Economic Evaluation Database, Database of Abstracts of Reviews of Effects, and Cochrane—abstracts could not be exported to Endnote; therefore these abstracts were cross-checked against the Endnote database for duplicates by hand.
This review adheres to the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) checklist, except for items that were not relevant for a systematic review of reviews. 11 All titles, bibliographic data, and abstracts of the results from these searches were then read and scanned by H.M. and H.G. for relevance based on the inclusion and exclusion criteria. Any disagreements were resolved by a third reviewer (R.O.). The inclusion criteria were journal articles relating to published systematic reviews that examined the cost-effectiveness of telemedicine applications. The exclusion criteria for the abstracts included reviews with no cost-effectiveness studies, in specific telemedicine areas (and not a general overview of telemedicine), studies looking at e-health, non-journal articles such as books and reports, non-English articles, and articles where one review led to multiple reporting of the same review results. The study flow was documented using a PRISMA diagram. 11
Data extraction forms were set up in Microsoft® (Redmond, WA) Excel™ to help synthesize the results extracted from the reviews in the form of descriptive statistics. A comprehensive quality assessment of each review using the validated scale of Oxman and Guyatt 12 was undertaken by H.M. and H.G. The instrument contains nine items pertaining to individual aspects in the reporting of a systematic review. A 10th item collates the information from the first nine and asks the assessor to grade the review articles based on an ordinal scale ranging from 1 (minimal flaws) to 7 (extensive flaws).
Results
A detailed search of the electronic databases retrieved 5,066 abstracts; after the removal of duplicates, 4,116 abstracts remained. The full titles and abstracts of the 4,116 abstracts were read to check for relevance. After the exclusion criteria were applied, 113 abstracts remained, and the full-text articles were obtained. Nine published systematic reviews remained after reviewing the full-text articles (Fig. 2).

Preferred Reporting Items for Systematic Reviews and Meta-Analysis flow diagram showing the review selection process.
Summary of Identified Reviews
Nine reviews were published between 2001 and 2012 that assessed studies on the cost-effectiveness of general telemedicine, arranged chronologically according to publication year. All of the reviews were similar in terms of their stated purpose, and the objectives were clearly stated. Table 1 highlights the details for the literature searches used for each systematic review. The number of databases searched ranged from 413 to 10. 14,15 In terms of time frame for review searches, most of the reviews were from inception of the database to just before the article was published; however, Hailey et al. 16 had only a 2½-year time frame for their review searches, as this was an extension of their previous review. 17 The numbers of telemedicine and economic search terms were not stated in two reviews, 16,18 so the search strategy that was undertaken could not be replicated; the number of telemedicine terms ranged from 413,17,19,20 to 10, 21 and the number of economic terms ranged from 220 to 10. 21 The number of abstracts retrieved (after removal of duplicates) ranged from 60516 to 5,219. 18 In terms of the articles that were included in the reviews, the numbers ranged from 921 to 8015; perhaps this higher number of articles for the latter review was because of the fact it was the latest review to be published.
Details of Literature Search for Each Systematic Review Including Databases and Search Strategy
Hailey et al.(16) provided 46 studies from 48 articles (that is, two articles reported the same study) and was an extension of Hailey et al.(17)
Peeters et al. (2011) reported nine studies from 20 articles.
Table 2 shows the general characteristics of studies that were included in each of the systematic reviews. All reviews except one 13 stated the different health areas in which telemedicine was used, and only five of the nine reviews stated the different countries in which the telemedicine intervention was used. 13,15,17,18,21 In terms of study design, the three earlier studies 13,17,19 only stated the study design for some of the articles that were included; however, the majority of reviews tended to have nonrandomized controlled trials, although Rojas and Gagnon 18 and Wade et al. 20 had more randomized trials included in their reviews than nonrandomized trials. The majority of reviews highlighted that telemedicine was only compared with one comparator; however, from the review of Wade et al., 20 over half of the included studies did not state how many comparators with which telemedicine was compared. Most of the reviews did not mention for how long the telemedicine intervention was evaluated, nor did they state the sample sizes of the included studies.
Before and after studies.
Five before and after studies, six diagnostic accuracy studies, and one decision modeling study.
Four before and after studies and six decision modeling studies.
Two decision modeling studies.
NS, not stated; RCT, randomized controlled trial.
With the exception of three of the reviews, 14,18,19 all other reviews used checklists for economic evaluation. One review 13 used a checklist for assessing the quality of economic evaluations formulated by Haycox and Walley, 22 three reviews 15 –17 adopted the checklist of Drummond et al., 1 one review 21 adopted the checklist of Evers et al., 2 and the final review 20 assessed included articles according to the grading system of Chiou et al. 23
The quality scores obtained using the index of Oxman and Guyatt 12 varied from 4 (major/minor flaws) to 7 (minimal flaws); five of the nine reviews indicating minimal flaws in study performance of risk assessments 13,15,17,20,21 (Table 1).
Summary of Economic Evaluation Methods Used
Table 3 shows the type of economic evaluation that was conducted and the perspective adopted for each of the included studies in the review. The four earlier reviews 13,16,17,19 included studies in which the majority were either cost analyses or cost-minimization analyses; one review did not state the type of economic evaluations that were conducted in their included studies, 18 the later four reviews 14,15,20,21 tended to use the different range of economic evaluations that are available, 1 and there was a growing number of cost-utility studies. In terms of perspective adopted by the studies, Rojas et al., 18 Roine et al., 19 and Hailey et al. 16 did not state in their reviews the perspective of the analysis that was adopted by any of their included studies; however, the other six reviews found that most articles did not state the perspective from which the analysis was conducted.
Type of Economic Evaluation and Perspective of Analysis
CBA, cost–benefit analysis CCA, cost-consequences analysis; CEA, cost-effectiveness analysis; CMA, cost minimization analysis; CUA, cost-utility analysis; NS, not stated.
Table 4 shows the results from the economic evaluations for each of the included studies in the reviews. Hailey et al. 16 and Wade et al. 20 did not clarify in their reviews what type of costs were included in each of the articles; the majority of studies that were included in each of the reviews had measured direct medical costs, whereas some of the reviews also included direct nonmedical and indirect costs. In terms of outcomes that were included in these studies, from the reviews we found that the majority of outcomes were either clinical or economic, and very few studies in each of the reviews included any utility-based measures as an outcome measure. These outcome measures included the generic-preference based measures such as the EQ-5D, SF-12, and SF-36, which can measure overall health-related quality of life. The majority of reviews found that the included studies did not report any incremental cost-effectiveness ratios; the latter study by Mistry 15 found that 20% of the articles included in the review had reported an incremental cost-effectiveness ratio. Examples of some of the cost-effectiveness ratios included were cost per patient journey avoided, cost per additional patient treated, cost per waiting day avoided, and cost per quality-adjusted life year gained. All four ratios are useful to clinical decision makers; however, the last ratio is perhaps more informative for policy makers as this ratio allows for comparison across different diseases and treatments, and this ratio, depending on the threshold that policy makers are willing to pay for a new treatment/intervention, would enable them to decide whether this new treatment/intervention is cost-effective or not to the health service. Discounting was not conducted in most of the studies that were included in each of the nine reviews. In terms of sensitivity analyses, all but two reviews 18,21 mentioned the number of actual studies in their reviews that conducted some sensitivity analyses.
Costs, Outcomes, Incremental Cost-Effectiveness Ratios, Discounting, and Sensitivity Analyses
DALYs, disability-adjusted life years; ICER, incremental cost-effectiveness ratio; LYGs, life-years gained; NA, not applicable; NS, not stated; QALYs, quality-adjusted life-years.
Cost-Effectiveness of Telemedicine
In terms of conclusions provided by the different reviews, a general consensus found that there was no further evidence that telemedicine was cost-effective, and further assessment was needed.
The reviews also highlighted various issues: conclusions were influenced strongly by local conditions, thereby making it hard to generalize 17 ; a major dearth in the methodology of the studies and its ability to produce robust and generalizable conclusions 13 ; the overall reliability of a study for decision making will depend on both performance and study design 16 ; and the challenges faced in economic evaluations in telemedicine are diverse because of nature of both study contexts and methods applied. 14 Overall, there was also a general consensus that most studies were not carried out in accordance with standard economic evaluation guidelines.
Discussion
As far as we are aware, this is the first systematic review of published systematic reviews looking at the cost-effectiveness of telemedicine. Based on our specific inclusion criteria, this appraisal identified nine systematic reviews that span a 12-year period, addressing the cost-effectiveness of telemedicine applications in the delivery of healthcare. The reviews identified and addressed a wide variety of telemedicine applications in different health areas; however, the evidence with regard to its cost-effectiveness has remained inconclusive.
In summary, there were slight variations in the research question for each review; however, all reviews wanted to systematically assess the cost-effectiveness of general telemedicine interventions. Methods for literature search used are vital to ensure that the review is complete and up-to-date and also for readers to replicate these searches. For most of the reviews, the search strategies used were clear and comprehensible, and there was a more coherent flow of methodology with each succeeding review.
Most reviews reported that analyses of costs and cost-effectiveness in many of the included studies lack consistency as to how cost estimates were collected and reported. The information on benefits in all the studies ranged from clinical outcomes to cost savings from avoided travel through to quality of life measures; however, these benefits were not properly reported in most studies to enable a proper cost-effectiveness analysis.
The overall cost-effectiveness of telemedicine was inconclusive mainly because many studies reported both significant and nonsignificant findings. All the reviews expressed difficulty when drawing conclusions on the overall cost-effectiveness, as included studies were often small scale with poor designs and ill-defined outcomes. Conversely, Rojas and Gagnon 18 did not provide assertions on the cost-effectiveness, presumably because of the lack of proper measures used in reporting cost-effectiveness, so therefore the authors based their conclusions on general statements rather than results presented in their review.
There are also challenges in generalizability of the cost-effectiveness of telemedicine due to the heterogeneous nature of the fields to which telemedicine was applied. Issues such as the variation in study design, setting, technologies used, and health areas to which telemedicine was applied can impact on its generalizability. However, Wade et al. 20 tried to deal with this by only considering real-time telemedicine. These authors suggested that grouping telemedicine applications into similar organizational settings can mitigate the problem of generalizability. Because of the heterogeneous nature of the data of the included studies, no reviews have conducted a meta-analysis; however, this is not unusual as results from cost-effectiveness studies are not usually presented as a meta-analysis or a meta-review.
We found that, over time, reporting of the cost-effectiveness has generally improved, but there is still room for a lot more improvement. As with the previous systematic reviews, Mistry 15 found that there was a continual increase in use of economic tools for evaluation of telemedicine over time. However, there still remains a lag in economic studies' adherence to the general standards for health economic evaluations 1,2 and the transparency of reporting methodologies and results. In addition, researchers do not use the existing frameworks for conducting economic evaluations in telemedicine. 4,5 Zanaboni and Wootton 24 found that the widespread adoption of telemedicine was a major and still an underdeveloped challenge. They concluded that even though “cost-effectiveness of telemedicine is necessary, it was not a sufficient condition for the widespread adoption of telemedicine.”
The findings from this appraisal also raise questions as to whether reviews published may have included studies that reported bad-quality results. The only way to look at this in more detail would have been to compare the results reported in the reviews with the original studies included in each of the reviews. The most recent review by Mistry 15 also highlighted some important considerations in assessing articles for publication in journals, such as restrictions on word counts preventing detailed reporting of methodologies and findings, and often health economists are not named as co-authors on articles, or they may not have contributed to analysis or to the write-up of the cost-effectiveness analysis.
This review did not consider applications in a specific health area, and we realize that this can be seen as a limitation. Although this review covers a wide variety of clinical specialty applications, it is quite difficult to reach reliable conclusions, as each health area and the nature of the applications are very different, but, as we have emphasized earlier, this is the first review of systematic reviews looking at the cost-effectiveness of telemedicine. Another limitation is that the conclusions in this article regarding the cost-effectiveness of telemedicine are drawn from the findings of systematic reviews that are mostly based on cost-effectiveness studies conducted that have immense methodological flaws. As Booth 25 reported, evidence from a critical appraisal is in itself limited by the quality of reporting of the included reviews. Furthermore, these reviews on the cost-effectiveness of telemedicine need to be continuously updated because new research is being done all the time as technologies are changing and some telemedicine technologies are becoming obsolete.
Ultimately, the systematic reviews unanimously concluded that evidence on the cost-effectiveness of telemedicine in general is inconclusive. Therefore, this review still reaffirms the knowledge of the inconclusive nature of telemedicine assessment, which is based on small sample sizes, short-term time horizons, failure to state the study perspective, and no proper calculation of costs and benefits.
Even though the quality of most the systematic reviews indicated minimal flaws, the methodological deficiencies found in the individual studies included in the systematic reviews made them more prone to bias and limit their value to aid in decision making. Therefore, decision makers should take caution when making informed decisions, and their decisions should be made based on both the methodological rigor of the research and the quality of its findings. However, caution needs to be taken as both the aspect of study context and methods applied are equally important. Ultimately, the value of systematic reviews depends on their replicability, validity, precision, and relevance.
Finally, findings from this appraisal reinforce the need for further studies targeted at economic evaluation of telemedicine to provide robust and empirically valid evidence regarding the cost-effectiveness of telemedicine.
Footnotes
Acknowledgments
This review originally formed part of the MSc dissertation thesis of H.G. and has been updated accordingly for publication.
Disclosure Statement
No competing financial interests exist.
