Abstract
Introduction
Overweight and obesity are significant health problems in the United States as well as globally. Recent statistics from the U.S. Centers for Disease Control and Prevention demonstrate that 62% of American adults are either overweight or obese. 1 Additionally, the World Health Organization (WHO) estimated that, in 2005, ∼1.6 billion adults aged 15 years and older were overweight and 400 million were obese worldwide. WHO projects that this number will increase, with 2.3 billion adults overweight and 700 million obese by the year 2015. 2
One contributing factor to these high numbers is a lack of physical activity (PA) required for maintaining health. Data from the United States show that 33% of adults are inactive and only 35% engage regularly in leisure-time PA. 1 Similarly, WHO estimates that at least 60% of individuals worldwide do not engage in the recommended amount of PA, 3 which, for both WHO and the U.S. Department of Health and Human Services, is either a minimum of 150 min of moderate-intensity PA per week or 75 min of vigorous-intensity PA, or an equivalent combination of both. 4,5 Engaging in PA can have numerous health benefits; in addition to addressing obesity, it can aid in the prevention of many chronic diseases, such as cardiovascular disease, diabetes, cancer, and hypertension, as well as in the prevention of premature death. 6
Considering the positive outcomes linked to PA, it is easy to understand why many have attempted to encourage PA among various populations. E-health is an increasingly popular method of carrying out PA interventions, as it capitalizes on the prevalence of technology, can cover far distances, improves access, and allows for more flexibility in treatment. Norman et al. 7 conducted a systematic review of PA and dietary change interventions through the year 2005, analyzing 13 peer-reviewed articles describing PA interventions, which utilized various technologies, such as e-mail, interactive compact discs (CD-ROMs), or Web sites. Overall, results showed a need for more rigorous research in the evaluation of e-health interventions.
This research seeks to continue where Norman et al. 7 left off and is a systematic review of e-health PA interventions from January 2006 to November 2010. Because of the increase in frequency of published e-health articles since then, this review is more exclusive, as it limits the sample to only PA interventions while leaving out those examining dietary change, and it also focuses only on studies employing randomization to see whether more conclusive findings have been demonstrated. The following research questions guided this systematic review and aided in the assessment of the current state of e-health PA interventions:
RQ1: What is the quality of the study designs used in the PA interventions?
RQ2: What are the characteristics of the PA interventions, in terms of their technologies used, intervention periods, and theories utilized?
RQ3: What are the characteristics of the PA interventions' samples?
RQ4: Do the outcomes of the PA interventions provide support for e-health as an effective intervention medium?
The remainder of this article will explain the methods used in this review to answer these research questions, provide an overview of the studies examined, discuss the results, and address the implications of the findings.
Materials and Methods
Article Collection
A systematic literature review was conducted to locate scientific peer-reviewed journal articles detailing e-health PA interventions from January 2006 to November 2010. Searches were performed in the Web of Science and PubMed databases, with each search including a technology keyword, a topic keyword, and the word “intervention.” Technology keywords included “Internet,” “web,” “multimedia,” “personal digital assistant,” “mobile phone,” “cellular phone,” “computer,” “email,” “e-mail,” and “game.” Topic keywords included “physical activity,” “exercise,” “weight loss,” “obesity,” and “body mass index.” Additionally, a citation search was conducted by examining the reference lists of the selected articles as well as by examining articles that cited those that were collected.
Selection Criteria
Inclusion and exclusion criteria are listed in Table 1. The criteria were chosen to ensure that only rigorous studies were included in the analysis. Additionally, the analysis contained PA-only interventions to narrow the realm of study, examine approaches specifically related to this topic area, and analyze their overall effectiveness on the PA of individuals.
Inclusion and Exclusion Criteria
PA, physical activity; PDA, personal digital assistant.
Data Synthesis
Each study was rated for its design quality on the basis of nine methodological characteristics, taken from the analysis of e-health interventions by Norman et al. 7 These criteria include whether or not the study employed individual randomization, had a control group, isolated the technology (i.e., control group participants were not exposed to any form of intervention materials), used a pre- and post-test, had retention ≥80%, demonstrated equivalence between groups at baseline, accounted for missing data, calculated the necessary sample size, and utilized validated measures. For the retention ≥80% criterion, study participation was assessed at the last recorded measurement. Details of these criteria can be found in Appendix A. Each study's score was then calculated as a percentage based on how many of the criteria it met out of the maximum criteria possible. The studies were rated independently by one researcher, and a check was performed on all of the articles by another researcher to establish reliability. Disagreements were discussed until differences were reconciled.
Additionally, the studies were assessed according to the amount of support they provided for e-health interventions. This analysis distinguishes between pure control studies (in which the technologies were completely isolated) and studies with comparison groups and was based upon the last reported measurement of PA in the study. Studies were classified according to whether or not significant differences were reported and whether these differences were in relation to pure control or comparison groups. For more details on this comparison, see Table 2.
Study Design Scores and e-Health Support
Validated measures were assessed only for the PA measurement.
Difference is for leisure-time PA only (not overall PA).
Validated measure was coded as “N” because authors did not feel the reliability calculations established true validity of the measure.
Analysis did not allow for comparison between pure control and e-health technology, although there was a pure control group present.
Spontaneous users not included in assessment of baseline equivalence.
++, e-health intervention resulted in significant improvements in PA compared with pure control group; = =, e-health intervention resulted in no significant differences in PA compared with pure control group; +, e-health intervention resulted in significant improvements in PA compared with nontechnology comparison group; =, e-health intervention resulted in no significant differences in PA compared with nontechnology comparison group; 0, support for e-health indeterminate from results or study had a lack of a pure control or nontechnology comparison group; Y=yes; N=no; UK, unknown.
Results
Following the literature search, 185 studies were initially selected and abstracts were obtained for potential inclusion. After applying the inclusion and exclusion criteria to the studies based on the abstracts, 36 remained. Full texts of these articles were acquired, and upon further examination and refinement of the criteria, nine more studies were removed from the analysis. Reasons for removal included a lack of randomization, follow-ups to articles already included, a lack of discussion of control group outcomes, a lack of actual PA levels reported in the results, and descriptions of only a subsample of an intervention already discussed in an included article. These attributes were then added to the inclusion and exclusion criteria. Following this process, four more articles were added based on a citation search of the included publications. This left a total of 31 articles 8 –38 to be included in the final analysis.
RQ1: Design Quality Scores
The average design score across all of these technologies was 66.7%, reflecting an average of six of the nine design quality criteria being met. The range of scores was from 44.4% to 88.9%. Full results of the design quality assessment can be found in Table 2. All of the studies met the control group and pre- and post-test criteria, and a majority met the individual randomization (26/31, 83.9%), the validated measures (25/31, 80.6%), and the equivalent baseline groups (23/31, 74.2%) criteria. Conversely, only 8 studies met the “isolate technology” criteria and only 11 calculated the sample size for their intervention.
RQ2: Intervention Characteristics
The included studies used a variety of technologies: 10 used Web sites (without e-mail), 3 utilized e-mail only, 3 used mobile phones, 3 used digital games, 3 used offline computer-tailored interventions powered by software or a CD/digital video disc (DVD)-ROM, and 9 used multiple technologies (often a combination of e-mail and Web sites). Intervention periods ranged from 2 weeks to 13 months. Twenty-seven of the studies (87.1%) incorporated theory into their interventions; the most common theories mentioned were social cognitive theory (13/31, 41.9%), the transtheoretical model (11/31, 35.5%), and either the theory of reasoned action or the theory of planned behavior (9/31, 29.0%). More details on the interventions can be found in Table 3.
Summary of e-Health Physical Activity Studies
This article detailed two separate studies; only study 2 assessed for review.
Int., intervention; Ctrl., control; Avg., average; BMI, body mass index; Sig., significant/significantly/significance; Cond., condition; TTM, transtheoretical model; SCT, social cognitive theory; MVPA, moderate to vigorous physical activity; IPAQ-A, adolescent adaptation of international physical activity questionnaire; TPB, theory of planned behavior; IPAQ, International Physical Activity Questionnaire; HBM, health belief model; AAQ, Active Australia Questionnaire; SMS, short message service; AWAS, Australian Women's Activity Survey; FPAQ, Flemish Physical Activity Questionnaire; SoC, stage of change; ANCOVA, analysis of covariance; ELM, elaboration likelihood model; CHAMPS, Community Healthy Activities Model Program for Seniors questionnaire; CVD, cardiovascular disease; PAR, 7-day Physical Activity Recall; YRBSS, CDC Youth Risk Behavior Surveillance System; TRA, theory of reasoned action; PAQ-C, Physical Activity Questionnaire for Children; GLTEQ, Goldin Leisure Time Exercise Questionnaire; PMT, protection motivation theory; SWET, self-report walking and exercise tables; BRFSS, U.S. Behavioral Risk Factor Surveillance Survey; AQuAA, Activity Questionnaire for Adolescents and Adults; HEPA, Health-Enhancing Physical Activity (Switzerland); AP, action planning; PAM, personal activity monitor.
RQ3: Sample Characteristics
Overall, the sample sizes for interventions discussed within the articles ranged from 20 to 1,531 individuals. Nine studies were conducted with child or adolescent populations, 7 with college students, 13 with adult populations, and 2 with middle-aged individuals. Additionally, seven studies included only females, and one study included only males. Seven of the studies were exclusively targeted toward overweight, sedentary, or insufficiently active populations.
RQ4: Intervention Outcomes
This research focused only on the reported PA outcomes of the studies, although many other types of outcomes were also reported (e.g., psychological, weight loss, and fitness). The studies used a variety of measures to assess participants' PA. Seven studies objectively measured PA through the use of either a pedometer or an accelerometer, whereas the rest relied on self-report. The most common questionnaire used to measure PA was the International Physical Activity Questionnaire (7/31, 22.6%), which has various forms.
The analysis of the studies' support for e-health interventions revealed that seven of the studies could be analyzed for their support when compared with a pure control group (22.6%). Of these, none reported that the e-health interventions fared worse than the control, three reported no significant differences between the groups, and four reported outcomes in favor of the e-health technology. Thirteen studies could be analyzed for e-health support when compared with a comparison group (41.9%). Of these, four reported outcomes in favor of the e-health technology, nine reported no significant differences between groups, and none reported that the e-health interventions fared worse than the comparison group. Eleven studies (35.5%) could not be analyzed for e-health support because of indeterminate results or a lack of necessary information reported. Full results of this analysis are shown in Table 3.
Discussion
This systematic review demonstrates that it is becoming increasingly common to use e-health technologies to address PA. The analysis by Norman et al. 7 included only 13 PA articles, and this review included 36 while using more stringent inclusion/exclusion criteria and a smaller time frame. Thus, as technology has become more ubiquitous and accessible, the use of e-health to facilitate PA interventions has increased and will likely continue to do so.
Study Designs and Quality
Overall, the articles analyzed demonstrate high-quality methods in terms of their use of pre- and post-tests, individual randomization, validated measures, and equivalent baseline characteristics of groups. However, there remains room for improvement regarding study design, as none of the articles analyzed met all of the study design criteria. Future research should use power analyses to calculate sample sizes, as the scarce use of this method may explain why a majority of the articles were not able to find statistically significant differences between intervention groups. Future work would also benefit from comparing e-health technologies to pure control groups, as not doing so makes it difficult to account for other variables that might be affecting PA outcomes and hence weakens arguments for the intervention's effectiveness. A lack of pure control groups also inhibits the synthesis of research, as the comparison groups are not consistent across studies. It should be noted that pure control groups are difficult to achieve in “real-world” settings because of the potential for contamination and exposure to other health materials. This can be addressed, however, by utilizing groups that are unlikely to interact and by requiring that participants not participate in any other PA programs during the study period.
Intervention Characteristics
The interventions analyzed were very diverse in terms of their target audiences, sample sizes, technologies utilized, and intervention lengths. For example, targeted age groups ranged from 7-year-old children to adults in their 60s, and both healthy and underactive individuals were included within study populations. This increasingly widespread and varied use of e-health to facilitate PA interventions is positive, as it is evidence of continued innovation in this field and shows the potential of technology to bring healthcare services to many different population segments. However, improvements could be made in terms of involving more racial and ethnically diverse audiences in interventions. Of those studies that reported racial/ethnic demographics, almost all had a Caucasian majority. Such homogenous samples could have skewed the results and makes it difficult to generalize them to wide populations.
Another positive finding is that so many of the articles utilized theory to guide their interventions. This shows that researchers are using past literature to inform their efforts, it furthers the field of e-health by integrating it with established research traditions, and it will ultimately allow studies to build upon one another more effectively if they are using consistent frameworks and constructs between interventions. Additionally, research investigating Internet-based health interventions found that a more extensive application of theory was associated with greater effect sizes 39 of behavior.
One way in which future e-health PA interventions could be improved is by increasing the use of objective measurements of PA, such as accelerometers, pedometers, or body bugs/sensors, as only seven studies analyzed did so. It has become much easier to use such methods in recent years, as these technologies have become more advanced and have decreased in price, which is especially the case for pedometers. 40 Objective measurements are ideal, because they increase reliability of measurement by decreasing the potential for human error. Also, more widespread use of these technologies would allow for easier research synthesis because of the consistent measurement methods.
Support for E-Health
The support provided by the analyzed studies for e-health technologies is mixed, at best. It is disappointing that 11 of the studies could not be analyzed in terms of their support for e-health, which shows a need for more rigorous methods and reporting of results. Of the studies that compared e-health with pure control groups (n=7), just over half showed a significant positive difference favoring e-health. The rest of these studies showed no difference between the e-health and control groups, meaning that the interventions were no more effective than everyday life for the participants, which is a discouraging finding for the use of e-health in this area. In examining the studies wherein the primary e-health technologies were compared with another form of intervention (n=13), either print or technology based, results showed that only 31% led to significant positive differences between the groups in favor of e-health. The other 69% of this group showed that e-health conditions were not significantly different than others in terms of PA outcomes. Altogether, these results make it difficult to argue for the benefits of e-health when compared with other forms of interventions.
However, these findings do not mean that e-health technologies should be entirely dismissed as a potential medium for PA interventions. The equivalence found between e-health and comparison groups is not necessarily a negative finding, because if e-health is equivalent to more traditional intervention forms, then their use could enable expanded reach, accessibility, and efficiency without sacrificing positive outcomes. Also, none of the studies showed that e-health conditions fared worse than others, implying that these interventions can at least do no harm. Finally, the presence of studies with results supporting e-health compared with both pure control and comparison groups show that such interventions can be successful. It is difficult to determine, however, what characteristics of e-health interventions are more likely to lead to positive PA outcomes, as no significant patterns emerged among these studies in terms of their technologies used, audiences, or intervention lengths.
Limitations
The analysis of PA outcomes conducted in this review was always based on the last measurement reported and earlier survey results were not assessed; thus, in many cases, the outcomes reported here refer to a follow-up survey conducted some time after the intervention was completed. This may have led to the review's lack of positive results, as it is a difficult task to maintain intervention effects long after the participants have been exposed to it. However, only three of the studies analyzed had positive outcomes that were lost at follow-up assessments, 25,32,37 so it appears as though the analysis of last measurements reported did not have a significant impact on the results. Another potential limitation is the fact that this study only examined quantitative PA outcomes that were expressed in terms of levels of activity. Many of the articles also reported psychological outcomes related to PA, such as self-efficacy or stage of change, but these effects were not examined in this analysis to narrow the scope of study. Future research should examine the effects on such variables across studies, as it may point to the more subtle benefits of e-health PA interventions.
Conclusions
This systematic review of e-health interventions for PA has demonstrated that the use of e-health in this field is increasing, but that there is much room for improvement in the studies that are conducted. Research would greatly benefit from more systematic and rigorous methods, which would make it easier to synthesize results from multiple studies and better determine the progress of this field. The suggestions for improvement in future research put forth by this review include the use of power analyses to calculate sample sizes, pure control groups instead of comparison groups, more racial and ethnically diverse samples, theory to guide interventions, and objective measurements of PA.
Overall, in terms of support for e-health technologies in facilitating PA interventions, the results of this review cannot provide definitive evidence for the effectiveness of this medium. However, this does not mean that researchers should stop exploring the potential of e-health for PA, as there were some studies that did show support for it. Instead, this review points to the need for continued research and refinement of approaches. It is the authors' hope that future researchers can learn from this review and the weaknesses can be identified to pursue more success in this line of research, as PA is an important issue that needs addressing.
Footnotes
Disclosure Statement
No competing financial interests exist.
Study Quality Coding Criteria a
| TABLE HEADING | SCORING CRITERIA |
|---|---|
| Individual randomization | Were participants randomized to study conditions? If so, was randomization at the individual level? Stratified and blocked randomization is acceptable. Studies that used individual randomization combined with a small proportion of randomized matched pairs are also considered YES. Appropriately designed and powered group randomization would also be acceptable if group was also unit of analysis. Individual randomization is NO when the authors fail to mention randomization, specify that another method of assigning group status was used, or randomize at the group level and analyze at the individual level. |
| Control group | Did the study include a comparison group? Comparison group could be a no treatment, treatment as usual, or alternate treatment group. |
| Isolate technology | Did study design allow for a test of effectiveness of the technology? For example, Web-based delivery versus no treatment. To isolate the technology, the authors had to test the technology alone and compare with a group with no technology |
| Pre/post-test design | Was assessment of behavior completed preintervention and postintervention? |
| Retention | Was study retention at least 80% of subjects who initially agreed to participate in the study? Retention is calculated for the entire sample and not by group. For studies that did not report retention or dropout rates, retention can be calculated using the sample sizes used for analyses (e.g., 300 randomized, but only 250 were included in analyses=83.3% retention). |
| Baseline groups equivalent | Were tests conducted to determine if groups were equivalent at baseline on important variables (e.g., gender, age, weight)? If no tests mentioned, then UK. If subset of tests indicated any group differences at baseline, then=NO. |
| Missing data | Were analyses conducted with consideration for missing data that maintains the fidelity of the randomization (e.g., intent-to-treat, imputation). Listwise case deletion (completer analysis)=NO, if only analysis conducted. If 100% retention then completer analysis is appropriate=YES. If authors compared the “dropped subgroup” to the selected or randomized sample, but did not consider the impact of the dropped subgroup on randomization (e.g., ITT or imputation), then code as NO. |
| Sample size calculation | Was power analysis reported to determine study sample size? |
| Validity of measures | Did description of measures include reliability and validity information? If reference or coefficients, then YES. If well-established measure that is known to be validated, then YES. For objective measures without validity evidence, if the objective measure is used as a proxy (e.g., food receipts for nutrition intake), then NO. If the objective measure is used as a direct measure of behavior (e.g., food receipts for food purchase), then YES. If validity not reported and measure unknown, then UK. |
| Total | Number of Yes answers out of number possible |
Taken and adapted from Norman et al.7 Additions are written in boldface.
Correlates identified from Sallis et al.41
Correlates identified from Trost et al.42
UK, unknown; ITT, intention-to-treat analysis.
