Abstract
Background:
Early Treatment Diabetic Retinopathy Study (ETDRS) seven-standard-field color stereoscopic retinal photography (ETDRS photos) has been a gold standard for determining diabetic retinopathy (DR) severity. The Automated Retinal Imaging System (ARIS™, model 110, Visual Pathways, Inc., Prescott, AZ) acquires seven-sequential color stereoscopic digital images (ARIS images) by a semiautomated technician-run process generally corresponding to ETDRS photos. We assessed the correlation between a single semiautomated ARIS imaging session without any re-imaging and ETDRS photos performed by a certified photographer for the determination of DR severity.
Methods:
Two independent masked readers graded mydriatic ARIS images and ETDRS photos. A third masked retinal specialist adjudicated discrepancies. Correlation between the two modalities was compared using weighted-κ statistics.
Results:
We evaluated 211 eyes of 106 patients with varying levels of DR. Partially ungradable images were present in 3.4% of ETDRS photos versus 31.8% of ARIS images. Exact agreement and agreement within one level between ETDRS photos and ARIS images using only completely gradable image sets occurred in 69% (κ=0.81) and 90% of cases, respectively. Exact agreement for clinically significant macular edema was 92.1% (κ=0.59). There was 100% agreement for eyes with high-risk proliferative DR. Within one level of DR severity, 100% agreement occurred for the following: questionable nonproliferative DR (NPDR), moderate NPDR, and severe NPDR.
Conclusions:
Results suggest that semiautomated ARIS images compare favorably with ETDRS photos when full image sets can be obtained; however, partially ungradable image sets occurred almost 10 times more frequently with ARIS images than with ETDRS photos. In the two-thirds of cases where ARIS images can be utilized, ARIS can obtain retinal images comparable to ETDRS photos while requiring less highly trained personnel than generally needed for standard ETDRS photos.
Introduction
Because of these limitations of ETDRS photos, alternative methods of fundus photography have been investigated. 4,5 Digital imaging modalities, including digital photography, compare favorably with 35-mm slides and are used in clinical practice and in numerous clinical trials, including the Diabetic Retinopathy Clinical Research network and the Epidemiology of Diabetes Interventions and Complications studies. 4 –8 To date, these modalities include mydriatic and nonmydriatic retinal cameras, digital-video imaging, and other modifications to ETDRS standard photos, such as nonstereoscopic (monoscopic) images and the acquisition of fewer fields than the seven standard retinal fields. 3,5,7,9 –11
The Automated Retinal Imaging System (ARIS™, model 110,Visual Pathways, Inc., Prescott, AZ) is a digital imaging system that acquires seven sequential color stereoscopic digital images (ARIS images), approximating the ETDRS photo fields, by a semiautomated, technician-run process. Because of the automation of the process with ARIS images, only minimally trained technicians are required for acquisition as opposed to highly trained, certified retinal photographers. If accurate and effective, this semiautomated process may represent significant operational cost savings for clinical, clinical trial, and telemedicine programs.
ARIS acquires seven standard stereoscopic fields approximately as defined by the ETDRS protocol. However, the feasibility, effectiveness, and utility of the ARIS images have yet to be compared directly with those of ETDRS photos. The present study compares the agreement of these two imaging modalities with clinical DR severity and DR lesion-level findings.
Subjects and Methods
This single-center, masked, multireader study evaluated the agreement in assessing severity of DR at the retinal lesion level between ARIS images and ETDRS photos. Subject eligibility was determined from medical record review of the most recently diagnosed clinical level of DR. Based on the most recent clinical exam for DR, subjects were selected sequentially to ensure adequate distribution of various levels of DR, ranging from no DR (ETDRS level 10) to high-risk proliferative DR (PDR) (ETDRS level 75). At least 80% of subjects enrolled had documented DR on clinical examination, at least 50% had varying levels of nonproliferative DR (NPDR), and at least 30% had PDR. All patients were recruited during regularly scheduled appointments from the clinic population of the Beetham Eye Institute at the Joslin Diabetes Center, a tertiary eye care center specializing in eye disease due to diabetes, in Boston, MA. Inclusion criteria were age 18 years or older, type 1 or type 2 diabetes mellitus as defined by the American Diabetes Association, and willingness to sit through photography and ARIS imaging sessions. Patient exclusion criteria were a history of conditions in either eye that may preclude pupil dilation, use of eye drops (mydriatic or miotic) that would alter pupil size or reactivity, and prior laser treatment or media opacities precluding adequate imaging of the retina. The research was conducted according to the tenets of the Declaration of Helsinki and received Joslin Committee of Human Studies review, and patients consented to sit through photography and imaging sessions after explanation of the nature of the study.
Two hundred eleven eyes of 106 patients were evaluated in the study. Patients were recruited to ensure an adequate representation of the various levels of DR, ranging from no DR to high-risk PDR. The recruitment of patients was weighted towards those with mild to moderate levels of DR because more severe changes of DR would most likely be easier to detect. Therefore, detection of the more subtle changes of mild to moderate NPDR would provide a more rigorous test of lesion detection capability by the ARIS imaging system.
Each patient underwent a comprehensive eye exam including pupil dilation followed by a dilated retinal exam by a retina specialist. During the same visit, patients underwent imaging by an ophthalmic photographer certified for capturing ETDRS photos to obtain seven-standard-field stereo digital images of each eye. Mydriatic ARIS imaging was conducted by an ETDRS-certified photographer using the ARIS model 110 (2003), a semiautomated imaging system that has 26 internal fixation targets, seven of which are automatically and sequentially turned on during the preprogrammed ETDRS7 imaging procedure. For this study, ARIS images were taken in seven predetermined fields using the preprogrammed ETDRS7 imaging procedure. Multiple wavelengths that yield differential light penetration into the retina are used. Multilingual voice prompts in the system directed the patient to fixate for each of the seven fields. Of important note is that only one imaging attempt was made with the ARIS system, and no attempt was made to retake ARIS images of poor quality. Re-imaging of poor quality images with the ARIS system would disrupt the automated image capture and require a trained photographer to identify ungradable images, which is contrary to the purpose for which the system was designed. For each field, high-resolution digital stereo pairs of images were collected using single exposures of approximately 100 ms in duration. All ARIS digital images were stored electronically in a proprietary format that could be automatically exported using any of the common image formats. Color images were formed by combining the red and green wavelength images utilizing retinal feature registration and a proprietary method of generation and addition of a blue wavelength image.
After adequate pupil dilation, ARIS imaging was performed followed by seven-standard-field stereo ETDRS photography taken by a study-certified ETDRS photographer on 35-mm slide film. ETDRS photos were processed through the Beetham Eye Institute Photography Department according to standard clinical practice and ETDRS Reading Center protocol.
ARIS images were displayed and graded on a 21-inch Sony (Tokyo, Japan) Trinitron® cathode ray tube color monitor at a resolution of 1,280×1,024×24 bits. Images were viewed monoscopically and stereoscopically and as “red free” images (digital filtering of red and blue channels). Stereoscopic viewing of ARIS images was achieved using liquid crystal display shuttered goggles (NuVision 60GX®, MacNaughton, Inc., Beaverton, OR) where the left and right images of the stereo pair were sequentially displayed on the display monitor at 140 Hz, allowing a flicker-free three-dimensional image to be perceived. ARIS images were graded using two methods: Color Only (ColorOnly) and Color Plus (ColorPlus). Grading by the ColorOnly method involved grading the captured ARIS color images without any image adjustments made by the reader. The ColorPlus images were graded after the reader was free to adjust contrast, alignment, and color to assist in grading. Grading of 35-mm slide ETDRS images was performed on a standard slide light box through Donaldson stereo viewers according to ETDRS protocol.
Each eye was evaluated for the presence and severity of specific index retinal lesions and degree of diabetic macular edema according to the ETDRS extension of the Modified Airlie House Classification of Diabetic Retinopathy. 1 These index lesions were hemorrhages and/or microaneurysms (HMa), venous beading (VB), intraretinal microvascular abnormalities (IRMA), neovascularization elsewhere (NVE), neovascularization of the disc (NVD), lesions of diabetic macular edema and clinically significant macular edema (CSME), and hard exudates (HE). Each eye was assigned a clinical level of DR severity based on these findings. 1
One grader (J.D.C.) graded all ARIS images, and a second grader (T.J.M.) graded all ETDRS images. Grader disagreements were adjudicated by a third, independent retinal specialist (L.P.A.). All graders were masked to patient demographics, medical information, results of the other imaging modalities, and results from the clinical exam by a retina specialist. The two initial graders entered findings directly into a patient-specific, de-identified, electronic findings template modified from the Wisconsin Reading Center ETDRS retinal evaluation form. 1 The results were recorded on the electronic medical record templates by ETDRS photographic field, and these templates were electronically verified to ensure that there were no omissions or internal inconsistencies. A separate section was included on the template for identifying nondiabetic retinal lesions and for grader comments.
Statistical analysis
The adjudicated grades derived from the ETDRS images were considered the reference standards. The clinical ETDRS level of DR severity and the presence/absence of ETDRS index retinal lesions were compared. Agreement of clinical ETDRS level of DR severity between ARIS and ETDRS images was cross-tabulated, and both simple kappa (κ) and weighted κ statistics (κ
w) were calculated. A weight of 1 was used for exact agreement, disagreements between absence and definite presence of a characteristic received 0 weight, and those between absence and questionable presence (wherein the evaluator was between 50% and 90% certain of the presence of the characteristic) received a weight of 0.75. Eyes with photographs classified as ungradable were excluded from the analysis. Guidelines for interpretation were based on those of Landis and Koch
12
as used in ETDRS Report Number 10: 0.0–0.2=slight agreement, 0.21–0.40=fair agreement, 0.41–0.60=moderate agreement, 0.61–0.80=substantial agreement, and 0.81–1.00=almost perfect agreement. Severity of DR and ETDRS retinal lesion grading comparisons were tested using κ statistics for agreement, sensitivity/specificity, and positive/negative predictive values. All statistical analyses were performed using SAS (version 9.2) (SAS, Inc., Cary, NC) and the VassarStats Website for statistical computing (
Results
Two hundred eleven eyes of 106 patients with diabetes mellitus were evaluated. One patient had a prosthetic eye. Baseline patient characteristics are shown in Table 1. DR severity and lesion prevalence derived from ETDRS photos are shown in Table 1 and Figure 1, respectively. The prevalence of DR and DR lesions did not reflect the prevalence of lesions or severity distribution of DR in the clinic, but rather was intentionally enriched with subjects having DR so as to more rigorously test the camera system. In this study only 11% of subjects enrolled did not have DR, in contrast to general clinic populations, which vary from 60% to 80% with no DR.

Diabetic retinopathy retinal lesions: 35-mm Early Treatment Diabetic Retinopathy Study photos compared with ARIS ColorPlus (n=211 eyes). CSME, clinical significant macular edema; HMA, hemorrhage/microaneurysm; HE, hard exudates; IRMA, intraretinal microvascular abnormalities; NVD, new vessels at the disc; NVE, new vessels elsewhere.
CSME, clinical significant macular edema; ETDRS, Early Treatment Diabetic Retinopathy Study; NPDR, nonproliferative diabetic retinopathy; PDR, proliferative diabetic retinopathy.
Some images were ungradable for specific lesions in a particular field because of either poor overall image quality or inability to grade at least three disc areas of retina within a single field. These ungradable images occurred in at least one retinal quadrant in 3.4% of ETDRS as compared with 31.8% of ARIS ColorPlus images. Excluding the eye image sets containing ungradable images in one or more retinal quadrants, exact agreement for DR severity occurred in 68% of cases (95% confidence interval, 62–75%). As shown in Table 2, agreement within one level of clinical DR severity was observed in 88% of cases (95% confidence interval, 84–93%). Exact agreement occurred for all eyes with high-risk criteria. In addition, agreement within one level of DR severity occurred for all eyes with questionable NPDR, moderate NPDR, or severe NPDR on ETDRS photos.
The measures graded here did not include photos that cannot be graded by ARIS and 35-mm ETDRS film. n (included)=198.
Simple κ statistic, 0.59 (95% confidence interval 0.50–0.67); weighted κ statistic (linear scale), 0.77 (95% confidence interval 0.70–0.83).
Perfect agreement: 68% (95% confidence interval 62–75%).
Within one-step agreement: 88% (95% confidence interval 84–93%).
DR, diabetic retinopathy; NPDR, nonproliferative diabetic retinopathy; PDR, proliferative diabetic retinopathy.
Table 3 and Figure 2 show the agreement of ARIS ColorPlus with ETDRS photos at different DR levels. Comparing eyes with no to mild NPDR with moderate NPDR or worse, ARIS photos showed agreement with ETDRS photos 92% and 81% of the time, respectively (substantial agreement, κ=0.65±0.08 and 0.62±0.05, respectively). Comparing eyes with very severe NPDR or better to eyes with PDR (including PDR with less than high-risk criteria, PDR with high-risk criteria, and quiescent PDR), ARIS photos showed near perfect agreement, matching ETDRS photos 93% of the time (κ=0.85±0.04). ARIS photos had a positive predictive value of 0.93 for ruling-in the presence of any DR. The negative predictive value of ARIS photos in ruling-out moderate NPDR or worse was 0.96.

Diabetic retinopathy threshold: 35-mm Early Treatment Diabetic Retinopathy Study photos compared with ARIS ColorPlus (n=198 eyes). NPDR, nonproliferative diabetic retinopathy; PDR, proliferative diabetic retinopathy.
Eyes graded as questionable nonproliferative diabetic retinopathy (NPDR) were considered to have NPDR.
CI, confidence interval; DR, diabetic retinopathy; NPV, negative predictive value; PDR, proliferative diabetic retinopathy; PPV, positive predictive value.
Lesion-by-lesion comparisons of ARIS photos to ETDRS gold standard photos are shown in Figure 1 and Table 4. Agreement with ETDRS photos was 60–90% for both ColorPlus and ColorOnly. Correlations had substantial κ agreement for NVD (κ=0.69±0.1), moderate agreement for CSME (κ=0.59±0.12), HMa (κ=0.54±0.02), NVE (κ=0.56±0.05), and HE (κ=0.47±0.04), fair agreement for IRMA (κ=0.29±0.03), and slight agreement for VB (κ=0.18±0.04). ARIS ColorPlus detected key DR lesions as accurately as ETDRS film photos in 94% of cases of NVD, 92% of NVE, 92% of CSME, 87% of HE, 76% of IRMA, 77% of VB, and 62% of HMa (Fig. 1 and Table 4). ARIS photos demonstrated very high negative predictive values for nearly all lesions (mean, 0.93; range, 0.84–0.99 [see Table 4]). ColorPlus and ColorOnly grading had near perfect agreement for all lesions and thus did not add substantially to ColorOnly's ability to detect lesions or match ETDRS grading.
CI, confidence interval; CSME, clinically significant macular edema; HE, hard exudate; HMA, hemorrhage or microaneurym; IRMA, intraretinal microvascular abnormality; κ w, weighted kappa statistic; NPV, negative predictive value; NVD, neovascularization of the disc; NVE, neovascularization elsewhere; PPV, positive predictive value; VB, venous beading.
Discussion
This was a single-center, masked instrument, category 3 validation study that adhered to the recommendations of the American Telemedicine Association guidelines for telehealth programs for diabetic retinopathy. 13 A category 3 American Telemedicine Association validation study demonstrates the capability to match dilated ETDRS photos for the evaluation of DR. All persons with diabetes may potentially benefit from such an evaluation, and this study design and recruitment were tailored to potentially assess performance in this diverse population.
In this study, when gradable images are obtained, the ARIS imaging system produced results similar to ETDRS photos for the evaluation of clinical DR severity. ARIS images also showed high negative predictive values in ruling out DR. The agreement obtained with this automated imaging system was comparable to those from previous reports using nonautomated technician-driven digital imaging systems. 14 However, as discussed below, the substantial percentage of ungradable images may affect efficiency and performance of the current ARIS systems in larger teleophthalmology implementations.
Compared with ETDRS photos, the ARIS system had nearly 10 times more ungradable retinal fields, and ungradable fields approached one-third of all ARIS images (3.4% ETDRS, 31.8% ARIS). This high percentage of ungradable images could clearly impact efficiency and usefulness of the ARIS system in clinical or research settings. However, when evaluating gradable fields present for each eye, DR severity could be determined by the ARIS system in 93.4% of eyes, which is similar to that reported in other teleophthalmology systems. 15,16
When comparing ARIS ColorPlus with ETDRS photos in lesion-by-lesion analyses, ARIS ColorPlus detected ETDRS retinopathy lesions to a similar extent as ETDRS photos in the majority of the cases. More than 90% of NVD and NVE was detected by ARIS. ARIS ColorPlus and ETDRS photos showed a high level of agreement with respect to clinically significant macular edema at 92% (κ=0.59±0.12). The sensitivity and specificity for detecting sight-threatening PDR were 92% and 94%, respectively. The ungradable/technical failure rate with ARIS images was 6.6%. The ARIS system compares favorably with the Australian National Health and Medical Research Council guidelines (approximately 60% and 90–95%) and the United Kingdom's Exeter Standards (approximately 80% and approximately 95%) in terms of sensitivity and specificity, respectively.
The lesions having the lowest κ values were VB (0.18±0.04) and IRMA (0.29±0.03), which also were among the least prevalent lesions at 10% and 23%, respectively. Other lesions had a high percentage of exact agreement (VB=77%, IRMA=77%) and high negative predictive values (VB=0.93, IRMA=0.84). If these lesions were more prevalent, it is possible that a higher κ value may have been observed as has been previously described. 17,18
Seven-standard-field ETDRS 35-mm stereoscopic fundus photography has been the gold standard for evaluating DR severity; however, obtaining ETDRS photos has certain drawbacks, including cost and the requirement for certified photographers. Previous studies have investigated alternatives to using standard ETDRS photography, including using monoscopic digital imaging and photographing fewer than seven fields. Although these methods have been shown to be time saving, they may be associated with decreased specificity for assessing certain lesions such as retinal thickening and elevated neovascularization. 19,20 The key advantage of the ARIS system is that it does not need a skilled photographer to image a retinal area equivalent to ETDRS photos. Based on the results of this study, the ARIS system compares favorably with ETDRS photos for evaluating DR when gradable images are obtained. The ARIS system replaces the variability that may arise with multiple operators with a standardized automated protocol that can achieve results consistent with the recommendations of established standards and guidelines.
Results using ARIS, when gradable, compared favorably with mydriatic ETDRS film photography obtained by certified personnel for determining clinical DR severity and individual DR lesions. These findings suggest that ARIS could provide a less labor-intensive and lower-cost alternative to ophthalmic photographer–obtained ETDRS protocol imaging, especially if a higher percentage of gradable images could be obtained across a broader population. Individual lesion identification in ARIS images had both a high specificity and high negative predictive value, indicating lesions graded as “not present” by ARIS were unlikely to have been missed. These results suggest that when the extent of ungradable images is not a limiting factor, ARIS images may be useful for determining retinopathy level as well as the presence and severity of individual ETDRS retinal lesions.
The ARIS is a semiautomated system that does not require significant operator input. Although only one operator used the ARIS in this study and the impact of multiple operators was not explicitly evaluated, substantial intra-operator variability is not expected based on the automated operation of this machine. The semiautomation of this retinal imaging process may improve access to retinal imaging because of less training requirements and may optimize workflow, thus enhancing efficiency. However, the ARIS imaging system could be optimized by substantially reducing the number of ungradable fields. Additional future innovations such as applying automated software algorithms and computer lesion detection might be applied to this technology once this hurdle is overcome. Such features combined with the limited expertise needed to run the system would be a significant benefit to many teleophthalmology programs. If future generation imaging systems incorporate a similar semiautomated process with enhanced image quality and gradability, they would likely have a significant positive impact on clinical care.
With ungradable image rates of over 30%, there may be a significant impact on efficiency and usefulness of the ARIS system. However, despite nearly 10 times more ungradable fields in ARIS images as compared with ETDRS photos (31.8% vs. 3.4%), the overall ungradable rate for an eye imaged with ARIS was 6.6%. The better correlation at the eye level is likely due to the acquisition of multiple retinal fields as is done with ETDRS photos. Unlike most teleophthalmology programs, which take one or two images of the disc and macula, the ARIS acquires multiple retinal images, including the optic disc, macula, temporal macula, and four peripheral retinal fields. Such coverage may lead to substantial agreement with a low overall ungradable rate despite having a substantial number of ungradable individual fields. Imaging of the retinal periphery is more technically demanding, and to achieve consistent results often requires a skilled retinal photographer. The ARIS acquires multiple retinal images that include the optic disc, macula, temporal macula, and four peripheral retinal fields without the need for a skilled photographer. The addition of the peripheral retinal fields compared with the usual one- or two-field retinal imaging results in a sevenfold increase in the number (1–2 vs. 14) and two- to threefold increase of retinal area (30–45° vs. 90°) that may be evaluated for diabetes pathology. Despite the number of ungradable individual fields with ARIS, there is substantial agreement with ETDRS photos and low ungradable rate for eyes overall.
In summary, when gradable images can be acquired using ARIS, they are comparable to ETDRS photos for determining clinical severity of DR based on individual ETDRS retinal lesions. Accuracy increases with more severe DR. Although this tool has the potential to reduce the need for a highly trained ophthalmic photographer to obtain the images and may be more time efficient, these benefits must be considered in light of nearly 30% of images being ungradable and a loss of sensitivity for VB and IRMA. Retinal imaging systems that automate the acquisition of multiple retinal fields may eventually prove to be a useful alternative to the labor- and resource-intensive ETDRS photography in large telemedicine programs where automation of workflow is becoming essential.
Footnotes
Author Disclosure Statement
No competing financial interests exist.
