The Role of Telemedicine in Strabismus Assessment: A Narrative Review and Meta-Analysis

Abstract

Purpose:

Strabismus is a common ocular condition requiring precise quantification of gaze deviation and qualification of strabismus category. Telemedicine refers to the use of technology to remotely diagnose and treat medical conditions. This narrative review aimed to assess the efficacy of a variety of telemedicine modalities for the assessment of strabismus. A secondary objective was to quantify overall accuracy, sensitivity, and specificity of automated methods using meta-analysis of available data.

Methods:

A literature search was conducted using the Ovid MEDLINE, Embase, and Cochrane Library data libraries. Keywords, including “strabismus,” “phoria,” “telemed*,” and “telehealth,” were used to locate relevant studies, with Medical Subject Headings terms, free text, and synonyms. No year restrictions were applied. Studies not in English were excluded. Risk of bias was assessed using the QUADAS-2 tool.

Results:

Thirty-four studies were included. All outcomes relating to accuracy and reliability of telemedicine versus a reference standard were extracted, as well as qualitative observations. High sensitivity, specificity, accuracy, and agreement were consistently shown across studies. Meta-analysis of two subsets featuring automated methods, for which relevant data were available, revealed a pooled accuracy of 0.877 (0.806–0.949), sensitivity of 0.856 (0.805–0.907), and specificity of 0.900 (0.845–0.954). Subcategories “remote standard assessment,” “digital image analysis,” “wearable devices,” “mobile health (mHealth),” and “artificial intelligence” were independently examined.

Conclusions:

The majority of systems achieved parity with standard physician assessment, with the added benefit of eliminating subjectivity. Meta-analysis results suggest potential introduction of remote automated assessment where conventional assessment is unavailable, although accuracy of current technologies remains limited compared to in-person examination. Telemedicine modalities described offer convenience for patients, shorter examination times, and the potential to go beyond in-person assessments. The evidence gathered in this review supports the beginning of telemedicine integration into the world of strabismus diagnosis.

Introduction

Major advances in telemedicine over recent decades have created a plethora of opportunities to enhance patient care.¹ Strabismus is an ocular condition arising from misalignment of the eyes. Standard assessment of strabismus involves the observation of the eyes in different positions of gaze, providing the potential for remote assessment and automated analysis of images/videos.² The term “telemedicine” was interpreted in its broadest sense as any use of technology removing the need for in-person examination. The purpose of this narrative review was to evaluate the scope for involvement of telemedicine to facilitate strabismus assessment, as well as quantify the accuracy of automated methods without human input.

Methods

A literature search was conducted using the medical databases Embase Ovid (Fig. 1), MEDLINE (Fig. 2), and the Cochrane Library (Fig. 3). The Cochrane Library was searched for related reviews, and the reference lists of said reviews were scanned for any studies that could help answer the research question. The keywords used included “strabismus,” “squint,” “phoria*,” “telemed*,” and “telehealth,” among synonyms. No filters or limits were applied. All abstracts picked up through the search were screened according to the predefined inclusion criteria:

Fig. 1.

Embase search.

Fig. 2.

Medline search.

Fig. 3.

Cochrane search.

Primary research—randomized controlled trials, nonrandomized controlled trials, and observational studies.

Quantify outcome measures relating to the validity of telemedicine, including reliability measures, sensitivity, and specificity.

Studies not in the English language were excluded. Reviews were excluded after checking their reference list for relevant articles. Following this, included studies were fully extracted onto a Microsoft Excel spreadsheet by one reviewer extracting platform, location, and agreement metrics.

Supplementing the primary literature search, a gray literature search was carried out with the Ovid Global Health database and the British Library e-thesis online service. EndNote was used throughout for organization of selected literature and deduplication. Risk of bias was judged with the QUADAS-2 tool. Meta-analysis of automated diagnosis was subsequently performed using statistical analysis software. Investigation was subdivided into five approaches: remote standard assessment, digital image analysis, wearable devices, mobile health (mHealth), and artificial intelligence.

Results

Thirty-four studies complied with the specified inclusion criteria and were fully extracted (Table 1). Twenty out of 34 studies had median age data available. Of these, the average median age was 20.82 years, and average number of participants was 79.15 (Table 2). A PRISMA flow diagram was generated (Fig. 4).

Table 1.

Full Results Table

AUTHOR	YEAR	STUDY REF	CLASS OF TELEMEDICINE	OUTCOME	UNIT	VALUE
Almeida	2015	#286	Computer-aided methodology	Esotropia identification accuracy	%	88.0
				Exotropia identification accuracy	%	100
				Hypertropia identification accuracy	%	80.3
				Hypotropia identification accuracy	%	80.3
Almeida	2012	#372	Computer-aided methodology	All sensitivity	%	95.14
				Correlogram sensitivity	%	88.05
				Covariogram sensitivity	%	84.06
				Semimadogram sensitivity	%	94.23
				Semivariogram sensitivity	%	90.32
				All specificity	%	95.38
				Correlogram specificity	%	72.38
				Covariogram specificity	%	69.3
				Semimadogram specificity	%	92.53
				Semivariogram specificity	%	84.96
Bakker	2013	#347	Wearable camera device	Error angle horizontal	Degrees	1.81
				Error angle vertical	Degrees	1.21
				Error angle varying head rotation fixation object 1 horizontal	Degrees	1.68
				Error angle varying head rotation fixation object 1 vertical	Degrees	1.4
				Estimate error angle varying head rotation fixation object 2 horizontal	Degrees	1.59
				Error angle varying head rotation fixation object 2 vertical	Degrees	1.47
Bindiganavale	2022	#24	VR display	Traditional vs VR double maddox rod correlation	R ²	0.94
Capo-Aponte	2012	#354	Computer-aided methodology	Lateral phoria correlation	R	0.914
				Lateral phoria LOA range	PD	−0.63 ± 1.30
				Lateral phoria sensitivity	%	83
				Lateral phoria specificity	%	100
				Vertical phoria correlation	R	0.935
				Vertical phoria LOA range	PD	−0.12 ± 0.39
				Vertical phoria sensitivity	%	86
				Vertical phoria specificity	%	100
Cheng	2021	#72	Automated photographic Hirschberg test	Application sensitivity threshold 3 PD	%	83.3
Cheng	2021	#72	Automated photographic Hirschberg test	Application specificity threshold 3 PD	%	76.5
Cheung	2000	#672	Synchronous remote assessment	Angle of deviation agreement horizontal 6 m fixation	ICC	0.79 (0.64–0.88)
				Angle of deviation agreement horizontal 0.33 m fixation	ICC	0.70 (0.5–0.82)
				Angle of deviation agreement vertical 6 m fixation	ICC	0.78 (0.63–0.88)
				Angle of deviation agreement vertical 0.33 m fixation	ICC	0.65 (0.45–0.80)
				Adjusted risk of disagreement category of strabismus 6 m fixation	Odds ratio	2.6 (1.17–5.76)
				Adjusted risk of disagreement category of strabismus 0.33 m fixation	Odds ratio	2.52 (1.29–5.61)
				Adjusted risk of disagreement angle of deviation 6 m fixation	Odds ratio	2.51 (1.14–5.50)
				Adjusted risk of disagreement angle of deviation 0.33 m fixation	Odds ratio	2.42 (1.25–4.67)
				Adjusted risk of disagreement eye muscle movements right eye	Odds ratio	1.89 (1.10–3.54)
				Adjusted risk of disagreement eye muscle movements left eye	Odds ratio	2.82 (1.36–5.82)
				Category of strabismus agreement horizontal 6 m fixation	Unweighted kappa	0.66 (0.47–0.85)
				Category of strabismus agreement horizontal 0.33 m fixation	Unweighted kappa	0.74 (0.57–0.91)
				Category of strabismus agreement vertical 6 m fixation	Unweighted kappa	0.28 (0.02–0.54)
				Category of strabismus agreement vertical 0.33 m fixation	Unweighted kappa	0.25 (0.01–0.51)
Dawson	2002	#1376	Synchronous remote assessment	Full agreement	%	80.0
Dawson	2002	#1376	Synchronous remote assessment	At least partial agreement	%	83.3
De Figueiredo	2021	#61	AI app	Global accuracy left eye gaze 1	%	45
				Global accuracy left eye gaze 2	%	76
				Global accuracy left eye gaze 3	%	67
				Global accuracy left eye gaze 4	%	81
				Global accuracy left eye gaze 6	%	58
				Global accuracy left eye gaze 7	%	73
				Global accuracy left eye gaze 8	%	61
				Global accuracy left eye gaze 9	%	50
				Global accuracy right eye gaze 1	%	42
				Global accuracy right eye gaze 2	%	71
				Global accuracy right eye gaze 3	%	56
				Global accuracy right eye gaze 4	%	73
				Global accuracy right eye gaze 6	%	54
				Global accuracy right eye gaze 7	%	92
				Global accuracy right eye gaze 8	%	57
Dericioglu	2019	#153	Computer-aided methodology	Correlation real and estimated gaze angles	R	0.99 (p < 0.001)
				Mean error estimated angle	PD	0.03 ± 4.60
				Average measurement error	PD	0.03 ± 4.60
				Correlation real and estimated imaging distance	R	0.997 (p < 0.001)
				Total reliability coefficient	Cronenbach’s alpha	0.983
Fisher	2007	#498	Synchronous remote assessment	Accuracy 50 group	%	100
Fisher	2007	#498	Synchronous remote assessment	Accuracy 65 group	%	100
Ho	2021	#69	Synchronous remote assessment	Horizontal angle measurements	ICC	0.95 (0.92–0.97)
				Vertical angle measurements	ICC	0.91 (0.69–0.98)
				Overall agreement in angle measurements Pivothead original	ICC	0.92 (0.65–0.97)
				Overall agreement in angle measurements Smart series	ICC	0.98 (0.97–0.99)
				Overall agreement in angle measurements Total videos	ICC	0.95 (0.93–0.97)
				Ocular motility agreement inferior oblique	%	93
				Ocular motility agreement superior oblique	%	98
				Presence of vertical deviation	Weighted kappa	1 (1.00–1.00)
				Presence of horizontal deviation	Weighted kappa	1 (1.00–1.00)
				Overall agreement in degree manifest Pivothead original	Weighted kappa	0.96 (0.92–0.99)
				Overall agreement in degree manifest Smart series	Weighted kappa	0.92 (0.87–0.96)
				Overall agreement in degree manifest Total videos	Weighted kappa	0.94 (0.90–0.97)
Huang	2022	#17	Face-detection model	MetaOptNet accuracy	%	70.9 (70.7–71.1)
				Proposed method accuracy	%	80.5 (80.3–80.6)
				MetaOptNet sensitivity	%	74 (73.7–74.4)
				Proposed method sensitivity	%	76.8 (76.6–77.1)
				MetaOptNet specificity	%	67.8 (67.5–68.1)
				Proposed method specificity	%	84.2 (83.9–84.4)
Huang	2021	#53	Face-detection model	Positional similarity estimate normal	Sample mean	1.073 ± 0.014
Huang	2021	#53	Face-detection model	Positional similarity estimate strabismus	Sample mean	1.924 ± 0.169
Kang	2020	#26	Deep-learning algorithm	Limbus segmentation accuracy	%	99.92
				Sclera segmentation accuracy	%	99.84
				Limbus segmentation sensitivity	%	95.63
				Sclera segmentation sensitivity	%	97.47
				Limbus segmentation specificity	%	99.96
				Sclera segmentation specificity	%	99.9
Li	2022	#1036	Synchronous remote assessment	Angle measurement overall agreement	ICC	0.97 (0.96–0.98)
				Ocular motility agreement	%	90.2 (85.5–93.9)
				Strabismus category vertical	Weighted kappa	1 (1.00–1.00)
				Strabismus category horizontal	Weighted kappa	1 (1.00–1.00)
				Strabismus category overall degree manifest	Weighted kappa	0.91 (0.88–0.93)
Lu	2018	#1366	AI algorithm	Diagnosis sensitivity	%	93.3
				Diagnosis specificity	%	96.17
				Diagnosis accuracy	%	93.89
				Diagnosis AUC	AUC	0.9865
Ma	2020	#1231	Smartphone app	Strabismus detection accuracy	%	94
				Strabismus detection sensitivity	%	80
				Strabismus detection specificity	%	98
Mesquita	2021	#59	Smartphone app	Cutoff 11 PD accuracy	%	92.8 (88.6–95.8)
				Cutoff 6 PD accuracy	%	84.5 (79.0–89.0)
				Cutoff 11 PD sensitivity	%	68.75 (41.3–88.9)
				Cutoff 6 PD sensitivity	%	89.47 (66.8–98.7)
				Cutoff 11 PD specificity	%	93.27 (88.96–96.27)
				Cutoff 6 PD specificity	%	84.39 (78.68–89)
				Cutoff 11 PD kappa coefficient	Unweighted kappa	0.49 (0.35–0.61)
				Cutoff 6 PD kappa coefficient	Unweighted kappa	0.43 (0.38–0.48)
Miao	2020	#119	VR display	Ocular deviation angle difference direct measurement	Degrees	0.3 ± 1.3
Miao	2020	#119	VR display	Ocular deviation angle difference stepwise analysis	Degrees	0.3 ± 1.3
Phanphruk	2017	#1172	Smartphone app	Increased detection of alignment abnormalities vs clinical photos (77%)	%	12 (p < 0.05)
Pundlik	2019	#222	Automated photographic Hirschberg test	Mean absolute difference app vs. cover test with prism neutralization	PD	5.4 ± 4.2
				Mean absolute difference app vs. Synoptophore	PD	10.1 ± 11
				Dissociated phoria measurement correlation	R ²	0.97 (p < 0.001)
				Monocular eye deviation measurement correlation	R ²	0.97 (p < 0.001)
				Strabismus measurement correlation	R ²	0.95 (p < 0.001)
Racano	2021	#79	Refractometer corneal reflexes app	Esotropia correlation	R ²	0.582 (p < 0.765)
				Exotropia correlation	R ²	0.241 (p < 0.056)
				Vertical correlation	R ²	0.168 (p < 0.0001)
				Esotropia sensitivity	%	88.9
				Subject under 5 years sensitivity	%	88.9
				Total sensitivity	%	79.2
				Vertical sensitivity	%	50.0
				Esotropia specificity	%	91.6
				Subject under 5 years specificity	%	80.0
				Total specificity	%	86.2
				Vertical specificity	%	100
Stewart	2022	#41	Synchronous remote assessment	Primary gaze angle measurements vertical Distance without correction	ICC	1.00
				Primary gaze angle measurements vertical Near without correction	ICC	1.00
				Primary gaze angle measurements vertical Distance with correction	ICC	1.00
				Primary gaze angle measurements vertical Near with correction	ICC	1.00
				Primary gaze angle measurements horizontal Distance without correction	ICC	0.98
				Primary gaze angle measurements horizontal Near without correction	ICC	1.00
				Primary gaze angle measurements horizontal Distance with correction	ICC	0.98
				Primary gaze angle measurements horizontal Near with correction	ICC	0.98
				Motility disease categorization horizontal Distance without correction	ICC	0.98
				Motility disease categorization horizontal Near without correction	Unweighted kappa	1.00
				Motility disease categorization horizontal Distance with correction	Unweighted kappa	0.96
				Motility disease categorization horizontal Near with correction	Unweighted kappa	0.97
				Motility disease categorization vertical Distance without correction	Unweighted kappa	1.00
				Motility disease categorization vertical Near without correction	Unweighted kappa	0.94
				Motility disease categorization vertical Distance with correction	Unweighted kappa	1.00
				Motility disease categorization vertical Near with correction	Unweighted kappa	1.00
Valente	2017	#1353	Eye-tracking system	Diagnosis sensitivity	%	80.00
				Diagnosis specificity	%	100.0
				Diagnosis accuracy	%	93.33
				Average error in deviation	PD	2.57
Weber	2017	#198	Video goggles	Agreement with Hess screen test horizontal deviations	ICC	0.83 (0.77–0.88)
				Agreement with Hess screen test vertical deviations	ICC	0.76 (0.68–0.82)
				Agreement with Hess screen test total deviations	ICC	0.82 (0.71–0.89)
Yang	2012	#375	3D photo analyzer	Concordance vs. Krimsky test	CCC	0.808 (0.6466–0.9001)
				Concordance vs. PCT	CCC	0.7437 (0.5522–0.8606)
				Correlation vs. Krimsky esotropia	CCC	0.949 (p < 0.001)
				Correlation vs. Krimsky exotropia	CCC	0.957 (p < 0.001)
				Correlation vs. Krimsky total	CCC	0.990 (p < 0.001)
				Correlation vs. PCT esotropia	CCC	0.831 (p < 0.001)
				Correlation vs. PCT exotropia	CCC	0.821 (p < 0.001)
				Correlation vs. PCT total	CCC	0.809 (p < 0.001)
Yang	2013	#341	3D photo analyzer	Bland–Altman half-width of 95^th LOA selective wavelength filter analysis	PD	±4.3
				Concordance correlation coefficients	CCC	0.96 (0.4–0.97)
				Test–retest reliability	PD	±8.5
				PCT-Correlation	R	0.900 (p < 0.001)
				PCT-Correlation esotropia	R	0.904 (p < 0.001)
				PCT-Correlation exotropia	R	0.900 (p < 0.001)
Yeh	2021	#63	VR-based ocular deviation measurement system	VR vs APCT	ICC	0.897 (0.810–0.945)
				VR vs APCT esotropia subgroup	ICC	0.962 (0.902–0.986)
				VR vs APCT exotropia subgroup	ICC	0.862 (0.651–0.950)
				Bland–Altman 95% LOA APCT vs VR	PD	11.32
				Bland–Altman 95% LOA APCT vs VR esotropia subgroup	PD	6.62
				Bland–Altman 95% LOA APCT vs VR exotropia subgroup	PD	11.27
Yehezkul	2020	#115	Eye tracking and dedicated full occlusion glasses	Bland–Altman half-width manual prism alternating cover test vs. automated ACT	PD	±11.4
				Correlation between automated and manual tests esodeviation	R	0.9 (p < 0.002)
				Correlation between automated and manual tests exodeviation	R	0.88
				Correlation between automated and manual tests vertical deviations	R	0.91
Zheng	2021	#77	Deep-learning algorithm	Deep learning model accuracy	%	96.8 (94.7–98.9)
				Deep learning model AUC	AUC	0.99 (0.989–1.000)
				Deep learning model sensitivity	%	99.3 (98.3–100.0)
				Deep learning model specificity	%	94.0 (91.9–96.8)
Zrinscak	2021	#46	Eye-tracking system strabiscope	ROC analysis os-scm variable cutoff 1.950 AUC	AUC	0.789 (0.701–0.894)
				ROC analysis os-scm variable cutoff 2.050 AUC	AUC	0.809 (0.713–0.904)
				ROC analysis os-scm variable cutoff 1.950 sensitivity	%	82.9
				ROC analysis os-scm variable cutoff 2.050 sensitivity	%	78.0
				ROC analysis os-scm variable cutoff 1.950 specificity	%	62.5
				ROC analysis os-scm variable cutoff 2.050 specificity	%	72.5

APCT, alternate prism cover test; AUC, area under the curve; CCC, concordance correlation coefficient; ICC, intraclass correlation coefficient; LOA, limit of agreement; PCT, prism cover test; PD, prism diopters; ROC, receiver operating characteristics; VR, virtual reality.

Table 2.

Study Characteristics

AUTHOR	YEAR	REFERENCE	LOCATION	MEDIAN AGE	PARTICIPANTS	CATEGORIES OF STRABISMUS
Almeida	2015	#286	Brazil	Not specified	200	4
Almeida	2012	#372	Brazil	Not specified	45	1
Bakker	2013	#347	The Netherlands	Not specified	3	1
Bindiganavale	2022	#24	USA	33	31	3
Capo-Aponte	2012	#354	USA	32.65	40	2
Chandna	2009	#438	UK	Not specified	27	41
Chen	2018	#173	Hong Kong	43	42	1
Cheng	2021	#72	China	Not specified	133	5
Cheung	2000	#672	Canada	16.85	85	11
Dawson	2002	#1376	UK	39	30	9
De Figueiredo	2021	#61	Brazil	41.47	110	2
Dericioglu	2019	#153	Turkey	8.6	72	1
Fisher	2007	#498	UK	Not specified	115	8
Ho	2021	#69	USA	8	37	2
Huang	2022	#17	Korea	Not specified	60	1
Huang	2021	#53	Korea	Not specified	60	1
Kang	2020	#26	Korea	Not specified	166	1
Li	2022	#1036	USA	7	18	2
Lu	2018	#1366	Korea	Not specified	5685	1
Ma	2020	#1231	China	9	100	1
Mesquita	2021	#59	Brazil	10	224	2
Miao	2020	#119	Korea	Not specified	17	1
Phanphruk	2017	#1172	USA	31.3	30	2
Pundlik	2019	#222	USA	13	66	2
Racano	2021	#79	Italy	7.9	137	3
Stewart	2022	#41	USA	6	210	34
Valente	2017	#1353	Brazil	Not specified	15	4
Weber	2017	#198	Switzerland	37	58	1
Yang	2012	#375	Korea	9.37	100	2
Yang	2013	#341	Korea	Not specified	90	3
Yeh	2021	#63	Taiwan	39.4	40	2
Yehezkul	2020	#115	Israel	7.17	72	1
Zheng	2021	#77	China	Not specified	1404	1
Zrinscak	2021	#46	Croatia	16.6	81	1

Fig. 4.

PRISMA flow diagram.

REMOTE ASSESSMENT

Cheung et al. first described the potential for strabismus examination remotely in 2000.³ Forty-two patients were examined both in-person and remotely, compared with 43 examined in-person only. Agreement on ocular muscle action was lower in the telemedicine study for all muscle groups. Odds of disagreement were consistently increased for all measurements (category, angle, ocular muscle movements) by two to three times.

Dawson iterated upon this initial integration of telemedicine into strabismus assessment.⁴ A total of 30 patients were examined by different ophthalmologists in person and through telemedicine, who were presented by the same orthoptist. Qualitatively, ophthalmologists noted straightforward diagnoses of large limitations, for example, Duane’s syndrome. Conversely, latent squint was more difficult to diagnose.

Mobile applications facilitated convenient remote assessment as well. In 2017 Phanphruk et al. developed the StrabisPIX tool, a mobile application allowing processing of images in the nine cardinal positions of gaze independently taken by the patient.⁵ The comparison was with images taken professionally by an orthoptist. Concurring with the findings of Cheung et al., significantly more acceptability of images for horizontal versions was found over vertical versions and head posture.

Ho et al. in 2021 introduced high-definition video smart glasses to simultaneously record a strabismus examination while performing one in person.⁶ The recorded videos were then assessed later in a store-and-forward manner. Agreement was classified as excellent for both vertical deviations, bucking the trend of inferior vertical deviation agreement noted in earlier studies. Equivalent agreement was found between the gold-standard in-person examinations and the store-and-forward videos.

Li et al. expanded on this work by examining real-time video feeds obtained with video glasses.⁷ In-person assessment was compared with store-and-forward review of recorded videos three years later. Strabismus category, degree manifest, angle measurement, and extraocular motility agreement were high.

Synchronous streaming was evaluated by Stewart et al., who analyzed the agreement between examinations streamed to ophthalmologists remotely and in-person re-examinations of the patients by the same doctor on the same day.⁸ Of the families, 98.5% were comfortable with quality of telemedicine examination, and 97.1% agreed that they would be happy to participate in another similar study in the future. Changes in management plan and discrepancies in diagnosis were lower than a reasonable noninferiority threshold set at 1.5% and 15%, respectively.

DIGITAL IMAGE ANALYSIS

Almeida et al. formulated a multistage computational methodology for automatic strabismus detection first in 2012, using digital automation of the Hirschberg test.⁹ Cover test was performed in all patients, with division into strabismic and nonstrabismic control groups. Reference alternate prism cover test (APCT) was applied for strabismic patients.

Yang et al. then validated the effectiveness of a novel 3D Strabismus Photo Analyzer to estimate ocular alignment through images.¹⁰ The analyzer tool calculated the angle k (between corneal light reflex and pupil center) from a primary 2D image captured. The tool requires minimal operator input and does not rely on constant Hirschberg ratio due to its 3D, rather than 2D, nature. Adjustment for age and angle-k ophthalmical biometry was performed.

This study was followed up by Yang et al. again, assessing the efficacy of a selective wavelength filter with an infrared camera followed by the 3D Strabismus Photo Analyzer previously described.¹¹ Functionality with infrared images allowed the measurement of latent strabismus only discernible following disruption of fusion.

Almeida et al. iterated upon their previous work in another analysis of images previously diagnosed by a specialist.¹² An important change in protocol was introduced by including patients with deviations up to 90 prism diopters (PD), facilitating both initial checking and diagnosis. Low accuracy noted in the diagnosis of orthotropic patients was attributed to the disconnect in precision between the tool and the specialist unaccustomed to working with such small shifts and precision.

Techniques used in static image analysis were transferred to digital videos in work by Valente et al., who proposed a computational methodology involving data extracted from a cover test video.¹³ Utilizing the eye-region detection and search-space delimitation technology introduced by Almeida et al., eye-tracking software was integrated to facilitate the classification of strabismus through selecting the highest average of deviation measure. Compared with previous relevant studies, Valente claimed superiority in affordability of equipment, classifying multiple directions of deviation, and diagnosis of nonapparent strabismus, although only exotropic patient videos were available.

Using a different method to quantitively measure the extent of horizontal strabismus, Dericioglu et al. observed and clinically validated the ratio between the geometrical corneal center and light reflex to calculate gaze angle and imaging distance.¹⁴ A high correlation was reported between real and estimated gaze angles, as well as imaging distance. The error rate was not correlated with patient age or deviation angle.

The functionality of dedicated occlusion glasses with eye-tracking software was investigated by Yehezkel et al.¹⁵ No significant interexaminer variability for APCT and automated ACT was detected. The average automated test duration was 46 s. The repeatability of automated test was significantly higher with just under a twofold reduction in average standard deviation for horizontal and vertical deviations.

Zrinscak et al. introduced eye-tracking software to detect manifest strabismus without the need for a skilled examiner.¹⁶ The novel Strabiscope system was described, to calculate strabismus diagnosis parameters. Strabismic participants were shown to have higher values for all measurements compared with the nonstrabismic control group.

Kang et al. reviewed an automated mathematical algorithm to quantitively measure strabismus from analysis of cardinal gaze position images, validated with confusion matrices.¹⁷ Through direct application to clinical scenarios, this study was an improvement on De Figueiredo’s previous work on a convolutional neural network (CNN) web application.¹⁸ Having many categories of strabismus was an improvement on Zheng’s proposed algorithm.¹⁹

WEARABLE DEVICES

Capo-Aponte examined the effectiveness of a fixed computerized oculomotor vision screening system.²⁰ While Pearson correlation was strong, Bland–Altman analysis showed moderate discrepancies to fusional vergence and monocular accommodative facility measurements. Persistent overestimation of left hyperdeviation and underestimation of right hyperdeviation were observed.

Upgraded components included in other hardware devices offer additional advantages. Bakker et al. introduced the Delft Assessment Instrument for Strabismus in Young Children combining infrared light-emitting diodes and a high-resolution stereo system, allowing unrestrained head movement for the quick and reliable quantification of strabismus angles in young children.²¹

Novel video goggles to deliver a simple noninvasive test for strabismus were innovated by Weber et al.²² The video goggles used infrared light with liquid crystal display shutters and projection of a laser target to measure ocular deviations on a nine-point target grid. Patients with visual suppression and comitant strabismus, who are not able to be examined by Hess chart, were able to be examined by the strabismus video goggles.

Virtual reality (VR) also has a role to play in facilitating assessment. Miao et al. integrated a VR system into the measurement of ocular deviation for strabismus patients, alternating fixation targets between the eyes to emulate a standard cover test.²³ The study contrasted a direct measurement (DM) strategy with stepwise approximation (SA, measuring ocular deviation through feedback calibration). While both DM and SA had excellent performance in orthotropia and exotropia, SA displayed more stable results than DM on 95% limit of agreement (LOA) of difference.

Following on from Miao et al’s VR work., Yeh et al. researched the viability of an eye-tracking VR headset for strabismus measurement simulating APCT.²⁴ Eye-tracking software recorded patient eye movements between two screens with alternating fixation targets. A large standard deviation of 5.77 PD was revealed between VR and APCT, which was attributed to large degrees of strabismus and 5-PD increments in the prism set.

mHealth

A quantitative strategy first pioneered by Pundlik et al. applied an automated photographic Hirschberg test for the measurement of ocular deviation using the EyeTurn mobile application, automatically processing corneal reflection position relative to the globe center.²⁵ Consistency of app measurements with cover test was further increased following correction of cover test values for near fixation.

Ma et al. continued exploring the scope of a mobile device’s potential for automated strabismus diagnosis, formulating a “one-step streamlined screening solution,” aiming for comprehensive examination of children in resource-limited areas such as low-skilled technicians.²⁶ The procedure proved highly scalable, calculating values in only ten seconds, leveraging artificial intelligence (AI) algorithms for a potential throughput of 200 children an hour.

Following Pundlik et al., EyeTurn was also evaluated for feasibility in a cross-sectional trial by Cheng et al. in the context of routine vision screening in a school performed by an untrained school nurse, using the photographic Hirschberg method.²⁷ The optimal threshold for strabismus detection was 3.0PD. From regression analysis, a high positive correlation was observed.

To tackle the public health problem of amblyopia, Mesquita et al. designed a trial looking at the concordance between a low-cost mHealth application for instant strabismus diagnosis and expert clinical evaluation by ophthalmologists.²⁸ Difference between measurements of horizontal and vertical deviations was attributed to the tangential observation of deviation to the mobile camera.

Racano et al. proceeded to validate the 2WIN corneal reflexes app using the AAPOS 2013 guidelines of a >8 PD threshold.²⁹ Difference with the standard calculated from Wilcoxon signed rank sum test was significant for vertical deviations (poor correlation) although not for eso- and exodeviations. Fair correlation was observed for esotropia compared to exotropia correlation.

Finally, VR was integrated by Bindiganavale et al., who validated a VR approach for the measurement of torsional strabismus.³⁰ A virtual double Maddox rod (DMR) examination was implemented using a commercially available smartphone and VR viewer. At higher degrees of cyclodeviation, more biasing of VR-DMR measurements was observed. Of the participants, 54.8% found VR-DMR to be easier to use than standard DMR, and all participants were optimistic about using smartphone applications for testing.

ARTIFICIAL INTELLIGENCE

Fisher et al. first described an artificially intelligent expert system featuring a backpropagation learning method to progressively categorize strabismus.³¹ This study was the first attempt to use real clinical data rather than a parametric biomechanical model. The StrabNet system was shown to be effective for diagnosis. Outside expert input showed potential for teaching and learning as well.

Chandna et al. further investigated the same StrabNet system simplifying the system to six directions.³² Matching between StrabNet and expert diagnosis occurred in a majority of cases, including those for which the StrabNet tool had not been explicitly trained.

In 2018, Chen et al. introduced the application of eye-tracking CNNs for recognition of strabismus using nine-point gaze acquisition data, reducing labor costs and increasing diagnosis efficiency.³³ The highest accuracy was achieved by the Visual Geometry Group-S model, only misclassifying one strabismic and one nonstrabismic participant.

An application for automated strabismus detection using telemedicine was described by Lu et al. in 2018 through the establishment of a tele-strabismus dataset.³⁴ Past a threshold of 1500 training examples, detection results improved significantly. Compared with previous studies such as Valente and Chen, the system described by Lu is superior in that it does not require on-site assistance of specialists.^13,33

De Figueriredo et al. designed an early prototype mobile app integrating AI technology with the aim of a fully objective automated classification of eye versions using the programming language Python.¹⁸ Results were limited due to several classes with few or no observations, decreasing overall global quality metrics.

The pediatric aspect of AI for strabismus was investigated by Zheng et al. in a study applying a deep learning approach to screening of photos of children’s eyes for horizontal strabismus.¹⁹ Promisingly, a difficult case of epicanthal pseudostrabismus was diagnosed correctly. Relative to the previous literature, this system proved superior in providing complete end-to-end learning without need for manual adjustment.

An automatic strabismus screening method utilizing CNNs was also validated by Huang et al., in a pretrained model for detection of facial landmarks to extract eye region for measurement of positional similarity.³⁵ Novel use of Otsu’s binarization and the hue saturation value model, mitigating confusion from lashes and canthi, was described. In comparison to Zheng, manual adjustment was not needed for extraction of eye region due to Huang’s use of facial landmark detector. Superiority was also achieved over Almeida’s work by obviating the need for image acquisition on-site and additional labor.

Huang et al. followed up this study a year later with an improvement on the proposed method, adding an image processing component to the meta-learning MetaOptNet architecture.³⁶ A data-scarce environment was suggested as a particular use-case for said supplementation.

META-ANALYSIS

For studies utilizing remote automated methods (defined as not involving human input, excluding remote assessment), for which data were available, a meta-analysis was conducted to determine their overall effectiveness. Statistical calculations were performed with the analysis software Stata (StataCorp. 2023. Stata Statistical Software: version 18.0. College Station, TX). A random effects meta-analysis model with the restricted maximum likelihood method was selected to compute raw effect sizes for estimating the relevant overall proportions. Eight studies using automated methods had available data on accuracy. From these studies, the overall accuracy computed was 0.877 (0.806–0.949) (Fig. 5). A different subset of 11 studies contained data on sensitivity and specificity. The overall sensitivity calculated was 0.856 (0.805–0.907) (Fig. 6), and specificity 0.900 (0.845–0.954) (Fig. 7).

Fig. 5.

Forest plot accuracy.

Fig. 6.

Forest plot sensitivity.

Fig. 7.

Forest plot specificity.

RISK OF BIAS

Risk of potential study bias was assessed using the QUADAS-2 tool, across the four domains patient selection, index test, reference test, and patient flow. Two studies were deemed to have high risk of bias, seven unclear, and 25 low. High risk of bias was attributable to patient selection and matching review question.

GRAY LITERATURE REVIEW

No relevant articles were identified from review of the surrounding gray literature.

Discussion

There is a large clinical need for technological innovations permitting timely diagnosis and treatment of strabismus, especially for children below the age of seven for whom amblyopia is still reversible.³⁷ Overall, the selected studies demonstrate a high level of confidence in the safety, reliability, and feasibility of utilizing telemedicine technology for the purpose of strabismus diagnosis across various metrics. Success of telemedicine modalities was also described qualitatively in several domains relating to clinical efficacy, such as short duration of examination time, patient satisfaction, and scalability.

Assuming that the gold standard of full eye examination has perfect sensitivity and specificity for strabismus diagnosis, the computed meta-analysis values may be high enough for automated methods to start to be considered in clinical practice, especially where access to conventional assessment is limited. However, considerable room for improvement remains before widespread adoption, as values equal to or less than 0.9 mean that at least one in 10 patients with strabismus is missed and one in 10 without strabismus incorrectly diagnosed. Interestingly, the calculated specificity was marginally higher than the sensitivity, indicating that the automated methods were slightly better at ruling out, rather than ruling in. This could speak to an overly cautious tendency in automated methods, incorrectly categorizing aberrant biomechanical parameters within the spectrum of physiological variance where a human would not. It will be useful to follow whether this disparity remains as technologies are iterated upon further, as a consistently higher specificity could implicate a greater utility for automated methods as a second-line confirmatory test, following a more sensitive human-mediated triaging system. In addition, it could be worthwhile for scientists designing said methods to be cognizant of this by modifying the in-built parameters of their algorithms to be less conservative.

LIMITATIONS OF INCLUDED EVIDENCE

Study designs and settings were heterogenous in nature, with wide variance in outcome measures used. Participant numbers in many studies were low, reducing statistical power.³⁰ Confirmation bias was introduced where blinding was absent.^5,8 Selection methods were highly biased through the arbitrary exclusion of patients due to young age and disability.^3,7,8 A subset of studies also excluded problematic diagnoses from the beginning, artificially inflating efficacy.^11,16 Systems utilizing population Hirschberg ratio averages were not controlled for ethnicity or sex, disproportionately affecting reliability for some demographic groups.²⁵

LIMITATIONS OF THE REVIEW

The validity of the meta-analysis conducted in this review is limited by high heterogeneity, as well as the calculations lacking several relevant included studies due to variation in study outcomes (lacking data on accuracy, sensitivity, and specificity). As only one reviewer was available, no consensus could be reached during the abstract and full article review process.

Conclusions

The future of strabismus assessment will likely involve integration of advanced, highly accurate systems into convenient platforms such as smartphone apps. Beyond diagnosis, rehabilitation could also be delivered through telemedicine, such as visual exercises for amblyopia. For clinical validation, studies should be designed directly comparing outcomes of discrete modalities, guiding the direction of future research. This review strengthens the argument that advancements in telemedicine technology have significant potential to improve the accuracy and availability of strabismus assessment.

Footnotes

Acknowledgment

Many thanks to University College London for supporting this work.

Authors’ Contributions

Dominic Wong: Conceptualization, Methodology, Software, and Writing—Original draft preparation. Malik Alsaif: Writing—Reviewing and Editing. Lloyd Bender: Supervision

Author Disclosure Statement

No competing financial interests exist.

Funding Information

No funding was received for this article.

References

Wilson

, Maeder

. Recent directions in telemedicine: Review of trends in research and practice. Healthc Inform Res, 2015; 21(4):213–222; doi: 10.4258/hir.2015.21.4.213

Bommireddy

, Taylor

, Clarke

. Assessing strabismus in children. Paediatrics and Child Health, 2023; 33(12):401–405; doi: 10.1016/j.paed.2023.09.007

Cheung

, Dick

, Kraft

, et al. Strabismus examination by telemedicine. Ophthalmology, 2000; 107(11):1999–2005; doi: 10.1016/s0161-6420(00)00377-8

Dawson

, Kennedy

, Bentley

, et al. The role of telemedicine in the assessment of strabismus. J Telemed Telecare, 2002; 8(1):52–55; doi: 10.1258/1357633021937361

Phanphruk

, Liu

, Morley

, et al. Validation of StrabisPIX, a mobile application for home measurement of ocular alignment. Transl Vis Sci Technol, 2017; 8(2):9; doi: 10.1167/tvst.8.2.9

, Kolin

, Stewart

, et al. Evaluation of high-definition video smart glasses for real-time telemedicine strabismus consultations. J AAPOS, 2021; 25(2):74.e1–e6; doi: 10.1016/j.jaapos.2020.11.016

, Nguyen

, Kolin

, et al. Evaluation of video glasses for real-time hardware-to-software telemedicine strabismus consultations across multiple graders. Journal of AAPOS, 2022; 26(4):e26; doi: 10.1016/j.jaapos.2022.08.096

Stewart

, Coffey-Sandoval

, Reid

, et al. Reliability of telemedicine for real-time paediatric ophthalmology consultations. Br J Ophthalmol, 2022; 106(8):1157–1163; doi: 10.1136/bjophthalmol-2020-318385

Almeida

, Silva

, Paiva

, et al. Computational methodology for automatic detection of strabismus in digital images through Hirschberg test. Comput Biol Med, 2012; 42(1):135–146; doi: 10.1016/j.compbiomed.2011.11.001

10.

Yang

, Han

, Hwang

, et al. Assessment of binocular alignment using the three-dimensional Strabismus Photo Analyzer. Br J Ophthalmol, 2012; 96(1):78–82; doi: 10.1136/bjophthalmol-2011-300305

11.

Yang

, Seo

, Hwang

, et al. Automated analysis of binocular alignment using an infrared camera and selective wavelength filter. Invest Ophthalmol Vis Sci, 2013; 54(4):2733–2737; doi: 10.1167/iovs.12-11400

12.

Almeida

, Silva

, Teixeira

, et al. Computer-Aided Methodology for Syndromic Strabismus Diagnosis. J Digit Imaging, 2015; 28(4):462–473; doi: 10.1007/s10278-014-9758-0

13.

Valente

TLA

, de Almeida

JDS

, Silva

, et al. Automatic diagnosis of strabismus in digital videos through cover test. Comput Methods Programs Biomed, 2017; 140:295–305; doi: 10.1016/j.cmpb.2017.01.002

14.

Dericioğlu

, Çerman

. Quantitative measurement of horizontal strabismus with digital photography. Journal of Aapos: American Association for Pediatric Ophthalmology & Strabismus, 2019; 23(1):18.e1–e6; doi: 10.1016/j.jaapos.2018.08.014

15.

Yehezkel

, Belkin

, Wygnanski-Jaffe

. Automated diagnosis and measurement of strabismus in children. Am J Ophthalmol, 2020; 213:226–234; doi: 10.1016/j.ajo.2019.12.018

16.

Zrinscak

, Grubisic

, Skala

, et al. Computer based eye tracker for detection of manifest strabismus. Acta Clin Croat, 2021; 60(4):683–694; doi: 10.20471/acc.2021.60.04.16

17.

Kang

, Yang

, Kim

, et al. Automated mathematical algorithm for quantitative measurement of strabismus based on photographs of nine cardinal gaze positions. Biomed Res Int, 2022; 2022:9840494; doi: 10.1155/2022/9840494

18.

de Figueiredo

, Dias

JVP

, Polati

, et al. Strabismus and artificial intelligence app: Optimizing diagnostic and accuracy. Transl Vis Sci Technol, 2021; 10(7):22; doi: 10.1167/tvst.10.7.22

19.

Zheng

, Yao

, Lu

, et al. Detection of referable horizontal strabismus in children’s primary gaze photographs using deep learning. Transl Vis Sci Technol, 2021; 10(1):33; doi: 10.1167/tvst.10.1.33

20.

Capo-Aponte

, Tarbett

, Urosevich

, et al. Effectiveness of computerized oculomotor vision screening in a military population: Pilot study. J Rehabil Res Dev, 2012; 49(9):1377–1398; doi: 10.1682/jrrd.2011.07.0128

21.

Bakker

, Lenseigne

BAJ

, Schutte

, et al. Accurate gaze direction measurements with free head movement for strabismus angle estimation. IEEE Trans Biomed Eng, 2013; 60(11):3028–3035; doi: 10.1109/TBME.2013.2246161

22.

Weber

, Rappoport

, Dysli

, et al. Strabismus Measurements with Novel Video Goggles. Ophthalmology, 2017; 124(12):1849–1856; doi: 10.1016/j.ophtha.2017.06.020

23.

Miao

, Jeon

, Park

, et al. Virtual reality-based measurement of ocular deviation in strabismus. Comput Methods Programs Biomed, 2020; 185:105132; doi: 10.1016/j.cmpb.2019.105132

24.

Yeh

, Liu

, Sun

, et al. To measure the amount of ocular deviation in strabismus patients with an eye-tracking virtual reality headset. BMC Ophthalmol, 2021; 21(1):246; doi: 10.1186/s12886-021-02016-z

25.

Pundlik

, Tomasi

, Liu

, et al. Development and preliminary evaluation of a smartphone app for measuring eye alignment. Transl Vis Sci Technol, 2019; 8(1):19; doi: 10.1167/tvst.8.1.19

26.

, Guan

, Yuan

, et al. a one-step, streamlined children’s vision screening solution based on smartphone imaging for resource-limited areas: Design and preliminary field evaluation. JMIR Mhealth Uhealth, 2020; 8(7):e18226; doi: 10.2196/18226

27.

Cheng

, Lynn

, Pundlik

, et al. A smartphone ocular alignment measurement app in school screening for strabismus. BMC Ophthalmol, 2021; 21(1):150; doi: 10.1186/s12886-021-01902-w

28.

Mesquita

MJTAM

, Azevedo Valente

, de Almeida

JDS

, et al. A mhealth application for automated detection and diagnosis of strabismus. Int J Med Inform, 2021; 153:104527; doi: 10.1016/j.ijmedinf.2021.104527

29.

Racano

, Di Stefano

, Alessi

, et al. Validation of the 2WIN corneal reflexes app in children. Graefes Arch Clin Exp Ophthalmol, 2021; 259(6):1635–1642; doi: 10.1007/s00417-020-05066-z

30.

Bindiganavale

, Buickians

, Lambert

, et al. Development and Preliminary Validation of a Virtual Reality Approach for Measurement of Torsional Strabismus. J Neuroophthalmol, 2022; 42(1):e248–e53; doi: 10.1097/WNO.0000000000001451

31.

Fisher

, Chandna

, Cunningham

. The differential diagnosis of vertical strabismus from prism cover test data using an artificially intelligent expert system. Med Biol Eng Comput, 2007; 45(7):689–693; doi: 10.1007/s11517-007-0212-z

32.

Chandna

, Fisher

, Cunningham

, et al. Pattern recognition of vertical strabismus using an artificial neural network (StrabNet). Strabismus, 2009; 17(4):131–138; doi: 10.3109/09273970903234032

33.

Chen

, Fu

, Lo

, et al. Strabismus recognition using eye-tracking data and convolutional neural networks. J Healthc Eng, 2018; 2018:7692198; doi: 10.1155/2018/7692198

34.

, Fan

, Zheng

, et al. Automated strabismus detection for telemedicine applications. arXiv Preprint, 2018:180902940.

35.

Huang

, Lee

, Kim

, et al. An automatic screening method for strabismus detection based on image processing. PLoS One, 2021; 16(8):e0255643; doi: 10.1371/journal.pone.0255643

36.

Huang

, Lee

, Kim

, et al. An improved strabismus screening method with combination of meta-learning and image processing under data scarcity. PLoS One, 2022; 17(8):e0269365; doi: 10.1371/journal.pone.0269365

37.

Holmes

, Lazar

, Melia

, et al. Effect of age on response to amblyopia treatment in children. Arch Ophthalmol, 2011; 129(11):1451–1457; doi: 10.1001/archophthalmol.2011.179