Deep Reinforcement Learning-Based Retinal Imaging in Alzheimer’s Disease: Potential and Perspectives

Abstract

Alzheimer’s disease (AD) remains a global health challenge in the 21st century due to its increasing prevalence as the major cause of dementia. State-of-the-art artificial intelligence (AI)-based tests could potentially improve population-based strategies to detect and manage AD. Current retinal imaging demonstrates immense potential as a non-invasive screening measure for AD, by studying qualitative and quantitative changes in the neuronal and vascular structures of the retina that are often associated with degenerative changes in the brain. On the other hand, the tremendous success of AI, especially deep learning, in recent years has encouraged its incorporation with retinal imaging for predicting systemic diseases. Further development in deep reinforcement learning (DRL), defined as a subfield of machine learning that combines deep learning and reinforcement learning, also prompts the question of how it can work hand in hand with retinal imaging as a viable tool for automated prediction of AD. This review aims to discuss potential applications of DRL in using retinal imaging to study AD, and their synergistic application to unlock other possibilities, such as AD detection and prediction of AD progression. Challenges and future directions, such as the use of inverse DRL in defining reward function, lack of standardization in retinal imaging, and data availability, will also be addressed to bridge gaps for its transition into clinical use.

Keywords

Alzheimer’s disease deep learning deep reinforcement learning reinforcement learning retinal imaging

BACKGROUND

Alzheimer’s disease (AD), the most common cause of dementia, poses a significant clinical challenge worldwide amid today’s aging population [1]. With evidence revealing the presence of pathophysiological markers before the manifestation of clinical symptoms [2, 3], the retina has been considered a “window” to study AD as an accessible extension of the brain in terms of embryology, anatomy, and physiology [4].

The advancement in retinal imaging modalities, such as optical coherence tomography (OCT), OCT angiography (OCT-A), and fundus photography (FP), has enabled detailed investigation into the neuronal and microvascular structures of the retina non-invasively [5–8]. Concomitantly, the development of cutting-edge artificial intelligence (AI) algorithms has the potential to facilitate retinal imaging analysis [5, 9–11], with deep learning (DL) showing promising results in detecting systemic diseases from retinal images, such as AD [10, 12, 13], cardiovascular disease [14, 15], kidney disease [16, 17], and hepatobiliary diseases [18]. Meanwhile, deep reinforcement learning (DRL), a less explored facet of ML in medical imaging, attempts to combine DL with reinforcement learning (RL), a learning strategy that constructs an “optimal policy” through trial-and-error to solve decision-making problems. DRL eliminates manual data engineering in conventional RL methods and is better for handling high-dimensional input.

The recent approval of two drugs, aducanumab and lecanemab, by the U.S. Food and Drug Administration signifies a paradigm shift in the management of AD, from a sole symptomatic treatment approach to the exploration of disease-modifying therapies. These monoclonal antibodies selectively bind to aggregated forms of amyloid-β (Aβ) and reduce the accumulation of Aβ plaques in the human brain [19], which was believed to be the culprit of cognitive decline in AD pathogenesis. Although studies regarding the clinical benefit of aducanumab and lecanemab on real-life patients remain inconclusive [20], their use was generally recommended for individuals with mild cognitive impairment or mild dementia due to AD [21], which calls for an unprecedented need for the development of a feasible screening tool that is sensitive towards the detection of early AD-associated changes once more convincing evidence arises.

With increasing recognition of the conjoint use of retinal imaging and AI as a clinically viable tool for systemic disease detection [9], this review aims to introduce DRL and discuss its applications, challenges, and future directions in using retinal imaging to study AD.

OVERVIEW OF DEEP REINFORCEMENT LEARNING

Deep learning

DL utilizes deep neural networks through representation learning, which can directly process raw data into suitable representations for pattern recognition [22]. In terms of information processing, the neural network is analogous to a biological brain: artificial neurons are arranged into immense layers [23], with the weight of the connections continuously adjusted throughout the learning process to attain remarkably complex functions. Manual steps such as feature extraction, selection, and dimensionality reduction are no longer required as the prerequisite in applying DL algorithms, making it desirable for large-scale, high-dimensional medical imaging analysis [24].

The most commonly used type of DL in analyzing images is the convolutional neural network (CNN), due to its exceptional feature recognizing ability [25]. CNNs are specialized variants of multilayer perceptron, in which each neuron in one layer is fully connected to all neurons of the next layer [26], similar to the human visual cortex. This particular architecture contributes to its two distinguishing properties: shared weights enable CNNs to learn features in a translational equivariance manner, i.e., the same feature would elicit a response in all neurons in a given convolutional layer [27], while pooling offers a certain degree of local translational invariance so that the features would still be recognized with slight distortion or positional shift [27]. The two properties work synergistically to achieve better generalizability in CNN architecture.

Reinforcement learning

RL is one of ML’s three basic paradigms, along with supervised learning and unsupervised learning. Unlike the other 2 paradigms that solely consider instant reward, the RL paradigm (Fig. 1A) contains an RL agent interacting with the environment through trial and error and aims to maximize the cumulative reward [28]. Its distinct nature in balancing exploration and exploitation, as well as not requiring prelabelled input and output, provides it with a substantial edge over supervised learning in real-time scenarios, where sequential decisions are made without an analytic solution.

Fig. 1

A) This is a simple illustration of the reinforcement learning (RL) process. The RL agent interacts with the environment in discrete time steps. First, it receives its current state from the environment. Second, it selects an action out of all possible actions depending on the information received. Finally, it transitions into the new state and the reward generated is determined by the environment. The iterative feedback loop is repeated until the terminal state is reached, such that the RL learns a policy that maximizes the cumulative reward through dynamic programming, which solves the optimization problem by breaking it into smaller sub-problems and finding their optimal solutions recursively. For example, in the detection of Alzheimer’s disease, the environment is constructed using input from imaging modalities such as fundus photography and optical coherence tomography. The task of the agent is to interact with the environment and obtain an optimal policy that maximizes the cumulative reward. The outcome, i.e., whether the patient has AD or his/her predicted progression, can be retrieved through the optimal policy. B) An RL problem is represented by a Markov Decision Process (MDP), a discrete-time stochastic control process that is composed of agent states, possible actions, the actions’ effects in each state and a reward function. It provides a mathematical framework for modelling decision-making in situations where outcomes are partly random and partly under the control of a decision-maker. In this figure, the states S_t are represented in blue circles, while the actions available at that state A_t are represented by the red circles. For instance, when the agent is at state S₀, it will be presented with the action A₀, where it can stay at S₀ or move to S₁. The probability that action A₀ in state S₀ will lead to it staying at state S₀ at the next time state is 0.5, while the probability of it transitioning to state S₁ is 0.5, which are represented by the black numbers on the arrows and are defined by the policy. The reward which the agent will receive by moving into the new state is represented by the green numbers. By repeating the process, the aim is to find an optimal policy for the agent, where the cumulative reward of the actions are maximized.

An RL problem is represented by a Markov Decision Process (MDP), composed of agent states, possible actions, the actions’ effects in each state, and a reward function (Fig. 1B) [29]. The RL agent interacts with the environment in discrete time steps. Firstly, it receives its current state from the environment. Subsequently, it selects an action out of all possible actions depending on the information received. Finally, it transitions into the new state and the reward generated is determined by the environment [28]. The iterative feedback loop is repeated until the terminal state is reached, such that the AI learns an “optimal reward policy” through dynamic programming [30], i.e., solving the optimization problem by breaking it into smaller sub-problems and finding their optimal solutions recursively.

Deep reinforcement learning

Although RL-based approaches have achieved success in a variety of problems, their applicability is limited to domains with fully observed and low dimensional spaces, and their efficacy is bottlenecked by challenges in hand-crafted feature design (i.e., conventional ML) in shallow models. Thus, with the integration of DL as a nonlinear approximator of value function or policy function to generalize across enormous state spaces [28], DRL is particularly useful for sophisticated, consequential decision-making problems with high-dimensional medical images such as three-dimensional (3D) anatomical landmark detection [31], vessel centerline tracing in 3D volumetric magnetic resonance imaging (MRI) and computerized tomography (CT) scans [32]. The value function, policy, and model in RL (i.e., an MDP) are approximated with the application of a deep neural network to manage continuous states or actions that are difficult to be represented in tabular functions.

USING RETINAL IMAGING IN AD DETECTION

Fig. 2

Retinal imaging offers a non-invasive alternative for the detection of Alzheimer’s disease (AD), with established quantitative associations between neurovascular changes in the retina and AD disease status.

While in vivo neuroimaging modalities such as MRI and positron emission tomography (PET) achieve high accuracy in AD detection, they are often limited by their high cost, low accessibility, invasiveness, technical complexity, and the risk of using radioactive tracers [5]. Given the status quo, scientists have actively explored other viable biomarkers of AD through retinal imaging to facilitate screening and risk stratification. Three major imaging modalities, namely OCT, OCT-A, and FP, have been extensively utilized for detecting changes in neuronal structure, capillary network, and larger vasculature, respectively. Substantial quantitative associations between retinal changes and AD disease status have been established (Fig. 2).

Optical coherence tomography in detecting retinal neuronal changes in AD

OCT is a non-invasive imaging technique that employs low-coherence interferometry for in vivo ocular measurements [33]. The individual retinal layers could be visualized in the high-resolution images, including the retinal nerve fiber layer (RNFL), the ganglion cell layer (GCL), and the inner plexiform layer (IPL). The quantitative segmental analysis of retinal layers enabled scientists to study their relationship with cognitive function impairment. Neuronal loss in the hippocampus and cerebral neocortex is a typical characteristic of AD [34]. The loss of RGCs and their axons in AD patients [35] is usually exhibited in the RNFL surrounding the optic nerve head (peripapillary RNFL). RNFL thinning in all four quadrants, especially in the superior and inferior quadrants, is observed in patients with AD in multiple OCT studies [36]. Macular GC-IPL thickness can also be measured through OCT, which demonstrates thinning around the fovea in most sectors under the manifestation of AD [37]. While the relative accuracy and sensitivity between the two parameters remain controversial, their association with AD demonstrates potential use as indicators of neurodegeneration for assisting early diagnosis.

Optical coherence tomography angiography in observing retinal capillary changes in AD

Apart from neuronal structures in the retina, vascular structures also demonstrate significant values in exhibiting pathological changes of AD. OCT-A is commonly used to visualize retinal and choroidal microvasculature at capillary level based on the principle of motion contrast: a series of OCT B-scans is taken at the same cross-section, and the laser reflectance from red blood cells in the vascular lumen is compared. The difference in decorrelation signal allows retinal capillaries and blood flow rate to be precisely mapped out non-invasively [38]. Computer algorithms are employed to reduce motion noise and segment the image into multiple zones, most commonly the superficial capillary plexus, the deep capillary plexus, and the radial peripapillary capillary plexus.

In recent years, there has been emerging evidence showing that AD is a multifactorial disease that overlaps neuropathologically with cerebrovascular diseases [39]. Possible speculation about the phenomenon is that hypoperfusion modulates β and γ-secretase which increases Aβ production [40]. Besides, vascular-mediated Aβ clearance is impaired by cerebrovascular diseases, thus further exacerbating the condition of AD patients [41]. The paradigm shift in the understanding of AD pathophysiology encouraged scientists to probe into microvascular changes in the retina. For instance, Bulut et al. have identified a reduction in retinal vascular density and enlargement in the foveal avascular zone in patients with Alzheimer’s type dementia [42]. Yet, most current studies are limited by their small sample size and often neglect other confounding factors [5]. More investigations are required to achieve a credible consensus.

Fundus photography in monitoring retinal arteriolar and venular changes in AD

Besides retinal capillaries, changes in larger vessels may also serve as a potential indicator for observing the microvascular pathology related to AD. FP is a simple but effective technique for capturing signs of vascular diseases from optically accessible retinal vessels. As hypoperfusion is a possible contributing factor to the development of AD, a reduction in retinal vascular density is expected in diseased individuals. Indeed, Cheung et al. reported a reduction in both arteriolar and venular fractal dimensions in association with AD dementia [43], which indicates a sparser vascular network and is consonant with the hypothesis. As for vessel caliber, Frost et al. and Cheung et al. concluded that narrower venule caliber is associated with an increased risk of AD [43, 44], although the effect is not consistently demonstrated in some studies. The relation of vessel tortuosity with the development of AD remains contentious, as Cheung et al. discovered that both increased retinal alveolar and venular tortuosity are associated with the risk of AD [43], while William et al. stated that reduced retinal alveolar tortuosity is associated with the risk of AD, with no significant correlation with retinal venular tortuosity [45]. The disputing results are attributed to cardiovascular comorbidities such as hypertension and diabetes in the subjects, which were believed to increase retinal vessel tortuosity as well [46, 47]. Studies adjusted to other clinical factors are required to confirm the validity of the hypothesis.

USING DEEP LEARNING TO DETECT AD FROM RETINAL IMAGES

The associations between retinal changes and AD disease status provided a solid foundation for using DL [48]-based retinal image analysis for AD detection. Cheung et al. [12] has recently developed a DL algorithm based on four retinal photographs (optic nerve head and macula-centered fields from both eyes) for each subject for detecting AD-dementia, which discriminated Aβ-positive from Aβ-negative with accuracy, sensitivity, and specificity ranging from 80.6% to 89.3%, 75.4% to 90.0%, and 92.0% to 100.0%, respectively, in testing datasets with data on PET. Wisely et al. [10] proposed a multimodal DL system to predict AD using images and measurements from multiple ocular imaging modalities (OCT, OCTA, ultra-widefield retinal photography, and retinal autofluorescence), which achieved the highest area under the receiver operating characteristic curve (AUROC) of 0.861 on the validation set and 0.841 on the test set. Tian et al. [13] has developed a highly modular DL algorithm that enables automated image selection, vessel segmentation, and classification of AD, achieving an accuracy of over 80%. With the general accuracy of AI-based AD detection revolving around 73% to 86% [49], the previous studies provided promising evidence for using AI-based ocular image analysis for AD detection as all of them achieved an accuracy of higher than 80%.

It is also promising to use AI-based ocular image analysis for AD detection as a screening tool when comparing with commonly used methods. For instance, Mini-Mental State Examination (MMSE) achieves sensitivity and specificity of 71.1% to 85.1% and 81.3% to 95.6%, respectively, when performed in various settings [50], while neuroimaging modalities achieve 87.5% to 95.0% accuracy in the binary classification of AD patients and control with the use of computer-based automated classification [51].

DEEP REINFORCEMENT LEARNING FOR ALZHEIMER’S DISEASE: CURRENT STUDIES AND POTENTIAL APPLICATION

Alzheimer’s disease classification

As multi-modal DL is shown feasible for AD classification from multiple ocular imaging modalities [10] and RL further offers a robust alternative by mimicking the cognitive flow of the human brain, the DRL algorithm can potentially be applied to the decision-making of AD classification via multimodal evaluation. For instance, Zhang et al. have proposed a multimodal DRL model for recognizing pathological features from MRI and diagnosing AD, which achieved an AUROC of over 95% in multiple datasets [52]. In the study, DRL was utilized to reconstruct compressed-sensing MRI images in order to improve clinical feasibility. A pixel-level disease probability risk map of the brain was then generated by feeding the reconstructed images into a full CNN. Finally, the risk map of specific brain regions was integrated with individual clinical data for biomarker analysis and obtaining a diagnosis with DL algorithms. With the advancement in the DRL technique itself, such as applying off-policy learning to enable the values of an optimal policy to be learned independently of the agent’s action [53], DRL is more catered towards personalized scenarios compared to other ML modalities, which is of particular importance due to the high degree of individual variance in multifactorial diseases such as AD. Besides, DRL possesses the edge over DL in terms of adaptability in sequential decision-making, especially in unseen cases with incomplete information, where DRL is able to take account into domain knowledge and learn policies that are able to generalize well in uncertain scenarios. Compared with physicians’ diagnosis, which is often suboptimal even in developed countries due to the insidious onset of AD and time-consuming diagnostic process [54], DRL can potentially offer an automated alternative that is less prone to human error and with a lower technical barrier, as it can take account into the diverse factors with minimal bias under an optimal policy. As retinal imaging allows the visualization of various retinal neuronal changes and vascular changes that correlates with AD, DRL exhibits its potential in the manipulation of high-dimensional retinal images such as 3D volumetric OCT images into a more interpretable form, which aids in the development of assessable AD detection techniques (Fig. 3). In the future, more studies are warranted to further investigate the advantages of DRL in AD classification and the comparisons to physicians or other ML models.

Fig. 3

Deep reinforcement learning (DRL) integrates deep learning as a nonlinear approximator of value function or policy function into conventional reinforcement learning. It is particularly useful for sophisticated and consequential decision-making problems with high-dimensional input such as medical images.

The application of DRL also offers an autonomous alternative for meticulous tracing, classification, and segmentation of changes in the retina. By applying DRL, the agent can learn a Q-function that enables it to search for the landmark effectively without examining the image exhaustively [55]. The agent begins at a coarse level to capture the global context and gradually increases its scale level to search for local details until the finest scale is reached. The landmark is pinpointed at the site of convergence, where the trajectory of the agent moves in a small, oscillatory cycle [56]. The implementation of DRL improves both the speed and accuracy of vessel detection and more importantly reduces the computational cost required for multidimensional images. The entire coronary tree can then be extracted with a single initial seed point with DRL, which achieves better performance than the supervised CNN approach with its optimization ability [56], holding great potential in developing multi-modal retinal imaging-based models for AD detection.

Alzheimer’s disease progression prediction

Understanding AD progression is of equal importance to identifying its presence, as it can be used to evaluate the severity of patients in a wide spectrum, formulate personalized management plans, and aid therapy development. In the hope of elucidating the pathophysiological process of AD, Saboo et al. have proposed a contemporary model that incorporates both RL and differential equation (DE), which is often used in mathematics to represent time-depending phenomena [57]. The DE-based model utilizes acknowledged mathematical relationships between factors and leverages the optimization power of RL to derive the unknown associations based on domain knowledge. The objective function aims to minimize the deficit between the cognitive demand of a task (represented by the maximum score of MMSE) and the cognition provided by the brain (proportional to the activated area measured by MRI), as well as reduce the energetic cost of cognition (proportional to the total brain activity measured by fMRI) [57]. By solving the optimization problem through RL, the model was able to outperform state-of-the-art models and the proposed model without RL in predicting a 10-year cognition trajectory, showing the effectiveness of RL implementation in presenting a more comprehensive view of the disease. It was also able to exhibit the compensatory process during disease progression, which involves the recruitment of specific brain regions for cognition to compensate for the neurodegeneration [58], without explicit manual programming.

This model presents a new direction for integrating RL with DE to construct a progression model, potentially bridging the knowledge gap and providing more insight into the biological processes that contribute to neurodegenerative diseases. The generic framework can be further explored to encompass retinal imaging by combination of DRL and application of different DEs, as retinal imaging offers a comprehensive view of the eye through different field-of-view (e.g., optic disc-centered and macula-centered) and neuronal layers (e.g., RNFL and GCIPL). With analogous changes associated with AD progression in the brain such as Aβ accumulation [5], parameters extracted from retinal images can possibly be derived similarly into an optimization problem for the prediction of AD progression. Studies of intervention approaches are also possible due to the mechanistic property of the model through perturbation analysis of the variables [59].

Long-term management of Alzheimer’s disease

As one of the major causes of disability and dependency on elderlies, AD carries a hefty economic burden on the healthcare system, with an estimated total global societal cost of US$ 1.3 trillion in 2019 [60]. With no curative treatment currently, the management of AD mainly falls under symptom relief and caregiving, thus requiring substantial effort from both clinicians and caregivers to keep track of the patient’s status. With this in mind, DRL offers a promising solution to provide management for patients in a more personalized and streamlined approach. For instance, the reward of the DRL algorithm could be defined by diminishing degenerative changes in the retina from images obtained by various modalities or reducing decline in cognitive function. Based on the patient’s state, the DRL agent can choose from a variety of actions, including the prescription of symptom-relieving drugs such as cholinesterase inhibitors and memantine [61], lifestyle modifications such as physical activity, and diet arrangement, or arrange follow-up by a clinician. Through off-policy learning, DRL can potentially learn from the disease progression trajectory of past patients and learn an optimal policy for managing AD patients on a case-by-case basis. Its immediate application can provide more timely intervention under the high patient volume, and potentially derive more systematic management protocols that can immensely benefit patients in low-resource settings.

CHALLENGES AND IMPLICATIONS OF DRL-BASED RETINAL IMAGE ANALYSIS IN AD

With its advantages of constructing an “optimal policy” through trial-and-error to solve decision-making problems, DRL is a promising tool that could be integrated with retinal imaging for AD, such as identifying participants with a high risk of AD, predicting the progression from mild cognitive impairment into AD dementia, and deciding the best intervention strategies for AD patients (Fig. 4). However, a few intrinsic challenges as well as prerequisites for its transition into clinical use should be further investigated.

Fig. 4

The potential application scenarios of deep reinforcement learning for AD.

Difficulty in defining reward function due to complexity of AD

A well-constructed reward function remains the cornerstone for a DRL algorithm’s performance, as it determines how the agent behaves. However, the complexity of AD can greatly convolute the process, especially when immediate reward may not be accessible to provide feedback and navigate the agent towards an optimal policy [56], due to various knowledge gaps in AD pathophysiology. To tackle real-world scenarios where many factors remain unknown, the application of inverse DRL is actively explored. Inverse DRL is a type of apprenticeship learning that extracts a reward function through observing demonstrations of a task performed by experts (expert trajectories) [62]. This reduces the necessity of manual specification, which is often intractable to take account of the diverse factor in medical diagnosis.

Curse of dimensionality

The curse of dimensionality has been a prominent problem in dynamic programming. It refers to the difficulty in identifying meaningful patterns when working with high-dimensional data. The rationale behind the phenomenon is that the same number of data points become sparser when distributed among different dimensions, resulting in an exponential increase in the amount of data and computational power required for ML models to obtain a reliable result [63]. In the context of medical research, dimensionality refers to the number of describing features. For instance, age, gender, medical history, the neuronal and vascular changes in the brain and the retina are all useful features for AD identification. With the extensive interindividual variation, it becomes challenging for the algorithms to generalize the correlation between each feature with the patient’s probability of having AD. Dimensionality reduction is one of the approaches to resolve the issue by projecting the states onto a lower-dimension manifold. Previous studies have utilized dimensionality reduction algorithms such as principal component analysis, kernel principal component analysis, isometric feature mapping, linear discriminant analysis and autoencoder to optimize AD classification [64–66]. The trade-off between reduced performance and faster convergence should be investigated to maximize the effectiveness of DRL in medical imaging.

Data availability

Predictive accuracy is of paramount importance in constructing a disease prediction model, which is generally ‘data-hungry’ when applied to complex, multifactorial diseases such as AD. DRL is generally applied in gaming or robotics at the current developmental stage, where new data can be generated through simulated runs in the training process. However, the data in medical research is generally obtained from individual patients, which can be costly and time-consuming to build extensive datasets. For instance, 12,949 retinal photographs from 648 AD patients and 3,240 control subjects were used to train, validate and test the DL model from Cheung’s group, while the retinal photographs from 36 AD patients and 123 control subjects were used to develop the model from Wisely’s group, where AUROCs of 0.93 and 0.861 were achieved, respectively [10, 12]. While there is no one-size-fits-all answer for the data size required to develop an AI-based model for disease detection, it is generally believed that a larger data size contributes to better accuracies of analytical models. Large longitudinal cohorts are also lacking for retinal imaging on AD patients compared to other chronic diseases [49]. Therefore, the development of more data-efficient algorithms for AD detection and management is vital for its further application.

Reproducibility and reliability of RL results

Sustainable advancement of medical research is often reliant on the reproducibility of existing works, to evaluate the legitimacy of novel discoveries and further build upon the knowledge. However, the reproduction of state-of-the-art DRL methods is often problematic, due to the lack of standard benchmark environments and the high degree of variance inherent to the mechanism of DRL [67]. To ensure straightforward interpretation of previous works, standardization of metrics in experimental reporting is necessary to reduce ambiguity. Regarding the decision-making aspect of RL, whether the model had complete access to all confounding factors such as concomitant systemic diseases and ocular diseases, shifts in hospital diagnosis and treatment standards, as well as errors in problem formulation or data processing can also affect its reliability [68]. Therefore, methods to improve the interpretability of DRL models should be actively explored to reduce the ‘black box’ phenomenon and ensure that it will behave in a regulated and intended way.

Lack of generalizability

Although many studies demonstrate associations between retinal parameters and AD [5, 35–37, 42–45], their practicality in clinical use is in doubt due to the intrinsic bias of the datasets and comorbidities of subjects that alter retinal parameters. To isolate the effect of AD on retinal changes, larger datasets from different populations are required to reduce sampling bias. The diversity of datasets could also be increased by including subjects with concomitant ocular neurodegenerative or cardiovascular pathologies, thus enhancing the generalizability of the result in real-life scenarios with complex patient conditions.

Education and infrastructure changes

Despite the numerous successes of RL in recent years, its development in the medical field has remained rather stagnant. One of the major reasons is the lack of systematic understanding of its potential implementation, as well as its advantages and shortcomings compared to other ML subtypes. Education, through reviews or seminars, are necessary for researchers and physicians to gain more exposure to the mechanisms of DRL and consider it a feasible tool for AD detection and management. Once it has accumulated its popularity, infrastructural changes should be introduced to accelerate the application of DRL in AD.

CONCLUSION

The recent explosive progress in AI offers exciting new opportunities. In this review, we have focused on the potential of DRL-based retinal image analysis for AD, most of which have not yet been deeply explored and hold great potential to enhance AD classification and progression prediction. Further improvement in generalizability, consistency, and reproducibility should be prioritized in future studies to accelerate its translation into clinical use.

Footnotes

ACKNOWLEDGMENTS

The authors have no acknowledgments to report.

FUNDING

The authors have no funding to report.

CONFLICT OF INTEREST

The authors have no conflict of interest to report.

References

(2021) 2021 Alzheimer’s disease facts and figures. Alzheimers Dement 17, 327–406.

Sperling

, Mormino

, Johnson

(2014) The evolution of preclinical Alzheimer’s disease: Implications for prevention trials. Neuron 84, 608–622.

Byun

, Park

, Lee

, Yi

, Jeon

, Choi

, Joung

, Ghim

, Park

, Kim

, Shin

, Yu

, Lee

, Group

(2021) Association of retinal changes with Alzheimer disease neuroimaging biomarkers in cognitively normal individuals. JAMA Ophthalmol 139, 548–556.

London

, Benhar

, Schwartz

(2013) The retina as a window to the brain—from eye research to CNS disorders. Nat Rev Neurol 9, 44–53.

Cheung

, Mok

, Foster

, Trucco

, Chen

, Wong

(2021) Retinal imaging in Alzheimer’s disease. J Neurol Neurosurg Psychiatry 92, 983–994.

Cheung

, Ikram

, Chen

, Wong

(2017) Imaging retina to study dementia and stroke. Prog Retin Eye Res 57, 89–107.

Cheung

, Chan

VTT

, Mok

, Chen

, Wong

(2019) Potential retinal biomarkers for dementia: What is new? Curr Opin Neurol 32, 82–91.

Snyder

, Alber

, Alt

, Bain

, Bouma

, Bouwman

, DeBuc

, Campbell

MCW

, Carrillo

, Chew

, Cordeiro

, Dueñas

, Fernández

, Koronyo-Hamaoui

, La

Morgia C

, Carare

, Sadda

, van

Wijngaarden P

, Snyder

(2021) Retinal imaging in Alzheimer’s and neurodegenerative diseases. Alzheimers Dement 17, 103–111.

, Cheung

, Milea

, Ting

DSW

(2021) Artificial intelligence and machine learning for Alzheimer’s disease: Let’s not forget about the retina. Br J Ophthalmol 105, 593–594.

10.

Wisely

, Wang

, Henao

, Grewal

, Thompson

, Robbins

, Yoon

, Soundararajan

, Polascik

, Burke

, Liu

, Carin

, Fekrat

(2022) Convolutional neural network to identify symptomatic Alzheimer’s disease using multimodal retinal imaging. Br J Ophthalmol 106, 388.

11.

Wagner

, Fu

, Faes

, Liu

, Huemer

, Khalid

, Ferraz

, Korot

, Kelly

, Balaskas

, Denniston

, Keane

(2020) Insights into systemic disease through retinal imaging-based oculomics. Transl Vis Sci Technol 9, 6.

12.

Cheung

, Ran

, Wang

, Chan

VTT

, Sham

, Hilal

, Venketasubramanian

, Cheng

, Sabanayagam

, Tham

, Schmetterer

, McKay

, Williams

, Wong

, Au

LWC

, Lu

, Yam

, Tham

, Chen

, Dumitrascu

, Heng

, Kwok

TCY

, Mok

VCT

, Milea

, Chen

, Wong

(2022) A deep learning model for detection of Alzheimer’s disease based on retinal photographs: A retrospective, multicentre case-control study. Lancet Digit Health 4, e806–e815.

13.

Tian

, Smith

, Guo

, Liu

, Pan

, Wang

, Xiong

, Fang

(2021) Modular machine learning for Alzheimer’s disease classification from retinal vasculature. Sci Rep 11, 238.

14.

Wong

DYL

, Lam

, Ran

, Cheung

(2022) Artificial intelligence in retinal imaging for cardiovascular disease prediction: Current trends and future directions. Curr Opin Ophthalmol 33, 440–446.

15.

Cheung

, Xu

, Cheng

, Sabanayagam

, Tham

, Yu

, Rim

, Chai

, Gopinath

, Mitchell

, Poulton

, Moffitt

, Caspi

, Yam

, Tham

, Jonas

, Wang

, Song

, Burrell

, Farouque

, Li

, Tan

, Ting

DSW

, Hsu

, Lee

, Wong

(2021) A deep-learning system for the assessment of cardiovascular disease risk via the measurement of retinal-vessel calibre. Nat Biomed Eng 5, 498–508.

16.

Zhang

, Liu

, Xu

, Yuan

, Cai

, Chen

, Wang

, Gao

, Nie

, Xu

, Qin

, Su

, Xu

, Olvera

, Xue

, Li

, Zhang

, Zeng

, Zhang

, Li

, Zhang

, Zhu

, Xu

, Kermany

, Zhou

, Pan

, Li

, Lai

, Chi

, Wang

, Pei

, Zang

, Zhang

, Lau

, Lam

, Zou

, Wumaier

, Wang

, Shen

, Hou

, Zhang

, Xu

, Zhou

, Wang

(2021) Deep-learning models for the detection and incidence prediction of chronic kidney disease and type 2 diabetes from retinal fundus images. Nat Biomed Eng 5, 533–545.

17.

Sabanayagam

, Xu

, Ting

DSW

, Nusinovici

, Banu

, Hamzah

, Lim

, Tham

, Cheung

, Tai

, Wang

, Jonas

, Cheng

, Lee

, Hsu

, Wong

(2020) A deep learning algorithm to detect chronic kidney disease from retinal photographs in community-based populations. Lancet Digit Health 2, e295–e302.

18.

Xiao

, Huang

, Wang

, Lin

, Zhu

, Chen

, Yang

, Xiao

, Zhao

, Li

J-PO

, Cheung

CY-L

, Mise

, Guo

, Du

, Chen

, Hu

, Zhang

, Lin

, Wen

, Liu

, Chen

, Zhong

, Lin

(2021) Screening and identifying hepatobiliary diseases through deep learning using ocular images: A prospective, multicentre study. Lancet Digital Health 3, e88–e97.

19.

Sevigny

, Chiao

, Bussière

, Weinreb

, Williams

, Maier

, Dunstan

, Salloway

, Chen

, Ling

, O’Gorman

, Qian

, Arastu

, Li

, Chollate

, Brennan

, Quintero-Monzon

, Scannevin

, Arnold

, Engber

, Rhodes

, Ferrero

, Hang

, Mikulskis

, Grimm

, Hock

, Nitsch

, Sandrock

(2016) The antibody aducanumab reduces Aβ plaques in Alzheimer’s disease. Nature 537, 50–56.

20.

Knopman

, Jones

, Greicius

(2021) Failure to demonstrate efficacy of aducanumab: An analysis of the EMERGE and ENGAGE trials as reported by Biogen, December 2019. Alzheimers Dement 17, 696–701.

21.

Cummings

, Aisen

, Apostolova

, Atri

, Salloway

, Weiner

(2021) Aducanumab: Appropriate use recommendations. J Prev Alzheimers Dis 8, 398–410.

22.

Esteva

, Robicquet

, Ramsundar

, Kuleshov

, DePristo

, Chou

, Cui

, Corrado

, Thrun

, Dean

(2019) A guide to deep learning in healthcare. Nat Med 25, 24–29.

23.

LeCun

, Bengio

, Hinton

(2015) Deep learning. Nature 521, 436–444.

24.

Plis

, Hjelm

, Salakhutdinov

, Allen

, Bockholt

, Long

, Johnson

, Paulsen

, Turner

, Calhoun

(2014) Deep learning for neuroimaging: A validation study. Front Neurosci 8, 229.

25.

Valueva

, Nagornov

, Lyakhov

, Valuev

, Chervyakov

(2020) Application of the residue number system to reduce hardware costs of the convolutional neural network implementation. Math Comput Simul 177, 232–243.

26.

O’Shea

, Nash

(2015) An introduction to convolutional neural networks. ArXiv, https://doi.org/10.48550/arXiv.1511.08458.

27.

Mouton

, Myburgh

, Davel

(2020) Stride and Translation Invariance in CNNs. In Artificial Intelligence Research, ed. Gerber A Springer International Publishing, Cham, pp. 267–281.

28.

(2017) Deep reinforcement learning: An overview. arXiv preprint arXiv:1701.07274.

29.

Otterlo

, Wiering

(2012) Reinforcement learning and Markov decision processes. In Reinforcement Learning. Adaptation, Learning, and Optimization, Wiering M, van Otterlo M, eds. Springer, Berlin, Heidelberg, pp. 3–42.

30.

Busoniu

, Babuska

, De

Schutter B

, Ernst

(2010) Reinforcement Learning and Dynamic Programming Using Function Approximators, CRC Press.

31.

Ghesu

, Georgescu

, Mansi

, Neumann

, Hornegger

, Comaniciu

(2016) An artificial agent for anatomical landmark detection in medical images. In Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016, Ourselin S, Joskowicz L, Sabuncu MR, Unal G, Wells W, eds. Springer International Publishing, Cham, pp. 229–237.

32.

Zhang

, Wang

, Zheng

(2018) Deep reinforcement learning for vessel centerline tracing in multi-modality 3D volumes. In Medical Image Computing and Computer Assisted Intervention – MICCAI 2018 Frangi AF, Schnabel JA, Davatzikos C, Alberola-López C, Fichtinger G, eds. Springer International Publishing, Cham, pp. 755–763.

33.

Fercher

, Mengedoht

, Werner

(1988) Eye-length measurement by interferometry with partially coherent light. Optics Lett 13, 186–188.

34.

Niikura

, Tajima

, Kita

(2006) Neuronal cell death in Alzheimer’s disease and a neuroprotective factor, humanin. Curr Neuropharmacol 4, 139–147.

35.

Mahajan

, Votruba

(2017) Can the retina be used to diagnose and plot the progression of Alzheimer’s disease? Acta Ophthalmol 95, 768–777.

36.

Thomson

, Yeo

, Waddell

, Cameron

, Pal

(2015) A systematic review and meta-analysis of retinal nerve fiber layer change in dementia, using optical coherence tomography. Alzheimers Dement (Amst) 1, 136–143.

37.

Chan

VTT

, Sun

, Tang

, Chen

, Wong

, Tham

, Wong

, Chen

, Ikram

, Whitson

, Lad

, Mok

VCT

, Cheung

(2019) Spectral-domain OCT measurements in Alzheimer’s disease: A systematic review and meta-analysis. Ophthalmology 126, 497–510.

38.

Koustenis

A Jr

, Harris

, Gross

, Januleviciene

, Shah

, Siesky

(2017) Optical coherence tomography angiography: An overview of the technology and an assessment of applications for clinical research. Br J Ophthalmol 101, 16–20.

39.

Sweeney

, Montagne

, Sagare

, Nation

, Schneider

, Chui

, Harrington

, Pa

, Law

, Wang

DJJ

, Jacobs

, Doubal

, Ramirez

, Black

, Nedergaard

, Benveniste

, Dichgans

, Iadecola

, Love

, Bath

, Markus

, Al-Shahi

Salman R

, Allan

, Quinn

, Kalaria

, Werring

, Carare

, Touyz

, Williams

SCR

, Moskowitz

, Katusic

, Lutz

, Lazarov

, Minshall

, Rehman

, Davis

, Wellington

, González

, Yuan

, Lockhart

, Hughes

, Chen

CLH

, Sachdev

, O’Brien

, Skoog

, Pantoni

, Gustafson

, Biessels

, Wallin

, Smith

, Mok

, Wong

, Passmore

, Barkof

, Muller

, Breteler

MMB

, Román

, Hamel

, Seshadri

, Gottesman

, van

Buchem MA

, Arvanitakis

, Schneider

, Drewes

, Hachinski

, Finch

, Toga

, Wardlaw

, Zlokovic

(2019) Vascular dysfunction-The disregarded partner of Alzheimer’s disease. Alzheimers Dement 15, 158–167.

40.

Cai

, Liu

, Xiao

, Wang

, Tian

(2017) Chronic cerebral hypoperfusion promotes amyloid-beta pathogenesis via activating β/γ-secretases. Neurochem Res 42, 3446–3455.

41.

Saito

, Ihara

(2016) Interaction between cerebrovascular disease and Alzheimer pathology. Curr Opin Psychiatry 29, 168–173.

42.

Bulut

, Kurtuluş

, Gözkaya

, Erol

, Cengiz

, Akıdan

, Yaman

(2018) Evaluation of optical coherence tomography angiographic findings in Alzheimer’s type dementia. Br J Ophthalmol 102, 233–237.

43.

Cheung

, Ong

, Ikram

, Ong

, Li

, Hilal

, Catindig

, Venketasubramanian

, Yap

, Seow

, Chen

, Wong

(2014) Microvascular network alterations in the retina of patients with Alzheimer’s disease. Alzheimers Dement 10, 135–142.

44.

Frost

, Kanagasingam

, Sohrabi

, Vignarajan

, Bourgeat

, Salvado

, Villemagne

, Rowe

, Macaulay

, Szoeke

, Ellis

, Ames

, Masters

, Rainey-Smith

, Martins

, AIBL Research Group (2013) Retinal vascular biomarkers for early detection and monitoring of Alzheimer’s disease. Transl Psychiatry 3, e233.

45.

Williams

, McGowan

, Cardwell

, Cheung

, Craig

, Passmore

, Silvestri

, Maxwell

, McKay

(2015) Retinal microvascular network attenuation in Alzheimer’s disease. Alzheimers Dement (Amst) 1, 229–235.

46.

Sasongko

, Wong

, Nguyen

, Cheung

, Shaw

, Wang

(2011) Retinal vascular tortuosity in persons with diabetes and diabetic retinopathy. Diabetologia 54, 2409–2416.

47.

Cheung

CY-l

, Zheng

, Hsu

, Lee

, Lau

, Mitchell

, Wang

, Klein

, Wong

(2011) Retinal vascular tortuosity, blood pressure, and cardiovascular risk factors. Ophthalmology 118, 812–818.

48.

O’Bryhim

, Apte

, Kung

, Coble

, Van

Stavern GP

(2018) Association of preclinical Alzheimer disease with optical coherence tomographic angiography findings. JAMA Ophthalmol 136, 1242–1248.

49.

Kumar

, Koul

, Singla

, Ijaz

(2022) Artificial intelligence in disease diagnosis: A systematic literature review, synthesizing framework and future research agenda. J Ambient Intell Humaniz Comput, doi: 10.1007/s12652-021-03612-z.

50.

Mitchell

(2009) A meta-analysis of the accuracy of the Mini-Mental State Examination in the detection of dementia and mild cognitive impairment. J Psychiatr Res 43, 411–431.

51.

Kim

, Jeong

, Stiles

, Choi

(2022) Neuroimaging modalities in Alzheimer’s disease: Diagnosis and clinical features. Int J Mol Sci 23, 6079.

52.

Zhang

, Du

, Liu

(2021) A whole-process interpretable and multi-modal deep reinforcement learning for diagnosis and analysis of Alzheimer’s disease. J Neural Eng 18, 066032.

53.

Zhu

, Zhu

(2021) Learning a diagnostic strategy on medical data with deep reinforcement learning. IEEE Access 9, 84122–84133.

54.

Podhorna

, Winter

, Zoebelein

, Perkins

, Walda

(2020) Alzheimer’s diagnosis: Real-world physician behavior across countries. Adv Ther 37, 883–893.

55.

Liu

, Zhou

, Bernhardt

, Comaniciu

(2010) Search strategies for multiple landmark detection by submodular maximization. 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, 2010, pp. 2831–2838, doi: 10.1109/CVPR.2010.5540016.

56.

Zhou

, Le

, Luu

, Nguyen

, Ayache

(2021) Deep reinforcement learning in medical imaging: A literature review. Med Image Anal 73, 102193.

57.

Saboo

, Choudhary

, Cao

, Worrell

, Jones

, Iyer

(2021) Reinforcement learning based disease progression model for Alzheimer’s disease. ArXiv, arXiv:2106.16187

58.

Skouras

, Falcon

, Tucholka

, Rami

, Sanchez-Valle

, Lladó

, Gispert

, Molinuevo

(2019) Mechanisms of functional compensation, delineated by eigenvector centrality mapping, across the pathophysiological continuum of Alzheimer’s disease. Neuroimage Clin 22, 101777.

59.

Conrado

, Duvvuri

, Geerts

, Burton

, Biesdorf

, Ahamadi

, Macha

, Hather

, Francisco

Morales J

, Podichetty

, Nicholas

, Stephenson

, Trame

, Romero

, Corrigan

, Drug Development Tools in the Alzheimer Disease Continuum (DDT-AD) Working Group (2020) Challenges in Alzheimer’s disease drug discovery and development: The role of modeling, simulation, and open data. Clin Pharmacol Ther 107, 796–805.

60.

World Health Organization (2021) Dementia, https://www.who.int/news-room/fact-sheets/detail/dementia.

61.

Yiannopoulou

, Papageorgiou

(2013) Current and future treatments for Alzheimer’s disease. Ther Adv Neurol Disord 6, 19–33.

62.

Abbeel

, Ng

(2004) Apprenticeship Learning via Inverse Reinforcement Learning. Proceedings, Twenty-First International Conference on Machine Learning, ICML.

63.

Bellman

(2003) Dynamic Programming, Dover Publications.

64.

Aouragh

, Bahaj

, Gherabi

(2022) Comparative study of dimensionality reduction techniques and machine learning algorithms for Alzheimer’s disease classification and prediction. In 2022 IEEE 3rd International Conference on Electronics, Control, Optimization and Computer Science (ICECOCS), pp. 1–6.

65.

, Yang

, Lanfranchi

(2021) Examining and evaluating dimension reduction algorithms for classifying Alzheimer’s diseases using gene expression data. In 2021 17th International Conference on Mobility, Sensing and Networking (MSN), pp. 687–693.

66.

Gao

, Ding

, Zhang

, Yu

, Zhu

, Wang

, Yang

(2022) KNN-based classification on Alzheimer’s disease data after dimensionality reduction using principal component analysis. SPIE/COS Photonics Asia, SPIE.

67.

Henderson

, Islam

, Bachman

, Pineau

, Precup

, Meger

(2017) Deep reinforcement learning that matters. arXiv, arXiv:1709.06560.

68.

Gottesman

, Johansson

, Komorowski

, Faisal

, Sontag

, Doshi-Velez

, Celi

(2019) Guidelines for reinforcement learning in healthcare. Nat Med 25, 16–18.