Abstract
Quantitative retinal imaging is essential for advanced study and clinical management of eye diseases. However, spatial resolution of retinal imaging has been limited due to available numerical aperture and optical aberration of the ocular optics. Structured illumination microscopy has been established to break the diffraction-limit resolution in conventional light microscopy. However, practical implementation of structured illumination microscopy for in vivo ophthalmoscopy of the retina is challenging due to inevitable eye movements that can produce phase artifacts. Recently, we have demonstrated the feasibility of using virtually structured detection as one alternative to structured illumination microscopy for super-resolution imaging. By providing the flexibility of digital compensation of eye movements, the virtually structured detection provides a feasible, phase-artifact-free strategy to achieve super-resolution ophthalmoscopy. In this article, we summarize the technical rationale of virtually structured detection, and its implementations for super-resolution imaging of freshly isolated retinas, intact animals, and awake human subjects.
Keywords
Impact statement
High-resolution retinal imaging is important for eye disease detection and treatment assessment. This article summarizes technical rationale and experimental implementations of virtually structured detection (VSD) for resolution improvement in retinal imaging. In conjunction with rapid line-scan imaging and digital registration to minimize the effect of eye movements, VSD enables in vivo super-resolution ophthalmoscopy of individual rod and cone photoreceptors, promising a solution for better management of eye diseases that can cause photoreceptor dysfunctions.
Introduction
The retina is responsible for capturing photons, converting light energy to bioelectric signal, and preliminary visual information processing. Given its delicate function, the retina can be frequently targeted by eye diseases that can cause vision loss and even blindness. Retinal examination plays one essential role in eye disease detection and treatment assessment. In addition to traditional fundus photography, multiple imaging modalities such as scanning laser ophthalmoscopy (SLO), 1 fluorescein angiography (FA), 2 optical coherence tomography (OCT), 3 and OCT angiography (OCTA) 4 have been established to enable quantitative assessment of the retina. In principle, a better imaging resolution corresponds to a better opportunity to reveal subtle abnormalities at the early stage of eye diseases. Therefore, spatial resolution of retinal imaging is never more than enough to improve eye disease detection and treatment assessment. Adaptive optics (AO) has demonstrated excellent capability to compensate for optical aberrations of the ocular optics for resolution improvement in fundus camera, 5 SLO, 6 and OCT 7 systems. However, the spatial resolution of conventional optical instruments is diffraction limited. The numerical aperture (NA) of the human eye is limited by the available pupil size, restricting the spatial resolution of in vivo retinal imaging.
By rejecting out-of-focus light, confocal microscopy has been well established to enhance image contrast and sectioning capability. 8 When the confocal pinhole in the confocal microscopy is small enough (e.g. ≤0.5 Airy Disk Diameter, ADD), the lateral resolution limited by the diffraction can be surpassed. Sub-ADD pinhole detection 9 and annular pupil illumination 10 have been demonstrated for resolution improvement in confocal ophthalmoscopy. However, the small pinhole in the confocal system also discards in-focus light to reduce the signal-to-noise ratio (SNR) which can be essential for imaging biological tissues, such as the retina in which the useable signal level is frequently low. Pixel reassignment technique has been explored as one alternative to confocal imaging for resolution improvement in SLO without SNR loss. 11 Without discarding any useable signal, structured illumination microscopy (SIM) has been established to break the diffraction limit for resolution improvement in light microscopy. 12 However, direct implementation of SIM for in vivo ophthalmoscopy of the human retina is challenging due to unavoidable phase-artifact because of eye movements. Randomly shifted pattern illumination has been theoretically proposed for in vivo SIM of the retina, but the experimental implementation is yet to be validated. 13
Recently, we have demonstrated the feasibility of using virtually structured detection (VSD) as one alternative to SIM for super-resolution imaging. 14 By providing the flexibility of digital compensation for the effect of eye movements, the VSD provides a feasible, phase-artifact-free strategy to achieve resolution improvement in in vivo retinal imaging.15,16 In this minireview, we summarize the basic principle of SIM, technical rationale of VSD, and its implementation in scanning laser microscopy (SLM) of freshly isolated retinas, and SLO for super-resolution imaging of intact animals and awake humans.
Basic principle of SIM for resolution improvement
SIM has been developed to surpass the optical resolution limit by a factor of two. 12 As shown in Figure 1(a), the resolution improvement in SIM is based on the recording and processing of a sequence of images with structured pattern illumination at different orientations and phases. As shown in Figure 1(b), the SIM image contrast and resolution can be significantly improved, compared to conventional light microscopy. 17

(a) Basic principle of SIM. (a1) Moiré fringes, i.e. the dark vertical stripes, are observed in the overlap region of two superposed line patterns. (a2) Spatial resolution of conventional light microscopy is diffraction limited. The observable region of reciprocal space produced by an objective is limited at its edge by the highest spatial frequencies (0.61λ/NA). (a3) A sinusoidally striped illumination pattern has only three Fourier components, i.e. the 0th (red dot) and ±1st (yellow dots) order diffraction components. If the pattern is at the limit of resolution, the 1st order spots fall at the very edge of the “observable field”. (a4) By frequency mixing, the observable region contains, in addition to the normal image of spatial frequencies (center circle), two new offset frequency images, each centered on the edge of the original field. The offset images contain spatial frequencies that are not observed in conventional light microscopy. (a5) From a set of images prepared from three phases at each of three orientations, a super-resolution image can be generated that has twice the spatial resolution in conventional light microscopy. (b) Comparative conventional fluorescence light microscopy (b1) and SIM images (b2) of a living HeLa cell stained with MitoTracker Green. Source: Reprinted from Gustafsson 12 and Murphy et al. 17 (A color version of this figure is available in the online journal.)
SIM has been extensively used for high resolution microscopy of biological cells and tissues.18,19 However, its practical implementation for imaging non-stationary targets is challenging because the multiple images with different orientation and phase illumination are required (Figure 1(a5)). Any movement between the sequential images can damage the phase relationship for super-resolution reconstruction. In theory, SIM can also be realized in a point scanning system through spatiotemporal modulation, either by modulating light source intensity in the illumination arm with a digital camera (Figure 2(a)) or by placing a moving mask in the light detection arm with a single-channel photodetector (Figure 2(b)). 20 The purpose of the spatiotemporal modulation of the point scanning illumination in Figure 2(a) is to produce a set of images with equivalent structured illumination in SIM for super-resolution reconstruction. The spatiotemporal modulation of the detection path illustrated in Figure 2(b) indicates that the structured modulation in illumination path is equivalent to that in the detection path. 20 In Figure 2(a) and (b), the objective is shared for both light illumination and detection. If we assume the optical property, i.e. optical transfer function (OTF), is the same for illumination and detection pathways, the spatiotemporal modulations in illumination path in Figure 2(a) and detection path in Figure 2(b) can be equivalent. Given only a single point is illuminated at each time point, crosstalk noise among adjacent volumes can be reduced, compared to wide-field SIM, and thus image contrast and penetration capability can be further improved. However, the spatiotemporal modulation of the illumination/detection arm still requires nine frames to reconstruct a high-resolution image. Sample motion during acquisition of these nine frames leads to reconstruction artifacts, making it difficult for imaging non-stationary targets such as the retina with inevitable eye movements.

(a) Schematic of super-resolution SLM through spatiotemporally modulated light illumination. The peak intensity of the focused excitation spot is temporally modulated in a controlled manner, while the spot scans across the specimen. The epi-collected fluorescence signal, after spectrally separated from the excitation light by a dichroic and/or a filter, is imaged onto a nondescanned camera. Each complete frame scanning with a particular modulation sequence builds up one picture cumulatively. Several such pictures are generated with a set of modulation sequences that are phase-shifted in space in order to produce images equivalent to structured illumination required in SIM for super-resolution reconstruction. (b) Schematic of super-resolution SLM through modulated light detection. A mask with modulated transmittance is positioned at the image plane of the epi-fluorescence signal, and a single element large-area nondescanned detector such as a PMT is placed behind the mask. For each scanning position, the detector sums up all transmitted signals through the mask and assigns the integrated intensity to the single pixel corresponding to the current scanning position. After each complete frame is scanned with a particular mask, one PMT-picture is built up. Several such PMT-pictures are generated with a set of masks whose modulated transmittance is phase-shifted in space, in order to produce images equivalent to structured illumination required in SIM for super-resolution reconstruction. Source: Reprinted from Lu et al. 20 (A color version of this figure is available in the online journal.)
Technical rationale of VSD-based super-resolution SLM
In principle, a digital camera can be used to record the 2D light profile, corresponding to a point illumination, and thus VSD can be implemented for virtually spatial modulation. In other words, the VSD implementation on 2D light profiles is equivalent to the function of the combined system that includes the photomultiplier tube (PMT), the moving mask, and corresponding laser scanning modulation required in Figure 2(b). 20 Differently, the VSD implementation on 2D light profiles can overcome the technical challenge of super-resolution imaging of non-stationary targets. If the camera speed is fast enough, i.e. kHz, the intra-frame movement within each 2D light profile can be ignored. The inter-frame movement can be digitally compensated by precise image registration among raw 2D light profiles, before implementing VSD for super-resolution reconstruction. Therefore, VSD-based super-resolution imaging holds the promise for in vivo retinal imaging to tackle the problem of eye movements.
Figure 3(a) shows an optical diagram of the VSD-based SLM.
14
As conventional SLM, the illumination light is focused to a single spot on the sample. The X- and Y- scanners steer the spot across the sample to produce a raster scanning pattern. The light reflected from the sample is de-scanned by the same scanners and focused to the detection apparatus. Instead of using a single-pixel detector, such as one avalanche photodiode (APD) or PMT, for collecting total reflected light from the sampling volume in conventional SLM, a 2D digital camera is used to map the light profile patterns of each sampling point (Figure 3(b)). For a super-resolution image with frame size

VSD-based super-resolution SLM. (a) Optical diagram of the SLM system. SLD: superluminescent laser diode; CO: collimator; BS: beam splitter; L1-L3: lens; and OB: objective. (b) Methodology of VSD. A 2D reflectance profile (b1) is recorded, corresponding to each point illumination. Digital sinusoidal modulations can be implemented to the 2D image profile at orientations θ = 0° (b2), 120° (b3), and 240° (b4). (c) The modulated 2D reflectance profile of each sampling point in (b) is used to construct the spatially modulated transmittance images at three orientations (θ1 = 0°, θ2 = 120°, and θ3 = 240°) with three phase shifts (α1 = 0°, α2 = 120°, and α3 = 240°). The spatially modulated transmittance images are equivalent to the images with structured illumination required in SIM for following super-resolution reconstruction. (d) Comparative imaging of one freshly isolated frog retina with conventional SLM (d1) and VSD-based super-resolution SLM (d2). (d3) The white curve and the red curve are normalized intensity profiles along the white line in (d1) and the red line in (d2), respectively. Source: Modified from Lu et al. 14 and Zhi et al. 30 (A color version of this figure is available in the online journal.)
In the spatial domain, a point spread function (PSF) can be used to evaluate the resolution of one imaging system. For the imaging system in Figure 3(a), we assume that the PSFs of the illumination (
In the Fourier domain, the corresponding cutoff frequency of the PSF can be expressed as
In other words, only frequencies below the cutoff frequency are able to pass through the conventional SLM system
In VSD, at the scanning position
By integrating equation (4) we have
Considering
Equation (7) exactly represents the acquired image of the conventional wide-field SIM in which the modulation function
In other words, the theoretical resolution is enhanced by a factor of two, which is equivalent to SIM.12,14,20
We used freshly isolated frog (Rana pipiens) retinas for a functional test of VSD-based SLM. According to equation (2), given that λ = 830 nm, the lateral resolution of the conventional SLM 5X objective with 0.1 NA was 5 µm. Given that the diameter of frog photoreceptors varies from 1 to 8 µm (rods ∼5–8 µm, and cones ∼1–3 µm),21,22 the conventional SLM with the theoretical lateral resolution of 5 µm resolved photoreceptors partially, while VSD-based super-resolution imaging delineated individual photoreceptors unambiguously (Figure 3(d)). In addition to the super-resolution SLM, we also demonstrated the feasibility of using VSD for resolution improvement in OCT. 23
In vivo super-resolution ophthalmoscopy of animal retinas
VSD-based SLM in Figure 3 has been demonstrated for resolution improvement in imaging of freshly isolated retinas. 14 The VSD method requires neither physical modulation of the light source intensity in the illumination arm nor a physical mask in the light detection arm. Without the system complexity of SIM for precise light phase and pattern controls, the VSD promises an easy, low-cost, and phase-artifact-free strategy to achieve super-resolution imaging. However, practical application of the VSD for in vivo retinal imaging is still challenging due to the limited frame-speed. For the single-point scanning prototype in Figure 3, ∼160 s is required for collecting 2D reflectance profiles to produce a super-resolution image with frame size of 400 × 400 pixels.
In order to minimize the effect of eye movements, line-scanning strategy has been demonstrated for high-speed in vivo retinal imaging. 24 Recently, we demonstrated the feasibility of the line-scanning strategy for rapid VSD-based super-resolution imaging of both isolated retinas 25 and intact frog eyes.26,27 Figure 4 shows the experimental setup and representative in vivo super-resolution images captured with 127 frames/s speed. As shown in Figure 4(b), the VSD-based SLO enabled unambiguous observation of individual retinal photoreceptors (Figure 4(b1)) and ganglion fibers (Figure 4(b2)). 26 By providing high-spatial (µm) and high-temporal (∼10 ms) resolutions, the VSD-based super-resolution SLO has been used to validate in vivo monitoring of transient retinal phototropism evoked by oblique light stimulation. 27

VSD-based super-resolution SLO. (a) Photographic illustration of the experimental setup. (b) Representative in vivo super-resolution imaging of photoreceptor (b1) and ganglion fiber (b2) layers. (c) In vivo monitoring of photoreceptor movement due to oblique light stimuli. (c1) The time-magnitude courses of photoreceptor movement corresponded to three stimulus intensity levels. Each trace is an average of 12 datasets recorded from six different retinal samples. The colored area that accompanies each trace illustrates the standard deviations. Shaded area represents the 500-ms stimulation period. PR: photoreceptor. The means and standard deviations of peak amplitude (c2) and time-to-peak (c3) of the traces in (c1). Source: Reprinted from Liu et al. 26 (b) and Lu et al. 27 (c). (A color version of this figure is available in the online journal.)
In vivo super-resolution ophthalmoscopy of human retinas
As shown in Figure 4, VSD has been demonstrated for in vivo super-resolution retinal imaging in anesthetized frogs. However, practical implementation of VSD for in vivo retinal imaging of awake humans is challenged by two factors: (1) involuntary eye movements;28,29 and (2) uncertain cut-off frequency of the ocular optics of each subject. In principle, increased imaging speed can minimize the intra-frame blurs, and digital image registration can be applied to raw image sequence to compensate for the inter-frame movements before the VSD processing for super-resolution reconstruction. Accurate assessment of the cut-off frequency of ocular optics, i.e. choosing a proper frequency shift in VSD processing is essential. An insufficient frequency shift will result in decreased high-frequency components being involved for image reconstruction and result in reduced resolution improvement. In contrast, an excessive frequency shift will introduce high frequency noises and artifacts to degrade the image quality. For our preliminary study with anesthetized frogs (Figure 4), which are known to have relatively low aberration, empirical frequency shift was adopted for VSD processing.26,27 However, optical property of human subjects can be significantly variable among different subjects, making the selection of a proper frequency shift difficult. Recently, we developed one objective method to retrieve the modulation transfer function (MTF) of the imaging system to enable an objective identification of the cut-off frequency of ocular optics and validated it for in vivo retinal imaging of awake human subjects. 15
Figure 5 illustrates the experimental setup, i.e. a line-scan SLO system. A near-infrared SLD with 830 nm central wavelength and 60 nm bandwidth was used as the light source. The line illumination was achieved by using a cylindrical lens (CL, Figure 5(b)) to condense the illumination light in one dimension. The galvo scanning mirror was placed at the conjugate plane of the pupil to minimize the vignetting effect. The reflected light from the retina with line-illumination was de-scanned by the scanning mirror and recorded by a high-speed CMOS camera (FastCam Mini Ax50, Photron) as a 2D line-profile pattern (Figure 5(b)). The line profile of the illumination was along the X-direction, and the scanning was performed in the Y-direction. The 2D line-profile patterns were acquired at a speed of 25,000 Hz to minimize the intra-frame blur correlated with eye movements.

Photograph (a) and schematic diagram (b) of the super-resolution SLO. SLD: superluminescent diode; CO: collimator; CL: cylindrical lens; L1–L4: lens; BS: beam splitter; SM: scanning mirror. The illumination light is projected onto the retina as a focused line. A representative 2D line-profile pattern is shown in the dashed window. Source: Reprinted from Lu et al. 15 (A color version of this figure is available in the online journal.)
Human subjects, without known ocular diseases, were recruited for functional validation of the system in Figure 5. The experiment was conducted in a room with minimized ambient light to maximize the available pupil size, without the requirement of pharmacologically pupil dilation. Digital registration of sequential line-profile patterns was implemented to compensate for inter-frame shifts correlated with eye movements before the VSD processing. Basic principle of VSD processing has been described previously.14,25,30 Here we focus on the process of retrieving the MTF of the imaging system. As the modulation is achieved by using digital mask, two modulation phases (α = 0 and π/2) can be employed to solve equation (8)
Taking the case of
As
Figure 6(a) and (b) illustrate representative MTF (

(a1) Representative spectra of
Figure 7(a) and (b) show the representative equivalent wide-field (EWF) image and the VSD image reconstructed from the same dataset acquired at ∼3° eccentricity from fovea. Compared to the EWF image (Figure 7(a)), the VSD image (Figure 7(b)) revealed enhanced resolution and contrast to resolve individual photoreceptors and visualize the projection of micro-capillary structures (yellow arrowheads in Figure 7(b1)). Based on the high speed VSD image sequence, motility processing (standard deviation map of time-lapse VSD images) was applied to further enhance the contrast of individual photoreceptors. The visibility of photoreceptor boundaries was significantly improved in Figure 7(c). Because the signal intensity of rod photoreceptors was typically weaker compared to cones, logarithmic scale display was able to enhance the visibility of rod photoreceptors (yellow arrows in Figure 7(d)). Surprisingly, sub-cellular structure, i.e. the dark center region (red arrows in Figure 7(d)), was observed in some cone photoreceptors. We speculate that the sub-cellular dynamics may reflect motility of cell membrane, disc, or waveguide property of retinal photoreceptors. Further investigation is required to understand the mechanism of the sub-cellular motility properties, which may provide new biomarkers of functional assessment of retinal photoreceptors.

EWF (a1) and VSD-based super-resolution (b1) images reconstructed from the same dataset with center at ∼3° eccentricity from the fovea. Arrowheads in (b1) identify the projection of retinal vasculatures on the photoreceptor layer. (c1) and (d1) Standard deviation map of a time-lapse recording of 80 super-resolution images in linear and logarithmic scale, respectively. Images are cropped to 400 × 512 pixels for display. (a2), (b2), (c2), and (d2) show magnified views of the areas specified by the yellow dashed rectangles in (a1), (b1), (c1), and (d1), respectively. The yellow and red arrowheads in (d2) specified rod photoreceptors, and the sub-cellular structures at the center of photoreceptors revealed by motility processing. Scale bars represent 50 μm. The scanning direction is along Y-direction. Source: Reprinted from Lu et al. 15 (A color version of this figure is available in the online journal.)
Figure 8 shows representative montaged images to demonstrate the capability of super-resolution imaging at retinal regions with different eccentricities relative to the foveal center (Asterisk, Figure 8).

Montaged super-resolution (a) and motility processed images (b) of photoreceptor mosaics. Asterisk points to the fovea center. Scale bars represent 100 μm. Reprinted from Lu et al. 15 (A color version of this figure is available in the online journal.)
Discussion
In summary, VSD-based super-resolution SLM and SLO have been developed and validated for resolution improvement in both in vitro and in vivo retinal imaging. Instead of using a single-channel photodetector, such as PMT in a conventional SLM system, a digital camera is employed to capture 2D light profiles corresponding to individual point illumination. VSD can be implemented to the 2D light profiles to achieve equivalent information in SIM to surpass the diffraction limit of optical resolution in the lateral direction.
Wide-field SIM has been extensively used for high resolution study of biological cells and tissues. Practical implementation of wide-field SIM for in vivo retinal imaging is difficult, due to inevitable eye movements. Because multiple images are required to be collected with different illumination orientations and phases (Figure 1(a)), any shift among the sequential images can distort the location/phase relationship for super-resolution reconstruction. The demonstrated VSD method uniquely overcomes the technical challenges, i.e. eye movement and uncertain cut-off frequency of the ocular optics, for in vivo retinal imaging.
In order to minimize the effect of the eye movements, a rapid line-scanning was adopted for the super-resolution SLO in Figure 5. By providing 25,000 Hz speed for recording 2D profiles corresponding to focused line illumination, the intra-frame blur due to eye movements within the 40 µs time interval can be omitted. The inter-frame movement could be digitally compensated by precise image registration among sequential 2D light profiles, before implementing VSD for super-resolution reconstruction. In order to pursue robust in vivo super-resolution imaging of human retina, an objective method has been developed to derive MTF from digital reflectance profiles to enable quantitative estimation of the cut-off frequency required for reliable VSD processing. In conjunction with rapid line-scan imaging and digital registration to minimize the effect of eye movements, VSD enabled resolution improvement for unambiguous observation of individual retinal photoreceptors without the involvement of AO (Figure 7). Dynamic motility processing further enhanced the visualization of individual photoreceptors and allowed differential identification of individual rod and cone photoreceptors (Figures 7 and 8). Interestingly, sub-cellular dark centers (red arrows in Figure 7(d)) were observed in some cone photoreceptors. Because not all cone photoreceptors show such sub-cellular dark centers, we speculate that the sub-cellular dynamics may reflect different functions relative to the motility property of cell membrane, disc, or waveguide property of individual cone photoreceptors. Further investigation is required to understand the mechanism of the sub-cellular motility properties, which may provide new biomarkers of functional assessment of cone photoreceptors.
As a proof-of-concept study, the system in Figure 5 was constructed with all commercially available components. In principle, further optimization of the system, such as customized optical design, can further improve the imaging performance and promise a cost-effective method to foster clinical deployments of quantitative analysis of retinal photoreceptors.
The high-speed and super-resolution imaging with single photoreceptor resolution also promises a feasible solution to advance functional intrinsic optical signal (IOS) imaging,31,32 also termed as optophysiology 33 or optoretinography (ORG),34–36 for objective assessment of retinal physiology. It is known that eye diseases, such as age-related macular degeneration (AMD),37–39 retinitis pigmentosa (RP),40,41 diabetic retinopathy (DR),42,43 can cause photoreceptor dysfunctions. Early detection and therapeutic assessment are essential steps to prevent vision loss due to eye diseases. Stimulus-evoked IOS, which is tightly correlated with the activation phase of phototransduction in retinal photoreceptors, has been demonstrated.44,45 Recent studies suggested that the photoreceptor-IOS attributes to transient outer segment (OS) shrinkage in stimulus activated photoreceptors.46,47 Therefore, high resolution is essential for robust detection of the localized photoreceptor-IOS due to transient OS change. Using the VSD-based super-resolution imaging system, transient photoreceptor movement has been demonstrated in intact frog eyes 27 (Figure 4). We anticipate that further development of the VSD-based super-resolution ophthalmoscopy may provide a feasible strategy to enable functional assessment of human photoreceptors at cellular resolution.
Conclusions
VSD provides a new pathway to achieve super-resolution SLO for high resolution imaging of the retina. In conjunction with rapid line-scan imaging and digital registration to minimize the effect of eye movements, VSD enabled resolution improvement for unambiguous observation of individual retinal photoreceptors without the involvement of AO. We anticipate that further development of the VSD-based imaging system can provide an easy, low-cost solution to foster clinical deployment of quantitative imaging of retinal photoreceptors, which are essential for advanced study and diagnosis of AMD, RP, and other eye diseases that are known to damage photoreceptors, particularly rod cells at the early stages.37,39,48
Footnotes
AUTHORS’ CONTRIBUTIONS
XY, RL, BW, and YL drafted a first version of the manuscript, and TK contributed to manuscript revision and figure preparation.
DECLARATION OF CONFLICTING INTERESTS
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
FUNDING
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported in part by NIH grants R01 EY023522, R01 EY030101, R01EY030842, R01EY029673, P30 EY001792; by Richard and Loan Hill endowment; by unrestricted grant from Research to Prevent Blindness.
