Abstract
Background:
A number of theoretical accounts have been put forward to explain the ability to simultaneously track multiple visually indistinguishable objects over a period of time. Serial processing models of visual tracking focus on the maintenance of the spatial locations of every single item over time. A more recent mechanism describes multiple object tracking as the ability to maintain a higher order representation of an abstract spatial configuration built by the illusory connection of the tracked items through their transition.
Objective:
The current study investigates the correspondence between these serial and parallel tracking accounts and the right hemispheric specialization for the space-based vs. left hemispheric for object-based attentional processing.
Methods:
Electrophysiological brain responses were recorded in two groups of patients with right- and left hemispheric lesions while performing in a multiple object tracking task.
Results:
The results suggest a failure to distinguish single item information for the right hemispheric patients accompanied by the absence of a known electrophysiological marker associated with single item tracking. Importantly, left hemispheric patients showed a graded behavioral and electrophysiological response to probe stimuli as a function of the congruence of the probe with the relevant target stimuli.
Conclusions:
The current data suggest that the differential contribution of serial and parallel tracking mechanisms during object tracking can partly be explained by the different functional contributions of the right and left brain hemispheres.
Introduction
Healthy subjects show a surprisingly effortless ability to visually track a number of randomly moving objects over a time of several seconds (Pylyshyn, 1989). In the classical multiple object tracking (MOT) task, subjects are presented with several visually ind-istinguishable objects and are required to keep track of a subset of them while they move around the vi-sual field, before identifying all relevant targets (Pylyshyn, 2004; Scholl, Pylyshyn, & Feldman, 2001). The normal student population is reported to be able to track up to five objects simultaneously with an accuracy of around 80% (Alvarez & Franconeri, 2007; Franconeri, Alvarez, & Cavanagh, 2013; Pylyshyn, 1989). A number of theories have been put forward in order to explain this feature of the visual system including pre-attentive indexing (Pylyshyn, 1989), a rotating attentional spotlight (Hogendoorn, Carlson, & Verstraten, 2007) and mul-tifocal attention (Cavanagh & Alvarez, 2005). Common to all these theories is the reliance of the respective process on all individual spatial locations of the relevant targets as well as a monotonous increase of resource requirements with higher task demands.
However, a further feasible tracking mechanism has been proposed, operating solely on the configurational information of the convex hull, formed by the entire set of relevant items during motion (Yantis, 1992). Behavioral data supports the notion that such a high-order object representation would be maintained partly outside the usual retinal reference space, since local spatiotemporal transformations of individual items do not impair tracking performance in such cases (Liu et al., 2005). Further results show that serial tracking processes operating on individual spatial item information and mechanisms maintaining a morphing high-order object representation of the entire object set might exist simultaneously (Merkel, Hopf, Heinze, & Schoenfeld, 2015; Merkel et al., 2014). Electrophysiological data show an early visual component indicating an object-based, parallel process operating on the entire target set only, while a latter visual component seems to modulate with the congruity of the relevant target items and a set of probe items (Merkel et al., 2014), thus indicating an individual item-based tracking mechanism.
Interestingly, several studies suggest an application of the object-based tracking mechanism mostly for experienced subjects (Merkel, Hopf, & Schoenfeld, 2017; Oksama & Hyönä, 2008; Yantis, 1992). Nevertheless, the relative contributions of parallel and serial tracking processes determining tracking performances are still unknown.
Data of hemispheric asymmetry suggests a dissociation between object-based and space-based attentional mechanisms within the left and right parietal cortex respectively (Egly, Driver, & Rafal, 1994). Moreover, a general precedence of the right hemisphere over the left hemisphere in attentional function has been suggested (Mesulam, 1981). In this context, the application of serial and parallel tracking processes might translate into a difference in relative resource contribution of right and left attentional networks.
The current study aims to resolve the role of left and right hemispheres in different multiple object tracking mechanisms by having the MOT paradigm performed by left or right hemispheric patients. Electrophysiological markers for object-based tracking and single item tracking (Merkel et al., 2014) have been identified previously and are used in the current study to operationalize the contribution of object-based and space-based tracking mechanisms in left and right hemispheric patients.
Methods
Subjects
Twenty-one stroke patients in their chronic state were recruited during rehabilitation in the Kliniken Schmieder hospital Heidelberg. In all patients a medial artery infarction was diagnosed and determined as cause for lesions within the left hemisphere in 11 patients (5 female) and the right hemisphere in 10 patients (5 female). The age of all patients ranged from 29 to 79 years of age (M = 57.8, SD = 12.3) and did not differ between both lesion groups (t(19) = 1.04, p > 0.3). All subjects were right-handed and determined to have retained their vision across the full visual field. Patients gave written informed consent for the experiment, which was approved by the local ethics board. All subjects were naïve to any form of multiple object tracking task.
Multiple object tracking task
For the MOT experiment, stimuli were presented against a black background subtending 21°×21° of visual angle. A central fixation cross of 0.5° was present throughout the experiment. Each trial started with the presentation of six visually identical items (black squares, 1°×1° with white outline) distributed randomly across the background (Fig. 1). Next, for 2 seconds three of the six items flashed three times, designating them relevant targets for the current trial to be tracked. Subsequently, all six items moved within the display on predefined semi-random, non-occluding, independent trajectories for 2800–3200 ms (180+-12 frames). Upon stopping, three of the six items color changed to white, designating them probes. The task for the subjects was to indicate by button press whether all three probes matched the relevant tracked targets or not. Participants had to press the right arrow button with their middle finger if all three probes matched all targets fully (Match3). On the other hand, if only two, one or none of the probes matched the tracked items, the left arrow button had to be pressed with the index finger (Match2, Match1, Match0).

Experimental design. a) Three out of six visually identical objects were assigned targets at the beginning of each trial and had to be kept track of throughout a motion period of 2800 ms –3200 ms. Once movement ceased, three objects were probed and subjects had to decide whether b) none, one or two of those objects were targets (Match0, Match1, Match2). In this case a response using the index finger was required. If all probe items matched the target items (Match3), subjects had to respond with the middle finger.
Importantly, the items trajectories were calculated in a way that the last frame (during movement cas-eation) was visually identical across all trials. Thus, not only were the locations of all items during that last frame constant but the identity of the probes as well. Differently matching conditions between targets and probes were created by varying the assignment between target and probe items at the beginning of each trial. This design allows for a better comparison of electrophysiological responses towards differently matching probe trials, since the stimulus evoked potential, time-locked to that last frame, would be identical (Merkel et al., 2014). In short, semi-random non-occluding trajectories sharing an identical final position were calculated offline by creating a set of paths with identical starting positions, each ‘chasing’ a random location, that changes position every 2–60 frames, and reversing their final course. For each of the six final locations a set of several thousand trajectories were calculated offline and subsequently merged randomly into groups of six trajectories, each comprising one trial with the restriction that at each frame path locations can never fall below 2° in order to avoid occlusion (see Merkel, Hopf, & Schoenfeld, 2020).
Each patient performed 200 trials overall. Each of the 4 conditions (Match3, Match2, Match1, Match0) was presented 50 times, randomly distributed thro-ughout the session. Subjects were seated comfortably in a chair within a dimly lit room and stimuli were presented on a 17 inch LCD screen placed 70 cm in front of the patients. Subjects were asked to fixate the cross in the middle throughout the experiment. They were instructed to keep track of the three target items throughout each trial and respond to the probe as quickly and accurately as possible. Before the session, each subject was given a number of practice trials to get familiar with the task.
Continuous electroencephalographic signals were recorded from each subject throughout the object tracking task using a wireless dry electrode Nielsen Tele Medical F1 v2.0 system (Nielsen Tele Medical GmbH, Magdeburg, Germany). Signals were recorded from 19 electrodes placed according to the 10/20 system with a sampling rate of 500 Hz and an online bandpass filter of 0.53–70 Hz. Impedances were kept below 5 kOhm–10 kOhm. The raw signal was re-referenced to the average of all electrodes. Epochs were extracted from the signal in a time window between –200 ms to 500 ms timelocked to the onset of the probe stimulus and subsequently baseline corrected. An artefact reduction was performed on the epoched dataset using an independent component analysis. The runica-function as implemented in EEGLAB for Matlab was used for the decomposition of the signal. Components containing ocular artefacts were removed manually based on their scalp topography and characteristical temporal pattern. The artefact corrected, epoched signal was additionally subjected to a peak-to-peak amplitude rejection criterion of 500 uV. Event-related potentials were finally averaged across all epochs for each of the four target-probe-match conditions. Mean ERP-amplitudes were extracted for the N290 time window (Merkel et al., 2014) between 200 ms and 300 ms within O1 and O2, in which the component reached its maximum amplitude, for each subject.
Voxel-based lesion mapping (VBLM)
In order to compare the extent and location of the lesions caused by the infarction between the left and right hemisphere, a lesion mapping was performed for each patient. For each but one patient structural images of different modalities were available from the time of diagnosis. For six patients CT images were used to locate the lesion while for another 14 T2-weighted FLAIR images were used to determine the damage caused by the media infarct. First, the clusterize-toolbox for SPM (de Haan, Clas, Juenger, Wilke, & Karnath, 2015) was used for a semiautomatic lesion segmentation within each structural image by manually selecting clusters of hypo- or hyperintensities that demarcate the observable structural damage (Clas, Groeschel, & Wilke, 2012). Thresholds for those hypo- and hyperintensity clusters were adjusted for each slice separately.
Subsequently, a lesion masked normalization was performed using the clinical toolbox for SPM (Rorden, Bonilha, Fridriksson, Bender, & Karnath, 2012) which applies a lesion masked normalization (Brett, Leff, Rorden, & Ashburner, 2001) for both CT and MRI imaging modalities. This toolbox provides a specialized template applicable for an aged stroke population in both modalities (Rorden et al., 2012) and is therefore optimally suited for the current application. As a result a normalized lesion mask was available for each patient save one, for which no structural image was available. To determine a potential difference in the extend of the lesions between the two groups an unpaired t-test was performed on the number of voxels within the lesion masks. Additionally, lesion maps for both groups were created to examine differences in their spatial pattern.
Analysis
Systematic variations in the response patterns towards the different match conditions for the two patient groups were examined using a two-factorial mixed model repeated measures ANOVAe. Since the distribution of the reaction times differed significantly between performance groups (levene: F(1,19) = 50.708, p < 0.001) a non-parametric Wilcoxon rank-sum test was employed to determine any overall performance differences indicated by reaction time. Within-subject differences in reaction time and error rates between the match conditions were additionally tested for each group separately using four-level repeated-measures ANOVAe. In order to determine systematic differences in the mean ERP-responses towards the different match conditions for the two patient groups a two-factorial rANOVA with factors group (left, right) and congruity (Match0, Match1, Match2, Match3) as well as rANOVAe within each of the patient groups were performed.
Results
The results seem to show a general slower re-sponse for right-hemispheric patients towards the probe stimulus than the left-hemispheric patients (Fig. 2a). However, a non-parametric Wilcoxon rank- sum test did not reveal a significant effect (U(11,10) = 42, p > 0.38). Furthermore, performing a rANOVA for each group separately confirmed no systematic variation of reaction times between the four match conditions neither for the left (F(3,30) = 2.617, p > 0.12, η2p = 0.207) nor right (F(3,27) = 0.644, p > 0.53, η2p = 0.067) hemispheric patients. The error rates did differ across the match conditions in both groups (F(3,57) = 10.5, p < 0.002, η2p = 0.356) but not between groups (F(1,19) = 0.97, p > 0.75, η2p = 0.005). Interestingly, both groups showed a different pattern of error rates towards the four match conditions (F(3,57) = 5.02, p < 0.029, η2p = 0.209). The left hemispheric patients showed a linear increase in errors with increasing match between target and probe items, while the right lesion group showed a sole steep increase in errors only for the full match condition (Fig. 2b). Note that the task was a two alternative forced choice task, requiring the same response for all partly matching probes and a different response for the fully matching probe. Figure 2b displays the individual responses of all subjects towards all match conditions, as well. The response pattern of the right hemispheric group is of particular interest, since it suggests that the ‘partly matching’ response was applied mostly as a default response irrespective of general performance, being pressed throughout the experiment in 100% of the trials by three right hemispheric patients. In general no variation between the partly matching probes could be observed within this group (F(2,18) = 0.347, p > 0.6, η2p = 0.037). However, within the right hemispheric group the full match condition did elicit correct responses in some subjects, thus having an error rate < 1 (t(9) = –3.83, p < 0.004, η2p = 0.62) suggesting that some patients were able to clearly identify a full match trial.

Behavioral results. a) Patients with right hemispheric lesions tended to be slower to respond to any of the match condition compared to the left hemispheric patients. Four of the ten patients with lesions on the right side showed substantially slower response times. b) Right and left hemispheric patients showed a different pattern of responses towards the match conditions. For the left hemispheric patients, between and within subject variability in the error rates for partly matching probes suggests a process operating on the individual tracked items. In contrast, the error rates of the right hemispheric patients suggest the adoption of using the ‘partly matching’ response as a default response throughout the experiment. Patients excluded from the VBLM are marked.
The pattern of the course of the mean ERP res-ponse amplitudes towards the different match probes for the two groups shares some of the properties of the error rate patterns (Fig. 3a). ERP amplitudes just failed to significantly vary across match conditions (F(3,57) = 3.003, p = 0.052, η2p = 0.136). The two-factorial rANOVA did not reveal a significant interaction with the patient group (F(3,57) = 1.707, p > 0.17, η2p = 0.08). No variation in the event related potential could be observed between match conditions in the right hemispheric group (F(3,27) = 0.279, p > 0.76, η2p = 0.03). Mean amplitudes differed how-ever within the left hemispheric patients (F(3,30) =3.877, p < 0.034, η2p = 0.279) showing more negative potentials during probes that are more congruent with the tracked targets (Fig. 3b).
Comparing the spatial distribution of the lesion maps between groups one can see a similar spatial extension of the lesions within the left and right STG respectively (Fig. 4). Although in two patients (pat1, pat2) with right hemispheric lesions the damage involves parts of the IPL, the spatial extent between the groups was not statistically significant (t(18) = 1.53, p > 0.14, η2p = 0.115). After exclusion of the two right-hemispheric patients with parietal lesions, the lesion patterns between groups were virtually identical (Fig. 5a) also with respect to the size (t(16) = 0.88, p > 0.39, η2p = 0.042). Figure 5b illustrates that the right-hemispheric parietal lesions in those two subjects did not drive the distinctive behavioral and electrophysiological patterns observed in that lesion group. The right-hemispheric pattern continued to show a different response towards the full match condition in an interaction effect with the left-hemispheric pattern that just felt short of exhibiting significance (F(3,51) = 3.39, p = 0.072, η2p = 0.166). Likewise, no variation in match conditions between ERP-responses between 200 ms and 300 ms was observed for the right-hemispheric patients (excluding the two subjects exhibiting parietal lesions) (F(3,21) = 0.69, p > 0.51, η2p = 0.089).

Electrophysiologic responses. a) Occipital electrode sites showed a different response pattern towards the probe display within the time range between 200 ms and 300 ms for right and left hemispheric patients. b) While the probe display did not elicit a differential visual evoked response in patients with lesions in the right hemisphere, patients with left hemispheric lesions showed an increase in the negative potential with higher correspondence between probe and target set.

Voxel-based lesion mapping. Lesions of all patients (save one) were semi-automatically delineated by manually thresholding and selecting automatically segmented hyper- and hypointensities within structural images of each patient. Lesion masked normalization was subsequently used to normalize those lesions onto a standardized brain template. The degree of anatomical overlap of the patient groups with lesions in the right- and left hemisphere are visualized separately in red and blue respectively. The main focus for both groups is centered within the insula and superior temporal gyrus. A second, more distributed focus for the right-hemispheric group can be seen within the inferior parietal lobe.

A summary of the results excluding two right-hemispheric patients with severe damage to the parietal cortex. a) The spatial extend of the lesion without two right-hemispheric patients (pat1, pat2) is virtual identical for the two groups. b) The behavioral and electrophysiological patterns observed for the right-hemispheric patients do not seem to depend on an involvement of the parietal cortex.
Twentyone patients with middle cerebral artery stroke lesions within the left or right hemisphere per-formed in a multiple object tracking task. Event related potentials were recorded at the same time. Behavioral data suggest that patients with right-hemispheric lesions are mostly unable to track three out of six objects over a course of several seconds. Electrophysiological data of those subjects support this conclusion, since potentials evoked by different matching probes did not show any significant variation in amplitude.
In contrast, damage to the frontoparietal network within the left hemisphere seems to leave the general ability to track multiple independently moving objects in those effected patients mostly intact. The electrophysiological data for this group shows a parametrical amplitude increase towards conditions in which fewer targets are being probed. Such a pattern was previously shown to reflect a serial tracking process operating on the individual relevant objects of the target set (Merkel et al., 2014).
Voxel based lesion maps for each group show consistent damage centered within the white matter of the STG and Insula in left and right hemispheric patients. Apart from lesions within the right IPL, VBLM patterns for both groups are mostly symmetric and on average of similar extent. For the present patient groups the observed lesions most likely resulted in intrahemispheric disconnections of fronto-parietal attention subnetworks (Doricchi, Thiebaut de Schotten, Tomaiuolo, & Bartolomeo, 2008; Karnath, Fruhmann Berger, Küker, & Rorden, 2004). Such lesions do not require a direct involvement of the grey matter of IPL or SPL regions to cause severe attentional deficits (Doricchi & Tomaiuolo, 2003; Suchan et al., 2014). On the contrary, it has been shown that the integrity of white fiber long range connections like SLF II and AF, located within the white matter of the STG, are associated with the extent of the cognitive impairment during neglect syndrome (He et al., 2007; Karnath, Rorden, & Ticini, 2009; Thiebaut de Schotten et al., 2008) for both right and left-hemispheric patients (Beume et al., 2017; Suchan & Karnath, 2011). Such disconnection might even have a larger impact on attentional networks since small damage to such fiber bundles can affect large sections of cortical target regions (Bartolomeo, 2007) and would not be subject to compensation by perilesional cortical areas (Catani & Mesulam, 2008; Doricchi et al., 2008). The current results confirm this argument, since the behavioral and electrophysiological pattern associated with patients with right hemispheric lesions do not necessarily require damage to the right parietal cortex. Thus, the observed lesion patterns most likely account for most of the presently described attentional deficits, since those patterns are well in line with earlier reports on voxel-based lesion mapping and tractography studies in relation to neglect symptoms.
Especially, the detrimental effect of right-hem-ispheric lesions on the ability to maintain multiple items over time is well in line with a large body of literature observing a central role of the right hemisphere in visual spatial attention. The left parietal cortex is believed to only exert control over contralateral shifts of spatial attention, while the right parietal cortex is thought to entail a representation of both hemifields (Mesulam, 1981; Posner, Walker, Friedrich, & Rafal, 1984). Such an asymmetry of hemispheric engagement in which the right parietal cortex exerts control over the entire visual field has been shown with the use of electrophysiology (Heilman & Van Den Abell, 1980) and imaging techniques (Corbetta, Miezin, Shulman, & Petersen, 1993; Pollmann & Morrillo, 2003; Thiel, Zilles, & Fink, 2004; Vandenberghe et al., 1997). This crucial role of the right hemisphere in the control of attentional mechanisms is believed to translate at clinical level into much more severe cognitive symptoms in neglect patients with right-hemispheric lesions (Mesulam, Waxman, Geschwind, & Sabin, 1976).
Apart from a general larger right-hemispheric eng-agement in attentional functions, further asymmetries between left and right parietal cortex associated with more specific visual processes were also described. Reports of functional dissociations between the left and right hemispheres in visual perception are equally well in line with the current results. Consistently, a larger perceptional bias for abstract stimulus features within the left hemisphere versus more concrete stimulus properties processed by the right hemisphere was previously described (Burgund & Marsolek, 1997; Koutstaal et al., 2001; Marsolek, Nicholas, & Andresen, 2002; McMenamin, Deason, Steele, Koutstaal, & Marsolek, 2015). Similarly, perceptual discriminations within an object-centered reference frame are faster and more accurate for stimuli presented to the left hemisphere, while the opposite holds true for a viewer-centered presentation (Behrmann & Moscovitch, 1994). Another description frames the same dissociation as a difference in processing categorical and coordinate spatial visual information within the left and right hemisphere, respectively (Hellige, Taylor, & Eng, 1989; Kosslyn et al., 1989). The common idea behind these frameworks is a right hemispheric bias for a representation of spatial information within a metric coordinate system retaining quantitative spatial distances (Ivry & Robertson, 1998).
The majority of object tracking theories stipulate processes operating on the spatial information of the individual relevant items (Alvarez & Franconeri, 2007; Pylyshyn, 1989). Previous results indicate that tracking performance is mostly determined by speed and spatial separation between targets (Shim, Alvarez, & Jiang, 2008), thus absolute spatial distance measures of the individual items (Franconeri, Jonathan, & Scimeca, 2010). An omission of this metric spatial information by right-hemispheric lesions, especially by disconnecting the parietal cortex, could therefore disrupt general tracking abilities, as the current results indicate. Interestingly, some imaging studies investigating cortical engagement during multiple object tracking observed a slight right-hemispheric (although not significant) bias throughout the task (Culham, Cavanagh, & Kanwisher, 2001; Culham et al., 1998). A failure to use specific spatial visual information compared to more abstract representations in right-hemispheric patients has been reported previously during perceptual priming experiments (Schnaider Beeri, Vakil, Adonsky, & Levenkron, 2004). Moreover, right-sided patients performed worse than left-sided patients and controls in recognizing changes in spatial relations of multiple objects in a delayed match to sample task (Laeng, 1994).
Furthermore, the above mentioned hemispheric functional asymmetry is not limited to differences in perceptual biases but is believed to also provide a dissociation between space- and object-based attentional mechanisms. Shifting attention through the same distance of space entails a larger engagement of the left hemisphere if this region is bound by some feature (Egly et al., 1994; Fink, 1997). This left-hemispheric bias for object-based attentional shifts over space-based attentional shifts was confirmed in several imaging studies (Arrington, Carr, Mayer, & Rao, 2000; Wilson, Woldorff, & Mangun, 2005). Furthermore, enhanced BOLD responses can be observed only within the left parietal cortex during the discrimination of two features within one object compared to one feature of the same object (Vandenberghe et al., 1997). The functional dissociation of space- and object-based attention observed in these studies provides an additional background for the current results. Lesions within the right hemisphere result in severe impairments of maintaining multiple objects based on their spatial location. Thus, tracking for this group is almost impossible. Left-hemispheric patients on the other hand retain the ability to track individual item information. The electrophysiological data show a graded visual mismatch effect for probes with a decreasing congruity with the currently tracked targets.
Apart from tracking individual items it has been suggested, that multiple object tracking could be facilitated by maintaining an abstract representation of the convex hull of all relevant items through time (Yantis, 1992). Recently, electrophysiological (Merkel et al., 2017, Merkel et al., 2017, 2014) as well as neuroimaging data (Merkel et al., 2015) provided evidence for such a process. While recording electrophysiological responses during a tracking task similar to the current one, an object-based tracking mechanism has been indicated by an early negative amplitude variation only for probes that match the relevant targets fully (Merkel et al., 2014). The current results do not show such an electrophysiological component for the left-hemispheric patients, suggesting a mostly space-based attentional process responsible for performing in the tracking task. This is well in line with the notion of a left-hemispheric bias for object-based attentional mechanisms, which would be impaired in the patient group showing a disruption of the left fronto-parietal attention network.
The current study provides evidence for hemis-pheric-specific contributions of attentional processes enabling the maintenance of multiple objects through space and time. The right parietal cortex seems to be crucial in operating on individual spatial information of each item while the left hemisphere would provide resources, that could maintain abstract representations of the spatial configuration of all relevant items. Hereby, the former mechanism seems to be a prerequisite of the latter, as right hemispheric lesion impede object tracking in its entirety.
Footnotes
Acknowledgments
This work was supported by the non-profit foundation Schmieder for science and research.
