Abstract
The hippocampus plays an indispensable role in episodic memory, particularly during the consolidation process. However, its precise role in retrieval of episodic memory is still ambiguous. In this study, we investigated the correlation of hippocampal morphometry and the performance in an autobiographical memory task in 27 healthy controls and 24 patients suffering from Alzheimer’s disease (AD). Most importantly, correlations were defined separately and comparatively for memory contents with different retrieval frequency in the past. In healthy subjects, memory performance for seldom retrieved autobiographical events was significantly associated with gray matter density in the bilateral hippocampus, whereas this correlation was not present for events with high retrieval frequency. This pattern of findings confirms that retrieval frequency plays a critical role in the consolidation of episodic autobiographical memories, thereby making them more independent of the hippocampal system. In AD patients, on the other hand, successful memory retrieval appeared to be related to hippocampal morphometry irrespective of the contents’ retrieval frequency, comprising events with high retrieval frequency, too. The observed differences between patients and control subjects suggest that AD-related neurodegeneration not only impairs the function, but also decreases the functional specialization of the hippocampal memory system, which, thus, may be considered as marker for AD.
Keywords
INTRODUCTION
Alzheimer’s disease (AD) is a progressive, neurodegenerative disease of the brain and is the most common cause of dementia [1]. Thereby, memory disturbances are cardinal symptoms resulting from early neurodegenerative processes in the medial temporal lobe (MTL) structures [2]. According to consolidation theories, structures within the MTL, particularly the hippocampus, but also the adjacent parahippocampal, perirhinal, and entorhinal cortices, are crucial for the acquisition of new memories [3]. However, the specific role of the hippocampus within the MTL structures as well as the time course of its involvement in the storage and retrieval of declarative knowledge (i.e., episodic and semantic memory) is ambiguous [4–6]. AD patients typically develop a retrograde amnesia, which is the inability to retrieve memories acquired before the onset of the disorder [7]. It is characterized by a loss of autobiographical memories [8–10] and public knowledge [9, 12]. Autobiographical memory has been found to be impaired in the earliest stages of AD [13–15], also present in individuals suffering from amnestic mild cognitive impairment (aMCI) and relates to the ability to retrieve personally experienced events from the past and includes general knowledge about the self [16]. Empirical evidence suggests that in retrograde amnesia remote memories are typically better preserved than recent memories, indicating a temporal gradient (TG) of memory decline. A temporally graded retrograde amnesia after damage to the hippocampal complex suggests that remote memories are not dependent on this structure anymore, indicating an important role of the hippocampus for the consolidation process but not for the long-term storage of memory. However, reports of a TG in retrograde amnesia have been inconclusive. Some investigators found higher preservation of remote memories compared to more recently acquired memories in patients suffering from AD [17–21], whereas others did not [9, 23].
In a previous study [12], our group examined the recall of memories for past public events in AD patients and healthy controls (HC) as a function of their retrieval frequency and age, by the newly developed Historic Events Test (HET), which assesses knowledge about famous events from the past 60 years. We found impaired memory for past public events in AD patients relative to HC and additionally a strong effect of retrieval frequency across all time segments, which, most importantly, occurred irrespective of the age of the memory. From this finding, we inferred that more frequently retrieved memories become independent of the hippocampus, whereas less frequently recalled knowledge remains dependent on an intact hippocampus and might therefore be more susceptible to early hippocampal damage due to AD. In another study [17], we found impaired autobiographical memory in aMCI and AD patients with better preservation of remote compared to recent memories. As remote memories were recalled more often than recent ones, we concluded that the temporal gradient of memory decline is mediated by retrieval frequency. Consequently, this indicates that more frequently recalled memories become gradually independent of the hippocampal complex, probably integrated into neocortical memory structures and are therefore less affected by early neuropathological changes in the MTL.
In the present study, we used voxel-based morphometry (VBM) and a region of interest (ROI) analysis to investigate the relationship between hippocampal morphometry and memory performance for autobiographical events subdivided according to their retrieval frequency in 24 patients suffering from early dementia of Alzheimer type (eDAT) or aMCI and 27 HC. Based on the above reported results and related interpretations, we hypothesized that autobiographical memory performance correlates with hippocampal volume specifically for seldom retrieved events, whereas no such correlation was expected for frequently retrieved ones.
METHODS
Participants
Our study included 51 participants (23 males and 28 females) with a mean age of 72±5.5 years. No participant had a physical handicap that affected his or her ability to perform the tasks or any indication of other neurological or psychiatric disorders unrelated to his or her diagnosis. To exclude acute symptoms of depression, all participants completed the German 15-item version of the Geriatric Depression Scale (GDS) [24]. The local ethical committee at the University Hospital of Tübingen approved the study. All participants signed an informed consent form after receiving a detailed description of the study. The study population contained HC, and patients with aMCI and eDAT. For the statistical analysis, we pooled aMCI and eDAT patients together into an AD cluster.
Healthy control group
HC individuals (n = 27) had no history of neurological or psychiatric disease or any sign of cognitive decline, which was confirmed by a clinical interview. The MRI of each HC subject was evaluated by trained radiologists to confirm that no signs of AD (mesiotemporal atrophy) were present.
Patients with aMCI or eDAT
Patients with aMCI or eDAT (n = 24) were recruited from the Memory Clinic of the Department of Psychiatry and Psychotherapy at the University Hospital of Tübingen. They underwent physical, neurological, neuropsychological, and psychiatric examinations, as well as brain imaging.
The diagnosis of aMCI was defined by Mayo criteria [25, 26], which include the presence of a memory complaint (corroborated by an informant), objectively impaired memory function, intact activities of daily living, and the absence of dementia. All patients in this group met the MCI-core clinical criteria according to the National Institute on Aging-Alzheimer’s Association (NIA-AA) clinical research criteria for MCI due to AD [27]. Although NIA-AA criteria do not explicitly exclude deficits in cognitive domains other than memory, we focused on memory impairment, as pre-dementia AD is most frequently characterized as an amnestic syndrome [1].
Patients with eDAT met the diagnostic criteria of probable Alzheimer’s dementia according to the National Institute of Neurological and Communicative Disorders and Stroke Alzheimer’s Disease and Related Disorders Association [25, 28]. All of these patients had a score of four on the Global Deterioration Scale [29].
Neuropsychological assessment
All participants completed the Mini-Mental State examination (MMSE) [30] and underwent the HET. The HET was developed to assess the ability to recall famous events of the past 60 years and the autobiographical, contextual information surrounding the events. The test was performed according to a previous study of our group [12], in which we used an updated and extended version of the HET with additional events occurring between 2006 and 2010. The original HET is provided in another paper of our group [9]. The 56-year period was subdivided into four time segments: A (1954–1974), B (1975–1999), C (2000–2009), and D (2010), each consisting of six famous events. The 24 historical events were presented to participants in a random order and for each event, they were asked to perform three tasks: a subjective memory rating task, a dating accuracy task, and a contextual memory task. In this study, we focused on the contextual memory task, which evaluates episodic autobiographical memories.
For the contextual memory task, participants were required to recall their personal circumstances surrounding a historic event, i.e., they had to extensively describe the situation in which they found themselves when they first heard about the incident. The amount of detail provided and the specificity in space and time was rated. Three points were given for a fully comprehensive answer (i.e., answers were spontaneous and reflected a memory that was specific for the time and space of the situation in which participants heard about or experienced the event). Two points were given for a personal but non-specific memory or a specific memory for which time and place were not recalled. For a vague personal memory, participants received one point. Zero points were given if participants did not remember the event, provided an incorrect answer, made general, non-specific statements, or provided a response based on a semantic memory. Informants, such as spouses or life-long companions, verified the accuracy of the responses.
Retrieval frequency was operationalized by a paired comparison analysis. For each of the four time segments, participants matched each event pairwise with each of the other events and indicated which of the two events they remembered more frequently during their lifetime. Fifteen paired comparisons were performed for each time segment. Retrieval frequency of an event was categorized as “high” when it was preferred four to five times in the head-to-head record (i.e., suggesting that the event was remembered more frequently than four or five other events during the participant’s lifetime), categorized as “medium”, when it was preferred two to three times in the head-to-head record, and categorized as “low” when it was preferred zero to one time in the head-to-head record. By doing so, for each participant, there were two events with a high, two events with a medium, and two events with a low retrieval frequency for each of the four time segments. Subsequently, for each subject, mean values from these two scores per time segment were calculated whereby there remained one memory performance score with low, one with medium, and one with high retrieval frequency per person and time segment. Eventually, these scores were averaged across time segments, finally leading to three values for each participant: one memory performance score for seldom, one for medium often, and one for frequently retrieved events. To examine our specific hypothesis, in this analysis we focused on the polar values, meaning, we only used events with low and high retrieval frequency for regression analysis.
Statistical analysis of sociodemographic and behavioral data
Sociodemographic and neuropsychological data were analyzed with the statistical software package SPSS 24.0 (IBM Corp., Somers, NY, USA). For all tests, the level of statistical significance was set to p < 0.05. Levene’s test was used to assess homogeneity of variance. Differences in age, years of education, and MMSE were assessed using independent t-tests. The Pearson chi-square-test was applied to detect group differences in gender distribution and the nonparametric Mann-Whitney U test was used to detect group differences in GDS scores. Total intracranial volume (TIV) was compared between HC and AD subjects applying an independent t-test.
Multivariate (two-factorial) repeated measures analysis of variance (MANOVA) with retrieval frequency (low, medium, high) as within-subject factor and group (HC, AD) as between-subject factor was calculated to examine the main effects of group and retrieval frequency on the contextual memory and dating accuracy task scores. Significant effects of the MANOVA were specified by post-hoc tests, comprising of one-way ANOVA and subsequent paired t-tests, using Bonferroni adjustments to control for alpha error accumulation. Mauchly’s test was used to evaluate violations of the sphericity assumption.
Differences in the three retrieval frequency categories between groups were assessed using one-way ANOVA. To test the effect of retrieval frequency on memory performance paired sample t-tests were applied within each group (retrieval frequency low versus medium, low versus high, medium versus high). For each group, we calculated the correlations between the performance scores for low, medium, and high retrieval frequency. Moreover, we present descriptive statistics of sociodemographic and behavioral data separately and comparatively for the two patient sub-groups (aMCI and eDAT) and also provide related t statics to evaluate the corresponding group differences (see Supplementary Table 1).
Image acquisition
MRI data were acquired on a 3.0 Tesla-Scanner (Siemens, Trio, Erlangen, Germany). A high resolution structural T1-weighted image was obtained using a magnetization prepared rapid acquisition gradient echo (MPRAGE) sequence with a voxel size of 1×1×1 mm3.
Voxel-based morphometry processing
VBM [31] analyses were performed with the CAT12 toolbox (Gaser & Dahnke, 2016; http://www.neuro.uni-jena.de/hbm2016/GaserHBM2016.pdf) implemented in the statistical analysis software Statistical Parametric Mapping version 12 (SPM12; http://www.fil.ion.ucl.ac.uk/spm) and MATLAB (R2016b, The MathWorks, Natick, MA, USA). Initially, pre-processing steps were applied according to the CAT12 manual (http://dbm.neuro.uni-jena.de/cat12/CAT12-Manual.pdf), using default settings. T1 images were first spatially normalized to the Montreal Neurological Institute (MNI) space template using the DARTEL algorithm [32]. Then, the whole brain structural data was segmented into gray matter (GM), white matter (WM), and cerebrospinal fluid (CSF). Finally, the normalized GM images were smoothed using a Gaussian kernel of 8 mm. After pre-processing, all scans passed through an automated quality check protocol.
Voxel-based morphometry analysis
For statistical analysis, we applied a parametric general linear model (GLM) approach implemented in SPM12. To avoid possible edge effects around the border between GM and WM, and to exclusively include voxels with sufficient GM proportion, only voxels with values above an absolute GM threshold of 0.1 were analyzed (absolute threshold masking). TIV was included as nuisance variable in the GLM in order to correct for different brain sizes among subjects and remove the related variance. To identify significant clusters and eliminate spurious small clusters, the minimum cluster size (k) was empirically determined for each contrast separately. We used the number of expected voxels per cluster as cluster extent threshold. This is necessary to avoid arbitrary threshold setting.
We performed voxel-wise independent two sample t-tests to compare GM tissue volume between HC subjects and AD patients. Statistical inferences were made at single voxel level and thresholded at p < 0.05 with whole brain family-wise error (FWE) correction.
Region of interest analysis
In addition, a region of interest (ROI) analysis focusing on the bilateral hippocampus was conducted. The corresponding target regions were defined using the anatomical automatic labeling (AAL) atlas [33] and mean GM values of the included voxels were calculated. Statistical analyses on these values were run in the R environment (R version 3.5.0) and defined the correlational relationship between the hippocampal morphometry and the quality of the mnemonic recall (centered values) in dependence of the experimental group. More specifically, we defined two general linear regression models including the bilateral hippocampal morphometry (i.e., average GM values) as dependent variable. Independent variables were group and the performance scores for seldom retrieved events in the first model—or rather the performance scores for often retrieved events in the second model—as well as the interaction between group and the respective performance score. Subsequently, for each group we calculated the Pearson correlation between memory performance and hippocampal morphometry, separately for seldom and often retrieved memory events. For a pairwise comparison of the corresponding correlation coefficients, we used the Pearson and Filon’s Z test [34] for dependent groups, as implemented in the R package cocor [35]. Tests were thresholded at p < 0.05, one-sided (based on a directional hypothesis).
RESULTS
Sociodemographic data
Sociodemographic characteristics and mean composite scores for both groups are presented in Table 1. AD and HC groups did not significantly differ in terms of age, years of education, and gender distribution. Also, GDS scores were not significantly different between groups and well below the cut-off score of six, indicating that participants did not exhibit signs for depression [24]. As expected, AD subjects had significantly lower MMSE scores compared to HC (t[49] = 7.691, p < 0.001), indicating greater impairment in global cognition. TIV did not significantly differ between patients and healthy subjects (t[49] = 3.53, p = 0.725), indicating equivalent total brain volume between.
Descriptive statistics of sociodemographic and clinical variables
Values are expressed as mean (standard deviation). HC, healthy control subjects; AD, patients with Alzheimer’s disease; N, number (sample size); f/m, female/male; GDS, Geriatric Depression Scale (higher score indicates more severe depressive symptoms; maximum 15; a cut-off of 6 indicates mild depression); MMSE, Mini-Mental State Examination; TIV, total intracranial volume.
Episodic autobiographical memory performance
We found a significant main effect of group on the autobiographical memory task scores (F = 24.986; p < 0.001). More specifically, a one-way ANOVA indicated that patients performed worse in the contextual memory task than HC to a significant extent in and across all frequency conditions (low retrieval frequency: F[1,49] = 12.820; p = 0.001; medium retrieval frequency: F[1,49] = 15.557; p < 0.001; high retrieval frequency: F[1,49] = 23.636; p < 0.001). We also found a significant main effect of retrieval frequency on the autobiographical memory task scores (F[2,98] = 49.7; p < 0.001). Both HC and patient subjects reported significantly less detailed memories when retrieval frequency was low compared to medium (p < 0.001/p = 0.002) or high (p < 0.001/p<0.001) and when retrieval frequency was medium compared to high (p = 0.005/p = 0.002). Correlations between the performance scores in the low, medium, and high retrieval frequency conditions are depicted in Table 2 and Table 3, separately for each group.
Descriptive statistics for performance of healthy control subjects in the autobiographical memory test: means, standard deviations, and correlations with confidence intervals
M and SD are used to represent mean and standard deviation, respectively. Values in square brackets indicate the 95% confidence interval for each correlation. *p < 0.05; **p < 0.01.
Descriptive statistics for performance of Alzheimer’s disease patients in the autobiographical memory test: means, standard deviations, and correlations with confidence intervals
M and SD are used to represent mean and standard deviation, respectively. Values in square brackets indicate the 95% confidence interval for each correlation. *p < 0.05; **p < 0.01.
Voxel-based morphometry analysis
Whole-brain analyses
Comparison of HC and AD patients showed several clusters of significantly reduced GM volume. These regions include the bilateral amygdala, the bilateral entorhinal cortex, the bilateral hippocampus, and the right temporal pole. The reported clusters (MNI coordinates and related statistics) are listed in Table 4 and graphically depicted in Fig. 1. HC subjects (compared to patients), on the other hand, exhibited no regions of reduced cortical volume.
Regions of gray matter volume loss in the patients with Alzheimer’s disease showed by voxel-based morphometry group comparison analysis
MNI, Montreal Neurological Institute; k, cluster size; cluster size in voxels; voxel size = 1×1×1 mm; cluster size threshold of 22 contiguous voxels. Thresholded at pFWE < 0.05. *survives FWE correction at cluster-level, **survives FWE correction at peak-level.

Results of group comparison of gray matter volume between patients with Alzheimer’s disease and healthy controls are presented on coronal brain slices (MNI template). Statistical parametric maps were thresholded at pFWE < 0.05 at peak level with a minimum cluster extent of 22 contiguous voxels.
Region of interest analyses
Prediction of hippocampal morphometry by memory performance and group. First, we defined a multiple regression model (AIC: – 100.70) to explain variance in hippocampal cortical volume by (a) memory performance for seldom recalled events, (b) group (HC versus patient), and (c) their interaction. The estimated regression equation was significant at p < 0.000 (F = 22.43), with an R2 of 0.589 and an adjusted R2 of 0.563. Thereby, both the regression coefficients for memory performance and the one for group were significant at p < 0.001, while their interaction exhibited no significant effect (regression equation/intercept and coefficients: Y = 0.806 + 0.058 + 0.052 – 0.019; see Fig. 2A).

Illustrations of the association (linear regression) between hippocampal morphometry and recall quality (A) in seldom recalled events and (B) in often recalled events, separated by group. Blue color of data points and corresponding trend lines represent the healthy control group, red color of data points and corresponding trend lines represent the AD patients.
In a second analogous regression model (AIC: – 91.160), hippocampal cortical volume was predicted by (a) the memory performance score for often recalled events, (b) group affiliation and (c) their interaction. Again, the estimated regression equation was found to be significant (F[3,47] = 15.93, p < 0.000), with an R2 of 0.504 and an adjusted R2 of 0.473, while all three regression coefficients reached the level of statistical significance (memory performance: p < 0.05; group: p < 0.001; interaction term: p < 0.05; regression equation / intercept and coefficients: Y = 0.819 + 0.035 + 0.057 – 0.039; see Fig. 2B).
Relationship between hippocampal morphometry and memory performance, depending on retrieval frequency. The correlational relationship between hippocampal morphometry and memory performance was expected to be different for seldom and frequently recalled memory contents. This expectation was confirmed by the data of the HC subjects, who exhibited a significantly decreased correlation for frequently recalled events (difference: z[27] = 1.666, p < 0.05; seldom recalled events: r[27] = 0.340, n.s.; often recalled events: r[27] = –0.036, n.s.; see Fig. 3A). AD patients, on the other hand, exhibited no differential, i.e., equivalently high correlations for the two frequency conditions (difference: z[26] = 0.877, n.s.; seldom recalled events: r[24] = 0.670, p < 0.001; often recalled events: r[24] = 0.549, p < 0.05; see Fig. 3B).

Illustration of the “recall quality gradient”, i.e., the association (linear regression) between recall quality and hippocampal morphometry in (A) the healthy control group and (B) AD/aMCI patients. Red color of data points and corresponding trend lines represent the association of the hippocampus with recall quality in seldom recalled events, blue color of data points and corresponding trend lines represent the association of the hippocampus with recall quality in often recalled events.
DISCUSSION
In this combined VBM-based and neuropsychological study, we examined whether episodic autobiographical memory performance is associated with hippocampal morphometry (i.e., GM volume) and whether this relationship is modulated by the retrieval frequency of the corresponding stored memory episodes. This research question was addressed comparatively in two study groups: (a) a patient group of subjects diagnosed with aMCI and early AD and (b) a control group of healthy elderly subjects. Patients compared to control subjects showed several regions of significantly reduced GM volume, which lie in the MTL involving the bilateral amygdala, the bilateral entorhinal cortex, the bilateral hippocampus, and the right temporal pole. These are basic (but not trivial) findings that essentially reflect the AD-typical neuropathology (i.e., neurodegeneration) and which therefore are highly consistent with the prior MRI literature reporting volume reductions in temporal lobe structures in both patients with AD and patients with aMCI [2, 36]. Moreover, we found impaired autobiographical memory performance in patients across all retrieval frequency conditions. Again, this is a quite basic finding which is well in line with results from our previous studies [12, 17] and also empirical evidence from others, which emphasizes that autobiographical memory loss occurs early in the course of the disease and is already present in patients with aMCI [15, 38]. Such neuropsychological deficits are commonly explained by neurodegeneration or other neuropathological changes within mesiotemporal structures, brain regions that are well known to be crucial for memory formation [8]. Beyond between-group differences, there was a significant effect of retrieval frequency on memory performance across groups. More specifically, both patients and control subjects exhibited a better memory performance (in terms of a richer, more detailed recall of contextual episodic information) for events with a higher retrieval frequency, which moreover occurred across all investigated time segments. This finding is in line with previous studies postulating a preserving effect of retrieval on episodic memories in AD patients [12, 39].
The hippocampal memory system is known to underlie age-related declines occurring even in absence of neurodegenerative disorders, so that also “healthy aging” may be accompanied by marked impairments in autobiographical memory performance [40]. Accordingly, several studies found age-related reductions in the contextual richness of autobiographical memory reports which have been construed to emanate from impaired strategic retrieval processes and poor recruitment of the hippocampus and ventrolateral prefrontal cortex [40]. In line with this reasoning, there are consistent reports of age-related decreases in hippocampal activity during episodic memory performance in non-demented subjects [41–44]. Such neurofunctional declines in the hippocampal system during normal aging can also explain why in the present work not only patients, but also healthy non-demented subjects exhibited better performance for memory episodes with high retrieval frequency.
Based on the outlined findings, the present study focused on the differential relationship between hippocampal morphometry and episodic memory performance in dependence on the retrieval frequency of the corresponding mnemonic contents (i.e., autobiographical episodes). Thereby, based on theoretical assumptions and prior empirical findings [17, 45], we expected a strong association between hippocampal GM volume and memory performance for seldom retrieved but no such (or a significantly reduced) relationship between hippocampal morphometry and the memory for frequently retrieved events. The analysis confirmed this expectation, however only for the healthy control subjects, whereas the patient subjects exhibited a significant correlation between hippocampal morphometry and memory performance irrespective of retrieval frequency, i.e., across all frequency conditions. Our finding of a differential relationship of seldom retrieved and frequently retrieved memory episodes with hippocampal morphometry in healthy subjects essentially corroborates that the retrieval of autobiographical memory episodes plays an important role in memory consolidation making memory traces stronger and more stable and, at the same time, becoming gradually independent of the hippocampal system. This consolidation process presumably protects (at least in part) the respective memory entries from decomposition related to neurodegeneration particularly in mesiotemporal regions [2]. Moreover, a differential association of hippocampal morphometry with seldom and frequently retrieved autobiographical memory contents may be considered in the context of the “semanticization” theory. This theory assumes that memory entries undergo a gradual transition over time from episodic to semantic memory by a process of abstraction, coined semanticization [46, 47]. Accordingly, this theory assumes an episodic-semantic continuum (rather than a categorical distinction) of memory entries, which also provides a conceptual explanation for Ribot’s temporal gradient according to which more recent memories are relatively more susceptible to loss than more remote memories in retrograde amnesia [48]. Most importantly, neuroimaging studies have shown that the semanticization of autobiographical memories is accompanied by a gradual disengagement of hippocampal activity related to memory retrieval [49]. The disengagement of the hippocampus for frequently retrieved memory events observed in the present work may be based, at least in part, on the same neurofunctional transition process. In this sense, the repeated retrieval of autobiographical events may be assumed to lead to a cumulative abstraction of the respective entries in memory, making them more factual and disconnected from their original temporal context in terms of less naturalistic or lively.
The increased relation of hippocampal morphometry to the memory of frequently retrieved episodes in the patient group arguably reflects that AD-related neurodegeneration not only impairs neuronal functioning, but also diffuses the neurofunctional specialization of involved brain regions, arguably based on disturbed neuroanatomical connections [50, 51]. In this context, a reduced neurofunctional specialization may be construed as part of the degenerating brain’s efforts to compensate for the increasing functional deficits by using other or additional available neural resources. Corroborating this assumption, exclusively in the performance data of the patient group we found a significant correlation between the memory performances for seldom retrieved events and frequently retrieved events, suggesting that they involve a broader common neurofunctional basis than in non-demented subjects. Based on this reasoning, one may speculate that the ratio of the memory performances for often and seldom recalled events could provide a useful neuropsychological marker for AD pathology.
Possible limitations of the presented work have to be mentioned. First, given the methodological approach of our study, it is important to emphasize that a present (or absent) statistical relationship between brain morphometry in a specific region on the one hand and a cognitive performance parameter on the other does not necessarily reflect the functional neuroanatomy of the respective cognitive process. Therefore, future studies using functional neuroimaging methods may confirm and also amend or refine the conclusions of the present work. Moreover, in the investigation of associations between neural structure and cognitive performance, “cognitive reserve” (CR) represents an important theoretical concept, which, however, was not explicitly considered in the present work. CR is construed as capacity to compensate for a given neural damage by flexibly using alternative brain networks [52–54]. Accordingly, CR provides an explanatory framework for the fact that individuals could markedly differ in their levels of cognitive or intellectual impairment, despite exhibiting similar levels of neural damage [55, 56]. In this respect, parameters of CR may represent important nuisance variables which could suppress or bias functional associations of brain-structural parameters. Moreover, if in a between-subject design compared groups are not matched for their CR, this may lead to either an over- or underestimation of group differences and, thus, complicate their interpretation [57]. Although the present study did not explicitly control for subjects’ CR, it was confirmed that the two groups (AD and HC subjects) did not deviate significantly in terms of age, years of education, as well as their gender distribution, which are all variables which have been shown to be importantly related to CR [58]. Moreover, age was confirmed to be normally distributed across groups without any outliers. Therefore, taken together, we can assume that CR did not significantly influence the here presented findings. Finally, in the present study, we did not assess CSF-based biomarkers which may be considered a limitation of the between-group study design. More specifically, amyloid-beta 42 and tau proteins could have been used, on the one hand, to confirm a common (i.e., homogeneous) AD-related neuropathology in the patient group and, on the other hand, to confirm the absence of neurodegenerative processes in the healthy control subjects [59]. However, our group assignment can be still considered sufficiently reliable as it is based on highly valid and internationally accepted diagnostic criteria (see method section). Moreover, each control subject‘s MRIs were assessed by trained radiologists who confirmed that the images exhibit no signs of AD (particularly no signs of mesiotemporal atrophy). In this context, the literature clearly shows that neuroimaging biomarkers already have a high predictive value for early onset AD [59, 60]. Concerning the aMCI patient subjects, admittedly we cannot be absolutely sure that really all of them will actually convert to AD. This is due to the purely cross-sectional study design and may be considered another potential limitation of the group classification. However, the aMCI diagnosis was established according to high standard criteria (the Mayo criteria and NIA-AA criteria; see Methods), which provide a strong argument to assume that the vast majority of aMCI subjects of our sample will actually convert to AD.
Conclusions
In the present combined brain-morphometric and neuropsychological study, we were able to define differential associations between hippocampal GM volume and the retrieval of autobiographical memory episodes of different retrieval frequencies. The results suggest that memory retrieval is associated with a neurofunctional transition (conceptually related to consolidation or “semanticization”) of the respective episodic memory entries whereby they gradually become independent of the hippocampal system. Most importantly, this neurofunctional transition process seems to be diffused or reversed in the progression of AD-pathology, so that the recall of frequently retrieved memory episodes is presumably related to hippocampal functional integrity in affected patients, as well. Future studies using functional neuroimaging methods are desirable to amend and refine the outlined conclusions.
