Abstract
Background:
We investigated a sample of cognitively healthy subjects with normal Alzheimer’s disease (AD) cerebrospinal fluid (CSF) biomarker levels to identify the earliest variables related to longitudinal memory changes.
Objective:
Employing a new highly demanding learning and memory test (the Ancient Farming Equipment Test; AFE-T), we aimed to investigate whether a biomarker related to neurodegeneration (i.e., CSF tau) was associated with longitudinal memory decline.
Methods:
Thirty-two cognitively and biologically normal (CBN) subjects underwent MRI, neuropsychological assessment, and the AFE-T at baseline and 18 months later. To explore the relationship between cognitive performance and relevant factors, a linear model was set up. For a secondary analysis that further explore the effect of tau, the subjects were divided into CBN-Tau↓ (tau < 228.64 pg/ml; n = 16) and CBN-Tau↑ (tau > 228.64 pg/ml; n = 16). We also performed voxel-based morphometry (VBM) to identify regions of grey matter volume that would predict both baseline and longitudinal cognitive performance.
Results:
Our main finding was an association between CSF tau and longitudinal memory decline measured with AFE-T (B = –0.17, p < 0.05; r = –0.414; p < 0.01), and further analyses showed different evolvement between subgroups, with an accelerated decline in individuals with higher tau (F(1,31) = 8.37; p < 0.01). VBM results suggested that AFE-T performance is related to grey matter volume in a medial temporal, middle frontal, and posterior cerebellar network at baseline, and that there are strategic brain areas driving the longitudinal cognitive changes.
Conclusions:
The present findings provide evidence for structural and biological markers linked to cognitive aging by highlighting the role of tau, a marker of neurodegeneration, which can be related with the earliest memory changes in healthy subjects.
INTRODUCTION
Understanding the factors underlying age-related changes in cognition has long been a challenge. Since cognitive changes in normal aging and in incipient pathological processes (e.g., preclinical Alzheimer’s disease; AD) are closely related, there is a need to identify the earliest factors driving the cognitive decline in both populations in order to ensure an early detection of the pathological processes. While there is extensive literature on the aging process in large cohorts of cognitively impaired and unimpaired subjects, we have very little information on biomarkers of age-related cognitive changes in cognitively healthy subjects with a confirmed normal AD cerebrospinal fluid (CSF) biomarker profile.
The cognitive profile in normal aging has been well-documented in the scientific literature [1–3]. However, one important question that remains open concerns the factors related to these observed cognitive changes. First, demographic variables such as age, years of education, and cognitive reserve [4–6] are known to have an impact on the trajectory of cognition over the life span. A prospective study involving a large sample of 2,509 cognitively healthy elderly adults reported that both age and educational level predicted maintenance of cognitive function over an 8-year period [7]. Second, some neuropsychological variables have been identified as predictors of cognitive decline. Memory decline has been considered as the major cognitive risk factor for developing age-related pathologies, such as mild cognitive impairment (MCI) or AD [8]. Third, the maintenance of cognitive functioning in aging is thought to be primarily related to brain maintenance, that is, relative lack of structural and functional brain changes [9]. In line with this, a recent report suggested that subjects with larger medial temporal lobe (MTL) volumes at baseline were more likely to maintain their cognitive function over time [10]. Fourth, biological markers on cognition have been a topic of intensive research during the last few years. The most studied AD CSF biomarkers are the tau protein levels and the 42 amino acid form of amyloid-β (Aβ42). Some studies have suggested that CSF tau levels rather than Aβ pathology are more closely related to cognition, specifically to memory decline [11–13].
Due to the proximity of cognitive changes related to normal aging versus incipient pathological processes, there is an increasing need to develop more sensitive tests for an early detection of subtle cognitive difficulties in populations at risk. For that reason, new highly demanding neuropsychological tests such as the Face Name Associative Memory Exam [14–16] or the Short-Term Memory Binding test [17] have been developed. Using previous research on the neural mechanisms involved in language learning and memory, we recently evaluated a highly demanding learning and memory test called the Ancient Farming Equipment Test (AFE-T) (for a review, see Laine & Salmelin [18]). This task engages the declarative memory system in learning to associate unfamiliar names (new labels or words) to completely new objects. To date, the AFE paradigm has been used to study acquisition of new words in groups of healthy adults [19, 20] and, in two studies, in MCI and AD patients [21, 22]. A fMRI study using the AFE paradigm in an aphasic patient and in healthy controls showed a clear involvement of MTL regions during the learning phase [23]. In our recent study, we employed the AFE-T to detect subtle cognitive difficulties in preclinical AD subjects. The AFE-T was found to be a promising tool for characterizing the cognitive profile of preclinical AD, being sensitive enough to detect initial learning difficulties in our at-risk population. Furthermore, the AFE-T was associated with the CSF Aβ42/tau ratio [24].
Since there is recent literature regarding the relationships between cognitive function and AD CSF biomarkers in cognitively normal subjects [25–29], the present research pushed further by focusing on a specific sample of cognitively healthy subjects with normal AD CSF biomarker levels. The aim of the present study was to examine and follow up this well-characterized sample in order to detect demographical, structural and biological variables related to the earliest cognitive changes in aging. Employing the sensitive AFE-T cognitive measure, we specifically wanted to investigate whether a biomarker related to neurodegeneration (i.e., CSF tau) is associated with the earliest longitudinal decline of learning and memory in cognitively and biologically normal aging.
MATERIAL AND METHODS
Participants
The present participants represent a sub-sample of Tort-Merino et al. [24], and thus the present methods description follows that paper. Thirty-two cognitively healthy subjects with normal AD CSF biomarker levels were included in the present study and followed up for 18 months. The participants were recruited at three Spanish memory centers: Hospital Clinic (HC) and Hospital de la Santa Creu i Sant Pau (HSP) in Barcelona, as well as the CITA-Alzheimer Foundation (CITA) in San Sebastian. Due to the nature of this recruitment, some of the subjects included in the study presented memory complaints. The ethics committee of the Hospital Clinic of Barcelona approved the study, and all participants provided a signed, informed consent before undergoing the neuropsychological assessment, MRI and the lumbar puncture. All subjects had to meet the following inclusion criteria: a) at least three years of formal education, b) Mini-Mental State Examination (MMSE) score >24, and c) Clinical Dementia Rating (CDR) score = 0 and objective cognitive performance within the normal range (cutoff 1.5 SD from normative mean) in all tests on a neuropsychological battery (see below). The following exclusion criteria were applied: a) presence of any neurological diagnosis, b) presence of a serious or unstable medical condition, c) diagnosis of a major psychiatric disorder including schizophrenia, major depression or substance abuse, and d) presence of any abnormality in CSF Aβ42, tau, and/or phosphorylated tau at threonine-181 (ptau) levels.
For a secondary analysis that further explored the effect of CSF tau in normal aging, we divided the present sample into CBN-Tau↓ and CBN-Tau↑ subgroups according to their CSF tau levels. These secondary analyses were done in order to visualize the main findings of the study. The mean of the CSF tau values (228.64 pg/ml) of the 32 subjects was used as a cut-off point to ensure an equal distribution between groups. Subjects with CSF tau levels below 228.64 pg/ml were classified as CBN-Tau↓ (n = 16) and subjects with CSF tau above 228.64 pg/ml were included in the CBN-Tau↑ group (n = 16).
Determination of biological and AD CSF biomarkers
All subjects underwent a lumbar puncture between 9 a.m. and 12 p.m. to collect 10 ml of CSF. The samples were centrifuged and stored in polypropylene tubes at –80°C within the first hour after extraction. CSF Aβ42 levels, tau, and ptau were measured by enzyme-linked immunosorbent assay kits (Innogenetics, Ghent, Belgium). Cut-off values of abnormality for each AD CSF biomarker were defined according to previous work [30]: a) Aβ42 ≤550 pg/ml, b) tau ≥400 pg/ml for subjects between 50–70 years old, and ≥450 pg/ml for subjects older than 70 years, and c) ptau ≥75 pg/ml. As noted in the inclusion criteria, all subjects included in the study presented normal levels for all AD biomarkers. The AFE-T administrator and the participants were blind to CSF results.
Apolipoprotein E analysis
Genomic DNA was extracted from peripheral blood of probands using the QIAamp DNAblood minikit (Qiagen AG, Basel, Switzerland). APOE genotyping was performed by polymerase chain reaction amplification and HhaI restriction enzyme digestion. Both the AFE-T administrator and the study participants were blind to the APOE results.
Neuropsychological assessment
All participants were assessed both at the baseline and at the follow-up session with a comprehensive neuropsychological battery, administered by a trained neuropsychologist blind to the CSF results. The battery encompassed four cognitive domains. The memory domain included the Free and Cued Selective Reminding Test (FCSRT) [31], the language domain comprised of the Boston Naming Test [32], and a Semantic Fluency Task [33]; the visual perception domain contained the Number Location subtest of the VOSP battery [34]; and the executive functions domain consisted of the Trail Making Test [35], the Stroop Test [36], the Symbol Digit Modalities Test [37], and the Digit Span test of the WAIS [38]. Global cognition was assessed with the MMSE [39]. Premorbid intelligence was assessed with the Spanish Word Accentuation Test [40]. Subjective memory complaints were measured by the Subjective Cognitive Decline Questionnaire (SCD-Q) [41]. The average time lapse between the baseline neuropsychological assessment and the baseline AFE-T (list A) was +0.4 (SD 0.6) months and the time lapse between the follow-up neuropsychological assessment and the follow-up AFE-T (list B) was –0.4 (SD 0.4) months.
The Ancient Farming Equipment Test
The AFE-T called for learning of two lists of new object/name pairs. For both lists, the objects were 24 black-and-white images of ancient farming equipment taken from the AFE paradigm [18]. Each object was paired with a pseudoword, that is, a non-existing word that follows the phonotactic rules of Spanish [42]. The object names consisted of 14 bisyllabic and 10 trisyllabic pseudowords that do not exist in the Spanish dictionary. All stimuli were presented on a computer screen against a white background using the E-prime 2.0 version (Psychology Software Tools, Inc., PA, USA).
The study had a total duration of 18 months and included a baseline testing session and an 18-month follow-up. List A was administered at the baseline assessment and list B at the 18-month follow-up. The assessments are explained in detail below.
Initial learning sessions, total learning score, and immediate cued recall
List A was administered in two initial learning sessions that were performed on two consecutive days. Each learning session included a total of seven runs and took approximately 45 min. Before starting the learning phase, each of the 24 object/name pairs was displayed for seven seconds with a 500 ms pause between the pairs. The participants were asked to read aloud the name of the object printed below, and to try to learn each object/name pair. After the presentation, the seven learning runs were performed. In each run, the participants were presented with the objects one at a time, and were asked to spontaneously say its name aloud. They were given a maximum of 7 s to recall the name of each object. After this, the correct name appeared below the object for 4 s, regardless of whether the participant had been able to produce the correct name. The following object was presented after 500 ms. The order of presentation in each run was randomized. For each run, the range of scores was 0–24.
After the last run of the second learning day (i.e., total learning score), the immediate cued recall (ICR) was administered. In this test, each object was presented one at a time. When the object appeared, the experimenter verbally provided the first syllable of the object’s name (phonemic cue). The participant then had a maximum of 7 s to provide the correct name. This time feedback (i.e., the correct name) was not provided after the response.
Follow-up session at 18 months
The 18-month follow-up included exactly the same procedure as at the baseline except that list B was administered.
Scoring system
All verbal responses were recorded for offline scoring. Following the scoring procedure of the AFE paradigm, a response was considered correct (score = 1) when (a) the participant recalled the exact name of the object, or when (b) the name recalled differed only by a single phoneme from the original name. Under (b), the following cases were considered: substitution, addition, or omission of a single phoneme at any given position of the word, or a change in position of an otherwise correct phoneme. This criterion was applied for all runs.
Validation study
A validation study with 30 young adults was conducted to confirm that the word lists A and B had comparable difficulty. We ran independent t-tests to explore possible between-group differences between word lists A and B. In addition, we compared the performances between the participants of the validation study (younger adults) and the subjects included in the study.
Neuroimaging
Acquisition
For each participant, two T1-weighted, high-resolution, MPRAGE structural MRI (echo time [TE] 2.98 ms, repetition time [TR] 2300 ms, inversion time 900 ms, flip angle 9°, bandwidth 240 Hz/pixel, matrix 256×256, 240 axial slices, isometric voxel size 1/4 1.0 mm3) scans were acquired at the IDIBAPS’s Imaging core facilities with a 3T whole-body MRI scanner (Siemens Magnetom Trio; Hospital Clínic, Barcelona).
The mean time between scans was 1.9 (SD 0.2) years. Three participants were excluded due to severe motion artifacts.
Processing and analyses
Image processing was performed using the unified segmentation procedure [43], DARTEL toolbox [44], and Pairwise Longitudinal Registration [28] toolbox implemented in SPM12 (Welcome Trust for Neuroimaging) according to standard procedures. In the whole sample, we performed whole brain voxel-based morphometry (VBM) analyses evaluating which regions of grey matter volume significantly predicted the total learning score and the immediate cued recall of the AFE-T at baseline (cross-sectional VBM, n = 29), as well as their longitudinal change (longitudinal VBM), using multiple regression within the SPM12 environment controlling for age and total intracranial volume.
In the longitudinal VBM analysis, we only included subjects whose scores declined (n = 20 for total learning score; n = 23 for immediate cued recall), because our focus was on identifying the brain regions associated with impairment over time. We chose these two measures as they are most representative of total learning and cued recall, respectively. We reported results at a threshold of p < 0.001 uncorrected for multiple comparisons and used a threshold of p < 0.005 uncorrected for multiple comparisons for visualization purposes (http://www.nitrc.org/projects/mricron).
Statistical analyses
Statistical analyses were performed using the SPSS (v. 22.0) package for Windows. Following the recommendations of the American Physiological Society [46] and in order to avoid type I errors, alpha value of p < 0.01 was considered to be significant for all the comparative analyses.
Whole sample analyses
For the whole sample, demographical data, levels of CSF Aβ42, CSF tau, and CSF ptau, and APOE ɛ4 frequencies were calculated. Regarding the longitudinal change in both the AFE-T and the standard neuropsychological tests, repeated-measures analyses of variance (ANOVA) were run in order to compare baseline and follow-up performances.
Pearson bivariate correlations were calculated to assess overall associations between the demographic (age, years of education, and cognitive reserve) and biological (AD CSF biomarkers) data and the difference between baseline and follow-up score in the immediate cued recall of the AFE-T (immediate cued recall difference score). The difference between baseline and follow-up in the immediate cued recall difference score was considered to be the best AFE-T outcome measure, as it is the final score of the test (end of the 2nd learning day).
To explore the relationships between the longitudinal memory performance (in both the AFE-T and a standard neuropsychological test) and relevant factors, a linear model was set up. The first analysis included the AFE-T immediate cued recall difference score as the dependent variable and the second one included the FCSRT cued recall difference score. For the second analysis, the FCSRT cued recall difference score was used as a homologous variable of the immediate cued recall difference score of the AFE-T (final cued recall output). Age, the SCD-Q score, CSF Aβ42, and CSF tau were included as covariates in both analyses.
Further analyses: differences between CBN-Tau↑ and CBN-Tau↓
Demographic data, levels of CSF Aβ42, CSF tau, and CSF ptau, and APOE ɛ4 frequencies were compared using t-tests for independent samples and Chi-square analyses when appropriate. We ran ANOVAs to explore possible cross-sectional differences between CBN-Tau↑ and CBN-Tau↓ on the AFE-T at the baseline and at the follow-up assessments.
Regarding the longitudinal change, dependent samples t-tests were run to compare within-group differences between baseline and follow-up scores of the AFE-T in both CBN-Tau↑ and CBN-Tau↓. Follow-up between-group differences were analyzed using mixed-model ANCOVA controlling for age, years of education, and CSF Aβ42 levels with post-hoc Bonferroni corrections.
RESULTS
Sample characteristics
Thirty-two cognitively and biologically normal (CBN) subjects were included in the present study. Age ranged between 53 and 78 years, and educational level between 5 and 18 years. Female/male ratio was 62.5/37.5. Regarding the AD CSF biomarker levels, the mean CSF Aβ42 was 824.1 (SD 210.9) pg/ml [557.5–1405.0], CSF tau was 228.6 (SD 72.3) pg/ml [83.5–364.2], and CSF ptau was 50.9 (SD 13.5) pg/ml [23.5–71.0]. Only 2 subjects (6.2%) were APOE ɛ4 positive (see Table 1).
Demographics, biological data, and AD CSF levels of the whole sample
Data are presented as means (SD; standard deviation) and [range]. CSF, cerebrospinal fluid; Aβ42, amyloid-β isoform 42; Tau, total tau; ptau, phosphorylated tau; CRQ, Cognitive Reserve Questionnaire.
Variables associated with longitudinal cognitive decline
Pearson correlations were run in order to find overall associations between demographic and biological data and cognitive changes. The immediate cued recall difference score (i.e., the difference between the follow up and baseline score on the final cued recall output) was used as an indicator of longitudinal cognitive decline. Demographic data such as age (r = –0.24; p = 0.20), years of education (r = 0.19; p = 0.29) and cognitive reserve (r = 0.16; p = 0.41) were not associated with the immediate cued recall difference score. Neither did the biological variables of CSF Aβ42 (r = –0.04; p = 0.82) and CSF ptau (r = –0.24; p = 0.18). However, we found a significant negative correlation between CSF tau and the immediate cued recall difference score (r = –0.414; p < 0.01; see scatter plot in Fig. 1), indicating worse recall associated to higher CSF tau levels.

Correlations in the whole sample between CSF tau and CSF pTau levels and the immediate cued recall difference score of the AFE-T. ICR, immediate cued recall of the AFE-T (difference between baseline and follow-up scores).
CSF tau driving longitudinal memory changes
The first linear model analysis showed that CSF tau (B = –0.17; p < 0.05) predicted the performance in the immediate cued recall difference score of the AFE-T. Age (B = –0.029; p = 0.705), the SCD-Q score (B = –0.107; p = 0.155), and CSF Aβ42 (B = 0.002; p = 0.317) did not predict change on this cognitive variable. The second model, with the FCSRT cued recall difference score as the dependent variable, did not reveal any statistically significant predictors [age (B = –0.089; p = 0.381), SCD-Q score (B = –0.130; p = 0.188), CSF Aβ42 (B = –0.003; p = 0.327), and CSF tau (B = 0.014; p = 0.148)].
Performance in the AFE-T
Validation study
Thirty healthy younger adults were recruited and randomly divided into two groups (group A, for word list A; and group B, for word list B). Age ranged between 17 and 29 years (mean 20.27; SD = 2.57) and all subjects were college students. There were no significant differences between the validation groups on age or educational level.
Importantly for the present purposes, independent samples t-tests revealed no significant differences between the validation groups A and B in any of the learning runs or in the immediate cued recall (with p-values ranging between 0.20 and 0.86; see Fig. 2). As expected, when comparing the overall performances in lists A and B between the validation subjects and the study participants, significant differences were found in favor of the younger validation subjects in both list A (t(14) = 3.74, p < 0.01) and list B (t(14) = 3.98, p < 0.01).

AFE-T baseline and follow-up performance in the whole sample. ICR, immediate cued recall, *p < 0.01.
The AFE-T in the whole sample
We compared baseline and 18-month follow-up scores of the AFE-T in the whole sample (Table 2; Fig. 2). ANOVAs showed significant differences with a better performance in the baseline session in comparison with the follow-up in the runs 6 (F(1, 31) = 7.13; p < 0.01) and 7 (F(1, 31) = 18.19; p < 0.01) of the first learning day, and in the runs 1 (F(1, 31) = 13.09; p < 0.01), 2 (F(1, 31) = 7.20; p < 0.01), 7 (i.e., total learning score; F(1, 31) = 7.81; p < 0.01) and immediate cued recall (F(1, 31) = 7.61; p < 0.01).
ANOVA of the AFE-T baseline and follow-up scores in the whole sample
Data are presented as means (SD; standard deviation). LS1, 1st learning day; LS2, 2nd learning day; R, learning run number; LS2_R7, total learning score; ICR, immediate cued recall. *p < 0.01; **p < 0.005.
Standard neuropsychological tests in the whole sample
Comparisons of baseline and follow-up scores on standard neuropsychological tests are shown in Supplementary Table 1. There was no significant difference in global cognition between baseline and follow-up scores (F(1,30) = 1.37; p = 0.25), as assessed by the MMSE. Nor was there a significant difference on the verbal intelligence score (F(1,30) = 4.12; p = 0.51). Most of the scores obtained at the follow-up assessment were higher than at the baseline, even reaching statistical significance for the free recall subtest of the FCSRT (F(1,30) = 12.25; p < 0.01).
Further analyses: comparisons between CBN-Tau↓ and CBN-Tau↑ groups
Group characteristics
Demographic and biological data for both CBN-Tau↓ and CBN-Tau↑ groups are shown in Table 3. No differences between CBN-Tau↓ and CBN-Tau↑ groups were found in age (t(30) = 0.80; p = 0.428), years of education (t(30) = 0.09; p = 0.928), cognitive reserve (t(26) = –1.03; p = 0.312), or CSF Aβ42 levels (t(30) = 2.04; p = 0.039). There were no significant differences between groups in terms of gender distribution (χ2 = 0.35; p = 0.554) or in APOE ɛ4 allele frequency (χ2 = 1.88; p = 0.170). Significant differences were found in CSF tau (t(30) = 9.32; p < 0.01) and CSF ptau (t(30) = 6.31; p < 0.01).
Demographics, biological data, and AD CSF levels of the CBN-Tau↓ and CBN-Tau↑ groups
Data are presented as means (SD; standard deviation). CSF, cerebrospinal fluid; CBN, cognitively and biologically normal; Aβ42, amyloid-β isoform 42; Tau, total tau; ptau, phosphorylated tau; CRQ, cognitive reserve questionnaire. aχ2 statistic; *p < 0.01.
Differences between CBN-Tau↑ and CBN-Tau↓ groups
ANOVAs were run to determine whether CBN-Tau↑ and CBN-Tau↓ groups showed cross-sectional differences at baseline and/or follow-up performance in the AFE-T.
There were no significant differences between groups in either baseline or follow-up performance (with p values ranging from 0.184 to 0.960 and 0.596 to 0.967, respectively).
Regarding the longitudinal change within each group, dependent samples t-tests were run in order to examine differences between baseline and follow-up learning runs. For CBN-Tau↓, learning runs 5 and 7 of the first learning day of the follow-up were significantly lower than at the baseline. In the CBN-Tau↑ group, these differences were found in the runs 6 and 7 (i.e., total learning score) and in the immediate cued recall of the second learning day (Table 4; Fig. 3). When ANCOVAs were run to analyze longitudinal between-group differences, the performance difference between the baseline and follow-up immediate cued recall scores between the CBN-Tau↓ and CBN-Tau↑ groups revealed that the CBN-Tau↑ had a larger difference in the immediate cued recall difference score than the CBN-Tau↓ subjects did (F(1,31) = 8.37; p < 0.01). The differences in the rest of the learning runs showed no significant differences between groups (see Supplementary Table 2; Fig. 3).

Comparison of the AFE-T baseline and follow-up performance in the CBN-Tau↓ and CBN-Tau↑ groups. CBN, cognitively and biologically normal; ICR, immediate cued recall; *p < 0.01 (within-group differences); **p < 0.01 (between-group difference).
Dependent samples t-tests for the AFE-T baseline and follow-up scores of the CBN-Tau↓ and CBN-Tau↑ groups
Data are presented as means (SD; standard deviation). CBN, cognitively and biologically normal; LS1, 1st learning day; LS2, 2nd learning day; R, learning run number; LS2_R7, total learning score; ICR, immediate cued recall. *p < 0.01.
Neuroimaging analyses
We evaluated which regions of grey matter volume predicted the total learning score and immediate cued recall of the AFE-T at baseline, as well as their longitudinal change (total learning score and immediate cued recall difference scores).
Cross-sectional VBM
At baseline, both total learning score and immediate cued recall showed positive correlations with grey matter volume in a similar network including the left and right posterior cerebellar lobes, right and left medial temporal regions, and the right middle occipital gyrus (Fig. 4). For the total learning score, an additional left middle frontal cluster was a significant predictor. No grey matter regions showed a significant negative correlation with either score.

Grey matter volume correlates of the Immediate Cued Recall (A) and Total Learning Score (B) of the AFE-T. Grey matter volume correlates of the immediate cued recall (A) and total learning score (B) performance on the AFE-T. Green represents cross-sectional performance at baseline. Hot color represents the correlates of longitudinal change. Results are displayed at a p < 0.005 threshold for visualization purposes.
Longitudinal VBM
The longitudinal decline in both scores correlated with the rate of grey matter volume reduction in the right posterior cerebellar lobe (Fig. 4). Volume reduction in additional clusters in the left insula for the total learning score, and the left anterior inferior temporal lobe for the immediate cued recall, correlated with decline in memory performance. All grey matter volume correlates of cross-sectional and longitudinal performance on the AFE-T are shown in Supplementary Table 3.
DISCUSSION
We conducted the first study examining a specific sample of cognitively healthy subjects with a normal pattern of AD CSF biomarkers using a highly demanding learning and memory test and VBM, with the aim to identify the earliest biological and structural variables related to longitudinal cognitive decline in cognitively and biologically normal aging. Our main finding showed that CSF tau is associated with longitudinal cognitive changes in this population. Furthermore, we found different longitudinal cognitive patterns by dividing our cohort into two subgroups based on their CSF tau levels. The VBM results also suggested that performance on the AFE-T is related to grey matter volume in a medial temporal, middle frontal, and posterior cerebellar network at baseline, and that there are strategic brain areas driving the observed longitudinal cognitive changes. Our findings suggest that there are biological and structural markers reflecting normal cognitive aging and highlight the critical impact of tau as a good predictor of the earliest memory decline in healthy subjects with normal AD CSF biomarkers. Taken together, since both CSF tau and anatomic MRI are considered biomarkers of neurodegeneration in the current ATN classification [47], our results point the presence of neurodegeneration as a potential predictor of cognitive change.
In the last few years, the relationships between biomarkers and cognition have been intensively studied [25–29, 48] and some studies have suggested a stronger association between cognition and tau than between cognition and Aβ [11, 49]. According to the literature, Aβ accumulates first in the neocortex and then in subcortical areas [50]. Instead, tau usually concentrates first in the medial temporal areas [51] that are closely related to encoding of new information, learning processes and memory. The main finding of the present study was to find of an association between CSF tau and the longitudinal cognitive decline in the AFE-T in a sample of subjects with normal AD CSF biomarker levels. Some longitudinal studies have found associations between cognition and tau [12, 49] in cognitively healthy subjects. Hessen et al. [49] studied 122 subjects with subjective cognitive decline (SCD) and found that a subgroup presenting memory decline during the study period had significantly higher CSF tau levels at baseline. However, it has been suggested that SCD population could have a somewhat higher risk of developing incipient cognitive decline than controls [52]. Furthermore, a study by Glodzik et al. [12] found that elevated p-tau231 levels were related to both decreased memory function and MTL atrophy in a group of normal elderly subjects that experienced longitudinal memory decline. Nevertheless, in this study it is important to note that 1) CSF p-tau231 is a specific marker of neurofibrillary pathology and 2) the group with decreased memory function presented lower levels of the Aβ42/Aβ40 ratio indicating a noticeable contribution of amyloid deposition. In the present study, our statistical model showed that SCD (measured by the SCD-Q) and CSF Aβ42 were not associated with the immediate cued recall difference score of the AFE-T, and allowed the identification of a unique contribution of CSF tau into predicting the longitudinal memory decline.
An important issue was to further investigate our main finding regarding the CSF tau impact on the cognitive performance of cognitively healthy subjects. In this regard, we identified distinct patterns of cognitive decline in the AFE-T between the two groups that were established according to their CSF tau levels. Our findings are enhanced by the fact that the study groups showed a marginal difference in the CSF Aβ42 levels in favor of the group with higher tau levels. This rejects the possibility that Aβ was contributing to the observed cognitive differences and, again, argues in favor of a unique effect of CSF tau in the memory function. As seen in Fig. 3, the group with higher tau levels performed worse in more runs of the follow-up assessment when compared with the group exhibiting lower levels of tau. Furthermore, in their last runs, the lower tau group reached their baseline performance level, whereas the group with higher levels of tau remained below their baseline level throughout the whole follow-up testing. These results suggest that subjects with lower tau levels exhibited practice effects in comparison with those with higher tau. Importantly, the lack of learning effects has been recently identified as a cognitive marker of subtle cognitive changes [53], highlighting its importance as a predictive variable of future cognitive decline. In addition, the change between the baseline and the follow-up immediate cued recall showed a significant difference in favor of the lower tau group. Taken together, our results suggest that CSF tau levels could have an impact on the learning and memory functions of cognitively healthy individuals, and that highly demanding memory tests such as AFE-T could detect this pathological process even in this population.
Our VBM results also suggest that AFE-T performance is related to grey matter volume in a medial temporal, middle frontal, and posterior cerebellar network at baseline (cross-sectional VBM, n = 29), and that there are strategic brain areas driving the observed longitudinal cognitive changes (longitudinal VBM, n = 20 for total learning score and n = 23 for immediate cued recall). We were able to identify grey matter regions the volume of which predicted performance on the AFE-T at baseline, as well as those in which the longitudinal change in volume predicted the amount of decline in the studied measures. Cross-sectional performance at baseline in both total learning and cued recall measures was associated with grey matter volume in a medial temporal and posterior cerebellar network. For the total learning score, an additional left middle frontal cluster was a significant predictor, perhaps reflecting a larger executive function component than that involved in cued recall. These results are in line with extensive previous literature showing that the medial temporal and prefrontal regions are related to performance on a variety of memory tasks [54–56]. The predominant brain region involved in both cross-sectional and longitudinal performance was the posterior cerebellum. This finding was unexpected. However, the role of the posterior cerebellum in a variety of cognitive tasks including language, learning, memory and especially executive functions such as working memory and planning is well established [57–61]. The large extension of the cerebellar clusters with respect to the cortical clusters may be explained by the healthy cognitive and biological state of our cohort. In normal aging, the posterior cerebellar lobes show accelerated volume reduction with respect to other structures [62], and this reduction may be more relevant to cognitive performance than typically shown by studies of diseased patients with cortical lesions. In line with our results, other recent studies including similarly aged and cognitively healthy cohorts [63–65] show involvement of the cerebellum in memory and executive function tests. Taken together, our findings showed structural correlates of cognitive performance in a well-characterized sample of subjects with normal cognition and normal AD CSF biomarker levels, providing evidence on the most vulnerable brain systems related to learning and memory in aging.
Another important topic of the present work concerned the sensitivity of cognitive measures in the assessment of cognitively healthy subjects. Our whole-sample analyses of the standard neuropsychological tests showed improved follow-up performance on most of the tasks, suggesting clear practice effects that are commonly observed in cognitively healthy individuals [53, 66–68]. Practice effects have been explained by several factors such as knowledge about testing procedures, previously learned strategies, and a reduced sense of novelty and nervousness when re-testing [69]. On the other hand, our findings on the AFE-T showed a poorer performance at the follow-up assessment when compared with the baseline, suggesting that the use of a demanding cognitive measure can alleviate practice compound that may mask subtle cognitive changes in cognitively normal subjects. This is crucial for pinpointing age-related cognitive changes that otherwise could become underestimated [70]. Importantly for the present research, our validation study demonstrated that both baseline and follow-up lists of stimuli were equally challenging and therefore comparable, and thus the observed follow-up changes could not be explained by list-related differences. Taken together, our findings support the idea that standard neuropsychological tests are not sensitive enough to detect subtle cognitive changes in a cognitively healthy population [71, 72], and suggest that the AFE-T could be a sensitive task for the early detection of longitudinal cognitive decline.
This study has some limitations. One important issue is its small sample size which limits the strength of the statistical analyses. However, the comprehensive AFE-T protocol allowed for a better and more fine-grained characterization of learning and memory processes. With AFE-T, participants’ learning and memory functions were evaluated using 15 learning runs at both baseline and follow-up assessments for both free and cued recall. Regarding the AFE-T, it is also important to note that only the form A was administered at baseline and form B at the 18-month follow-up. Although our pilot study indicated that lists A and B did not differ in terms of level of difficulty, it would have been more appropriate to counterbalance their presentation for the baseline and follow-up. Another potential limitation of the present study concerns the multiple comparisons problem that arises from the large number of statistical comparisons performed. This was dealt with post-hoc Bonferroni corrections, albeit this is an admittedly conservative method. Moreover, following recommendations [46] to avoid type I errors, alpha level was set to p < 0.01 for all the comparative analyses. Finally, considering the novelty of the present findings and the limited sample size, the results have to be interpreted with caution and replication is called for in further studies involving larger samples of cognitively and biologically normal subjects.
Conclusions
The present study is the first to show that in a sample of cognitively healthy individuals with normal AD CSF biomarker levels, tau, a marker of neurodegeneration, is associated with longitudinal cognitive decline. Furthermore, our results pinpoint critical brain areas related to cross-sectional and longitudinal learning and memory performance in this well-characterized sample. Taken together, our results suggest the presence of neurodegeneration as a potential predictor of cognitive change in healthy individuals. The present findings shed light on the impact of CSF tau in cognition and provide important knowledge about the relationships between biological status and the earliest age-related cognitive changes.
Footnotes
ACKNOWLEDGMENTS
This work has been supported by the Carlos III Health Institute (project PI14/00563), integrated in the State Plan of Scientific and Technical Research and Innovation (2013-2016), and co-financed by the European Regional Development Fund (ERDF) “Una manera de hacer Europa”.
We thank all volunteers for their participation in this study; without their collaboration this work would have not been possible.
