Abstract
Background:
The early diagnosis of Alzheimer’s Disease (AD), particularly in its prodromal stage, mild cognitive impairment (MCI), still remains a challenge. Many computational tools have been developed to successfully explore and predict the disease progression. In this context, the Spherical Brain Mapping (SBM) proved its ability in detecting differences between AD and aged subjects without symptoms of dementia. Being a very visual tool, its application in predicting MCI conversion to AD could be of great help to understand neurodegeneration and the disease progression.
Objective:
In this work, we aim at predicting the conversion of MCI affected subjects to AD more than 6 months in advance of their conversion session and understanding the progression of the disease by predicting neuropsychological test outcomes from MRI data.
Methods:
In order to do so, SBM is applied to a series of MRI scans from the Alzheimer’s Disease Neuroimaging Initiative (ADNI). The resulting spherical brain maps show statistical and morphological information of the brain in a bidimensional plane, performing at the same time a significant feature reduction that provides a feature vector used in classification analysis.
Results:
The study achieves up to 92.3% accuracy in the AD versus normal controls (CTL) detection, and up to a 77.6% in detection a of MCI conversions when trained with AD and CTL subjects. The prediction of neuropsychological test outcomes achieved R2 rates up to more than 0.5. Significant regions according to t-test and correlation analysis match reported brain areas in the literature.
Conclusion:
The results prove that Spherical Brain Mapping offers good ability to predict conversion patterns and cognitive state, at the same time that provides an additional aid for visualizing a two-dimensional abstraction map of the brain.
Keywords
INTRODUCTION
Alzheimer’s disease (AD) is the most common neurodegenerative disorder in the world, with more than 46 million affected, a number that is expected to grow up to 131.5 million by 2050 [1]. Many computational tools have been developed to aid in the diagnosis of AD, however, its early diagnosis remains a challenge, especially in its prodromal stage known as mild cognitive impairment (MCI). An early and more accurate diagnosis of possible AD would reduce the mistreatment of MCI and an earlier treatment of the disease when its symptoms are still subtle. This might positively impact the progression of AD, delaying the apparition of a more severe symptomatology and improving the life expectancy and quality of life of the patients and their families.
Currently, cognitive testing is often used to guide in the diagnosis procedure. Clinical tests such as Mini-Mental State Exam (MMSE) [2] or the Alzheimer’s Disease Assessment Scale-cognitive subscale (ADAS-cog, in 11 and 13-question variants) [3] are often used to assess cognitive decline in patients at risk of suffering AD. However, recent studies [4, 5] show that non-specific neuropsychological scores, such as MMSE, could be a possible source of false positives in MCI diagnosis.
Neuroimaging, with its ability to perform in vivo exploration of brain structure and function, could be a key source of information to enhance MCI diagnosis. Many works have explored the possibility of multivariate analyses of structural and functional imaging using techniques such as principal component analysis [6, 7], support vector machines [8–10], texture analysis [11–13], or volume and shape analysis [14–19]. These works used semi-automatic methodology to segment and extract features from images that led to higher discriminative Computer Aided Diagnosis (CAD) systems [20], offering performances up to 80% in the diagnosis of MCI.
Obtaining reliable and objective information from neuroimaging data could effectively increase the accuracy of MCI diagnosis. For its part, predicting neuropsychological tests outcomes could help to understand the relationship between structural changes in the brain and cognitive state (as measured by cognitive tests). This is key to validate diagnostics and establish a connection between neuroimaging and clinical tests. However, only a few works, such as [21], attempt to predict cognitive test outcomes.
In this work, we will evaluate the usefulness of the Spherical Brain Mapping (SBM) texture measures [22] in two main environments: to predict conversion of MCI subjects and to predict outcomes of different cognitive tests such as MMSE, ADAS-11, and ADAS-13. In the first case, we will use a support vector machine classifier (SVC) to classify the derived SBM maps, as in [8]. For predicting test outcomes, we will test two regression models [23]: support vector machine regression (SVR) with linear kernel, a robust and generalizable model even in high-dimensional spaces [24], and a least absolute shrinkage and selection operator (LASSO), that estimates sparse coefficients and can therefore be used for feature selection [25]. In addition to this, we will identify and visualize the brain areas more related to disease progression using hypothesis testing and correlation analysis, and we will test our approaches on MRI data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) [2]. Finally, we draw some conclusions about this work.
METHODS
Spherical brain mapping
SBM is a framework proposed in [22], whose main feature is to project three-dimensional structural brain images to bidimensional maps representing the spatial distribution of a certain texture measure. To do so, it constructs a mapping vector

Definition of the mapping vector vθ,φ and the spherical coordinates.

Example of the six SBM maps generated over the GM segmented image of a CTL subject.
The average of Vθ,φ is defined as:
The remaining values are computed using the general formula for the r
th
moment:
And finally, the only morphological measure: the thickness of the tissues crossed by
Once we have computed the different measures at each coordinate pair, we can build the SBM maps by putting each value at its corresponding pair. This way, we obtain maps such as the ones presented in Figs. 2 and 3. The python package used has been uploaded at http://github.com/SiPBA/mapBrain.

Example of the six SBM maps generated over the WM segmented image of a CTL subject.
Hypothesis testing
A widespread technique for assessing where class differences are located is hypothesis testing. In this context, we assume a null hypothesis of H0, that is, the hypothesis that there is no difference between two populations. On the other hand, we formulate the alternative hypothesis H1: the assumption that there are some differences. Then, a statistical test is applied, from which a certain p-value—the probability of rejecting H0 by chance—can be derived.
The t-test is a hypothesis testing statistic extensively used in the neuroimaging community, since it is the basis for the SPM and VBM analyses [26]. In this work, we will apply an independent two-sample t-test [27] to each pixel of the resulting SBM measures, in order to display maps where the more discriminant areas are highlighted.
The t-test used in this work quantifies the differences between two classes assuming normal distribution and independent variances, which most of the SBM measures at different (θ, φ) comply with. Let
Pearson’s correlation coefficient
Correlation maps based on the Pearson’s correlation coefficient (PCC, also known as Pearson’s r) have been generated in order to visually assess which regions are more correlated with test outcomes (MMSE, ADAS-11, and ADAS-13). This is done for each measure value vθ,φ in each direction.:
Regression analysis
Support vector regression
SVR trains support vector machines to predict a continuous variable from previously unseen data [24]. In contrast to the general lineal model (GLM) regression, if two features are highly correlated, the computed SVR weights are similar. This is a clear advantage for its use in SBM maps, where the measures are spatially distributed in coordinate pairs, sharing inherent similarities due to neighborhood. In this work, we use the ɛ-SVR implementation. For each training vector
LASSO
Least absolute shrinkage and selection operator (LASSO) is a regression model that estimates sparse coefficients. This is very useful under the assumption that very few features influence the dependent variable
Evaluation
ADNI dataset
Data used in the preparation of this article were obtained from the ADNI database (http://adni.loni.usc.edu). The ADNI was launched in 2003 as a public-private partnership, led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of MCI and early AD.
The database used in this article was obtained with from the ADNI1 initiative, containing subjects that have undergone periodical assessment of cognitive function and MRI acquisition every 6 months, with a total 2,182 T1-weighted MRI images (for further information, see http://www.adni-info.org). These images were spatially normalized to the standard SPM template using the SPM8 software [26], normalized with default parameters (non-rigid, preserve concentrations), and them segmented into grey matter (GM) and white matter (WM) using the VBM8 toolbox.
In the classification experiments, we will consider four different cohorts. The first subset corresponds to all AD subjects in their baseline acquisition (that is, when the disease is less advanced), and will be regarded as AD from this point. Two more subsets containing the last available visit from CTL and MCI stable will be selected as well, namely the CTL and MCI-S (from MCI stable) subsets. Finally, a MCI subset containing all converters in the last session available prior to their conversion will form the MCI-C (from MCI converter) set. Check the full demographics in Table 1.
Demographics of the ADNI dataset
This choice is not trivial. Since the final aim is to estimate how discriminant are the SBM maps in predicting MCI conversion, we wanted to make the classification task as difficult as possible. Since the MCI-to-AD progression can be viewed as a continuum, the last CTL acquisition will be more difficult to differentiate from the earliest AD, and likewise, the last MCI stable acquisition will be more difficult to differentiate from MCI converters. That way, the MCI-S will be more representative of the whole spectrum commonly classified as MCI but never progress to AD.
For comparison purposes, we will provide in Table 2 the Voxels-As-Features (VAF) [28] performance of this database using the segmented GM and WM maps in the AD versus CTL and MCI-C versus MCI-S classification.
VAF performance of this dataset
Evaluation parameters
We will test the SBM statistic maps in two different environments: in predicting the conversion of MCI to dementia and trying to predict neuropsychological test outcomes. The first one is a classification approach, and therefore, we will employ a linear SVC to estimate how relevant the differences between classes are. The final performance values estimate how discriminant the different measures are when classifying two classes. Accuracy (acc.), Sensitivity (sens.), and Specificity (spec.) and their standard deviation (SD) will be estimated as:
The performance estimates are obtained via a stratified 10-fold cross-validation [29]. In brief, this procedure divides the whole dataset in 10 subsets with the same proportion of classes, and iteratively uses 9 of this subsets for training and tests against the remaining one. The procedure is repeated 10 times, from which we obtain the average and standard deviation of the different proposed estimates. Performance estimates of the classification of AD versus CTL groups and MCI converters versus stable will be computed. Since MCI is a rather heterogeneous class, with overlapping converters and non-converters, we additionally will use AD and CTL subjects to train the classifier, and then identify the MCI subjects more similar to AD as converters and those more similar to CTL as stable.
The second environment corresponds to regression analysis. Since SBM maps have never been tested on this environment, we will employ two different strategies: SVR and LASSO. To quantify the quality of the prediction of our regressive model, we use the Mean Square Error (MSE) and the coefficient of determination R2. These performance measures are estimated as the average of all 10 measures computed within a 10-fold cross-validation loop. The MSE measures the mean error of the outcomes of the trained model, and is computed as:
For its part, R2 measures the proportion of variance in the dependent variable y
i
(the outcomes of the cognitive tests) that is explained by the trained model. The best R2 possible is 1.0; a 0.0 is the score of a constant model that always predicts the expected value of
Furthermore, visual maps for both environments will be provided. For differences between groups, we will use the t-test in each value of the SBM maps. To estimate the correlation of each SBM feature with the neuropsychological test outcomes, we will use the Pearson’s r, as defined before.
RESULTS
Anatomical reference
To better understand the SBM maps and the location of differences and correlations, we have created two different templates (external, Fig. 4, and internal, Fig. 5, depending on their distance to the crust) that will be used as a reference throughout this section. These templates have been created by projecting each of the regions of the well-known Automatic Anatomical Labelling atlas [30] by using the SBM-average measure, then smoothing out the projection of each individual region, and finally assigning different colors.

Structures at the cortex projected using SBM, including: 1) Precentral gyrus, 3) frontal superior gyrus, 5) frontal superior orbital gyrus 7) frontal middle gyrus, 9) frontal medial orbital gyrus, 11) frontal inferior operculum, 13) frontal inferior pars triangularis, 15) frontal inferior orbital gyrus, 17) rolandic operculum, 19) superior motor area, 23) frontal superior medial gyrus, 25) frontal medial orbital gyrus, 27) gyrus rectus, 43) calcarine sulcus, 45) cuneus, 47) lingual gyrus, 49) occipital superior gyrus, 51) occipital middle gyrus, 53) occipital inferior gyrus, 55) fusiform gyrus, 57) postcentral gyrus, 61) parietal inferior gyrus, 63) supramarginal gyrus, 65) angular gyrus, 67) precuneus, 69) paracentral lobule, 81) temporal superior gyrus, 83) temporal pole superior part, 85) temporal middle gyrus, 87) temporal pole middle part, 89) temporal inferior gyrus.

Subcortical structures projected using SBM. Same label is used for both left and right hemisphere as a reference. 21) olfactory bulb, 31) anterior part of the cingulate gyrus, 33) middle part of the cingulate gyrus, 35) posterior part of the cingulate gyrus, 37) hippocampus, 39) parahippocampal gyrus, 41) amygdala, 71) caudate nucleus, 73) putamen, 75) globus pallidus, 77) thalamus.
Visualization
In this section, we focus on the two aforementioned visualization techniques: the hypothesis testing (to assess differences between the AD, MCI-C, MCI-S, and CTL groups) and maps that display the correlation between SBM measures and neuropsychological tests.
Differences between groups
In Fig. 6, we show the significant regions (uncorrected t-test, p < 0.001) of each SBM measure computed over the GM tissue. In these maps, we can see shapes that remind us of some internal structures of the brain such as the hippocampus, amygdala, and cingulate gyrus. Blue and red colors, respectively, mean negative and positive t-values. This shows us that the average measure is smaller in AD than in CTL in some areas that have been widely linked to AD, as it could be expected. Similar areas present a similar behavior when using entropy. Conversely, the t-values for kurtosis or skewness present similar patterns to the average or variance measures, although with smaller statistical significance and negativevalues.

Significant regions according to the uncorrected t-test applied over the AD versus CTL scenario and GM, thresholded at the p < 0.001 value. The reference for internal structures is superimposed on the maps.
In Fig. 7, we show similar information (thresholded significant areas, uncorrected t-test, p < 0.001) of the WM SBM measures. In this case, the most significant areas are located at the top of the image, which might correspond to the middle part of the cingulate gyrus (33) in the internal reference and the superior motor area (19) in the external. However, these areas correspond more specifically to the longitudinal fissure, as it can be seen at Figs. 2 and 3. Regions at the inferior part of the hippocampus can also be appreciated in the average, skewness, kurtosis, and variance maps, and at the posterior part of the caudate nucleus (71). See Supplementary Figures 1–6 for a more detailed analysis of the significant regions in AD, MCI, and CTL.

Significant regions according to the uncorrected t-test applied over the AD versus CTL scenario and WM, thresholded at the p < 0.001 value. The reference for internal structures is superimposed on the maps.
Correlation with neuropsychological tests
In this section, we will assess the correlation between each SBM measure and the MMSE and ADAS-13 tests to check which regions and measures are more related to variations in cognitive ability.
In Fig. 8, one general pattern can be described in the SBM’s average, kurtosis, skewness, and variance measures for GM. In these particular cases, there is a noticeable positive (average and variance) and negative (kurtosis and skewness) correlation between MMSE scores and SBM values at the hippocampus (37), amygdala (41) and parahippocampal gyrus (39). In these cases, the PCC reaches absolute values higher than 0.46. However, two different correlation patterns (smaller, with PCCs around 0.3) appear in entropy and thickness. In these two cases, the maximum correlations are located at the bottom part of the parahippocampal gyrus (39) and, in general, the limits of the temporal lobe. In the case of entropy, we also find some high positive correlations at the frontal superior orbital gyrus (5), occipital superior (49), middle (51), and inferior gyri (53), the angular gyrus (65), and temporal middle gyrus (85), which also could match the thalamus (77) and posterior part of the caudate nucleus (71), globus pallidus (75), the hippocampus (37), and the olfactorybulb (21).

PCC maps computed between the SBM-GM measures and the MMSE. The reference of internal and external structures is superimposed on the figures, depending on the most relevant patterns.
The correlations of the SBM GM-derived measures with ADAS-cog measures are always higher (in some cases, more than 0.1 in absolute value) than the correlations found in MMSE. However, the patterns are extremely similar, but inverse, due to the nature of each kind of test.
Correlations of the WM-derived SBM measures with neuropsychological tests are in all cases smaller, as can be checked at Fig. 9 for ADAS-13 (correlations are again higher for ADAS-cog than for MMSE). The patterns are much more related to WM variations, and they can hardly be related to the previous structures. However, most of them show highest PCC values in the superior motor area (19), an area shared between the precentral and postcentral gyrus, including the parietal inferior gyrus (1, 57, and 61 respectively) and, for the thickness measure, the outermost part of the temporal gyrus, near the temporal pole (87). These patterns can also be checked in more detail in Supplementary Figures 7–13, where the correlation between SBM measures of GM and WM tissues and the MMSE and ADAS-13 is shown with the internal and external templates for reference.

PCC maps computed between the SBM-WM measures and the ADAS-13 (the best scoring of the two ADAS-cog tests).
Classification analysis
In this section, we present the results of evaluating the 6 SBM maps for GM and WM tissues under the AD versus CTL and MCI-C versus MCI-S scenarios.
AD versus CTL
Under the first scenario, the classification analysis obtained a performance shown at Table 3. As we can easily see in that table, the best results are obtained by the average measure on the GM tissue, and both average and entropy in the WM tissue. The tendency is that average, entropy, and variance obtain similar performance, whereas the skewness, kurtosis, and below all, thickness, are less discriminant between AD and CTL classes.
Performance of the different SBM measures under the AD versus CTL scenario
MCI-C versus MCI-S
Now we will focus on predicting the conversion of MCI subjects. To do so, we will analyze the system when trained and tested with MCI-C and MCI-S. The performance achieved by the different SBM measures is presented at Table 4.
In this case, we can see that the best performing measures are the average (for GM tissue) and variance (for WM tissue). These maps achieve high sensitivity, although smaller than the one obtained when using the VAF approach (see Table 2). Therefore, even when obtaining a significant feature reduction, the performance degrades. This is probably due to the MCI subclasses sparsity in the SBM space and a high overlapping rate between converters and non-converters. Therefore, we aim to test whether these two subclasses are better classified when using two more separate clusters, AD and CTL subjects, to train the SVC.
In the second part of Table 4, we provide the classification performance of that very approach, training on AD and CTL subjects and predicting converters by choosing those more similar to the AD class. We evaluate the different maps when predicting the conversion of MCI subjects by training the classifier with AD and CTL subjects. It is clear that there exists a significant performance increase, of more than 10% in for GM and 5% in WM. In this case, the best performing SBM measure is again average for GM and entropy for WM, outperforming the VAF approach (see Table 2) in this task.
Performance of the different SBM measures under the MCI-C versus MCI-S scenario
Regression analysis
Now we will evaluate how two different regression techniques, LASSO and SVR, can predict neuropsychological test outcomes using SBM maps as a source. The results for these approaches can be found on Table 5.
Coefficient of determination and MSE of the different models for the three cognitive tests, based on the SBM transformation of GM and WM maps. Best values per test and model are highlighted
Regarding the regression model, SVR performs better than LASSO in ADAS-cog and GM. Conversely, LASSO correlates better with MMSE in all cases and ADAS-cog in WM. Of the three tests, the ADAS-13 scores are best modelled, followed by ADAS-11, and, at a long distance, MMSE. The maps that better predict test outcomes are the entropy (in GM and WM) and the variance (for GM when using the SVR). MMSE prediction achieves a poorer performance, with small R2 values for MMSE, in both GM and WM maps. On the contrary, the variance/entropy maps for GM outperform the same measures in WM by almost 0.2.
DISCUSSION
The main purpose of this work is to test the ability of the SBM maps in two challenging scenarios: predicting the conversion of MCI subjects to dementia at least six months in advance and predicting neuropsychological tests outcomes from MRI data in order to have another objective measure of cognitive state. This is enhanced by the special characteristics of the SBM mapping, which allows a direct bidimensional visualization of textural changes in the brain.
However, before discussing these two scenarios, we will try to provide intuition about what the different measures mean. In the first place, average, variance, skewness, and kurtosis are related to different statistical moments of Vθ,φ. The average of these intensities is then easy to relate to tissue density, and therefore, a decrease in this measure could be a hint of atrophy in that direction. Note how this occurs in the AD and MCI-C groups at Fig. 10, where the mean of all SBM-average maps for each of the four defined groups are displayed. Similarly, the variance of Vθ,φ measures the variability in its values. Higher variance could mean higher neurodegeneration, since its values may be more disperse. Skewness and kurtosis are often considered a measure of symmetry and ‘peakedness’ of the statistical distribution of the intensities in Vθ,φ. Smaller skewness means that the intensity has decreased, which can also be associated to atrophy. A higher kurtosis is the result of extreme deviations from the mean, which is indicative of more abrupt transitions between grey levels, and therefore, better defined boundaries of GM and WM. Smaller kurtosis can also be related to neurodegeneration.

Mean of the SBM-average maps for the AD, MCI-C, MCI-S, and CTL groups. Zoom over the hippocampus region.
Thickness is the only morphological measure in this work, and it measures the distance between the first and last voxels in Vθ,φ higher than a threshold. It is a rough estimate of tissue thickness, very different from more precise -and complex- cortical thickness measures provided by software such as FreeSurfer [31] or the CAT12 toolbox [32]. Finally, entropy comes from information theory, and it measures the ‘amount of information’ or ‘randomness’ of Vθ,φ. In this work, a higher entropy means a more random distribution of the voxels in Vθ,φ, which may be linked to the structural changes due to the neurodegeneration process.
Now, regarding the first scenario, we presented the regions with more significant differences between AD and CTLs above. The highlighted areas in GM, especially in the afterward higher scoring measures, correspond mainly to the hippocampus, amygdala, and parahippocampal gyrus. Differences between MCI-C and MCI-S are also located at these areas, which also project to the external temporal middle gyrus and temporal pole. In WM, there are no significant differences in any SBM measure between MCI-C and MCI-S, which might be due to neurodegeneration affecting first the GM and later progressing to WM [33]. However, significant WM differences can be found between AD and CTL in the boundaries of the superior motor area which correspond to the longitudinal fissure, and also some other regions overlapping to the precuneus, cuneus, paracentral lobule, and the frontal middle gyrus.
These differences correspond to existing evidence in the literature. The hippocampus is widely regarded as one of the most discriminating structures in structural MRI [11, 19]. Its influence in declarative memory is patent, and many studies have reported atrophy in early AD, along with surrounding structures, such as the amygdala and the parahippocampal gyrus. Most studies report early neurodegeneration in MCI in these structures and generally in the temporal lobe [11, 19]. Martinez-Torteya et al. [12] also reported the left medial orbital gyrus and the left inferiolateral remainder of the parietal lobe (area between inferior parietal gyrus and the supramarginal gyrus). The SBM measures are then collecting useful information on neurodegeneration that can be useful for further assessment and visualization.
Regarding the classification performance, we tested four different groups: AD, CTL, MCI-C, and MCI-S. In the AD versus CTL performance, we achieved high accuracy (around 90% for WM, higher for GM) that may be due to an improvement since our last work. While in [22] we placed the origin of vθ,φ at the geometrical centre of the image, now we place it at the Anterior Commissure, one landmark anatomical position in the literature and one of the points where the two cerebral hemispheres are connected. Thank to this, it may have improved the computation of the SBM measures in different regions of interest, especially internal regions such as the hippocampus or the caudate nucleus.
However, the most relevant classification results are those that predict MCI conversion between 6 and 12 months in advance. The highest performance was obtained when training with AD and CTL subjects, which may be due to a better definition of the separation hyperplane of the SVC by two more distant clusters than the higher-variability MCI subclasses. The reported results are comparable to the state of the art, [9, 14–18]. Of these studies, which achieved accuracies ranging from 70% to 80%, the best performing approach was the one proposed by Misra et al. [15], which achieved between 75–80% accuracy on a subset of 27 converters and 76 non-converters from the ADNI dataset. Chupin et al. [14] used the larger population, with 210 individuals (76 converters) from ADNI, achieving an accuracy of 64%. Some of these even used histopathological labels, acquired postmortem, instead of clinical labels, which are more heterogeneous and make the classification task more difficult. Our performance results match those achieved by the best methodology in the state of the art reports, with a larger, more heterogeneous and less prevalent population (70 converters + 64 non-converters), which is key for comparison.
Of all SBM measures used in this article, the average, variance, and entropy measures achieved similar performance. Particularly, average worked very well with segmented GM maps, whereas the entropy always performed better in the WM. Thickness yielded the poorest performance, perhaps due to the lack of relevant features in areas of interests, such as the hippocampus.
In the second scenario, the PCC maps show the areas that are highly correlated with different neuropsychological tests. The main impression is that the ADAS-cog scores (especially the ADAS-13) achieve higher correlations with the MRI data than the MMSE, suggesting that these measures are much more specific of AD progression and cognitive decline. This is also supported by current literature [4, 5], where some authors claim that non-specific neuropsychological scores may be a source of false positives, especially in MCI.
In GM, correlations with the ADAS-13 test outcomes scored higher than 0.5 in areas such as the hippocampus and parts of the parahippocampal gyrus. This was repeated in the average, variance, skewness, and kurtosis measures. Entropy and thickness measures revealed different patterns, with maximum correlations in the lower boundary of the temporal lobe. As for the WM, the correlations were smaller (maximum values around 0.3), with more correlated areas in the longitudinal fissure and, only in the case of entropy, the occipital middle gyrus, the parietal inferior gyrus and the junction between the temporal middle and inferior gyri.
The performance of the regression methods is displayed at Table 5. The reader may note the large differences between the MSE in the ADAS-cog tests and the MMSE, but this is related to a much higher variability in the values (ADAS-13 ranges from 0 to 63 in our database, and ADAS-11 from 0 to 48) compared to the MMSE (ranges from 14 to 30). See Table 1 for more information on the demographics of the dataset. The R2, however, takes into account this variability, yielding a more consistent measure of the performance, and that is why we will focus on it throughout this section.
Two main properties can be inferred from the data in Table 5. First, the differences in R2 scores stress again the usefulness of the ADAS-cog against the MMSE in measuring Alzheimer’s progression. Second, they show that the SVR performs better than LASSO in predicting test outcomes for ADAS-cog in GM. Since LASSO internally focuses on a small number of features, it may be an indication that there are moderately correlated covariates in the maps, which would eventually degrade its performance [34]. Then, SVR is a more adequate choice in this case. Conversely, LASSO outperforms SVR in modelling MMSE, however its performance is smaller than with the ADAS-cog. In WM, our regression techniques can find even less evidence to predict neuropsychological tests, which adds support to the fact that GM neurodegeneration is a better predictor of cognitive state.
The maps with higher performance in predicting neuropsychological tests are entropy and variance. This differs with the largest PCCs computed for each vθ,φ in the maps, where average, kurtosis, skewness, and variance had the largest correlations. However, PCCs were obtained for each individual SBM value at the (θ, φ) position, where the regression methods use all values in the maps to create a linear model that predicts the neuropsychological outcomes. Therefore, it is possible that entropy, where individual PCCs were relatively smaller, contains relevant—and different—information in more areas than the other maps, and those values can be used to inform a more accurate model of the test. When looking at the PCC maps for the SBM-entropy, large negative and positive correlations were found in GM (in contrast to all other measures), including negative PCCs at the lower boundary of the temporal lobe and positive at the angular gyrus, thalamus, occipital gyrus, caudate nucleus, etc., that have also been found in functional imaging [35]. The pattern in WM pattern is similar, which may be the reason why the SBM-entropy achieves best performance in regression.
The SBM mapping, as any other technique intended to visualize three-dimensional volumes in two dimensions, is subject to deformations due to the projections. Our choice for SBM, given its simplicity, was the equirectangular projection, also known as plate carrée projection [36]. However, it is neither equal area nor conformal, introducing distortions as the elevation θ increases. Other projections, for example the stereographic projection [36] (see Fig. 11 for comparisons), might seem more convenient for visualization. This last projection is conformal and therefore preserves angular relationships, yielding a more brain-like, intuitive visualization of the SBM maps. We plan to include this approach in future releases of the python mapBrain package, which contains SBM.

Comparison between the equirectangular projection for two SBM examples (an average map and the PCC maps computed in the section “Correlation with Neuropsychological Tests”) and the stereographic projection, currently in development.
In summary, the SBM offers a series of bidimensional maps that provide a significant feature reduction along with a visual aid that could be useful to locate differences, in combination with either hypothesis testing or correlation analysis. We have proven that a classifier trained with SBM average maps from AD and CTL subjects can effectively predict conversion of MCI in the session immediately previous to their reported conversion session (that is, in all cases, between 6 and 12 months in advance) with a 77% accuracy and 73% sensitivity. And ultimately, we demonstrated that these maps can be used to predict scores that match the outcomes of clinical tests, providing an objective replication tool based on imaging data.
CONCLUSIONS
In this work, we propose the application of SBM of MRI in two challenging scenarios involving AD: predicting the conversion of MCI subjects to dementia at least six months in advance and predicting neuropsychological tests outcomes from MRI data. These two classification and regression scenarios are tested using SVM and LASSO, providing numerical evidence of their performance. Moreover, we also analyze visual evidence on the affected regions via hypothesis testing (for differences between groups) and correlation coefficient (for correlation with test outcomes).
The SBM maps provide a significant feature reduction over the million-voxel MRI images, while achieving a state-of-the-art performance (77% accuracy) in the prediction of MCI conversion more than 6 months in advance. The best prediction is obtained with a classifier trained with AD and control (CTL) classes which, thanks to being separate clusters, define a better separation hyperplane. Significant differences between maps have been found at the hippocampus, parahippocampal gyrus, and more generally the temporal lobe, all of which have a proven relationship with ADprogression.
For its part, the relationship between some of the SBM measures in GM and the outcomes of neuropsychological tests has also been proven. The highest performance of this regression analysis is achieved when predicting the ADAS-cog, supporting existing evidence that this test is more indicative of the disease stage than the MMSE. R2 scores up to 0.504 were obtained when predicting ADAS-13 with SVR on the SBM-variance maps. The SBM average and variance measures are also noticeably correlated with this test (Pearson-r higher than 0.5). A visual analysis of the Pearson-r reveals the utility of the bidimensional representation in identifying changes in texture along the brain. Regions typically related with AD, achieve highest correlation rates, but the most persistent area is the hippocampus.
In summary, SBM offers a series of bidimensional maps that provide a significant feature reduction along with a visual aid that could be useful to locate differences, in combination with either hypothesis testing or correlation analysis. These maps keep relevant textural information that allows a system to predict MCI conversion more than six months in advance. New projections to enhance visualization are currently being explored in order to provide a more intuitive insight of the brainstructure.
Footnotes
ACKNOWLEDGMENTS
This work was partly supported by the MINECO/ FEDER under the TEC2015-64718-R project, the Consejería de Economía, Innovación, Ciencia y Empleo (Junta de Andalucía, Spain) under the Excellence Project P11-TIC- 7103 and the Salvador de Madariaga Mobility Grants 2017.
Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (
). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.
