Abstract
Despite much recent interest in music and dementia, music perception has not been widely studied across dementia syndromes using an information processing approach. Here we addressed this issue in a cohort of 30 patients representing major dementia syndromes of typical Alzheimer’s disease (AD, n = 16), logopenic aphasia (LPA, an Alzheimer variant syndrome; n = 5), and progressive nonfluent aphasia (PNFA; n = 9) in relation to 19 healthy age-matched individuals. We designed a novel neuropsychological battery to assess perception of musical patterns in the dimensions of pitch and temporal information (requiring detection of notes that deviated from the established pattern based on local or global sequence features) and musical scene analysis (requiring detection of a familiar tune within polyphonic harmony). Performance on these tests was referenced to generic auditory (timbral) deviance detection and recognition of familiar tunes and adjusted for general auditory working memory performance. Relative to healthy controls, patients with AD and LPA had group-level deficits of global pitch (melody contour) processing while patients with PNFA as a group had deficits of local (interval) as well as global pitch processing. There was substantial individual variation within syndromic groups. Taking working memory performance into account, no specific deficits of musical temporal processing, timbre processing, musical scene analysis, or tune recognition were identified. The findings suggest that particular aspects of music perception such as pitch pattern analysis may open a window on the processing of information streams in major dementia syndromes. The potential selectivity of musical deficits for particular dementia syndromes and particular dimensions of processing warrants further systematic investigation.
Keywords
INTRODUCTION
Despite much recent interest [1–3], the impact on music processing of Alzheimer’s disease (AD) and other dementias has not been fully defined. Music is first and foremost a complex acoustic phenomenon, and the perception of music requires the parsing of a musical stimulus of interest against the acoustic background (musical scene analysis: [4]), representation of the musical source (instrumental or vocal timbre), and tracking of pitch (melody) and temporal (rhythm, meter) information to create a coherent musical ‘object’ [5]. This formulation suggests that music presents the brain with a difficult problem of auditory information processing, entailing the decoding of a number of perceptual and cognitive modules [6, 7]. On both computational and neuroanatomical grounds, these processes are likely to be vulnerable to the effects of neurodegenerative diseases, most notably AD and primary progressive aphasia syndromes that target peri-Sylvian cortex (progressive nonfluent aphasia (PNFA) and logopenic aphasia (LPA): [8–11]). A substantial body of structural and functional neuroimaging work in the healthy brain and in patients with focal brain lesions has delineated distributed cortico-subcortical networks that analyze the dimensions of music [6, 13]: these networks closely overlap the networks targeted in canonical dementia syndromes [14, 15]. However, to date most studies of music in dementia have focused on the interaction of music and memory [16–18], preserved abilities in trained musicians developing dementia [16, 20], and potential benefits of music more widely in dementia [21–25].
Aside from its intrinsic interest, music is an attractive candidate paradigm for assessing the processing of complex information streams or patterns in both the healthy and the diseased brain. In the domain of musical pitch, patterns of pitch change can be analyzed at two levels: pitch interval (the magnitude of change between consecutive notes) and pitch change direction (the overall pattern of ‘ups’ and ‘downs’ comprising the contour of the melody: [7, 26]).By analogy with the visual domain, pitch interval and melody contour entail the processing of ‘local’ and ‘global’ pitch pattern information, respectively; according to this formulation, pitch intervals can be considered fine-grained musical features while combining these intervals to create a melody contour can be considered an overall (global) ‘gestalt’ of the musical piece. The distinction between these levels is evident in everyday music listening; changing individual pitch intervals is often perceived as a jarring distortion to the musical line, whereas simultaneously changing all pitch intervals but maintaining the relations between them (as in transposition of a melody to another key) retains the same musical gestalt (the tune is still recognizably the same). The concept of local versus global processing levels is fundamental for understanding how percepts are organized and relevant to many sensory domains. Music can be considered a non-visual test case for assessing the generality of effects on sensory information streams and the relative impact on featural (local) versus gestalt (global) perception of clinical disorders such as the dementias.
The local (pitch interval) and global (melody contour) levels of music perception can be differentially affected by focal brain lesions distributed between the cerebral hemispheres [26–30]. Functional neuroimaging studies in the healthy brain have demonstrated separable mechanisms in posterior superior temporal lobe and parietal and prefrontal projection pathways for the processing of pitch interval and melody contour [30, 31]. Available evidence suggests that the decoding of musical patterns may be affected by common dementias and may help to stratify dementia syndromes and pathologies without relying on more specialized (and potentially confounding) verbal mechanisms. Elementary pitch discrimination may be retained in AD and impaired in PNFA, consistent with relatively greater involvement of early auditory areas in neurodegenerative processes that target peri-Sylvian cortex [18, 32–36]. However, the effects of these diseases on more complex pitch pattern processing have not been resolved. Studies in the visual domain suggest that patients with AD may have disproportionate difficulty in the analysis of global structure with relatively intact analysis of local features: this profile is likely to reflect dysfunction of integrative mechanisms in parietal cortex that are particularly targeted by AD pathology but may be more difficult to interpret in the context of associated executive, verbal or spatial deficits [37–43]. In the musical domain, it follows that AD should produce more severe impairment for processing global (melody contour) than local (pitch interval) patterns; whereas in PNFA, a more pervasive impairment of local and global pitch pattern processing would be anticipated. However, currently available neuropsychological instruments for assessing pitch pattern processing often rely on comparisons between paired musical sequences [44]. Such comparisons are vulnerable to concurrent auditory working memory deficits that accompany AD and the progressive aphasias [17, 45–47]; moreover, the explicit serial comparison of sequential melodies is seldom required in everyday music listening. Whereas specific musical working memory systems are likely to be integrally linked to the perception of pitch and temporal patterns in music, these are separable from verbal and other working memory systems that might be generically involved in any auditorytask [48–50].
Temporal patterns in music can similarly be represented at interval (rhythmic, local) and longer duration stress or accent (metrical, global) levels of analysis [51]. Deficits in these dimensions of musical temporal perception occur with focal lesions involving temporal and parietal cortices [52–55] but frequently dissociate from pitch impairment [26, 56] and further dissociate from each other [52, 57]. In the healthy brain perceptual analysis of rhythm and meter engages cortico-subcortical circuitry jointly involved in preparing motor output [58–61]. While evidence in AD is not conclusive [19, 63], impairments of temporal pattern processing have been described in PNFA associated with involvement of peri-Sylvian cortex [64]: this may be attributable both to loss of dynamic precision mediated by the dominant hemisphere and the high temporal resolution required for accurate processing of speech signals[65, 66].
Under most circumstances, the listener must simultaneously decode more than one stream of musical information (whether produced by an ensemble of instruments or a single instrument played polyphonically). Such an analysis is fundamental to the initial parsing of a musical ‘scene’, before more detailed analysis can occur [7]; it is likely to entail an interaction of bottom-up mechanisms for coding perceptual structure with top-down mechanisms for resolving perceptual ambiguities based on stored templates or schemas derived from past experience of music [5, 67].Musical scene analysis has not been widely studied neuropsychologically in clinical populations but is likely to engage posterior superior temporal and parietal lobe regions and their dorsal projections [68–72]. AD has been shown to produce a generic impairment of auditory scene analysis under diverse listening tasks and conditions, including the streaming of sound sequences that bear some similarities to musical melodies; this has been linked to dysfunction of posterior temporo-parietal areas overlapping those involved in music perception [33, 73–75]. On both neuroanatomical and neuropsychological grounds, patients with AD might therefore be anticipated to have difficulties with musical scene analysis; however, this has not been addressed directly in previous work.
In this study we assessed the perceptual components of music processing systematically in a cohort of patients representing major dementia syndromes. Based on the above synthesis of the available literature in both the auditory and visual domains, we anticipated that global versus local levels of musical pitch and temporal information processing and the effect of presenting a melody against a musical background (i.e., processing of musical ‘scenes’) would be the most informative components of music perception to target in the principal neurodegenerative dementias. The framework we addressed in designing the experimental music perception battery is outlined in Fig. 1, adapting the modular model of music cognition proposed by Peretz and Coltheart [7]. We studied patients with typical AD in relation to patients with a syndromic diagnosis of primary PNFA and patients with the LPA clinical variant presentation of AD pathology. Inclusion of these syndromic groups allowed us to assess the effects of disease topography in dominant peri-Sylvian cortex in relation to the predicted underlying molecular pathology (PNFA in relation to LPA). We designed novel neuropsychological tests requiring continuous tracking of musical patterns and detection of deviants from the established pattern in the domains of pitch (interval, melody) and time (rhythm, meter). Our rationale was that detection of a deviant or ‘wrong’ note played during a performance more closely approximates natural music listening than does sequential comparison of melodies or related neuropsychological procedures and also reduces working memory and associated, extraneous executive demands. In addition, we created a test to assess detection of melody patterns within a musical ‘scene’. These dimensions of perceptual pattern processing were assessed in relation to detection of timbral deviants (a measure of sustained auditory attention and executive processing of sound sequences) and recognition of familiar tunes (a widely used index of musical semantic processing). To allow musical perceptual effects to be interpreted without potentially confounding effects from auditory working memory impairment, we controlled for this factor in analyzing the musical performance profiles of our patient groups: our concern here was to adjust for generic, task-related auditory working memory capacity rather than any more specifically musical working memory subsystem.
In line with previous evidence including studies of the healthy brain and focal brain damage, we hypothesized that musical deficits would be produced by all three target dementia syndromes, with distinctive profiles of impairment in each syndrome. More specifically, we hypothesized that typical AD would be associated with relatively greater impairment of global than local levels of musical pattern analysis and impaired musical scene analysis, with a similar profile of deficits in LPA; while PNFA would be associated with deficient analysis of both local and global pitch patterns but with more severely impaired analysis of temporal patterns in music.
METHODS
Participants
The key inclusion criterion for the study was a clinical diagnosis of one of the target dementia syndromes based on current standard, consensus diagnostic criteria [76, 77]. Sixteen patients (six female) fulfilling diagnostic criteria for typical AD (henceforth simply ‘AD’) led by episodic memory decline [76], five patients (two female) with a diagnosis of LPA, and nine patients (seven female) fulfilling criteria for PNFA [77] were recruited. Nineteen healthy individuals (ten female) matched to the patient cohort for age and musical background, with no history of significant neurological or psychiatric disorders were recruited via our Centre’s research participant database. To provide an index of musical background, patients’ caregivers and healthy control participants completed a questionnaire detailing current musical exposure (estimated hours/week) and years of previous formal musical training. Inability to comply with neuropsychological testing, a clinical history of significant hearing loss or congenital amusia would constitute exclusion criteria for a study of this kind; in the event no individuals were excluded on thesegrounds.
All participants had audiometric screening of peripheral hearing function and an elementary pitch discrimination screening test (details in Supplementary Material) designed to establish that they could comply with experimental tests involving the processing of pitch sequences. One potential participant with AD and one with PNFA were excluded as they failed to reach the criterion (>80% correct) required to pass screening.
Demographic, clinical, and general neuropsychological characteristics of the study cohort are summarized in Table 1. Syndromic diagnoses in the patient groups were corroborated with a comprehensive general neuropsychological assessment (Table 1). Brain MR images (available for 28 patients) revealed a profile of atrophy consistent with the syndromic diagnosis in each case; no brain images showed a significant cerebrovascular burden. Twelve of 12 patients in the AD group and three of four patients in the LPA group for whom CSF was available had a protein marker profile suggesting underlying AD pathology (total CSF tau: amyloid-beta1 - 42 ratio > 1, based on local laboratory reference ranges) and the remaining patient with LPA had a positive Florbetapir PET brain amyloid scan; in contrast, five of six patients with PNFA had a CSF profile that did not suggest underlying AD pathology while the remaining patient had a negative brain amyloid scan. At the time of testing, 13 patients in the AD group were receiving symptomatic treatment with donepezil and two with memantine; in the LPA group, four patients were receiving donepezil and two memantine; while in the PNFA group one patient was receiving donepezil.
The study was approved by the local institutional ethics committee, and all participants gave informed consent in accordance with the guidelines of the Declaration of Helsinki.
Experimental music perception battery
General structure
The overall structure of the music perception battery is schematized in Fig. 2; examples of the stimuli are provided in Supplementary Material.
Procedures were adapted from previously described tests of musical deviance detection [78, 79]. Detection of deviant notes has been employed in previous music psychology paradigms that sought to capture on-line analysis of musical information in pitch and temporal domains under conditions that resemble natural musical listening; such paradigms establish a continuous musical context, allow precise programming of incongruent events that violate musical expectancies, capture moment-to-moment tracking of musical structure [78] and allow estimation of processing latencies [79], while at the same time avoiding any explicit requirement to make delayed, serial comparisons with episodes held in musical memory (potentially, a particular advantage in patients with dementia). For the present pitch and temporal processing tests, participants were required to listen to a sequence of musical notes that conformed to a basic pattern with randomly presented notes that deviated from the pattern according to the musical parameter of interest; for each subtest, the task on each trial was to press a button as soon as a deviant note occurred. As a control for the attentional and response requirements of these tests, we designed a task that required detection of timbral deviants in note sequences. For the musical scene analysis (‘tune streaming’) test, highly familiar or novel melodies were presented against a harmonic background with similar perceptual characteristics; the task on each trial was to decide whether or not a familiar tune was present. As a baseline test of tune recognition, familiar or novel melodies were presented alone and the task on each trial was to decide whether or not the tune was familiar; this task acted as a control for the tune recognition component of the musical scene analysis test. The order of the experimental tests (fixed for all participants) and approximate times to administer the tests were as follows: timbre deviant task (approximately two minutes); pitch deviant tasks (approximately six minutes); temporal deviant tasks (approximately five minutes); tune streaming (approximately five minutes); tune recognition (five to ten minutes).
Note sequences were synthesized in MATLAB® (pitch, temporal, timbral deviant detection tests) or MuseScore (tune recognition tests). Stimulus parameters were in line with values used in previous work [78, 79]. Stimuli were presented from a notebook computer running MATLAB® via headphones (Audio-Technica®) at a comfortable listening level (at least 70 dB) in a quiet room. Participants were first familiarized with each test using visual aids (see examples in Supplementary Figure 1) and practice examples to ensure they understood the task instructions and were able to comply reliably. For all tests based on deviance detection, participants were instructed to press the keyboard spacebar as quickly as possible whenever they heard a ‘wrong note’; presses within a pre-specified window (see Supplementary Material for details) after deviant onset were counted as correct detections. Participant responses were recorded for offline analysis. During the tests no feedback was given about performance and no time limits were imposed.
Further details of stimulus parameters in each condition are provided in the Supplementary Material.
Assessment of pitch pattern processing
Stimulus note sequences comprised alternating tonic and dominant pitches (intervals of five or four tones) in one of three keys, spanning two octaves (range F2 to C5) and arranged to form a single simple template melody contour (five ascending – five descending – five ascending – five descending ; see Fig. 2). Individual notes lasted either 500 or 400 ms with inter-note interval of 100 or 80 ms, yielding a base tempo for the sequence of either 100 or 125 beats/minute; total sequence duration for a given trial ranged from 33.1 to 41.4 seconds. Each trialcontained five deviant notes, each of which diverged from the template pitch pattern in one of three ways: local (interval step altered, global melody contour preserved), global (melody contour direction altered), or global direction-only (melody contour direction altered, using only notes previously heard in the pattern so that only the order of notes was altered). The global direction-only condition was intended to access a ‘pure’ process of melody contour analysis that could not be performed (for example) by detecting the occurrence of novel out-of-pattern notes. The magnitude of a deviant ranged from two to eleven semitones; all deviant notes adhered to the diatonic scale of that trial. Deviant notes occurred with random onsets over the course of the trial such that the complete (unviolated) pattern occurred at least once before any deviants occurred and the interval between deviants was at least 1.5 seconds. Four trials for each deviant type were presented as blocks, yielding 20 deviants for each condition (local, global, global direction-only). Responses within 1.5 seconds from deviant onset were counted as correct detections.
If a participant correctly detected fewer than 50% of deviants for any of the condition blocks, they completed half of all subsequent blocks and continued to an easier version of the pitch test (see Fig. 2). In this ‘easy’ version of the test, the pitch pattern comprised only two notes; local deviants changed the interval and global deviants the melody contour. Two trials (10 deviants) were presented for each condition. Data on this test were also collected for six healthy control individuals, to provide a performance reference.
Assessment of temporal pattern processing
Stimulus sequences for the temporal test comprised repeated rhythmic patterns, adapted after the stimuli described by Geiser et al. [78] (see Fig. 2); a given sequence (trial) established a template rhythm with meter (time signature) fixed at either three or four beats per cycle (3/4 or 4/4 time), emphasizing the first note of the cycle (bar) with increased sound intensity. Individual notes had fixed pitch (either D4, Eflat4, or E4) with note duration 200 ms, and a base tempo for the sequence of either 100 or 120 beats/minute; total sequence duration for a given trial ranged from 22.5 to 38.4 seconds. Each trial contained four deviants, each of which diverged from the temporal template pattern in one of two ways: local (rhythm altered by varying inter-note interval by 100 to 600 ms) or global (meter altered by varying the position of a louder note, perceived as an ‘early’ or ‘late’ beat). Deviant notes occurred with random onsets over the course of the trial such that the complete (unviolated) pattern occurred at least three times before any deviants occurred and the interval between deviants was at least 2 seconds. Five trials for each deviant type were presented as blocks, yielding 20 deviants for each condition (rhythm, meter); the same set of temporal templates was used in each condition. Responses within 2 seconds from deviant onset (allowing time to make decisions on the inter-note interval) were counted as correct detections.
Assessment of acoustic deviance detection
In order to assess participants’ performance on acoustic deviance detection beyond the pitch and temporal domains, we designed a test that required detection of timbre deviants presented as elements of a note sequence based on an ascending or descending major scale. Deviants were created by altering the envelope of frequency intensities composing the spectrogram of the tone (its ‘spectral shape’) to produce one of two different timbre variants. Individual notes had duration 600 ms, with base tempo 100 beats/minute and sequence duration 32.4 seconds for each trial. Five timbre deviants were presented randomly during each trial; four trials were presented, yielding 20 timbre deviants in total. Responses within 1.5 seconds from deviant onset were counted as correct detections.
Tune recognition tests
In order to assess the parsing of melodies within complex musical scenes, we designed a test requiring detection (streaming) of tunes against a harmonic background (see Fig. 2). Stimuli were created in three part harmony in a major key with a synthetic piano carrier. The top line of the harmony carried the tune for all trials; 10 trials contained very familiar tunes (based on pilot data in older British individuals; the tunes were Auld Lang Syne, Frere Jacques, God Save the Queen, Jingle Bells, London Bridge is Falling Down, Mary had a Little Lamb, Silent Night, Three Blind Mice, Twinkle Twinkle, Little Star, When the Saints Go Marching In) while for the remaining 10 trials, the original tunes were pseudo-reversed (such that the phrase ended on a long tonic or dominant note). Trial duration ranged between 7 seconds and 13 seconds. On each trial, the task was to respond ‘yes’ if a famous tune was present and ‘no’ if not.
To provide a baseline measure of tune recognition, the same 20 famous and pseudo-reversed tunes previously presented in the tune streaming test were presented in isolation, in randomized order. On each trial, the task was to respond ‘yes’ if the tune was famous and ‘no’ if not.
Analysis of behavioral data
General characteristics
All behavioral data were analyzed using Stata12®. Most demographic and neuropsychological data violated normality assumptions and groups were therefore compared using a Kruskal-Wallis equality-of-populations rank test followed by pairwise comparisons with Wilcoxon rank sum tests with Bonferroni adjusted p-values to account for the six pairwise comparisons; gender distributions were compared using Fisher’s exact test. Tone detection thresholds on audiometry screening were analyzed using a multiple linear regression model adjusted for age, with bias corrected, accelerated confidence intervals calculated from 2000 bootstrap replications. Pairwise comparisons used Bonferroni-adjusted confidence intervals (99.17%) to account for the six pairwise comparisons between experimental groups.
Deviance detection tests
As participants were free to respond at any time, an individual participant’s proportion of correct presses was first adjusted for ‘guesses’ (or indiscriminate responses), as estimated using a Poisson distribution of that participant’s rate of incorrect presses outside the ‘correct’ time window. This can be represented by the following equation:
Processing of familiar tunes
Tune recognition performance was analyzed using a multiple linear regression model comparing groups with bias corrected, accelerated confidence intervals calculated from 2000 bootstrap replications and subsequent Bonferroni-corrected pairwise comparisons. A different approach was required for analysis of the tune streaming task: if a participant was unable to correctly identify a famous tune as famous when presented in isolation, this item was excluded from analysis of their responses on the tune streaming test. This resulted in varying numbers of famous and pseudo-reversed (non-famous) items for each participant on this test. A logistic regression model incorporating all participants’ binary responses, controlling for reverse digit span performance, was used to model scores on the tune streaming task. To take account of any bias introduced by this imbalance of trial numbers, a framework based on signal detection theory was used to fit a logistic regression model for odds of labeling a tune as famous [80]. The dependent variable was a binary category indicating for each test item whether or not each participant in a group had responded ‘famous’. Accordingly, this model assessed famous tune detection accuracy as odds ratios comparing labeling of famous and non-famous tunes across all participants in each group. Here, an odds ratio of 1 corresponds to chance level performance, i.e., the group had equal likelihood of labeling a famous or non-famous tune as famous; an odds ratio >1 corresponds to increased accuracy discriminating famous from non-famous tunes; and an odds ratio <1 corresponds to over-rejection of famous tunes as non-famous or over-labeling of non-famous tunes as famous. Overall effects of experimental group were therefore assessed through the interaction of group and labeling tunes correctly. The Wald criterion was used to test for any interaction effect or specific group differences, with Bonferroni adjusted p-values to account for the six pairwise comparisons between experimental groups.
Correlates of musical perceptual performance
Where deficits on music processing tasks relative to healthy controls were identified, Spearman’scorrelation coefficient was used to assess associations of performance on the relevant musical tasks with background musical training, general disease measures (Mini-Mental State Examination score, symptom duration) and speech encoding measures (word and sentence repetition) in the patient cohort. A threshold p < 0.05 was accepted as the criterion for statistical significance for all associations.
RESULTS
General characteristics of participant groups
The analysis of demographic, clinical, and background neuropsychological data is summarized in Table 1. Due to time constraints, reduced numbers of participants completed particular assessments (these are detailed in Tables 1 and 2). Patient and healthy control groups were well matched for age (χ2 (3) = 6.32, p = 0.10), gender (χ2 (3) = 2.23, p = 0.56), education (χ2 (3) = 6.41, p = 0.09), musical training (χ2 (3) = 3.74, p = 0.29), and current music listening (χ2 (3) = 2.81, p = 0.42). Patient groups were well matched for Mini-Mental State Examination score (χ2 (2) = 1.58, p = 0.45) and symptom duration (χ2 (2) = 0.26, p = 0.88). Patient groups showed anticipated profiles of general neuropsychological impairment.
On the screen of peripheral hearing function, relative to healthy controls, the AD and LPA groups showed no significant performance difference but deficits compared to both the healthy control and the AD group were shown by the PNFA group; there was no difference between the LPA and PNFA groups (details summarized in Supplementary Table 1). A combined audiometry score using the sum of detection thresholds for all frequencies was derived as an overall measure of peripheral hearing function to test for associations with performance on the experimental tasks: no significant associations were found and audiometry scores were therefore not included in further analyses. For the pitch discrimination screening task (Table 1), total scores did not differ significantly between experimental groups (χ2 (3) = 2.66,p = 0.45).
Performance on experimental tests of music processing
Performance profiles for each group on all conditions and mean difference between groups for pairwise comparisons for the experimental musicbattery are presented in Table 2 with further details in Supplementary Tables 2 and 3; individual data are shown in Fig. 3.
Inspection of the individual performance data prior to adjustment for general auditory working memory performance (Fig. 3) suggests that patients in each syndromic group (and most prominently, the progressive aphasia groups) performed substantially worse than the healthy control group across the experimental tests of music processing. However, this was in the context of wide individual variation within each group. An analysis of group performance profiles without adjustment for general auditory working memory effects is summarized in Supplementary Table 4; the following is based on the main analysis adjusted for this factor.
Pitch pattern processing
For the pitch pattern processing tasks, the PNFA group showed overall (across all three conditions) poorer performance compared to the healthy control (beta = –0.47, 95% CI –0.81 to –0.16) and AD groups (–0.33, 95% CI –0.61 to –0.02); no other significant overall performance differences between groups were found. Examining for effects of condition, poorer performance was found across all groups in the global-direction-only compared to the local condition (beta = –0.09, 95% CI –0.17 to –0.03). Compared to healthy controls, the AD group performed significantly worse in the global and global-direction-only pitch conditions but not the local condition; the LPA group performed significantly worse only in the global-direction-only condition; and the PNFA group performed significantly worse in all pitch conditions (Table 2). No significant performance differences between patient groups were identified. No significant correlations of task performance with prior musical training, general disease measures (Mini-Mental State Examination score, symptom duration) or standard speech encoding measures (word and sentence repetition) were found within the patientcohort.
Thirteen patients (five AD, two LPA, six PNFA) were also administered the ‘easy’ version of the pitch pattern test having detected <50% of deviants in the more difficult test (we ran an additional analysis of this subset of patients; data for all patients were included in the main analysis of the more difficult test). Although raw detection scores (Supplementary Table 2) suggested impaired performance of the PNFA and LPA groups in each pitch condition relative to the healthy control group, no significant differences between groups were found after taking auditory working memory performance intoaccount.
Temporal pattern processing
For the temporal pattern processing tasks, no significant effects of patient group on performance were found after adjusting for auditory working memory capacity (versus controls: AD beta = –0.02, 95% CI –0.12 to 0.09; LPA beta = –0.07, 95% CI –0.32 to 0.22; PNFA beta = –0.18, 95% CI –0.40 to 0.03). Across all experimental groups, the global condition resulted in poorer performance than the local condition (beta = –0.10, 95% CI –0.18 to –0.04). However, there was no indication of a significant interaction between condition and group (versus controls×condition: AD beta = –0.06, 95% CI –0.15 to 0.04; LPA beta = –0.10, 95% CI –0.29 to 0.08; PNFA beta = –0.05, 95% CI –0.14 to 0.08).
Timbral deviance detection
On the timbre processing (general acoustic deviance detection) task no significant effect of group on performance was found after adjusting for auditory working memory capacity (versus controls: AD beta = 0.05, 99% CI –0.04 to 0.26; LPA beta = –0.03, 99% CI –0.33 to 0.41; PNFA beta = –0.05, 99% CI –0.23 to 0.07).
Tune recognition tasks
No significant interactions were found between group and correctly labeling a tune as ‘famous’ in the tune streaming (musical scene analysis) task (χ2 (3) = 3.92, p = 0.27), indicating no effect of patient group on performance for this task. No effect of patient group was found for the baseline tune recognition task (versus controls: AD beta = 0.13, 99% CI –0.09 to 0.74; LPA beta = –0.95, 99% CI –4.04 to 2.60; PNFA beta = –0.34, 99% CI –2.83 to 0.74).
Correlations between dimensions of music processing
Within the patient cohort, significant pairwise correlations were found between all measures of pitch pattern and temporal pattern processing (all p < 0.05). Significant correlations were found for performance on the tune streaming and global pitch pattern processing (direction-only) tasks; and for performance on general acoustic (timbral) deviance detection and global pitch (direction-only) and local temporal processing tasks (all p < 0.05; see Supplementary Table 3). Tune recognition correlated with performance on timbral deviance detection; years of musical training correlated with global temporal processing. Peripheral audiometry detection thresholds did not correlate significantly with any of the experimental measures.
DISCUSSION
Here we have shown that canonical dementia syndromes of typical AD, LPA, and PNFA may be associated with profiles of impaired music perception relative to healthy older individuals. Deficits exhibited by the present syndromic groups affected the analysis of pitch pattern and were not simply attributable to prior musical expertise, general cognitive, elementary perceptual, or task factors. After taking general auditory working memory performance into account, detection of acoustic deviants (indexed by varying note timbre) was comparable to healthy controls in all syndromic groups. Patients’ performance on pitch pattern analysis tasks deteriorated with increasing perceptual difficulty (as indexed by the more versus less difficult versions of the pitch pattern tests), consistent with a true deficit of pitch pattern processing. Patients with typical AD had impaired processing of global pitch (melody contour) information but (after accounting for general auditory working memory capacity) intact processing of local pitch (interval) and temporal pattern, as well as intact tune recognition whether in isolation or within a polyphonic ‘musical scene’. Patients with LPA (a syndrome generally underpinned by AD pathology) showed a similar profile with predominant impairment of global pitch processing, albeit the evidence of impairment was most apparent in the more demanding processing of direction-only contour variation (produced by deviance in the ordering of the same note sequence). In contrast, patients with PNFA exhibited deficits affecting local (interval) as well as global (melody) information in pitch patterns but (again, after controlling for general auditory working memory capacity) performance that did not reach statistical significance when compared to control processing of temporal pattern, tune recognition and musical scene analysis.
These findings are broadly consistent with a modular organization of music cognition, as previously proposed [7]. More specifically, the relatively greater impairment of global than local pitch pattern analysis in typical AD and the similar profile in LPA corroborate our experimental predictions. Impaired global processing of pitch information in music is in line with other evidence for defective formation of coherent global stimulus representations in AD: this deficit might reflect increased demand for coordinated integrative computations between temporo-parietal association cortices vulnerable to AD pathology [81–85], though any disadvantage with respect to the coding of local stimulus features is likely to be relative rather than absolute [37–40, 86]. While the processing of global stimulus characteristics unfolding over longer time windows requires attentional resources [40, 86], it is unlikely that the profile of pitch deficits here was entirely underpinned by attentional compromise: as our paradigm required a single response to consecutively presented stimuli, it is unlikely to have taxed divided attention, while demands on sustained attention are likely to have been similar in the timbral deviance detection task, on which the present AD group performed normally. Moreover, pitch pattern deficits in our patient groups were documented after taking auditory working memory capacity into account. On the other hand, the present data suggest any claim that dementia syndromes differentially affect particular components of music cognition must be qualified. Syndromic profiles were documented in the context of wide individual variability (Fig. 3). Moreover, across the patient cohort, correlated performance was observed for processing local and global information and pitch and temporal patterns. As the neural mechanisms mediating different components of music perception are likely to be affected together by the spreading neurodegenerative process, the finding of correlation (or absence of differential impairment) in this setting cannot be used to draw inferences about the underlying cognitive architecture.
Our findings provide further evidence that LPA and PNFA have associated phenotypes of nonverbal auditory impairment [32, 87–89]. The musical phenotype was more severe in the PNFA group here; the involvement of pitch pattern analysis in this syndrome is in line with previous work [32] and suggests a putative mechanism linking generic mechanisms of dynamic auditory encoding with speech production via the dorsal auditory cortical pathway, extending over a range of timescales relevant to processing of individual and sequential speech sounds [9, 90–92], Marked involvement of musical perceptual mechanisms might be anticipated from the severe and focal involvement of auditory association areas in the progressive aphasias [9, 11]. Although we did not demonstrate a correlation of musical measures with standard measures of verbal encoding, pitch processing mechanisms are likely to be more relevant to prosody (a crucial non-linguistic attribute of speech signals) than phonemic sequencing, at least for non-tonal languages. Both perception and production of prosody are abnormal in PNFA [89], raising the possibility of a common mechanism linking musical pitch encoding with the programming of pitch variations in speech.
Allowing for the relatively small cohorts here, the present data offer relatively little support for specific musical signatures of particular dementia pathologies: when syndromic groups were directly compared, no measures indicated robust differences. Our findings suggest that certain musical perceptual attributes such as melody (pitch contour) tax neural computational resources across dementia syndromes; the data do not suggest any simple dichotomization of dementias according to whether they degrade or spare the perception of music. Though the overall profile of pitch pattern deficits suggested some selectivity for particular syndromes (predominantly affecting global pitch characteristics in AD and LPA and more widespread in PNFA), any syndromic effects were relative rather than syndrome-specific. An important theme emerging from this study is that auditory working memory deficits are likely to amplify any purely musical deficits (compare Fig. 3 prior to adjustment for this factor and the unadjusted analysis summarized in Supplementary Table 4 with the adjusted significance attributions in Table 2): patients presenting with impairments of music processing may be comparably impaired on processing of other extended auditory information streams. The extent to which musical deficits reflect music-specific processes might then depend on the nature of the interaction between auditory working memory and the relevant musical characteristic, as suggested by previous work [48–50]. This factor may partly explain the lack of evidence here for specific deficits of temporal pattern processing from music, which we anticipated particularly in the PNFA group [64]. While in principle this could also reflect the small study cohort or failure to sample relevant temporal windows (as temporal characteristics of music are less constrained than pitch variations), temporal analysis of musical sequences may be more intimately reliant on auditory working memory capacity than pitch analysis; moreover, the linkage between temporal analysis and working memory mechanisms may have a neuroanatomical substrate (including insular cortex) that is targeted in PNFA [93]. This is a difficult issue to resolve, as particular subsystems of working memory are likely to be music-specific [48–50]. On the other hand, it has been shown that musical listening tasks also engage domain-general working memory circuits [94]. In this study, we set out to adjust for a general index of (verbal) auditory working memory capacity that might affect performance on auditory tracking tasks; however, the relative effects of music-specific and music-independent buffer systems on the perception of musical structure will only be resolved by assessing indices of musical pitch and temporal short-term memory directly alongside standard working memory measures. Even if underpinned by separable neural substrates, music-specific and music-independent working memory systems may be affected together in neurodegenerative disease.
Perhaps more surprisingly, we found no evidence for a specific deficit of musical scene analysis (as indexed by the processing of polyphonic melodies) in any patient group. This contrasts with previous work characterizing a generic impairment of auditory scene analysis in AD [33, 72–75] and may in part reflect the relatively wide variation in healthy control performance on our tune streaming test (Fig. 3). However, it is also possible that the analysis of musical scenes benefits to a greater degree than other kinds of auditory scenes from the availability of stored templates, here familiar tunes. If (as the present data also suggest) recognition of familiar tunes is relatively preserved in these dementia syndromes, patients may be able to engage ‘top-down’ mechanisms for parsing the musical scene even despite degraded mechanisms of early scene analysis [95, 96].
The present findings have certain practical and clinical implications. Deficits of pitch pattern analysis here were demonstrated using stimuli that required tracking of musical information over time. Conventional neuropsychological (including music psychology) procedures that assess discrete stimulus tokens presented in isolation may not fully capture information processing deficits in dementia, particularly earlier in the disease course. Novel neuropsychological instruments that require on-line tracking of information streams could be relevant for assessing the encoding of verbal as well as musical sequences in these diseases. Though conclusions must be qualified pending further detailed investigation, our findings suggest that particular musical attributes (such as rhythm) might be used as a vehicle for designing musical interventions in at least some patients with dementia. At present, formal trials of music therapy in dementia often yield disappointing results despite anecdotal reports of benefit [97]. Targeting of those musical components where the prospect of benefit is greatest would provide a rational basis for music therapy in patients with dementia; moreover, rhythm-based interventions might be more straightforward to deliver and outcomes (for example, patient motor responses) may be easier to code than more complex musical interventions [98].
Considered together, our findings suggest that music perception may be a useful paradigm for assessing neural computational processes that support the analysis of information streams over different time windows and levels of complexity and the impact of dementias on those processes. Impaired encoding of pitch contour may have potential utility as a novel nonverbal and nonvisual biomarker across dementia syndromes while the overall profile of pitch pattern processing may have relative selectivity for particular pathologies such as AD. In addition, potential linkages between musical pitch processing and the processing of speech prosody warrant further investigation particularly in patients with progressive aphasia. This study has several limitations that should direct further work. The numbers of patients recruited here to particular syndromic groups were small and additionally, particular tests were not completed by all patients (Table 2), further underlining the need to study larger patient cohorts to corroborate these findings. Future study cohorts should ideally encompass a wider range of neurodegenerative syndromes and diseases with longitudinal assessments to determine the sensitivity and specificity of particular musical perceptual indices and patterns of evolution over time, ultimately with histopathological and molecular correlation. Combining multi-center patient cohorts might improve power to detect effects and potentially, to stratify neurodegenerative syndromes and pathologies. Even within the AD spectrum, factors such as age and disease stage (severity) may importantly modify phenotype [39]; moreover the present data underline the need to take into account individual variability, which may be amplified by prior musical competence. Besides the analysis of local and global information per se, dementia syndromes might degrade associated cognitive operations, such as perceptual learning, executive shifting between processing levels, or top-down attentional modulation of perceptual mechanisms [39, 43]: future musical paradigms should address these possibilities. Structural and functional neuroanatomical studies comparing patient and healthy older cohorts will be required to delineate the alterations in brain mechanisms of music processing produced by these diseases and to more fully understand the musical phenotypes demonstrated here. More broadly, our findings may provide a prima facie case for tackling theoretical and practical issues of sensory information processing in the dementias that go beyond the domains of language and vision.
Footnotes
ACKNOWLEDGMENTS
We are grateful to all participants for their involvement. The Dementia Research Centre is supported by Alzheimer’s Research UK, the Brain Research Trust and the Wolfson Foundation. This work was funded by the Wellcome Trust, the UK Medical Research Council and the NIHR Queen Square Dementia Biomedical Research Unit. HLG was supported by an Alzheimer Research UK PhD Fellowship. CNC is supported by The National Brain Appeal –Frontotemporal Dementia Research Fund. SJC is supported by an Alzheimer Research UK Senior Research Fellowship and an ESRC/NIHR (Grant no ES/K006711/1). JDW was supported by a Wellcome Trust Senior Clinical Fellowship (Grant No 091673/Z/10/Z).
