Abstract
The latent variable āĪ“ā, can accurately diagnose dementia. Its generalizability across populations is unknown. We constructed a Ī“ homolog (ādT2Jā) in data collected by the Texas Alzheimerās Research and Care Consortium (TARCC). From this, we calculated a composite d-score ādā. We then tested dās generalizability across random subsets of TARCC participants and to a convenience sample of elderly Japanese persons with normal cognition (NC), mild cognitive impairment (MCI), and dementia (AD) (nā=ā176). dT2J was indicated by Instrumental Activities of Daily Living and psychometric measures. Embedded in this battery were the Mini-Mental Status Examination (MMSE) and an executive clock-drawing task (CLOX). Only MMSE and CLOX were available in both TARCC and the Japanese cohort. Therefore, a second composite variable, āT2Jā, was constructed solely from the factor loadings of CLOX and MMSE on d. The diagnostic accuracy of T2J was estimated in the validation sample, the remainder of the TARCC cohort, and in the Japanese sample. The areas under the receiver operating curve (AUC; ROC) for T2J were compared in each sample, and against d in TARCC. The AUCs for T2J were statistically indiscriminable within TARCC, and in Japanese persons. In Japanese persons, AUCs for T2J were 0.97 for the discrimination between AD versus NC, 0.86 for AD versus MCI, and 0.79 for NC versus MCI. The AUCs for T2J in Japanese persons were higher than any individual psychometric measure in that sample. Valid d-score composites can be extracted from a subset of Ī“ās indicators. Moreover, those composites are exportable across cultural and linguistic boundaries.
INTRODUCTION
We have constructed a latent measure of dementia severity, āĪ“ā, and validated it in several datasets, including well characterized subjects participating in the Texas Alzheimerās Research and Care Consortium (TARCC) study [1] and a convenience sample of Japanese persons [2]. The latent variable Ī“ represents the ācognitive correlates of functional statusā. It is uniquely related to dementia severity as measured by the Clinical Dementia Rating scale Sum of Boxes (CDR-SOB) [3] and accurately distinguishes cases with Alzheimerās disease (AD) and mild cognitive impairment (MCI) from each other, and from controls [1, 5]. Although we have validated Ī“ in datasets that are highly selected for AD cases, Ī“ scores are likely to represent the severity of dementia itself, regardless of etiology. A Ī“ homolog has been independently validated in the National Alzheimerās Coordinating Center āUniform Datasetā among cases with frontotemporal, Lewy body, and vascular dementias, in addition to AD [6].
Ī“ can be constructed from a minimal cognitive assessment. We have constructed Ī“ homologs from as little as an executive clock-drawing task (CLOX) [7] and either the Mini-Mental State Examination (MMSE) [8] or the Executive Interview (EXIT25) [9]. In both cases, the resulting composite variable achieves areas under the receiver operating curve (AUC /ROC) >0.95 for the diagnosis of āAlzheimerās diseaseā versus controls [2, 5].
Given that Ī“ composites (i.e., ād-scoresā) retain their diagnostic accuracy when constructed from bedside measures, valid dementia case finding might be freed from the clinic if d-scores derived in a well characterized reference cohort can be generalized to other populations. This would allow the determination of Ī“ in the field, either prospectively, or post hoc, and in datasets that are currently limited by their lack of comprehensive psychometrics and/or expert consensus clinical diagnoses.
Matsuoka et al. [10] recently validated a Japanese language translation of the CLOX in a convenience sample of elderly Japanese persons with normal cognition (NC), MCI, and dementia (nā=ā176). Their sample offers an opportunity to test the generalizability of a brief Ī“ homolog. We hypothesize that a Ī“ composite constructed in a subset of the TARCC sample will generalize both to the remainder of the TARCC participants, and to Matsuokaās sample. Moreover, we hope to take advantage of TARCCās relatively extensive psychometric battery by constructing the composite from factor loadings that have been adjusted for formalcognitive performance measures. Finally, we plan to limit the composite to cognitive measures only, after dropping Ī“ās target indicator, i.e., Instrumental Activities of Daily Living (IADL). The retention of the diagnostic accuracy of such a ārestrictedā Ī“ composite would extend Ī“ās utility to the future prediction of IADL and other clinical outcomes. It would also facilitate the assessment of Ī“ in a wide range of clinical settings, by removing the requirement to obtain IADL information from a qualified informant.
METHODS
Subjects
TARCC
TARCC was established in 1999 by the 76th Texas Legislature. The Consortium consists of six state institutions: i.e., Texas Tech University Health Science Center, the University of North Texas Health Science Center, the University of Texas Southwestern Medical Center at Dallas, Baylor College of Medicine, the University of Texas Health Science Center at San Antonio (UTHSCSA), and Texas A&M. DRR is PI of the UTHSCSA TARCC site, and a member of TARCCās Steering Committee.
The Consortiumās methods have been described in detail elsewhere [11]. Briefly, the TARCC cohort is a convenience sample of 2,016 subjects containing well-characterized cases of AD (nā=ā920), MCI (nā=ā277), and NC (nā=ā819). Each TARCC participant undergoes a standardized annual examination that includes a medical evaluation, neuropsychological testing, and clinical interview. Diagnosis of AD status is based on National Institute for Neurological Communicative Disorders and Stroke-Alzheimerās Disease and Related Disorders Association (NINCDS-ADRDA) criteria [12]. Institutional Review Board approval was obtained at each site and written informed consent was obtained for all participants.
This analysis was performed on TARCC data obtained prior to 2011. We divided the cohort into two randomly selected subsets. A āvalidation sampleā (nā=ā1,018) was used to construct the Ī“ homolog and validate it against TARCC clinical diagnoses. A ātest sampleā (nā=ā998) was used to test the constructās diagnostic accuracy before exporting its factor weights into the Japanese sample.
Japanese sample
The Japanese participants represented 201 consecutive older persons who visited the Center for Diagnosis of Dementia at Kyoto PrefecturalUniversity of Medicine (nā=ā172), Ayabe City Hospital (nā=ā8), or Fukuchiyama City Hospital (nā=ā2), or who lived in a nursing home (nā=ā19). 62 men and 139 women (mean age±SD, 78.4±6.4 years old; mean education±SD, 11.4±3.0 years) participated in the study. A subset was chosen for this analysis. Inclusion criteria consisted of (i) subjectās age of 65 or above; (ii) a CDR global score of less than 3; (iii) a MMSE score of 10 or above. All participants were comprehensively assessed by a geriatric psychiatrist. On the basis of that exam, the 176 selected subjects were diagnosed as NC (nā=ā45), MCI (nā=ā40), or ādementiaā (nā=ā91). The Ethics Committee of Kyoto Prefectural University of Medicine approved the study and informed consent was obtained from all subjects.
Clinical assessments
TARCC
āTargetā indicators of Ī“: IADL scores [13] were assessed using caregiver ratings. The ability to use the telephone, shopping, food preparation (COOK), housekeeping (HSWK), laundry, use of transportation (DRIVE), ability to handle finances (MONY), and responsibility for medication adherence were each rated on the Likert scale ranging from 0 (no impairment) to 3 (specific incapacity). The IADL items rating COOK, DRIVE, HSWK, and MONY items were used as target indicators of Ī“.
Cognitive indicators of Ī“:
General cognitive ability: The MMSE [8] is a well-known and widely used test for screening cognitive impairment. Scores range from 0 to 30.
Executive function measures: The CLOX [7] is a brief executive cognitive function measure based on a clock-drawing task and is divided into two parts. CLOX1 is an unprompted task that is sensitive to executive control. CLOX2 is a copied version that is less dependent on executive skills. CLOX1 is more āexecutiveā than other comparable clock-drawing tests [14]. Each CLOX subtest is scored on a 15-point scale. Lower CLOX scores are impaired.
The Controlled Oral Word Association (COWA) [15] is a test of oral word production (verbal fluency). The patient is asked to say as many words as they can in one minute, beginning with a certain letter of the alphabet.
Memory: In the Logical Memory I (LMI) [16] test, immediately after a ten second presentation, the subject recalls two paragraphs read aloud.
Digit Span Test (DIS) [16] sums the longest set of numbers the subject can repeat back to the examiner in correct order (forwards and backwards) immediately after presentation.
Attention: Trail-Making Test Part A (Trails A) [17] provides a measure of conceptualization, psychomotor speed, and attention. The subject is asked to draw sequential connections between a series ofnumbers.
Japanese cohort
All subjects were evaluated using previously validated Japanese translations of the CDR, CLOX (J-CLOX), IADL, and MMSE (J-MMSE). Details of the translations and validations of the MMSE and CLOX are available elsewhere [10, 18]. In the Japanese sample, the CDR was scored on an algorithm that derived global ratings of CDRā=ā0, ānormalā; CDR 0.5, āquestionable dementiaā; CDRā=ā1, āmild dementiaā, CDRā=ā2 āmoderate dementiaā, or CDRā=ā3 āsevere dementiaā (excluded from these analyses). The CDR was rated blind to the J-CLOX, but not the J-MMSE
Statistical analyses
Analysis sequence
This analysis was performed using Analysis of Moment Structures (AMOS) software [19]. All analyses were conducted in an SEM framework. We used the Maximum Likelihood estimator. In our experience with large samples such as TARCC, this results in nearly identical parameter estimates to a Baysean approach. However, we repeated the model using a Baysean method to confirm this.
First we divided the TARCC cohort into two randomly selected subgroups of approximately equal size (i.e., nā=ā998 and nā=ā1,018 respectively). The larger sample became the āvalidation sampleā. The smaller became the ātest sampleā.
Next, we used the validation sample to construct a Ī“ homolog from a bifactor latent variable model, as previously described [5]. All observed measures, including both cognitive performance measures and the functional status ātarget indicatorsā (i.e., IADL items), were adjusted for age, education, ethnicity, and gender. Co-variances between the residuals were allowed to be estimated if they were significant and improved model fit.
The new Ī“ homolog [(i.e., ād: Texas to Japanā (dT2J)] was indicated by LMI, Trails A, COWA, and DIS. Its target indicators were four selected IADL items (i.e., COOK, HSWK, DRIVE, and MONY). We also embedded CLOX1, CLOX2, and the MMSE, the only three measures in the TARCC battery that were also available in Matsuokaās sample. Our intention was to use factor loadings of dT2J on those three measures to construct an exportable composite (āT2Jā). This way, the composite weights of T2J would be adjusted for dT2Jās additional formal psychometrics and IADL items, preserving those measuresā influence on the T2J composite. The composites we derived were simple linear sums of observed test scores multiplied by the homologās factor loadings.
The factor determinancy of dT2J was tested by Griceās method [20]. Next, the latent Ī“ homolog was output as two composite variables. ādā was constructed from the factor loadings of the IADL items and all the cognitive measures. āT2Jā was composed solely from the summed products of CLOX1, CLOX2, and MMSE and their respective standardized dT2J loadings. d was then validated as a predictor of observed diagnostic class (i.e., by ROC analysis) and CDR-SB (i.e., by correlation) in the validation sample, then in the test sample. The diagnostic accuracy of T2J was compared to d, first in the same validation sample, then in the test sample.
Finally, we calculated the T2J scores of the Japanese subjects. Only MMSE and CLOX overlap TARCC and the Japanese sample. Therefore, in the Japanese sample, the T2J composite is derived from J-MMSE and J-CLOX scores only, using factor weights derived in TARCC from the latent dT2J construct. Furthermore, while the Full information Maximum Likelihood (FIML) option for missing data was selected by AMOS, the Japanese dataset had no missingness, and so no correction for missing data was required.
We presume that the MMSE and CLOX factor weights of T2J reflect multivariate associations with dT2Jās additional indicators, none of which were available in Japan, but which never the less inform the factor weights exported from Texas to Japan. the diagnostic accuracy of T2J was determined by ROC in that sample and its correlation with CDR global scorestested.
Missing data
These models were all constructed in an SEM framework, using raw data. Modern Missing Data Methods were automatically applied by the AMOS software. Only the ROC analyses were limited to complete cases. Elsewhere, we used FIML methods to address missing data. FIML uses the entire observed data matrix to estimate parameters with missing data. In contrast to list wise or pair wise deletion, FIML yields unbiased parameter estimates, preserves the overall power of the analysis, and is arguably superior to alternative methods, e.g., multiple imputation [21, 22].
Fit indices
Model fit was assessed using four common test statistics: chi-square, the ratio of the chi-square to the degrees of freedom in the model (CMIN/DF), the comparative fit index (CFI), and the root mean square error of approximation (RMSEA). The chi-square fit statistic is sensitive to sample size. In large samples, such as the TARCC, it tends to achieve statistical significance when all other fit indices (which are not sensitive to sample size) show that the model fits the data well. A CMIN/DF ratio <5.0 suggests an adequate fit to the data [23]. The CFI statistic compares the specified model with a null model [24]. CFI values range from 0 to 1.0. Values below 0.95 suggest model misspecification. Values approaching 1.0 indicate adequate to excellent fit. An RMSEA of 0.05 or less indicates a close fit to the data, with models below 0.05 considered āgoodā fit, and up to 0.08 as āacceptableā [25]. All fit statistics should be simultaneously considered when assessing the adequacy of the models to the data.
ROC curves
The diagnostic performance or accuracy of a test to discriminate diseased from normal cases can be evaluated using ROC curve analysis [26, 27]. Briefly the true positive rate (Sensitivity) is plotted as a function of the false positive rate (100-Specificity) for different cut-off points of a parameter. Each point of the ROC curve represents a sensitivity/specificity pairing corresponding to a particular decision threshold. The AUC is a measure of how well a parameter can distinguish between two diagnostic groups (diseased/normal). In TARCC, the consensus diagnosis of expert clinicians was used as the ROC outcome. In the Japanese data, consensus diagnoses were not available. Instead, āADā was coded as CDR ā„1.0, āMCIā as CDRā=ā0.5, and NC as āCDRā=ā0ā. The analysis was performed in Statistical Package for the Social Sciences (SPSS) [28].
Factor determinacy
An essential limitation of the common factor model is that an infinite number of unique factor score composites can be derived from any factor [20]. While they all might be consistent with the factorās loadings, some composites may be orthogonal to others, or even inversely related, potentially resulting in wildly discrepant subject rankings, depending on the composite selected.
However, these can be divided into ādeterminantā and āindeterminantā fractions [29]. Fortunately, many common factor score estimates are highly intercorrelated and yield an identical reproduced covariance matrix [30]. Several statistical methods are available to test a factorās determinacy. We used Griceās āRefined Factor Score Evaluation Program (Equation 5)ā [20]. This method maximizes composite validity and is recommended when the factor composite scores are to be used as āobservedā variables in subsequent analyses (e.g., as predictors). We report two indices from this programās output: āTotal Item Squared Multiple Correlationā (TIMSC), and a āMinimum Correlationā (MC). Acceptable TIMSC and MC shouldbe >0.50.
RESULTS
TableĀ 1 presents mean demographic variables for both TARCC subgroups and the Japanese sample. There were no significant differences between the TARCC subgroups on any measure. However, the Japanese sample differed significantly from the TARCC samples with regard to age, education, gender, and CDR global scores. This is to be expected when comparing two convenience samples.
TableĀ 2 presents cross-group differences in the observed dT2J indicator variables. Once again, there were no significant differences between the TARCC subgroups on any measure. However, the Japanese sample differed significantly from the TARCC samples with regard to observed CLOX1, CLOX2, and MMSE scores. This is again to be expected.
TableĀ 3 presents cross-group differences in observed T2J composite indicator variables, within diagnostic subsets. In Controls, CLOX2 and MMSE scores differed significantly. In MCI, only MMSE scores differed significantly. In AD, CLOX2 and MMSE scores differed significantly. However, these significant cross group differences were clinically negligible (i.e., <3 MMSE and <2 CLOX points).
FigureĀ 1 presents the Ī“ model parameterized for the TARCC test sample (nā=ā998). All observed indicators are adjusted for age, education, ethnicity, and gender. The model has excellent fit [RMSEAā=ā0.012; CFIā=ā0.999; ChiSQā=ā36.31 (32), pā=ā0.027]. A Baysean model produced nearly identical parameter estimates. dT2J exhibited acceptable factor determinancy by Griceās method (i.e., TISMCā=ā0.86; MCā=ā0.72).
TableĀ 4 presents each compositeās correlations with the CDRSOB and /or the CDRGlobal in three samples: TARCC validation, TARCC test, and the Japanese sample. All correlations were strong, ranging from T2J (Japanese) x CDRGlobal: rā=ā0.79, pā< ā0.001, to d(test) x CDRSOB: rā=ā0.94, pā< ā0.001. In general, the correlations were slightly less strong for CDRGlobal versus CDRSOB, and for the Japanese sample versus TARCC.
TableĀ 5 presents each composite variableās AUCs for the discriminations between 1) AD versus NC, 2) AD versus MCI, and 3) NC versus MCI. Each compositeās AUC was best for the relatively easy discrimination between AD cases and controls (AUCā=ā0.95, 95% CIā=ā0.91ā0.98). However, they also performed well at more difficult discriminations, including AD versus MCI (AUCā=ā0.81, 95% CIā=ā0.72ā0.90), and MCI versus NC (0.84, 95% CIā=ā0.77ā0.91). In the TARCC, T2Jās performance was slightly less robust than dās on every discrimination. However, T2Jās discriminations were statistically indiscriminable from dās. Similarly, T2Jās performance in the Japanese sample was statistically indiscriminable from dās performance in the TARCC validation and test samples on every discrimination.
DISCUSSION
We have achieved three important objectives by this analysis. First, we have demonstrated that a Ī“ composite, constructed in one cohort, can be generalized to another well-characterized convenience sample without loss of its psychometric properties. That this can be accomplished across cultural and linguistic barriers and despite significant differences in demographics and observed cognitive performance is a testament to a latent constructās relative freedom from psychometric measurement error, as has been recently reviewed [31]. Nevertheless, the TARCC and our Japanese cohort are both convenience samples. The generalization of Ī“ to a population-based cohort might be more problematical because of a priori differences in the prevalence of dementia, and could limit unrestricted application of Ī“ homologs.
Second, we have succeeded in extracting a composite from a ārestrictedā set of indicators, without sacrificing diagnostic accuracy. This achieves two goals. First, it allows us to use the robust battery of our validation cohort to improve the psychometric properties of the final composite, T2J. Second, the restricted composite allows us to estimate Ī“ scores from cognitive performance data alone, without reference to Ī“ās target indicator (in this case IADL items). This is a remarkable advance. First, it frees us from the necessity of obtaining information from a reliable informant (in the case of IADLs) or expert clinicians (in the case of CDR scores). Dementia diagnosis by latent variables then becomes much more feasible for cases without informants, in remote clinical settings, or in the post hoc analysis of datasets that lack CDR or even IADL ratings. Second, this should allow us to use cognitive performance data to estimate the target indicator in cases where only cognitive performance data are known.
This is important because Ī“-like models are modular and can be directed to any target indicator. We havepreviously constructed one such Ī“ āorthologā representing āthe cognitive correlates of depressive symptomsā, and associated it with the default mode network by voxel based morphometry [32]. By a similar method, we have identified serum insulin-like growth factor binding protein-2 as a biomarker of age-specific cognitive change [33]. Other potential ortholog targets might be driving ability, decision-making capacity, financial, and/or medication management. Now it becomes feasible to estimate these outcomes from Ī“ās cognitive performance indicators alone. As we have demonstrated here, these need not be very sophisticated measures. They might be simple bedside measures that are easily acquired, at the bedside, via telephone, or the Internet. Moreover, they might represent only a subset of the indicators used to construct the original latent variable [2].
It is also important to note that both Ī“ and its alternative orthologs could be constructed from the same cognitive assessment, and simultaneously estimated in individuals after a single clinical interview. Moreover, given the high AUCs we have achieved for some discriminations, those outcomes might be confidently estimated and interpreted without expert review or adjudication.
Ī“ scores have had difficulty discriminating MCI from controls. We originally thought this weakness might be due to a lack of variance in Ī“ās IADL indicator(s) among MCI cases. However, MCI samples often demonstrate impairments in IADL [34], and we later found Ī“-scores can accurately discriminate controls from MCI in non-Hispanic whites [35]. Therefore, we speculated that TARCC clinicians might be experiencing difficulties with the diagnosis of MCI in Hispanic cases. If so, then T2J should perform better in distinguishing controls from MCI in the Japanese sample, where Japanese clinicians are examining ethnic Japanese patients in their native language. However, although the performance of T2J in Japanese persons was slightly more robust than in TARCC, it did not perform statistically better than in TARCC (TableĀ 5). This then suggests that MCI may itself be a heterogeneous condition, composed of subgroups with widely varying Ī“ scores and hence widely varying risks of near term conversion to dementia.
Ī“ ranks subjects along a continuous dimension of dementia severity. Thus, longitudinal change in Ī“ is a very strong determinant of future CDR scores independently of baseline Ī“ [6, 35]. In contrast, gā is unrelated to functional status, and therefore to dementia, by definition. Thus, it ranks cases along an orthogonal dimension of cognitive performance that is irrelevant to future dementia status. As a consequence, gā and Īgā make trivial contributions future CDR scores independently of baseline Ī“ and ĪĪ“ [36].
Since MCI is defined by cognitive impairment in the absence of dementia, it should be comprised of at least three subgroups: cases with poor gā, cases with intermediate d-scores, and cases with both. These are broadly recognizable as āamnesticā, ānon-amnesticā, and āmulti-domainā MCI. Only the latter two would be at near-term risk of conversion, and gās contribution to the last category would be superfluous. Thus, an adjustment for gā may improve upon the ability of Ī“ to make this important discrimination.
We have not demonstrated Ī“ās factor invariance across language or culture. That would have been impractical here because d, from which T2Jās factor weights were derived, was indicated by cognitive measures not available in our Japanese sample. It may not even be feasible to achieve factor invariance across widely disparate populations or cohorts, because Ī“ās factor loadings are determined in part by each indicatorās measurement error, which may vary across samples. Our aim instead has been pragmatic. Regardless of whether T2J can be found to demonstrate measure invariance, it achieves a statistically indiscriminable classification of Japanese persons.
This is partly because, as latent variables, Ī“ homologs will be relatively immune to the unique measurement biases of their indicators. However, any latent constructās āreificationā as a composite score potentially introduces new measurement error. In very constrained conditions, this can lead to unacceptable discrepancies in a compositeās psychometric properties [20]. Regardless, the methods that are most commonly employed to generate factor composites lead to identical outcomes [30].
As in this study, other Ī“ homologs we have tested all exhibit acceptable determinancy. Moreover, although the reification of Ī“ as a restricted T2J composite may have introduced some measurement error, this 1) explicitly does not include the measurement biases that characterize its psychometric indicators (e.g., cultural or linguistic psychometric biases) and 2) empirically does not affect the psychometric performance of T2J relative to a homolog constructed and validated within Japanese persons, despite statistical differences in these two samplesā demographics and observedcognitive performance (and those measuresā administration in three different languages).
In summary, latent variables can precisely quantify dementia status, in contrast to categorical diagnoses such as āMCIā. The resulting composites rank order subjects along a continuous dimension of dementia severity. We have demonstrated that a latent variableās target indicator can be omitted from its composite, indeed the majority of its indicators can be omitted, without a drop in diagnostic accuracy. The resulting T2J dementia phenotype is generalizable across cultural and linguistic boundaries. If this can be demonstrated in other Ī“ homologs, Ī“ composites may hold great promise for dementia case-finding.
Footnotes
ACKNOWLEDGMENTS
This study was made possible by the Texas Alzheimerās Research and Care Consortium (TARCC) funded by the state of Texas through the Texas Council on Alzheimerās Disease and Related Disorders.
DRR and RFP have disclosed the results of these analyses to the University of Texas Health Science Center at San Antonio (UTHSCSA), which has filed patent application 2012.039.US1.HSCS and provisional patents 61/603,226 and 61/671,858 relating to the latent variable Ī“/dās construction and biomarkers.
Investigators from the Texas Alzheimerās Research and Care Consortium: Baylor College of Medicine: Rachelle Doody MD, PhD, Mimi M. Dang MD, Valory Pavlik PhD, Wen Chan PhD, Paul Massman PhD, Eveleen Darby, Monica Rodriguear RN, Aisha Khaleeq; Texas Tech University Health Sciences Center: Chuang-Kuo Wu MD, PhD, Matthew Lambert PhD, Victoria Perez, Michelle Hernandez; University of North Texas Health Science Center: Thomas Fairchild PhD, Janice Knebl DO, Sid E. OāBryant PhD, James R. Hall PhD, Leigh Johnson PhD, Robert C. Barber PhD, Douglas Mains, Lisa Alvarez, Rosemary McCallum; University of Texas Southwestern Medical Center: Perrie Adams PhD, Munro Cullum PhD, Roger Rosenberg MD, Benjamin Williams MD, PhD, Mary Quiceno MD, Joan Reisch PhD, RyanHuebinger PhD, Natalie Martinez, Janet Smith; University of Texas Health Science Center ā San Antonio: Donald Royall MD, Raymond Palmer PhD, Marsha Polk; Texas A&M University Health Science Center: Farida Sohrabji PhD, Steve Balsis PhD, Rajesh Miranda, PhD; Essentia Institute of Rural Health: Stephen C. Waring DVM, PhD; University of North Carolina: Kirk C. Wilhelmsen MD, PhD, Jeffrey L. Tilson PhD, Scott Chasse PhD. This research is also supported by āRedesigning Communities for Aged Societyā project (Research Institute of Science and Technology for Society, Japan Science and Technology Agency: RISTEX, JST).
