Digital tools and biomarkers for cognitive screening: A systematic review

Abstract

Objective

This systematic review aimed to evaluate and compare three major types of digital cognitive assessment tools—digitized traditional tests, newly developed digital tools, and digital biomarkers. It also explored their validity, feasibility, and potential integration into community-based healthcare, along with clinical utility and implementation challenges.

Methods

A search of nine electronic databases (July 2024) was conducted following PRISMA guidelines. Eligible studies included those reporting diagnostic accuracy and practical application of digital cognitive assessment tools or biomarkers. Data were narratively synthesized, and diagnostic accuracy metrics such as sensitivity, specificity, and area under the curve (AUC) were extracted. The review was prospectively registered in PROSPERO (CRD42024564512).

Results

A total of 48 studies involving older adults with or at risk of cognitive impairment were included. These studies evaluated digital cognitive tools or biomarkers and reported diagnostic performance comparison with established tests or clinical diagnosis across clinical, community, and home settings. Approaches were categorized into digitized traditional tests (AUC: 0.65–1.0), newly developed digital cognitive tools (AUC: 0.75–0.94), and digital biomarkers (AUC: 0.59–0.95). Although diagnostic accuracy was generally promising, most studies were conducted in controlled settings with limited longitudinal or real-world validation. Evidence supporting the role of community-based healthcare professionals, such as nurses, remains limited; however, self-administered and remote assessments indicate potential for future community-based implementation. Future validation in real-world, community-based settings is essential before widespread implementation.

Conclusions

Digital cognitive assessments and biomarkers offer valuable opportunities to enhance early screening and continuous monitoring of cognitive impairment. Based on these findings, we suggest that selection between digitized traditional tests and newly developed tools should be tailored to specific context, with digital biomarkers serving as complementary tools to strengthen cognitive assessment, particularly in community-based nursing practice. Policymakers and healthcare organizations should establish guidelines to safe and effective adoption of digital cognitive tools in community and clinical settings.

Keywords

cognitive dysfunction community health nursing digital biomarkers digital health neuropsychological tests

Introduction

The global prevalence of dementia is increasing, with South Korea alone recording nearly 890,000 dementia patients aged 65 years and older in 2021, an increase of nearly 50,000 cases per year.¹ Dementia not only impacts cognitive and functional abilities but also imposes significant economic and psychological burdens on families and healthcare systems. Early detection plays a critical role in slowing dementia progression and improving patient outcomes, demanding convenient and effective methods of cognitive testing.²

Historical neuropsychological measures including the Mini-Mental State Examination (MMSE) and Montreal Cognitive Assessment (MoCA) have served as the gold standard for cognitive screening.³ These tests, however, need expert clinicians, are time-consuming, and are often insensitive to early symptoms of cognitive decline.^4,5 Moreover, the suboptimal uptake of screening tests reported above in national dementia screening programs (only 16.6% of eligible individuals underwent screening through public health centers in 2015⁶) suggests the necessity of improving access to cognitive assessments and increasing their usability to populations.

Several systematic reviews have previously examined digital cognitive assessments, focusing mainly on specific modalities such as mobile-based or AI-assisted tools.^7,8 However, few have integrated a comparative evaluation across digitized traditional tests, newly developed digital tools, and digital biomarkers. The present review addresses this gap by synthesizing and contrasting these three categories to provide a comprehensive overview of current evidence.

Traditional neuropsychological tests, although used as a popular standard screening tool, pose a number of concerns in community settings. These evaluations usually involve trained specialists, are time-intensive, and may not be accessible to seniors with limited mobility or geographical barriers.^2,4,5 It is also the case that their sensitivity to more modest/early-stage cognitive deterioration is restricted, particularly when administered as single time-point assessments.^9,10 Prior systematic reviews investigated digital cognitive assessment tools that typically focused on particular modalities, including mobile-based applications and artificial intelligence-enabled systems.^7,8 But these reviews also have missed combining different categories of digital approaches and have lacked the assessment of diagnostic accuracy and of whether/when it is feasible in the real-world. This underscores a major gap in the literature, especially for the use of these tools in community healthcare environments. Digital biomarkers, defined as objective and quantifiable physiological and behavioral data obtained via digital devices, are considered an important complement to traditional cognitive evaluations.^11,12 They are clinically relevant because they capture continuous, real-life data, allowing for the early detection of subtle cognitive changes not detected using traditional tests.¹³ Such advantages make digital biomarkers an effective tool for improving early screening and long-time monitoring of cognitive outcomes.

Digital cognitive assessment tools have emerged as promising alternatives to address the limitations of traditional methods. These tools utilize mobile applications, artificial intelligence (AI)-assisted scoring, and adaptive testing to provide automated, scalable, and precise cognitive evaluations.^14,15 Moreover, self-administered remote-testing methods improve accessibility, particularly for older adults in community settings where medical resources are limited.¹⁶ In addition to these digital cognitive assessment tools, digital biomarkers are defined as objective, quantifiable physiological and behavioral data collected through digital devices such as smartphones, wearable sensors, and computer-based platforms.^11,12 In the context of cognitive assessment, these biomarkers enable the detection of subtle changes in cognitive function through real-world interactions and passive data collection. For example, studies included in this review have shown that speech and language features, such as pauses and semantic complexity, can serve as early indicators of cognitive decline.^17,18 In addition, motor and behavioral patterns, including gait characteristics such as turn velocity and stride length, have demonstrated predictive value for cognitive impairment.¹⁹Activity-based monitoring using smart home environments has also been effective in identifying changes in daily living patterns associated with cognitive decline.²⁰ These characteristics highlight the potential of digital biomarkers for continuous, non-invasive, and ecologically valid monitoring of cognitive function in real-world settings. Traditional neuropsychological assessments are limited by being time-consuming, requiring trained personnel, and lacking scalability, reinforcing the need for innovative digital approaches.^2,4,5 Recent evidence indicates that AI-driven multimodal biomarkers achieve higher sensitivity than conventional cognitive screening tools for early Alzheimer’s detection. AI-driven multimodal biomarkers have shown superior performance for early Alzheimer’s detection compared with conventional cognitive test.²¹

In this review, traditional neuropsychological tests refer to paper-based cognitive assessments that have been digitized with minimal procedural modification. Digital cognitive assessment tools denote newly developed computerized or mobile-based instruments designed to measure cognitive functions through novel interfaces. Digital biomarkers, in contrast, involve passive or active data derived from sensors, wearables, or device interactions that serve as indirect indicators of cognitive function.

Recent studies have expanded the scope of digital assessments using AI and multimodal data integration, demonstrating improved predictive validity for early Alzheimer’s detection.^22,23 Furthermore, updated systematic reviews highlight the growing importance of digital biomarkers in community-based cognitive health monitoring.²⁴ Despite these advances, several challenges remain regarding the implementation of digital cognitive assessments in real-world settings. Score discrepancies between digital- and paper-based tests, variations in digital literacy among older adults, and the need for standardized scoring frameworks must be addressed to ensure reliability.²⁵ Although digital cognitive assessments and biomarkers hold promise, their practical feasibility and effectiveness in community healthcare settings require further evaluation.

Community nurses play a critical role in bridging this accessibility gap by incorporating digital cognitive assessments into routine home visits and community health programs. However, visiting nursing services face structural and operational challenges, including limited resources, administrative burdens, and role ambiguity.^26,27 To enhance early detection and intervention, an integrated community-based cognitive function-monitoring system that leverages digital cognitive assessment and biomarkers should be developed.

This systematic review aims to evaluate and compare different types of digital cognitive assessment tools and examine their validity, feasibility, and potential integration into community-based healthcare. Specifically, this review assessed (a) the validity of digital adaptations of traditional neuropsychological tests, (b) the advantages and limitations of newly developed digital cognitive assessment tools, and (c) the emerging role of digital biomarkers in cognitive function monitoring.

Although previous studies have explored digital approaches for cognitive assessment, existing systematic reviews have often focused on specific tools or technologies without comprehensively integrating multiple domains, such as digitized neuropsychological tests, newly developed digital tools, and digital biomarkers.^15,16 In addition, limited attention has been given to the combined evaluation of diagnostic accuracy and practical feasibility, particularly in community-based settings. Therefore, a more integrative and practice-oriented review is needed.

Methods

This systematic review was conducted and reported in accordance with the PRISMA 2020 guidelines and was prospectively registered in the PROSPERO database (registration number: CRD42024564512). In addition, the PRISMA checklist has been included as an appendix to enhance methodological transparency and reproducibility.

The research question was developed using the PICOS framework, including Population (older adults with or at risk of cognitive impairment), Intervention (digital cognitive assessment tools and digital biomarkers), Comparator (traditional cognitive tests or clinical diagnosis), Outcomes (diagnostic accuracy and feasibility), and Study design (observational and diagnostic accuracy studies).

The data search was conducted over a period of five days, from July 24, 2024 to July 29, 2024. An updated literature search was conducted in March and April 2026 using the same databases and search strategy to identify any newly published studies. No additional studies met the inclusion criteria for this review. A comprehensive search was conducted across nine electronic databases, including MEDLINE, Embase, Cochrane Library, CINAHL, PsycINFO, RISS, KISS, KMbase, and KoreaMed. The search strategies combined controlled vocabulary and relevant keywords related to cognitive impairment (such as “dementia,” “mild cognitive impairment,” and “cognitive dysfunction”), cognitive assessments (such as “neuropsychological tests” and “cognitive screening”), and digital technologies (such as “digital health,” “mobile applications,” and “digital biomarkers”). The search strategy combined controlled vocabulary (e.g., MeSH terms such as “Cognitive Dysfunction” and “Dementia”) and free-text keywords (e.g., cognitive, dysfunction, impairment, Alzheimer*), using Boolean operators (AND, OR) to ensure comprehensive retrieval across multiple fields.

Most of the included studies were cross-sectional in design. Longitudinal or interventional studies were not included, as the primary objective of this review was to examine the validity and applicability of digital cognitive screening tools rather than intervention effects or longitudinal change trajectories. This criterion was defined a priori to ensure transparency in study design selection.

Eligibility criteria and selection procedure

This systematic review included studies that evaluated digital cognitive assessment tools for older adults with a focus on their validity and applicability. The inclusion criteria were as follows: (a) studies involving older adults, including those with mixed-age samples where older participants were the primary focus; (b) studies using digital cognitive assessment tools; and (c) studies that assessed diagnostic performance for identifying cognitive impairment. Review articles, study protocols, nonhuman studies, and studies unrelated to cognitive assessment were excluded.

However, direct evidence for nurse-led or community-based implementation of these tools remains limited. Self-administered and remote monitoring options have demonstrated growing feasibility, offering valuable opportunities for continuous cognitive screening in community-dwelling older adults. Future research should prioritize usability testing among older adults, training programs for nurses, and the acceptability of digital devices among healthcare professionals.

All retrieved studies were reviewed according to the predefined selection and exclusion criteria, and duplicate studies were removed using a bibliographic management program (EndNote 20). After removing duplicate literature in the first round of selection and exclusion, we excluded literature unrelated to the research topic by looking at the title and abstract. If it was difficult to judge whether a study was suitable for selection, the full manuscript text was reviewed. In the second selection/exclusion process, the full text of the literature selected in the first step was reviewed, and literature suitable for the research topic of this study was selected. Two researchers independently evaluated the literature search and selection processes. If the opinions of the researchers were not in agreement for literature selection, a consensus was reached through discussion.

Data extraction

We extracted data regarding the characteristics of the included studies (such as study design, sample size, and participant demographics), intervention details (including the type of digital cognitive assessment and setting), and outcomes of interest.

The primary outcomes of interest included measures of diagnostic validity, such as sensitivity, specificity, and area under the curve (AUC). In addition, extracted data included the statistical associations with traditional cognitive tests, such as correlation coefficients and score differences between digital and conventional assessments. Furthermore, information related to feasibility of the tools was collected, including self-administration formats and the general utility rates.

Study risk of bias assessment

Due to methodological heterogeneity, only a narrative synthesis was conducted rather than a meta-analysis.

The risk of bias was assessed using the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool for studies that evaluated outcomes related to diagnostic accuracy. Two reviewers independently evaluated each study across the following four domains: patient selection, index test, reference standard, and flow and timing. Discrepancies were resolved by discussion or consultation with a third reviewer. Data extraction and quality appraisal were independently performed by two reviewers to ensure consistency and reduce bias. Discrepancies were resolved through discussion and consensus, with inter-rater agreement assessed using Cohen’s kappa coefficient to confirm reliability of the review process.

Synthesis method

We conducted a narrative synthesis owing to heterogeneity in study designs, participant characteristics, assessment tools, and reported outcomes. The studies were grouped into three categories based on the type of digital cognitive assessment tool used: (a) digital adaptations of traditional neuropsychological tests; (b) newly developed digital cognitive assessments; and (c) cognitive assessments using digital biomarkers. The first category included studies that digitized well-established neuropsychological tests. The second category comprised newly developed digital assessments that are not direct digital versions of traditional tests. The third category included studies that used digital biomarkers to assess cognitive function in a continuous and unobtrusive manner.

Within each category, the key findings related to diagnostic performance, feasibility, and unique technological features were summarized. Results are presented in descriptive tables and synthesized narratively to highlight similarities, differences, and notable trends across studies.

Results

A total of 48 studies examining various digital cognitive assessment tools for older adults were included in this systematic review (Figure 1). The general characteristics of the included studies are summarized in Table 1. The included studies were published between 2004 and 2024, with the majority employing a cross-sectional design (n=42) and a few (n=6) following a longitudinal approach. The sample sizes varied significantly, ranging from as few as 13 participants in pilot studies to over 900 participants in large-scale studies. The mean age of participants for each study was in the older adult range, with most studies focusing on individuals aged 60 and above, although some included younger adults for comparison. The participants were categorized into different cognitive status groups: healthy controls (HC), mild cognitive impairment (MCI), and dementia. Some studies further classified individuals into preclinical cognitive decline stages or specific conditions, such as subjective cognitive decline (SCD) and Parkinson’s disease (PD)-related cognitive impairment. Most studies were conducted in high-income countries with limited representation of diverse ethnic or socioeconomic groups.

Figure 1.

Flow chart of the study selection process.

Table 1.

General characteristics of included studies.

Author (Year)	Study design	Participants (n)	Mean age (SD)	Device	Assessment tool	Comparator test	Reference standard
1. Digital adaptation of Traditional Neuropsychological Tests
Haas (2022) (Haas et al., 2022)	Cross-sectional, Correlation	Adults (253)	Total: 46.80 (17.39)	PC	eCOGTEL	COGTEL	Not specified
Haas (2022) (Haas et al., 2022)	Cross-sectional, Correlation	Adults (253)	Older adults: 67.77 (7.03)	PC	eCOGTEL	COGTEL	Not specified
Prange (2022) (Prange & Sonntag, 2022)	Cross-sectional, Diagnostic accuracy	Older adults (40)	74.4 (4.1)	Tablet	dCDT, dTMT, dROCF	N/A	CDT, TMT, ROCF
Ruengchaijatuporn (2022) (Ruengchaijatuporn et al., 2022)	Cross-sectional, Diagnostic accuracy	HC (651), MCI (267)	67 (median)	Tablet	dCDT, dTMT, Cube-Copying Test	N/A	Clinical Diagnosis
Katz (2021) (Katz et al., 2021)	Cross-sectional, Diagnostic accuracy	HC (279), MCI (149)	78.1 (5.2)	Phone	T-MoCA	MoCA	Clinical Diagnosis
Berg (2017) (Berg et al., 2018)	Cross-sectional, Correlation	Older adults with primary memory complaints	71.6 (8.8)	Tablet	eMoCA	MoCA	Not specified
Dahmen (2017) (Dahmen, Cook, Fellows, & Schmitter-Edgecombe, 2017)	Cross-sectional, Diagnostic accuracy	HC (28), MCI (6), PD (7), PD/MCI (3), Others (10)	68.4 (9.3)	Tablet	dTMT	cTMT	Clinical Diagnosis
Scharre (2017) (Scharre, Chang, Nagaraja, Vrettos, & Bornstein, 2017)	Cross-sectional, Diagnostic accuracy	HC (21), MCI (24), Dementia (21)	75.2 (2.3)	Tablet	eSAGE	SAGE	Clinical Diagnosis
Ruggeri (2016) (Ruggeri, Maguire, Andrews, Martin, & Menon, 2016)	Cross-sectional, Correlation	HC (60)	61.37 (6.44)	Tablet	CUPDE (digital version of SLUMS)	SLUMS	Not specified
	Cross-sectional, Correlation	HC (60)	61.37 (6.44)	Tablet	CUPDE (digital version of SLUMS)	SAGE	Not specified
Wu (2015) (Y.-H. Wu et al., 2015)	Cross-sectional, Correlation	HC (94)	74.6 (7.3)	Tablet	e-CT	Neuropsychological tests	Not specified
Junkkila (2012) (Junkkila, Oja, Laine, & Karrasch, 2012)	Cross-sectional, Diagnostic accuracy	HC (22), MCI (17), AD (19))	HC: 70 (4.48) MCI: 73 (6.3) AD: 73 (6.76)	PC	CANTAB-PAL	CERAD Wordlist Learning Task	Clinical Diagnosis
2. Newly developed digital cognitive assessments
Cattaneo (2024) (Cattaneo et al., 2024)	Longitudinal, Diagnostic accuracy	HC (32), MCI (39)	76.7 (5.1)	Phone	Guttmann Cognitest	MoCA	Clinical Diagnosis
Glenn (2023) (Glenn et al., 2023)	Cross-sectional, Diagnostic accuracy	HC (37), pAD (38)	HC: 72.7 (7.5) pAD: 77.5 (6.6)	Tablet	Neurotrack Cognitive Assessment Battery	MMSE	Clinical Diagnosis
Glenn (2023) (Glenn et al., 2023)	Cross-sectional, Diagnostic accuracy	HC (37), pAD (38)	HC: 72.7 (7.5) pAD: 77.5 (6.6)	Tablet	Neurotrack Cognitive Assessment Battery	HDSR	Clinical Diagnosis
Lesoil (2023) (Lesoil et al., 2023)	Cross-sectional, Diagnostic accuracy	HC (65), AD (50), others (14)	71.3 (NR)	Tablet	Santé-Cerveau digital tool	NCB	Clinical Diagnosis
Muurling (2023) (Muurling et al., 2023)	Cross-sectional, Diagnostic accuracy	HC (57), preAD (27), proAD (37)	HC: 68 (7)	Tablet, AR	The AR app used from the Altoida Digital Biomarker Platform	Neuropsychological tests	Clinical Diagnosis
			preAD: 71 (5)
			proAD: 70 (8)
Na (2023) (Na, Seo, Kim, Yoo, & Lee, 2023)	Cross-sectional, Correlation	HC (73), MCI (93)	72.6 (8.63)	Tablet	Inbrain Cognitive Screening Test	CERAD-K	Clinical Diagnosis
Nie (2023) (Nie et al., 2023)	Cross-sectional, Diagnostic accuracy	HC (61), MCI (46), Dementia (44)	HC: 72.44 (6.07)	Phone	Thoven Cognitive Self- Assessment	C-MoCA	Clinical Diagnosis
			MCI: 73.07 (9.96)
			Dementia: 73.30 (8.72)
Wu (2023) (J. Wu et al., 2023)	Cross-sectional, Diagnostic accuracy	HC (175), MCI (73)	55-75 (range)	Tablet	EOmciSS	N/A	Clinical Diagnosis
Alim-Marvasti (2022) (Alim-Marvasti et al., 2022)	Cross-sectional	HC (38), MCI (34)	HC: 69.1 (5.5)	Phone	M-Cogscore	MMSE-2	Clinical Staging (CDR)
Alim-Marvasti (2022) (Alim-Marvasti et al., 2022)	Cross-sectional	HC (38), MCI (34)	MCI: 69.5 (6.6)	Phone	M-Cogscore	MMSE-2	Clinical Staging (CDR)
Skirrow (2022) (Skirrow et al., 2022)	Longitudinal case-control	HC (78), MCI or mild dementia (73)	HC: 70.37 (4.35)	Phone	Automated Story Recall Task	Wechsler Logical Memory Test	Clinical Diagnosis
			Mild dementia: 69.58 (7.30)			PACC5
			Mild dementia: 69.58 (7.30)			CDR
Papp (2021) (Papp et al., 2021)	Cross-sectional, Correlation	HC (234)	73.7 (10.63)	Tablet, Phone	Boston Remote Assessment for Neurocognitive Health	Neuropsychological tests	Biomarker-based Diagnosis
Mansbach (2020) (Mansbach, Mace, Tanner, & Simmons, 2020)	Cross-sectional, Diagnostic accuracy	HC (NR), Dementia (NR)	77.18 (11.65)	PC	myMemCheckⓡ	BCAT-SF	Clinical Diagnosis
Freedman (2018) (Freedman et al., 2018)	Cross-sectional, Diagnostic accuracy	HC (57), MCI (50)	HC: 75.3 (7.9)	Tablet	Toronto Cognitive Assessment	Neuropsychological tests	Clinical Diagnosis
Freedman (2018) (Freedman et al., 2018)	Cross-sectional, Diagnostic accuracy	HC (57), MCI (50)	MCI: 77.7 (6.5)	Tablet	Toronto Cognitive Assessment	Neuropsychological tests	Clinical Diagnosis
Possin (2018) (Possin et al., 2018)	Cross-sectional, Diagnostic accuracy	HC (137), MCI (72), Dementia (30)	HC: 75.6 (6.3)	Tablet	Brain Health Assessment	MoCA	Clinical Diagnosis
Possin (2018) (Possin et al., 2018)	Cross-sectional, Diagnostic accuracy	HC (137), MCI (72), Dementia (30)	Dementia: 69.1 (9.9	Tablet	Brain Health Assessment	MoCA	Clinical Diagnosis
Montenegro (2017) (Montenegro & Argyriou, 2017)	Cross-sectional, Validation	HC (NR), AD (NR)	NR	VR	Virtual reality test	Dr. Oz Memory Quiz,	Not specified
						Visual Association Test,
						Dichotic Listening Test
Jongstra (2017) (Jongstra et al., 2017)	Longitudinal, Correlation	Older adults (151)	57.3 (5.3)	Phone	iVitality	TMT	Not specified
Jongstra (2017) (Jongstra et al., 2017)	Longitudinal, Correlation	Older adults (151)	57.3 (5.3)	Phone	iVitality	Stroop test	Not specified
Dougherty (2016) (Dougherty Jr et al., 2010)	Cross-sectional, Diagnostic accuracy	HC (104), MCI (27), AD (84)	59-94 (range)	PC	Computerized Self Test	MMSE	Clinical Diagnosis
Dougherty (2016) (Dougherty Jr et al., 2010)	Cross-sectional, Diagnostic accuracy	HC (104), MCI (27), AD (84)	59-94 (range)	PC	Computerized Self Test	Mini-Cog	Clinical Diagnosis
Rentz (2016) (Rentz et al., 2016)	Cross-sectional, Validation	HC (49)	71.2 (7.6)	Tablet	Computerized Cognitive Composite for Preclinical Alzheimer’s Disease	PACC	Clinical Diagnosis
Ruano (2016) (Ruano et al., 2016)	Longitudinal, Diagnostic accuracy	HC (39), MCI (18), Mild dementia (21)	HC: 67.9 (11.9)	PC	Brain on Track	MMSE	Clinical Diagnosis
Ruano (2016) (Ruano et al., 2016)	Longitudinal, Diagnostic accuracy	HC (39), MCI (18), Mild dementia (21)	MCI/mild dementia: 68.2 (11.8)	PC	Brain on Track	MoCA	Clinical Diagnosis
Zygourris (2016) (Zygouris et al., 2017)	Longitudinal, Diagnostic accuracy	HC (6), MCI (6)	63.75 (NR)	Tablet	Virtual Super Market	Neuropsychological tests	Clinical Diagnosis
Scanlon (2015) (Scanlon, O'Shea, O'Caoimh, & Timmons, 2016)	Cross-sectional, Diagnostic accuracy	HC (20), Mild dementia (40)	HC: 72.5 (12)	Tablet	Computerised cognitive screening	MoCA	Clinical Diagnosis
Scanlon (2015) (Scanlon, O'Shea, O'Caoimh, & Timmons, 2016)	Cross-sectional, Diagnostic accuracy	HC (20), Mild dementia (40)	Mild dementia: 76.5 (12.5)	Tablet	Computerised cognitive screening	MoCA	Clinical Diagnosis
Zorluoglu (2014) (Zorluoglu, Kamasak, Tavacioglu, & Ozanar, 2015)	Cross-sectional, Validation	HC (9), Dementia (14)	HC: 81.78 (4.77)	Phone	Mobile Cognitive Screening	MoCA	Not specified
	Cross-sectional, Validation	HC (9), Dementia (14)	Dementia: 72.55 (9.95)	Phone	Mobile Cognitive Screening	MoCA	Not specified
Brouillette (2013) (Brouillette et al., 2013)	Cross-sectional, Correlation	HC (57)	67.18 (NR)	Phone	Color-Shape Test	Neuropsychological battery	Not specified
Makizako (2013) (Makizako et al., 2013)	Cross-sectional, Correlation	HC (20)	71.6 (4.6)	Tablet	National Center for Geriatrics and Gerontology functional assessment tool	Neuropsychological tests	Clinical Diagnosis
Onoda (2013) (Onoda et al., 2013)	Cross-sectional, Diagnostic accuracy	HC (34), Dementia (50)	HC: 70.7 (5.0)	Tablet	Cognitive Assessment for Dementia, iPad version (CADi)	MMSE	Clinical Diagnosis
Onoda (2013) (Onoda et al., 2013)	Cross-sectional, Diagnostic accuracy	HC (34), Dementia (50)	Dementia: 75.9 (5.6)	Tablet	Cognitive Assessment for Dementia, iPad version (CADi)	FAB	Clinical Diagnosis
Thompson (2012) (Thompson, Barrett, Patterson, & Craig, 2012)	Cross-sectional, Correlation	Older adults (29)	56.2 (NR)	Tablet	Smartphone-Based Puzzle Games	Neuropsychological tests	Not specified
Inoue (2009) (Inoue, Jinbo, Nakamura, Taniguchi, & Urakami, 2009)	Cross-sectional, Diagnostic accuracy	HC (102), AD (72)	HC: 80.0 (5.1)	Compact touchscreen device	Computerized screening battery for Alzheimer’s disease	N/A	Clinical Diagnosis
	Cross-sectional, Diagnostic accuracy	HC (102), AD (72)	AD: 77.1 (5.8)	Compact touchscreen device	Computerized screening battery for Alzheimer’s disease	N/A	Clinical Diagnosis
Dwolatzky (2004) (Dwolatzky et al., 2004)	Cross-sectional, Diagnostic accuracy	HC (39), MCI (30), mild AD (29)	HC: 73.41 (8.0) MCI: 77.15 (6.43) AD: 80.55 (4.91)	PC	Mindstreams battery	Neuropsychological battery	Clinical Diagnosis
3. Cognitive assessment using digital biomarkers
Grammatikopoulou (2024) (Grammatikopoulou et al., 2024)	Cross-sectional, Pilot study	HC (12), SCD (13), MCI (11)	HC: 63.9 (6.4)	Smart IoT	Activity-based biomarker	N/A	Clinical Diagnosis
			SCD: 64.4 (6.4)
			MCI: 69.7 (6.4)
Wang (2024) (J. Wang et al., 2024)	Cross-sectional, Diagnostic accuracy	HC (38), MCI (42), Dementia 421)	74.7 (7.7)	Wearable device	Motion-based biomarker (gait)	N/A	Clinical Diagnosis
Kim (2023) (S. Y. Kim et al., 2023)	Cross-sectional, Diagnostic accuracy	HC (20), MCI (31)	HC: 70.95 (6.02)	VR	Motion-based biomarker (hand, eye)	N/A	Clinical Diagnosis
Kim (2023) (S. Y. Kim et al., 2023)	Cross-sectional, Diagnostic accuracy	HC (20), MCI (31)	MCI: 72.68 (7.75)	VR	Motion-based biomarker (hand, eye)	N/A	Clinical Diagnosis
Wang (2023) (C. Wang, Liu, Li, & Liu, 2023)	Cross-sectional, Diagnostic accuracy	HC (30), MCI (22)	NR	NR	language biomarker	N/A	Clinical Staging (MMSE, CDR)
Hanczár (2022) (Hanczár et al., 2022)	Cross-sectional, Diagnostic accuracy	HC (48), MCI (22)	HC: 71.18 (5.82)	PC	Motion-based biomarker (hand)	N/A	Clinical Diagnosis
Hanczár (2022) (Hanczár et al., 2022)	Cross-sectional, Diagnostic accuracy	HC (48), MCI (22)	MCI: 66.88 (7.25)	PC	Motion-based biomarker (hand)	N/A	Clinical Diagnosis
Parra (2022) (Parra, Granada, & Fernández, 2022)	Longitudinal, Diagnostic accuracy	HC (42), MCI (65)	HC: 72 (6.7)	PC	Motion-based biomarker (eye)	N/A	Clinical Diagnosis
Parra (2022) (Parra, Granada, & Fernández, 2022)	Longitudinal, Diagnostic accuracy	HC (42), MCI (65)	MCI: 73 (6.1)	PC	Motion-based biomarker (eye)	N/A	Clinical Diagnosis
Krebs (2021) (Krebs et al., 2021)	Cross-sectional, Pilot	HC (13)	70.67 (4.75)	PC	Motion-based biomarker (eye)	MoCA	Clinical Staging (MoCA, TMT)
Krebs (2021) (Krebs et al., 2021)	Cross-sectional, Pilot	HC (13)	70.67 (4.75)	PC	Motion-based biomarker (eye)	TMT-B	Clinical Staging (MoCA, TMT)
Konig (2018) (Konig et al., 2018)	Cross-sectional, Diagnostic accuracy	SCI (56), MCI (44), AD (27), mixed (38)	76.2 (NR)	Tablet	language biomarker	N/A	Clinical Diagnosis
Suzumura (2018) (Suzumura et al., 2018)	Cross-sectional, Correlation	HC (48), MCI (15), AD (31)	HC: 73.6 (8.3)	Tablet	Motion-based biomarker (hand)	MMSE	Clinical Diagnosis
			MCI: 74.3 (6.0)
			AD: 74.2 (6.3)
Muller (2017) (Müller, Preische, Heymann, Elbing, & Laske, 2017)	Cross-sectional, Diagnostic accuracy	HC (20), MCI (30), AD (20)	66.9 (10.3)	Tablet	Motion-based biomarker (hand)	Conventional Clock Drawing Test (cCDT)	Clinical Diagnosis
Tung (2014) (Tung et al., 2014)	Cross-sectional	HC (33), AD (19)	HC: 74.0 (1.2)	Phone	Activity-based biomarker	N/A	Clinical Diagnosis
Tung (2014) (Tung et al., 2014)	Cross-sectional	HC (33), AD (19)	AD: 70.7 (2.2)	Phone	Activity-based biomarker	N/A	Clinical Diagnosis

AD: Alzheimer’s disease, A-IADL: Amsterdam Instrumental Activities of Daily Living, AR: Augmented reality, CANTAB: Cambridge Neuropsychological Test Automated Battery, CDR: Clinical Dementia Rating, CDT: Clock Drawing Test, CERAD: Consortium to Establish a Registry for Alzheimer’s Disease assessment packet, CNB: comprehensive neuropsychological battery, COGTEL: Cognitive Telephone Screening Instrument, CT: Cancellation Test, FAB: Frontal Assessment Battery, HC: Healthy Control, HDSR: Revised Hasegawa’s Dementia scale, MCI: Mild Cognitive Impairment, MMSE: Mini Mental State Examination, MoCA: Montreal Cognitive Assessment, N/A: Not Applicable, NR: Not Response, PC: Personal Computer, PACC: Preclinical Alzheimer’s Cognitive Composite, PD: Parkinson’s disease, ROCF: Rey-Osterrieth Complex Figure, SAGE: Self-Administered Gerocognitive Exam, SCD: Subjective Cognitive Decline, SLUMS: Saint Louis University Mental State examination, SD: Standard Deviation, TICS: Telephone Interview for Cognitive Status, TMT: Trail Making Test, VR: Virtual reality.

Tablets were the most frequently used devices for cognitive assessments, followed by smartphones and computers. Some studies also incorporated immersive technologies such as virtual reality (VR), augmented reality (AR), or wearable devices to facilitate real-time or continuous cognitive monitoring. In terms of validation, most studies relied on clinical diagnoses established through expert evaluation or standardized diagnostic criteria as the primary reference standard. In addition, many studies compared digital assessments to widely accepted cognitive screening tools such as the MoCA, MMSE, and the Consortium to Establish a Registry for Alzheimer’s Disease Neuropsychological Assessment Battery (CERAD-NAB) to assess their diagnostic accuracy.

1. Digital adaptation of traditional neuropsychological tests

Ten studies explored the digital adaptation of traditional paper-based neuropsychological tests for use on mobile devices and computers (Table 2). These studies included tests such as the Trail Making Test (TMT), Clock Drawing Test (CDT), MoCA, Cognitive Telephone Screening Instrument (COGTEL), Rey-Osterrieth Complex Figure (ROCF), Self-Administered Gerocognitive Exam (SAGE), digital version of the Saint Louis University Mental Status Examination (CUPED), Cambridge Neuropsychological Test Automated Battery (CANTAB) and Cancellation Test (CT). By digitizing these widely used assessments, researchers aimed to improve accessibility, automate scoring, and enhance reliability while maintaining equivalence with traditional versions. The validity and clinical utility of these digital adaptations were assessed in terms of their correlations with paper-based tests, score differences, and diagnostic accuracy.

Table 2.

Evaluation outcomes of digitally adapted traditional neuropsychological tests.

Author (Year)	Assessment tool	Comparison test	Participants	Outcomes
1. Correlation between traditional paper-based neuropsychological tests and their digital adaptations
Haas (2022)	eCOGTEL	COGTEL	Adults	Correlation between eCOGTEL and COGTEL	rs = 0.70 (p<0.001)
Haas (2022)			Older adults (sub-group)	Correlation between eCOGTEL and COGTEL	rs = 0.71 (p<0.001)
Berg (2017)	eMOCA	MOCA	Older adults with primary memory complaints	Correlation between eMOCA and MOCA	CCC = 0.84 (p<0.001) r = 0.84 (p<0.001)
Dahmen (2017)	dTMT	cTMT, TICS, FAB	HC, MCI, PD	Correlation between dTMT and cTMT	r = 0.65 (p<0.05)
				Correlation between dTMT and TICS	r = 0.12 (NR; TMT-A)
				Correlation between dTMT and TICS	r = 0.10 (NR; TMT-B)
				Correlation between dTMT and FAB	r = 0.13 (TMT-A)
				Correlation between dTMT and FAB	r = 0.29 (TMT-B)
Scharre (2017)	eSAGE	SAGE	HC, MCI, dementia	Correlation between eSAGE and SAGE	CCC = 0.8687 (NR) rs = 0.882 (p<0.0001)
Ruggeri (2016)	CUPED (digital version of SLUMS)	SLUMS	Older adults	Correlation between CUPED and SAGE	rs = 0.44 (p<0.05)
	CUPED (digital version of SLUMS)	SAGE	Older adults	Correlation between CUPED and SAGE	rs = 0.44 (p<0.05)
				Correlation between SLUMS and SAGE	rs = 0.54 (p<0.05)
Wu (2015)	eCT	CT, TMT, digit span	HC	Correlation between eCT and CT Correlation between eCT and TMT Correlation between eCT and digit span	rs = 0.63 (p<0.001) rs = -0.32 (p=0.003; TMT-A) rs = -0.41 (p<0.001; TMT-B) rs = -0.10 (p=0.35; Backward) rs = -0.01 (p=0.91; Forward)
2. Comparison of score differences between digital and paper-based version of test
Hass (2022)	eCOGTEL	COGTEL	Adults	Difference between tests	MD = 1.05 (p=0.034)
	eCOGTEL	COGTEL	Adults	Difference between tests	BF10 = 0.70
			Older adults (sub-group)	Difference between tests	MD = 2.54 (p=0.001)
Katz (2021)	T-MOCA	MOCA-22	HC, MCI	Difference between tests	MD = -0.04 (p=0.76)
Berg (2017)	eMOCA	MOCA	Older adults with primary memory complaints	Difference between tests	RMSD = 2.27, t=0.20, p=0.84
Scharre (2017)	eSAGE	SAGE	HC, MCI, dementia	Difference between tests	MD = 1.21 (p<0.0001)
				Difference between tests (in device familiar group)	MD = 0.83 (p=0.029)
				Difference between tests (in device unfamiliar group)	MD = 1.65 (p=0.0002)
Ruggeri (2016)	CUPED	SLUMS	Older adults	Difference between test groups Difference in SAGE scores between test groups	t = 5.93 (p<0.001) Z=-0.702 (p=0.48)
3. Diagnostic accuracy of digital neuropsychological test
Prange (2022)	dCDT, dTMT, dROCF	N/A	HC/Suspicious	Diagnostic accuracy of AI	AUC: 0.877, Accuracy: 87.5%, Precision: 86.0%, Recall: 87.7%, F1-score: 86.7%
Ruengchaijatuporn (2022)	dCDT, dTMT, Cube-Copying Test	N/A	HC/MCI	Diagnostic accuracy of AI (multi-input model)	AUC: 0.836, Accuracy: 81.2%, F1-score: 65.4%
Ruengchaijatuporn (2022)				Diagnostic accuracy of AI (single-input model; CDT)	AUC: 0.743, Accuracy: 74.8%, F1-score: 35.7%
Katz (2021)	T-MOCA	MOCA-22	HC/MCI	Diagnostic accuracy of test (T-MOCA)	AUC: 0.71, Sensitivity: 72%, Specificity: 59%
				Diagnostic accuracy of test (MOCA-22)	AUC: 0.79, Sensitivity: 79% Specificity: 67%
				Comparison of AUC between tests	p=0.002
Dahmen (2017)	dTMT	cTMT	HC/Neurologic	Diagnostic accuracy of test	AUC: 0.65
Scharre (2017)	eSAGE	SAGE	HC/Cognitive impairment	Diagnostic accuracy of test (eSAGE)	AUC: 0.88, Sensitivity: 71%, Specificity: 90%
				Diagnostic accuracy of test (SAGE)	AUC: 0.83, Sensitivity: 69%, Specificity: 86%
				Comparison of between tests	AUC (p=0.14), Sensitivity (p=0.65), Specificity (p=0.56)
Junkkila (2012)	CANTAB-PAL	CREAD	HC/MCI/AD	Diagnostic accuracy of test (CANTAB-PAL)	MCI detection: AUC=0.803
	CANTAB-PAL	CREAD	HC/MCI/AD	Diagnostic accuracy of test (CANTAB-PAL)	AD detection: AUC=1
				Diagnostic accuracy of test (CREAD)	MCI detection: AUC=0.906
				Diagnostic accuracy of test (CREAD)	AD detection: AUC=0.872

AD: Alzheimer’s disease, BF: Bayes factors, CANTAB: Cambridge Neuropsychological Test Automated Battery, CDT: Clock Drawing Test, CERAD: Consortium to Establish a Registry for Alzheimer’s Disease assessment packet, COGTEL: Cognitive Telephone Screening Instrument, CT: Cancellation Test, FAB: Frontal Assessment Battery, HC: Healthy Control, MCI: Mild Cognitive Impairment, MD: Mean Difference, MMSE: Mini Mental State Examination, MoCA: Montreal Cognitive Assessment, PD: Parkinson’s disease, SAGE: Self-Administered Gerocognitive Exam, RMSD: Root Mean Squared Differences, TICS: Telephone Interview for Cognitive Status, TMT: Trail Making Test.

1.1. Correlation between traditional paper-based neuropsychological tests and their digital adaptations

Six studies examined the correlations between traditional paper-based neuropsychological tests and digital adaptation.^25,28–32 Among these, five specifically assessed the direct correlation between the same tests in both formats. The studies reported either Pearson’s or Spearman’s correlation coefficients ranging from 0.63 to 0.84. Additionally, some studies investigated the correlation between digital cognitive tests and other traditional assessments.^29,32 The correlations between digital TMT and TICS (Telephone Interview for Cognitive Status), as well as FAB (Frontal Assessment Battery), ranged from 0.10 to 0.29, reflecting a weak correlation.²⁹ Another study³² examined the relationship between eCT and both the TMT and Digit span tests, showing a moderate association for eCT-TMT, whereas the correlation with Digit span remained weak.

1.2. Comparison of score differences between digital and paper-based version of test

Five studies compared the score differences between the digital and paper-based versions of the cognitive assessments.^{25,28,30,31,33} However, the findings were heterogeneous, with three studies reporting significant differences between the two versions, and two studies reporting no meaningful differences.

Regarding additional analyses, one study categorized the participants into familiar and unfamiliar groups based on their level of device familiarity.³⁰ Although the difference between the groups was not statistically significant, the mean difference (MD) was 0.82. Another study conducted additional analyses focusing on older adults.²⁵ Although the overall mean difference between the eCOGTEL and COGTEL scores was 1.05, this difference was more pronounced in the older adult subgroup, in which the MD increased to 2.54.

1.3. Diagnostic accuracy of digital neuropsychological test

Six studies evaluated the diagnostic accuracy of neuropsychological tests.^{29,30,33–36} Among these, four studies focused on the diagnostic performance of the tests themselves, while two assessed the diagnostic accuracy of AI models using these neuropsychological tests. AI-related studies were published in 2022, whereas test-based studies were conducted between 2017 and 2021. Four studies investigated the distinction between healthy controls and patients with MCI, whereas three studies classified between healthy controls and individuals with suspected cognitive impairment.

Across all studies, the AUC values ranged from 0.65 to 1, indicating moderate to high diagnostic accuracy. The sensitivity ranged from 62.5% to 87.5%, whereas the specificity varied from 59% to 90%. Of the two studies that assessed both digital and traditional tests, one reported that the AUC for the digital test was not significantly different from that of the traditional test (Ref. 30, p = 0.14). In contrast, one study reported superior diagnostic accuracy for the traditional paper-based version.³³ Notably, one study compared the diagnostic performance of multi-input and single-input AI models.³⁵ The multi-input AI model, using data from three neuropsychological tests, outperformed the single-input model, which relied on only one test across all metrics.

Across the three categories, most tools demonstrated acceptable diagnostic accuracy (AUC = 0.65–0.95), though with variation in feasibility and ecological validity. Digitally adapted neuropsychological tests showed strong correlation with traditional formats but limited innovation in usability. Newly developed digital tools provided higher feasibility through self-administration and automation, while digital biomarkers enabled continuous, real-world monitoring but lacked longitudinal validation. These comparative findings suggest that diagnostic accuracy alone does not fully capture the practical value of each approach; rather, integration of validity, feasibility, and technological adaptability is essential for community application.

2. Newly developed digital cognitive assessments

Twenty-seven studies described newly developed digital cognitive tests designed for older adults.^{4,14–16,37–59} These studies often incorporated a combination of novel tasks and adapted versions of existing cognitive subtests (Table 3).

Table 3.

Diagnostic accuracy and features of newly developed digital cognitive assessment tools.

Author	Participant	Assessment Tool	Cognitive domain	Unique feature of tests	Outcomes
Cattaneo (2024)	HC/MCI	Guttmann Cognitest	• Composite	- Under supervision	AUC=0.84, Sensitivity=72%, Specificity=91%
			- Memory	- Practice trial
			- Executive function
			- Visuospatial
Glenn (2023)	HC/pAD	Neurotrack Cognitive Assessment Battery	• Composite	- Self-administered	Sensitivity=100%, Specificity=100%
			- Memory	- Automated scoring
			- Processing speed	- Digital biomarker integration (eye-tracking)
			- Executive function	- Digital biomarker integration (eye-tracking)
Lesoil (2023)	HC/Cognitive impairment	Santé-Cerveau digital tool	• Memory	- Self-administered (with initial supervision)	Sensitivity=96.8%, Specificity=90.7%
			• Memory	- Practice trial
			• Attention/Processing speed	- Practice trial
			• Attention/Processing speed	- Automated scoring and reporting
			• Executive function	- AI-assisted diagnosis
Muurling (2023)	HC/AD	The AR app used from the Altoida Digital Biomarker Platform	• Composite (not specified)	- Self-administered	proAD detection: AUC=0.84, preAD detection: AUC=0.66
				- home-based
				- Augmented Reality (AR)-based
				- Automated digital scoring
				- AI-assisted diagnosis
				- digital biomarker (auxiliary)
Na (2023)	HC/MCI	Inbrain Cognitive Screening Test	• Memory	- Self-administered	Correlation r=0.23–0.72 (p<0.05)
			• Attention
			• Language
			• Executive function
			• Visuospatial
Nie (2023)	HC/MCI/Dementia	Thoven Cognitive Self- Assessment	• Composite	- Self-administered	AUC=0.81–0.94
			- Memory	- Two-step screening
			- Attention/Processing speed	- Automated scoring
			- Attention/Processing speed	- Automated scoring	Sensitivity=75.0%–89.3%
			- Language	- Gamification-like interface
			- Others (Calculation, Orientation)
					Specificity=63.4%–95.1%
				- High usability (100% Completion rate)	Specificity=63.4%–95.1%
Wu (2023)	HC/MCI	EOmciSS	• Composite	- Self-administered	AUC=0.912
			- Memory	- Self-paced
			- Attention/Processing speed	- Automated scoring
			- Attention/Processing speed	- Automated scoring	Sensitivity=84.9%
			- Executive function	- Depressive mood screening	Sensitivity=84.9%
			- Visuospatial	- Depressive mood screening	Specificity=85.1% (cut-off 7.90)
Alim-Marvasti (2022)	HC/MCI	M-Cogscore	• Composite	- Self-administered	AUC=0.85
			- Memory	- Automated scoring	AUC=0.85
			- Attention/Processing speed	- Time-efficient (5 minutes)	Sensitivity=82%
			- Executive function	- Time-efficient (5 minutes)	Specificity=74%
Skirrow (2022)	HC/MCI/mild AD	Automated Story Recall Task	• Memory	- Self-administered	Correlation
				- AI-assisted scoring (speech content analysis)
				- Alternative version
				- Alternative version	Mean rho=0.73
				- Longitudinal assessment	Mean rho=0.73
Papp (2021)	HC	Boston Remote Assessment for Neurocognitive Health	• Composite	- Self-administered	Correlation r=0.617 (p<0.001)
			• Composite	- Automated scoring
			- Memory
			- Attention/Processing speed
Mansbach (2020)	HC/Cognitive impairment (MCI, Dementia)	myMemCheckⓡ	• Not specified	- Self-administered	AUC=0.82
					Sensitivity=80%
					Specificity=67% (cut-off 5)
Freedman (2018)	HC/aMCI	Toronto Cognitive Assessment	• Composite	- Automated scoring and reporting	AUC=0.84
			- Memory
			- Attention
			- Executive function
			- Language
					Sensitivity=80%
					Specificity=79%
			- Visuospatial
			- Others (Orientation)
Possin (2018)	HC/Cognitive impairment (MCI/Dementia)	Brain Health Assessment	• Memory	- Automated scoring and reporting	MCI detection: Sensitivity (at 85% specificity)=100%
			• Executive function
			• Language
			• Language	- Informant survey (optional)	Dementia detection: Sensitivity (at 85% specificity)=84%
			• Visuospatial	- Informant survey (optional)	Dementia detection: Sensitivity (at 85% specificity)=84%
Montenegro (2017)	HC/AD	VR test (no name)	• Memory	- VR-based	Correlation r=0.49–0.83 (p<0.05)
Montenegro (2017)	HC/AD	VR test (no name)	• Executive function	- Adaptive difficulty levels	Correlation r=0.49–0.83 (p<0.05)
Jongstra (2017)	Older adults	iVitality	• Memory	- Self-administered	Correlation rs=0.39–0.50 (p<0.001)
			• Attention/Processing speed	- Home-based
			• Executive function	- Longitudinal assessment (6 month)
Rentz (2016)	HC	Computerized Cognitive Composite for Preclinical Alzheimer’s Disease	• Composite (not specified; memory-focused)	- Self-administered	Association R2=0.168 (p<0.03)
				- Alternative version
				- Reaction time measurement
Ruano (2016)	HC/Cognitive impairment (MCI, Mild dementia)	Brain on Track	• Composite	- Self-administered	AUC=0.75
			- Memory	- Home-based
			-Attention/Processing speed	- Adaptive testing elements
			- Executive function	- Randomized tasks
			- Language	- Automated scoring
			- Visuospatial	- Longitudinal assessment
					Sensitivity=54%
					Specificity=90%
Zygourris (2016)	HC/MCI	Virtual Super Market	• Memory	- Self-administered	Sensitivity=94
			• Attention/Processing speed	- Home-based
				- VR-based
				- VR-based	Specificity=89%
			• Executive function	- Longitudinal assessment
			• Visuospatial	- Longitudinal assessment
Scanlon (2015)	HC/Dementia	Computerised cognitive screening	• Composite	- Self-administered	AUC=0.94
			- Memory		AUC=0.94
			- Memory		Sensitivity=94
			- Attention		Sensitivity=94
			- Attention		Specificity=60%
			- Visuospatial		Specificity=60%
Zorluoglu (2014)	HC/Dementia	Mobile Cognitive Screening	• Composite	- Automated scoring and reporting	Association R²=0.57 (p<0.01)
			- Memory
			- Attention
			- Executive function
			- Language
			- Others (Calculation, Orientation)
Brouillette (2013)	HC	Color-Shape Test	• Processing speed	- Self-administered	r=0.515 (p<0.01)
Makizako (2013)	HC	National Center for Geriatrics and Gerontology functional assessment tool	• Memory	- N/A	Correlation r=0.496–0.842 (p<0.05)
			• Attention/Processing speed
			• Executive function
			• Visuospatial
Onoda (2013)	HC/Dementia	Cognitive Assessment for Dementia, iPad version	• Composite (not specified)	- Self-administered	Sensitivity=96%
				- Reaction time measurement (auxiliary assess)	Sensitivity=96%
				- Reaction time measurement (auxiliary assess)	Specificity=77% (cut-off 7)
Thompson (2012)	HC	Smartphone-Based Puzzle Games	•Memory	- Game-based	Correlation
			•Attention/Processing speed
			• Executive function
			• Language
			• Visuospatial
Dougherty (2010)	HC/Cognitive impairment (MCI, AD)	Computerized Self Test	• Composite	- Self-administered (with caregiver support)	AUC=0.99
			- Memory	- Self-administered (with caregiver support)
			- Attention/Processing speed	- Automated scoring
			- Attention/Processing speed		Sensitivity=99%
			- Executive function
			- Language
			- Visuospatial		Specificity=95%
Inoue (2009)	HC/AD	Computerized screening battery for Alzheimer’s disease	• Memory	- N/A	Sensitivity=96%
			• Executive function		Sensitivity=96%
			• Executive function		Specificity=86%
			• Others (orientation)		Specificity=86%
Dwolatzky (2004)	HC/MCI/mild AD	Mindstreams battery	• Memory	- Adaptive difficulty levels	AUC
			• Attention/Processing speed	- Adaptive difficulty levels
			• Attention/Processing speed	- Automated scoring
			• Executive function
			• Language
			• Visuospatial	- Digital biomarker (auxiliary)

AD: Alzheimer’s disease, AUC: Area Under the Curve, AR: Augmented reality, HC: Healthy Control, MCI: Mild Cognitive Impairment, N/A: Not Applicable.

Most newly developed digital cognitive assessments reported diagnostic performance metrics with AUC values ranging from 0.75 to 0.94. Sensitivity ranged from 54% to 100%, while specificity varied from 60% to 100%, indicating a generally high, but variable, accuracy across different studies. Several studies explored the association between digital test scores and conventional cognitive measures. Reported correlation coefficients with established tools such as MoCA, MMSE, and PACC (Preclinical Alzheimer’s Cognitive Composite) ranged from 0.39 to 0.84 (p < 0.05), depending on the statistical method used.

Many of the newly developed digital assessments incorporated distinctive technological features to enhance usability, adaptability, and diagnostic precision. Of the 27 tools reviewed, 18 were designed for self-administration, enabling older adults to complete the assessments independently, often at home. Automated scoring was incorporated into 13 tools, with some offering provider feedback to facilitate the timely interpretation of results. Five tools featured adaptive testing or parallel versions to minimize effects of test practice. Reaction times and other digital biomarkers were integrated into three tools, highlighting their potential to capture subtle performance variations.

Cognitive domain analysis revealed that most newly developed digital cognitive assessments emphasize memory, often in combination with executive function and attention evaluation. Several assessments integrated executive function, attention, and processing speed, highlighting a shift toward broader cognitive profiling. A few tools incorporated reaction time-based tasks or complex cognitive processing paradigms. Thirteen of the 27 assessments adopted composite scores that integrated multiple domains.

3. Cognitive assessment using digital biomarkers

Eleven studies were classified as cognitive assessments using digital biomarkers.^{17–20,60–66} This category includes studies that assess cognitive function by analyzing physiological and behavioral data gathered through either passive monitoring or active engagement with digital devices. Table 4 summarizes the various digital biomarkers categorized by type, including motion-, language-, and activity-based. These biomarkers were evaluated in different settings, such as clinics, smart homes, laboratories, virtual environments, and community settings, with timeframes ranging from single sessions to continuous monitoring. Seven out of eleven studies were conducted in controlled settings such as clinics and laboratories, while the remaining four explored home or community environments, including two using browser-based assessments and two using smart home or community-based settings.

Table 4.

Digital biomarkers for cognitive assessment: Types, applications, and diagnostic performance.

Author (Year)	Biomarker Type	Subtype	Task performed	Purpose	Test Environment	Measurement Frequency	AI-based	Outcome
Grammatikopoulou (2024)	Activity-based biomarker	• Activity duration	• ADL tasks (Meal Preparation, Beverage Preparation, Snack Preparation)	Assist in Cognitive Assessment	Smart home	Single session (for a day)	N/A	AUC=0.59–0.86
		• Appliance usage duration
		• Cabinet open time
		• Sensor activation count
Wang (2024)	Motion-based biomarker (gait)	• Dual-task turn velocity	• Walking test with single-task and dual-task conditions	Independent screening tool	Clinic	Single session	N/A	AUC=0.923
		• Gait speed						Sensitivity=85.7%
		• Stride length						Specificity=84.2%
Kim (2023)	Motion-based biomarker (hand, eye)	• Hand movement speed	• The virtual kiosk test	Assist in screening	Virtual Reality	Single session	Machine learning model used	Sensitivity=100%
		• Proportion of fixation duration						Sensitivity=100%
		• Proportion of fixation duration						Specificity=83.3%
		• Time to completion
		• The number of errors
Wang (2023)	language biomarker	• Vocabulary	• Existing text dialogue	Assist in screening	Browser	Single session	Machine learning model used	AUC=0.80
		• Syntax and grammar						Sensitivity=77.3%
		• Semantics						Specificity=83.3%
Hanczár (2022)	Motion-based biomarker (hand)	• Velocity	• Digital Trail Making Test A (dTMT-A)	Assist in screening	Browser	Single session	Machine learning model used	Sensitivity=83.6%
		• Acceleration
		• Movement distance
		• Movement distance	• Paired matching game					Specificity=76.0%
Parra (2022)	Motion-based biomarker (eye)	• Saccade	• Visual short-term memory binding (VSTMB)	Assist in screening	Lab	Longitudinal (annual f/u for 40 month)	N/A	AUC=0.95
		• Fixation						Sensitivity=95%
		• Fixation						Specificity=95%
Krebs (2021)	Motion-based biomarker (eye)	• Saccade	• Puzzle game	Assist in cognitive assessment	Lab	Single session	N/A	Correlation rs=0.83–0.87
Krebs (2021)	Motion-based biomarker (eye)	• Fixation	• Puzzle game	Assist in cognitive assessment	Lab	Single session	N/A	Correlation rs=0.83–0.87
Konig (2018)	language biomarker	• Time locations of first words	• Verbal fluency	Assist in screening	Clinic	Single session	Machine learning model used	AD detection: Accuracy=86%
		• Time locations of first words	• Picture description
		• Speech Timing & Distribution	• Picture description
			• Counting backward
			• Free speech tasks
		• Silence & Speech Segments	• Free speech tasks
Suzumura (2018)	Motion-based biomarker (hand)	• Tap response time	• Finger tapping tasks	Assist in screening	Clinic	Single session	N/A	Correlation r=-0.5 – -0.36 (p<0.05)
		• Rhythm
		• Contact duration
		• Inter-hand coordination
Muller (2017)	Motion-based biomarker (hand)	• Time-in-air	• Clock Drawing Test (CDT)	Independent screening tool	Clinic	Single session	N/A	AUC=0.88
		• Time-on-surface						Sensitivity=81.3%
		• Total time						Specificity=72.2%
Tung (2014)	Activity-based biomarker (GPS)	• The area and perimeter of the territory covered	• Tracking their movement in daily life	Assist in cognitive assessment	Community	Continuous monitoring (3-5 days)	N/A	Difference between HC and AD
Tung (2014)	Activity-based biomarker (GPS)	• Average distance and time away from home	• Tracking their movement in daily life	Assist in cognitive assessment	Community	Continuous monitoring (3-5 days)	N/A	Difference between HC and AD

AD, Alzheimer’s disease; ADL: Activities of Daily Living, AUC: Area Under the Curve, GPS: Global Positioning System, HC: Healthy Control, N/A: Not Applicable.

3.1. Motion-based digital biomarkers

Motion-based biomarkers were the most commonly studied and included gait, hand-movement, and eye-movement metrics. Assessment of gait parameters, such as turn velocity and stride length, demonstrated strong predictive accuracy for cognitive decline (¹⁹; AUC=0.923). Similarly, hand movements observed during neuropsychological tests, including virtual or digital assessments, demonstrated acceptable accuracy in distinguishing cognitive impairment (⁶⁴; AUC=0.88) and significant negative correlations with MMSE scores (⁶³; r=-0.50 to-0.36, p<0.05). Eye movement biomarkers, characterized by saccades and fixation, were assessed through puzzle and visual memory binding tasks using longitudinal designs (⁶⁶; AUC=0.95).

3.2. Language digital biomarkers

Language biomarkers, such as syntax and semantic complexity analysis in speech (^17,18), have been explored for cognitive impairment screening (AUC=0.80, Alzheimer’s disease detection accuracy 86%). A study has applied machine learning models to analyze language features and reported high accuracy in distinguishing cognitively impaired individuals from HC.¹⁸

3.3. Active-based digital biomarkers

Activity-based biomarkers included the assessment of daily living tasks and GPS (Global Positioning System) tracking. Smart home monitoring studies²⁰ utilized sensor-based tracking of household activities. This approach demonstrated moderate to high accuracy in detecting cognitive impairment, with AUC values ranging from 0.59 to 0.86 across different ADL (Activities of Daily Living) tasks. Other studies⁶⁵ monitored spatial movement patterns in community settings over multiple days.

Risk of bias assessment

The risk of bias for the included studies was evaluated using the QUADAS-2 tool. In general, the risk of bias across all four domains was low to moderate in most studies. Most studies were low risk studies in the patient selection domain; however, a number of studies showed risk of bias due to non-random sampling or unclear inclusion criteria. The index test domain was considered to be characterized as comparatively low risk, but a few studies did not clearly report blinding practices. For the reference standard domain, there was reliance on accepted clinical diagnostic criteria in most studies, such that the risk for bias was low. On the other hand, concerns were more frequent in the flow and timing domain, where variations in assessment intervals and incomplete reporting were observed in several studies. The included studies exhibit acceptable methodological quality but their variability in study design and reporting calls for larger, standardized, and rigorously designed validation studies (Table 5).

Table 5.

Study quality assessment using QUADAS-2.

First author	Year	Risk of bias				Applicability concerns
First author	Year	Patient selection	Index test	Reference standard	Flow and timing	Patient selection	Index test	Reference standard
Haas	2022	1	1	1	1	1	1	1
Prange	2022	1	1	1	1	1	1	1
Ruengchaijatuporn	2022	1	1	1	1	1	1	1
Katz	2021	1	1	1	1	1	1	1
Berg	2017	1	1	1	1	1	1	1
Dahmen	2017	2	1	1	1	1	1	1
Scharre	2017	1	1	1	1	1	1	1
Ruggeri	2016	2	1	1	1	2	1	1
Wu	2015	2	1	1	1	1	1	1
Junkkila	2012	2	1	1	1	2	1	1
Cattaneo	2024	2	1	1	1	1	1	1
Glenn	2023	2	1	1	2	1	1	1
Lesoil	2023	2	1	1	1	1	1	1
Muurling	2023	1	1	1	2	1	1	1
Na	2023	1	1	1	2	1	1	1
Nie	2023	1	1	1	1	1	1	1
Wu	2023	1	1	1	1	1	1	1
Alim-Marvasti	2022	2	1	1	2	1	1	1
Skirrow	2022	2	1	1	1	1	1	1
Papp	2021	1	1	1	2	1	1	1
Mansbach	2020	1	1	1	1	1	1	1
Freedman	2018	1	1	1	1	1	1	1
Possin	2018	2	1	1	1	1	1	1
Montenegro	2017	2	1	1	2	1	1	1
Jongstra	2017	1	1	1	1	2	1	1
Dougherty	2016	2	1	1	1	2	1	1
Rentz	2016	2	1	1	1	2	1	1
Ruano	2016	2	1	1	1	1	1	1
Zygourris	2016	2	1	1	1	1	1	1
Scanlon	2015	2	1	1	1	2	1	1
Zorluoglu	2014	2	1	1	1	2	1	1
Brouillette	2013	1	1	1	1	1	1	1
Makizako	2013	2	1	1	1	2	1	1
Onoda	2013	2	1	1	1	1	1	1
Thompson	2012	2	1	1	1	2	1	1
Inoue	2009	1	1	1	1	1	1	1
Dwolatzky	2004	2	1	1	1	1	1	1
Grammatikopoulou	2024	1	1	1	1	1	1	1
Wang	2024	1	1	1	1	1	1	1
Kim	2023	1	1	1	1	1	1	1
Wang	2023	1	1	1	1	1	1	1
Hanczar	2022	2	1	1	1	1	1	1
Parra	2022	1	1	1	1	1	1	1
Krebs	2021	2	1	1	1	1	1	1
Konig	2018	2	1	1	1	1	1	1
Suzumura	2018	2	1	1	1	1	1	1
Muller	2017	2	1	1	1	1	1	1
Tung	2014	2	1	1	1	2	1	1

1 Low Risk, 2 High Risk, 3 Unclear Risk.

Discussion

With the increasing adoption of digital cognitive assessment tools, nursing professionals face critical decisions regarding their implementation. Available options include relying on digital adaptations of traditional neuropsychological tests, adopting newly developed digital tools, and integrating cognitive assessments with digital biomarkers. This systematic review evaluated the reliability and applicability of these three approaches and provides foundational insights into the implementation of digital cognitive assessment tools in nursing practice and community-based settings. Specifically, this review aims to support nurses in integrating these tools into clinical practice and community healthcare to ensure effective and accurate cognitive assessments in older adults.

The validity and limitations of digital adaptations of traditional neuropsychological tests

Our systematic review suggests that the digital adaptations of neuropsychological tests maintain a strong correlation with their traditional paper-based counterparts, indicating that digital platforms can reliably replicate core cognitive assessments. Specifically, correlation coefficients ranging from 0.63 to 0.84 confirm the robustness of these digital adaptations. In addition, the diagnostic accuracy of digital neuropsychological tests, as reflected by AUC values ranging from 0.65 to 1.0, further supports their effectiveness in distinguishing between healthy individuals and those with cognitive impairment. Similar findings have been reported in previous studies, with the digital CDT (AUC = 0.87;⁶⁷) and visual cognitive assessment test (AUC = 0.77;⁶⁸) demonstrating promising diagnostic accuracy for detecting cognitive impairment. This suggests that digital assessments can serve as viable alternatives to traditional methods, particularly for large-scale screening and remote-testing settings.

However, the differences in scores between the digital and paper-based tests were inconsistent. This highlights the importance of considering various contextual factors when developing digital assessments and suggests that new evaluation standards tailored to digital formats may be necessary. One key factor contributing to this variability is the influence of participant demographics and technological experience. For instance, one study found that individuals unfamiliar with digital devices scored lower on the digital version, with a mean difference of 0.82, although this difference was not statistically significant.³⁰ Another study focusing on older adults found a larger score discrepancy (MD = 2.54) than the general sample (MD = 1.05;²⁵), suggesting that age-related factors, including digital literacy, motor function, and cognitive adaptability, may disproportionately affect older individuals’ performance on digital tests. This observation aligns with the findings of a previous scoping review, which also emphasized that participant characteristics such as age, education, and prior technology experience can significantly impact performance on mobile cognitive assessments.⁷

Considering these discrepancies, several studies have proposed score adjustments to better align digital test scores with traditional assessments.^28,30 One approach suggests that applying a 1-point correction to digital test scores may improve comparability. This underscores the critical need to develop new scoring standards and algorithms specifically tailored for digital cognitive assessments rather than directly applying the criteria and benchmarks used for paper-based versions. Thus, while nurses can choose to implement digital adaptations of traditional neuropsychological tests, given their strong correlation and diagnostic accuracy, it is essential to establish appropriate scoring frameworks that ensure valid and reliable assessments in digital formats.

The advantages and challenges of newly developed digital cognitive assessment tools

Newly developed digital cognitive assessments for older adults provide alternatives to traditional neuropsychological tests by enhancing accessibility, reducing the need for professional oversight, and incorporating broader cognitive domains. Consequently, digital assessments may be more suitable for nursing and community-based cognitive screening initiatives. Many newly developed digital cognitive tools offer self-administered testing with automatic scoring and AI-assisted preliminary diagnoses, thereby significantly reducing the need for professional oversight during the early stages of assessment. Moreover, the integration of adaptive testing strengthens the applicability of these tools in nursing practice by tailoring assessments to individual performance levels in real time. This shift enhances accessibility and allows more widespread use in community healthcare settings, including nursing-led initiatives.

In addition to their accessibility, these new digital assessments demonstrate strong diagnostic accuracy. The reported AUC values for these tools range from 0.75 to 0.94, indicating their effectiveness in distinguishing between healthy individuals and those with cognitive impairments. Traditional neuropsychological assessments, including digitized versions, maintain high diagnostic reliability, with previously reported AUC values of 0.65–1.0, supporting their continued relevance alongside newer tools. Compared with previous reviews reporting variable diagnostic accuracy for remote cognitive assessments (sensitivity, 26–100%; specificity, 58–100%;⁸), the current review included studies with higher sensitivity and specificity (sensitivity = 54-100%, specificity = 60-100%). This improvement may reflect recent technological advancements that enable more stable and user-friendly digital environments, such as tablet- and smartphone-based applications. Enhanced interface design and improved processing capabilities are likely to contribute to minimizing technical limitations and optimizing test performance.

Digital cognitive assessments introduce additional advantages that improve the flexibility and precision of cognitive evaluations. Adaptive testing is a key innovation that dynamically adjusts task difficulty based on individual performance, leading to a more tailored assessment. Although only a few tools currently employ adaptive approaches ^14,15,59 this method holds promise for improving the sensitivity to mild cognitive deficits, particularly in populations with diverse baseline measurements. Additionally, digital platforms mitigate common issues associated with traditional assessments, such as practice effects, in which repeated exposure leads to artificial score improvements.^45,69 Digital assessments enhance the reliability and validity of cognitive evaluations by incorporating parallel test versions and randomized item sequences.

Another critical advantage of digital cognitive assessments is their ability to evaluate a wider range of cognitive domains than traditional tools. Conventional assessments, such as the MMSE and MoCA, primarily focus on memory, language, and attention, making them effective for detecting moderate-to-severe cognitive impairment, but less sensitive to early cognitive decline.⁹ However, identifying subtle cognitive changes in a community setting is important, as early detection enables timely interventions that can slow progression and support independent living.¹⁰ In this context, newly developed digital assessments are especially valuable as they incorporate broader cognitive domains, including executive function, visuospatial abilities, and processing speed, enhancing sensitivity to mild impairments. Their applicability to community-based screening provides nurses with practical tools for identifying at-risk individuals and supporting their cognitive health.

Despite these improvements, some limitations remain for the implementation of digital cognitive assessments, particularly in detecting subtle cognitive changes in real-world settings and accounting for natural performance variability. Cognitive performance fluctuates over time, making it challenging to draw reliable conclusions from a single test.¹³ As a result, there is growing recognition of the need for performance-based approaches that capture behavioral trends over time, rather than relying solely on isolated scores.⁷ In this context, incorporation of digital biomarkers has been initiated to address the limitations of single-time, score-based cognitive assessments and to provide a more comprehensive understanding of cognitive health.^38,40

The emerging role of digital biomarkers as complementary tools for cognitive function monitoring

Through our examination of digital biomarker studies, we found that motion-, language-, and activity-based biomarkers are commonly used to detect cognitive impairment. Diagnostic accuracy varied across studies, with activity biomarkers demonstrating AUC values between 0.59 and 0.86, while eye-tracking biomarkers achieved AUC values up to 0.95 in specific visual tasks. Although biomarker utilization for cognitive assessments has increased, validation studies are limited. As shown in Table 4, most studies were conducted in controlled environments such as clinics and laboratories, often as single-session assessments, with only a few⁶⁶ incorporating longitudinal or real-world designs. These limitations highlight the need for further studies to establish the clinical utility of digital biomarkers in daily clinical settings.

Home- and community-based assessments offer greater potential for scalability and for continuous cognitive monitoring in clinical settings. Technologies such as smart homes and wearable sensors enhance accessibility, particularly for older adults facing barriers to frequent clinic visits.^20,70 Previous studies demonstrated the feasibility of IoT-based systems for monitoring daily activities and behavioral patterns in home environments ⁷¹ supporting their potential for cognitive health management. Community nurses can enhance the use of these technologies by providing patient education and participating in home-based assessments.^72,73 Nurses can also assist with digital biomarker data interpretation to guide personalized care and facilitate early referrals when appropriate. However, it is important to note that direct empirical evidence supporting the role of community health professionals, such as nurses, in implementing digital cognitive assessments tools remain limited. Most of the included studies focused on tool validation in controlled or clinical environments rather than real-world, nurse-led applications. Therefore, while the potential role of nurses in facilitating digital cognitive screening and monitoring is promising, further research is required to establish their effectiveness and practical applicability in community-based settings.

Digital biomarkers have another unique advantage.¹¹ They can provide continuous, non-invasive assessments.¹¹ Rather than being an indication of a condition, digital biomarkers are passive tracking that allow you to obtain insights into your cognitive performance over time rather than relying on single-session, score-based tests.¹³ However, Koo and Vizer (2019) recognized that longitudinal follow-up of behavioral trends may improve the capability for cognitive impairment detection in the early stages of cognitive deterioration. With the support of local community-based healthcare professionals, digital biomarkers hold great promise into the future for promoting proactive and preventive cognitive health strategies in elderly populations.

Nevertheless, although the new digital biomarkers offer significant benefits, they are still not ready to replace existing cognitive tests. They should rather act as adjunctive tools to complement traditional tests in terms of early detection and long-term monitoring. Most studies assessing these digital biomarkers were aimed at supporting cognitive screening rather than providing independent diagnostic capabilities. Longitudinal and practical studies in practical community contexts are needed to confirm the clinical utility of digital biomarkers.

Implications for integrated cognitive assessment strategies in nursing practice

This systematic review assessed strengths and limitations in digitizing traditional neuropsychological tests and developing new cognitive assessments. Even with developments in these approaches, they fall short of sensitivity to subtle cognitive changes and their application in the real world. As digital biomarkers evolve, there are avenues to integrate these platforms with digital cognitive tests that support continuous cognitive monitoring, greater access, and the early detection of cognitive decline in ways that are especially relevant in home- and community-based settings.

Community and public health nurses can be key practitioners to increase the linkage of digital cognitive assessment tools to nursing practice. In real-world settings, these tools may be part of nurses’ routine health checkups, home visits, and chronic disease management programs to assist patients with evidence about cognitive screening early on. Nurses should develop training programs to enhance their digital literacy and ability to administer, interpret and follow up assessment results. Additionally, as early detection and intervention becomes possible as an essential tool of the future, the collaboration between nurses, primary physicians, and ICT specialists may generate an interdisciplinary model of early detection and intervention. For instance, traditional community nurses may apply self-administered digital screening techniques to recognize vulnerable older adults, educate them on promoting cognitive health and refer these patients with early-onset decline for additional assessment. Nurse-led digital screening-type initiatives can improve accessible care at the community level.

Inspired by the insights these researchers obtained, we propose that whether traditional test/digital cognitive assessment should be digitized or developed should be a sensitive matter according to the person’s particular needs and the specific context to use it. Either way, nurses have a vital role to play in introducing these digital tools as part of routine cognitive screening and long-term monitoring. Digital biomarkers in conjunction with cognitive assessment tools may help nursing professionals identify cognitive decline sooner and take prompt action at the clinical and community levels. While additional studies are needed for corroboration of clinical utility, implementation of these combined strategies in nursing-led cognitive health programming with these strategies can improve dementia prevention and management at the patient level in nursing and the community level, with some clinical utility, only empirical work will help clarify.

It might be useful for future studies to use novel biomedical and data-driven approaches paired with digital cognitive assessment technologies at an advanced age to enrich already known biomarker methodologies in the cognitive fields with a focus on improving early detection, as well as ongoing measurement of cognitive impairment and monitoring. Recent findings have illustrated the promise of sophisticated biomarker systems and multidisciplinary technological approaches to enhance diagnostic accuracy and address the current constraints in the scope of its real-world application.^74,75 Incorporating these approaches may also enhance the clinical utility and scalability of digital cognitive screening in community-based settings. Moreover, new biomedical technologies—including nanocarrier-based systems to bypass blood-brain barrier limitations—could represent adjunctive prospects to augment the sensitivity and ecological validity of cognitive assessment strategies. While these technologies have mostly been studied in therapeutic settings, their linkage with digital health interventions might assist in more accurate and continuous surveillance of neurological disorders. Nevertheless, issues of the safety, biocompatibility and clinical scale are important, requiring translational research.⁷⁵

Limitation

The majority of included studies have been carried out in controlled laboratory or clinical settings, and without rigorous real-world or longitudinal designs, the review notes. Further, his methodological limitation highlights an important gap in the current literature that merits future validation studies in community-based and naturalistic settings to guarantee ecological validity and applicability of digital cognitive assessments in practice, There are some limitations of this study. The included studies were widely heterogeneous regarding the population, assessment environment, and type of digital cognitive tool and thus might have restricted the generalization of the findings. The study designs also limited the applicability of the findings: most studies were performed in controlled environments and had small numbers of experimental subjects with no longitudinal examination. While we noted potential involvement of community nurses in the implementation of digital cognitive assessments and biomarkers, more empirical studies regarding the participation of these practitioners are still lacking in the literature. Future validation studies in real community-based settings need to be conducted before these tools can be put into practice. Policy frameworks should address the safe use, data governance, and system-level implementation of digital cognitive assessment tools in healthcare practice. Lastly, the review may have been subject to publication bias, being only comprised of peer-reviewed articles that were published in nine electronic databases.

Conclusions

This systematic review examined the validity and applicability of digital cognitive assessment approaches, including traditional digitized tests, newly developed digital tools, and digital biomarkers. Although each method offers distinct advantages, limitations remain regarding sensitivity to subtle cognitive changes and applicability in real-world settings. Based on these findings, we suggest that the choice between digitizing existing tests and developing new assessments should be tailored to specific contexts, with digital biomarkers serving as complementary tools to enhance continuous monitoring and early detection. Policymakers and healthcare organizations should establish guidelines to support the safe, equitable, and effective adoption of digital cognitive tools in community and clinical settings. Further research is needed to establish the clinical utility of these approaches, particularly longitudinal studies in community settings. Notably, more extensive and real-world validation studies done in community-based populations need to be conducted before nationwide implementation. Before these technologies can be integrated into routine care, large-scale validation studies across diverse real-world populations are needed. Utilizing digital cognitive assessment strategies in nursing practice may pave the way for more proactive cognitive care for older adults.

Footnotes

Acknowledgements

This work was supported by the National Research Foundation of Korea(NRF) grant funded by the Korea government(MSIT) (RS-2026-25489560).

ORCID iD

Seon Heui Lee

Ethical considerations

This study is a systematic review and was exempt from ethical approval by the Gachon University Institutional Review Board (IRB). The exemption approval number is 1044396-202408-HR-138-01.

Author Contributions

Hong, Seunghui: Data curation, Methodology, Writing – original draft, Writing – revision. Kim, ChanHee: Data curation, Validation, Writing – original draft. Lee, Seon Heui: Writing – review and editing, Supervision

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Gachon University research fund of 2024(GCU-202403980001) and National Research Foundation of Korea(NRF) (RS-2026-25489560).

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Trail Registration

This systematic review is registered in PROSPERO (Registration No: CRD42024564512, registered on July 2, 2024)

References

Yoon

Lee

, et al. Global trends of dementia policy. National Dementia Center, 2022.

Deckert

Chavira

Selzler

, et al. Early detection of cognitive impairment in primary care: Implementation challenges and opportunities. Innovation in Aging 2024; 8(Suppl 1): 880. https://doi.org/10.1093/geroni/igae098.2846

Floden

Curran

Hogue

, et al. Comparing the Brief Assessment of Cognitive Health and Montreal Cognitive Assessment: Test–retest reliability and sensitivity to cognitive change in older adults. Alzheimer’s & Dementia 2024; 20: e094295. https://doi.org/10.1002/alz.090995

Onoda

Hamano

Nabika

, et al. Validation of a new mass screening tool for cognitive impairment: Cognitive Assessment for Dementia, iPad version. Clinical Interventions in Aging 2013; 8: 353–360. https://doi.org/10.2147/CIA.S42342

Lopez

. Mild cognitive impairment. Continuum: Lifelong Learning in Neurology 2013; 19(2): 411–424. https://doi.org/10.1212/01.con.0000429175.29601.97

Ministry of Health and Welfare . The 4th Comprehensive Plan for Dementia Management (2021–2025). Ministry of Health and Welfare, 2020.

Koo

Vizer

. Mobile technology for cognitive assessment of older adults: a scoping review. Innovation in Aging 2019; 3(1): igy038. https://doi.org/10.1093/geroni/igy038

Beishon

Elliott

Hietamies

, et al. Diagnostic test accuracy of remote, multidomain cognitive assessment (telephone and video call) for dementia. Cochrane Database of Systematic Reviews 2022; 2022(4): CD015436. https://doi.org/10.1002/14651858.cd013724.pub2

Carton

Calafiore

Cauet

, et al. MoCA use in general practice for the early detection of cognitive impairment. BJGP Open 2024; 8(2): BJGPO.2024.0039.

10.

Webster-Cordero

Gimenez-Llort

. Main dimensions, myriad of assessment tools and new approaches for early detection of subjective cognitive decline and mild cognitive impairment: a systematic review. Frontiers in Aging Neuroscience 2024; 16: 1397139.

11.

Meier

Buegler

Harms

, et al. Using a Digital Neuro Signature to measure longitudinal individual-level change in Alzheimer’s disease: the Altoida large cohort study. NPJ Digital Medicine 2021; 4(1): 101. https://doi.org/10.1038/s41746-021-00470-z

12.

Kim

, et al. A preliminary feasibility study on screening cognitive impairment based on multi-modal biomarkers and stacking ensemble AI approach. Alzheimer’s & Dementia 2023; 19: e082466. https://doi.org/10.1002/alz.082466

13.

Allard

Husky

Catheline

, et al. Mobile technologies in the early detection of cognitive decline. PLoS One 2014; 9(12): e112197. https://doi.org/10.1371/journal.pone.0112197

14.

Ruano

Sousa

Severo

, et al. Development of a self-administered web-based test for longitudinal cognitive assessment. Scientific Reports 2016; 6(1): 19114. https://doi.org/10.1038/srep19114

15.

Montenegro

JMF

Argyriou

. Cognitive evaluation for the diagnosis of Alzheimer’s disease based on Turing test and virtual environments. Physiology & Behavior 2017; 173: 42–51. https://doi.org/10.1016/j.physbeh.2017.01.034

16.

Papp

Samaroo

Chou

, et al. Unsupervised mobile cognitive testing for use in preclinical Alzheimer’s disease. Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring 2021; 13(1): e12243. https://doi.org/10.1002/dad2.12243

17.

Wang

Liu

, et al. Text dialogue analysis for primary screening of mild cognitive impairment: development and validation study. Journal of Medical Internet Research 2023; 25: e51501. https://doi.org/10.2196/51501

18.

König

Satt

Sorin

, et al. Use of speech analyses within a mobile application for the assessment of cognitive impairment in elderly people. Current Alzheimer Research 2018; 15(2): 120–129. https://doi.org/10.2174/1567205014666170829111942

19.

Wang

Zhou

Cheng

, et al. Dual-task turn velocity: a novel digital biomarker for mild cognitive impairment and dementia. Frontiers in Aging Neuroscience 2024; 16: 1304265. https://doi.org/10.3389/fnagi.2024.1304265

20.

Grammatikopoulou

Lazarou

Alepopoulos

, et al. Assessing the cognitive decline of people in the spectrum of Alzheimer’s disease by monitoring their activities of daily living in an IoT-enabled smart home environment: a cross-sectional pilot study. Frontiers in Aging Neuroscience 2024; 16: 1375131. https://doi.org/10.3389/fnagi.2024.1375131

21.

Raza

Hassan ST Jamil

, et al. Advancements in deep learning for early diagnosis of Alzheimer's disease using multimodal neuroimaging: challenges and future directions. Frontiers in Neuroinformatics. Frontiers in Microbiology 2025; 19: 1557177.

22.

Chi

Sharma

Gebhard

, Predicting Cognitive Decline: A Multimodal AI Approach to Dementia Screening from Speech. In: Proceedings of the 2025 IEEE International Conference on AI and Data Analytics (ICAD). NeuroDigital Advances, Medford, MA, USA, 2025; 2(1): 1–8. doi: 10.1109/ICAD65464.2025.11114053.

23.

Wahul

Ambadekar

Dhanvijay

, et al. Multimodal approaches and AI-driven innovations in dementia diagnosis: a systematic review. Discover Artificial Intelligence. Discover Artificial Intelligence 2025; 5: 96. doi: 10.1007/s44163-025-00358-x.

24.

Park

Lee

. Clinical integration of digital biomarkers for cognitive decline: opportunities and challenges. Digital Health and Patient Safety 2025; 13: S496924.

25.

Haas

Scheibe

El Khawli

, et al. Online assessment of cognitive functioning across the adult lifespan using the eCOGTEL: a reliable alternative to laboratory testing. European Journal of Ageing 2022; 19(2): 1–11. https://doi.org/10.1007/s10433-021-00667-x

26.

Lim

Kim

Choi

, et al. Analysis of barriers and activating factors of visiting nursing in long-term care insurance. Journal of the Korea Contents Association 2012; 12(8): 283–299. https://doi.org/10.5392/jkca.2012.12.08.283

27.

Nelson

McClure

Spiegel

. Working in resource-challenged environments. In: Spiegel

(ed). Pediatric Lower Limb Deformities: Principles and Techniques of Management. Springer, 2024, pp. 405–425.

28.

Berg

Durant

Léger

, et al. Comparing the electronic and standard versions of the Montreal Cognitive Assessment in an outpatient memory disorders clinic: a validation study. Journal of Alzheimer’s Disease 2018; 62(1): 93–97. https://doi.org/10.3233/jad-170896

29.

Dahmen

Cook

Fellows

, et al. An analysis of a digital variant of the Trail Making Test using machine learning techniques. Technology and Health Care 2017; 25(2): 251–264. https://doi.org/10.3233/THC-161274

30.

Scharre

Chang

Nagaraja

, et al. Digitally translated Self-Administered Gerocognitive Examination (eSAGE): Relationship with its validated paper version, neuropsychological evaluations, and clinical assessments. Alzheimer’s Research & Therapy 2017; 9(1): 1–13. https://doi.org/10.1186/s13195-017-0269-3

31.

Ruggeri

Maguire

Andrews

, et al. Are we there yet? Exploring the impact of translating cognitive tests for dementia using mobile technology in an aging population. Frontiers in Aging Neuroscience 2016; 8: 21. https://doi.org/10.3389/fnagi.2016.00021

32.

Vidal

de Rotrou

, et al. A tablet-PC-based cancellation test assessing executive functions in older adults. American Journal of Geriatric Psychiatry 2015; 23(11): 1154–1161. https://doi.org/10.1016/j.jagp.2015.05.012

33.

Katz

Wang

Nester

, et al. T-MoCA: A valid phone screen for cognitive impairment in diverse community samples. Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring 2021; 13(1): e12144. https://doi.org/10.1002/dad2.12144

34.

Prange

Sonntag

. Modeling users’ cognitive performance using digital pen features. Frontiers in Artificial Intelligence 2022; 5: 787179. https://doi.org/10.3389/frai.2022.787179

35.

Ruengchaijatuporn

Chatnuntawech

Teerapittayanon

, et al. An explainable self-attention deep neural network for detecting mild cognitive impairment using multi-input digital drawing tasks. Alzheimer’s Research & Therapy 2022; 14(1): 111. https://doi.org/10.1186/s13195-022-01043-2

36.

Junkkila

Oja

Laine

, et al. Applicability of the CANTAB-PAL computerized memory test in identifying amnestic mild cognitive impairment and Alzheimer’s disease. Dementia and Geriatric Cognitive Disorders 2012; 34(2): 83–89. https://doi.org/10.1159/000342116

37.

Cattaneo

Roca-Ventura

Heras

, et al. Investigating the application of “Guttmann Cognitest”® in older adults and people with acquired brain injury. Frontiers in Neurology 2024; 14: 1292960. https://doi.org/10.3389/fneur.2023.1292960

38.

Glenn

Bryk

Myers

, et al. The efficacy and practicality of the Neurotrack Cognitive Battery assessment for utilization in clinical settings for the identification of cognitive decline in an older Japanese population. Frontiers in Aging Neuroscience 2023; 15: 1206481. https://doi.org/10.3389/fnagi.2023.1206481

39.

Lesoil

Bombois

Guinebretiere

, et al. Validation study of “Santé-Cerveau”, a digital tool for early cognitive changes identification. Alzheimer’s Research & Therapy 2023; 15(1): 70. https://doi.org/10.1186/s13195-023-01204-x

40.

Muurling

de Boer

Vairavan

, et al. Augmented reality versus standard tests to assess cognition and function in early Alzheimer’s disease. NPJ Digital Medicine 2023; 6(1): 234. https://doi.org/10.1038/s41746-023-00978-6

41.

Seo

Kim

, et al. Correlation analysis between subtest scores of CERAD-K and a newly developed tablet computer-based digital cognitive test (Inbrain CST). Frontiers in Aging Neuroscience 2023; 15: 1178324. https://doi.org/10.3389/fnagi.2023.1178324

42.

Nie

Yang

Gao

, et al. Newly self-administered two-step tool for screening cognitive function in an ageing Chinese population: an exploratory cross-sectional study. General Psychiatry 2023; 36(1): e100837. https://doi.org/10.1136/gpsych-2022-100837

43.

Liu

, et al. An Effective Test (EOmciSS) for Screening Older Adults With Mild Cognitive Impairment in a Community Setting: Development and Validation Study. Journal of Medical Internet Research 2023; 25: e40858. https://doi.org/10.2196/40858

44.

Alim-Marvasti

Kuleindiren

Harvey

, et al. Validation of a rapid remote digital test for impaired cognition using clinical dementia rating and mini-mental state examination: an observational research study. Frontiers in Digital Health 2022; 4: 1029810. https://doi.org/10.3389/fdgth.2022.1029810

45.

Skirrow

Meszaros

Meepegama

, et al. Validation of a remote and fully automated story recall task to assess for early cognitive impairment in older adults: longitudinal case-control observational study. JMIR Aging 2022; 5(3): e37090. https://doi.org/10.2196/37090

46.

Mansbach

Mace

Tanner

, et al. A path to early diagnosis of mild cognitive impairment and dementia: validity and reliability of the myMemCheck® self-administered screening tool. Family Practice 2020; 37(4): 561–567. https://doi.org/10.1093/fampra/cmaa014

47.

Freedman

Leach

Carmela Tartaglia

, et al. The Toronto Cognitive Assessment (TorCA): normative data and validation to detect amnestic mild cognitive impairment. Alzheimer’s Research & Therapy 2018; 10(1): 1–18. https://doi.org/10.1186/s13195-018-0382-y

48.

Possin

Moskowitz

Erlhoff

, et al. The brain health assessment for detecting and diagnosing neurocognitive disorders. Journal of the American Geriatrics Society 2018; 66(1): 150–156. https://doi.org/10.1111/jgs.15208

49.

Jongstra

Wijsman

Cachucho

, et al. Cognitive testing in people at increased risk of dementia using a smartphone app: the iVitality proof-of-principle study. JMIR mHealth and uHealth 2017; 5(5): e6939. https://doi.org/10.2196/mhealth.6939

50.

Rentz

Dekhtyar

Sherman

, et al. The feasibility of at-home iPad cognitive testing for use in clinical trials. Journal of Prevention of Alzheimer’s Disease 2016; 3(1): 8.

51.

Zygouris

Ntovas

Giakoumis

, et al. A preliminary study on the feasibility of using a virtual reality cognitive training application for remote detection of mild cognitive impairment. Journal of Alzheimer’s Disease 2017; 56(2): 619–627. https://doi.org/10.3233/jad-160518

52.

Scanlon

O'Shea

O'Caoimh

, et al. Usability and validity of a battery of computerised cognitive screening tests for detecting cognitive impairment. Gerontology 2016; 62(2): 247–252. https://doi.org/10.1159/000433432

53.

Zorluoglu

Kamasak

Tavacioglu

, et al. A mobile application for cognitive screening of dementia. Computer Methods and Programs in Biomedicine 2015; 118(2): 252–262. https://doi.org/10.1016/j.cmpb.2014.11.004

54.

Brouillette

Foil

Fontenot

, et al. Feasibility, reliability, and validity of a smartphone-based application for the assessment of cognitive function in the elderly. PLoS One 2013; 8(6): e65925. https://doi.org/10.1371/journal.pone.0065925

55.

Makizako

Shimada

Park

, et al. Evaluation of multidimensional neurocognitive function using a tablet personal computer: Test–retest reliability and validity in community-dwelling older adults. Geriatrics & Gerontology International 2013; 13(4): 860–866. https://doi.org/10.1111/ggi.12014

56.

Thompson

Barrett

Patterson

, et al. Examining the neurocognitive validity of commercially available, smartphone-based puzzle games. Psychology 2012; 3(7): 525–526. https://doi.org/10.4236/psych.2012.37076

57.

Dougherty

Jr Cannon

Nicholas

, et al. The Computerized Self Test (CST): an interactive, Internet-accessible cognitive screening test for dementia. Journal of Alzheimer’s Disease 2010; 20(1): 185–195. https://doi.org/10.3233/JAD-2010-1354

58.

Inoue

Jinbo

Nakamura

, et al. Development and evaluation of a computerized test battery for Alzheimer's disease screening in community-based settings. American Journal of Alzheimer’s Disease & Other Dementias 2009; 24(2): 129–135. https://doi.org/10.1177/1533317508330222

59.

Dwolatzky

Whitehead

Doniger

, et al. Validity of the Mindstreams™ computerized cognitive battery for mild cognitive impairment. Journal of Molecular Neuroscience 2004; 24(1): 33–44. https://doi.org/10.1385/jmn:24:1:033

60.

Kim

Park

Choi

, et al. Digital marker for early screening of mild cognitive impairment through hand and eye movement analysis in virtual reality using machine learning: first validation study. Journal of Medical Internet Research 2023; 25: e48093. https://doi.org/10.2196/48093

61.

Hanczár

Griechisch

Ovád

, et al. Detection of mild cognitive impairment based on mouse movement data of Trail Making Test. Informatics in Medicine Unlocked 2022; 35: 101120. https://doi.org/10.1016/j.imu.2022.101120

62.

Krebs

Falkner

Niklaus

, et al. Application of eye tracking in puzzle games for adjunct cognitive markers: pilot observational study in older adults. JMIR Serious Games 2021; 9(1): e24151. https://doi.org/10.2196/24151

63.

Suzumura

Osawa

Maeda

, et al. Differences among patients with Alzheimer’s disease, older adults with mild cognitive impairment and healthy older adults in finger dexterity. Geriatrics & Gerontology International 2018; 18(6): 907–914. https://doi.org/10.1111/ggi.13277

64.

Müller

Preische

Heymann

, et al. Increased diagnostic accuracy of digital vs. conventional clock drawing test for discrimination of patients in the early course of Alzheimer’s disease from cognitively healthy individuals. Frontiers in Aging Neuroscience 2017; 9: 101. https://doi.org/10.3389/fnagi.2017.00101

65.

Tung

Rose

Gammada

, et al. Measuring life space in older adults with mild-to-moderate Alzheimer’s disease using mobile phone GPS. Gerontology 2014; 60(2): 154–162. https://doi.org/10.1159/000355669

66.

Parra

Granada

Fernández

. Memory-driven eye movements prospectively predict dementia in people at risk of Alzheimer’s disease. Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring 2022; 14(1): e12386. https://doi.org/10.1002/dad2.12386

67.

Chan

Bat

BKK

Wong

, et al. Evaluation of digital drawing tests and paper-and-pencil drawing tests for the screening of mild cognitive impairment and dementia: a systematic review and meta-analysis of diagnostic studies. Neuropsychology Review 2022; 32(2): 357–376. https://doi.org/10.1007/s11065-021-09523-2

68.

Hsu

Liu

Chen

, et al. Efficacy of the visual cognitive assessment test for mild cognitive impairment/mild dementia diagnosis: a meta-analysis. Frontiers in Public Health 2023; 11: 1293710. https://doi.org/10.3389/fpubh.2023.1293710

69.

Goldberg

Harvey

Wesnes

, et al. Practice effects due to serial cognitive assessment: implications for preclinical Alzheimer’s disease randomized controlled trials. Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring 2015; 1(1): 103–111. https://doi.org/10.1016/j.dadm.2014.11.003

70.

Fakhimi

Hughes

Gustavson

. Evaluating smart home usability and accessibility in early detection and intervention of mental health challenges among older adults: a narrative review and framework. Journal of Ageing and Longevity 2025; 5(1): 3. https://doi.org/10.3390/jal5010003

71.

Sheikhtaheri

Sabermahani

. Applications and outcomes of Internet of Things for patients with Alzheimer’s disease/dementia: a scoping review. BioMed Research International 2022; 2022(1): 6274185. https://doi.org/10.1155/2022/6274185

72.

Ramadan

. Community nursing and the digital technology revolution: the past, present and future. British Journal of Community Nursing 2024; 29(9): 447–450. https://doi.org/10.12968/bjcn.2024.0076

73.

Luque

Chivite

Serena

, et al. Implementation of COGNIVITRA, an information and communications technology-based solution for dual-task training in patients at risk of cognitive impairment. Applied Sciences 2024; 14(17): 7906. https://doi.org/10.3390/app14177906

74.

Jain

Yadav

Bukke

SPN

, et al. Post-operative infection treatment in cardiac surgery: current practices and future directions. Perioper Med 2025; 14: 110. https://doi.org/10.1186/s13741-025-00580-2

75.

Sarawathi

Mothilal

Bukke

SPN

, et al. Recent advances in potential drug nanocarriers for CNS disorders: a review. BioMed Eng OnLine 2025; 24: 137.