Abstract
Background:
Artificial intelligence-based chatbots (AI chatbots) can potentially improve mental health care, yet factors predicting their adoption and continued use are unclear.
Methods:
We conducted an online survey with a sample of U.S. adults with symptoms of depression and anxiety (N = 393) in 2021 before the release of ChatGPT. We explored factors predicting the adoption and continued use of AI chatbots, including factors of the unified theory of acceptance and use of technology model, stigma, privacy concerns, and AI hesitancy.
Results:
Results from the regression indicated that for nonusers, performance expectancy, price value, descriptive norm, and psychological distress are positively related to the intention of adopting AI chatbots, while AI hesitancy and effort expectancy are negatively associated with adopting AI chatbots. For those with experience in using AI chatbots for mental health, performance expectancy, price value, descriptive norm, and injunctive norm are positively related to the intention of continuing to use AI chatbots.
Conclusions:
Understanding the adoption and continued use of AI chatbots among adults with symptoms of depression and anxiety is essential given that there is a widening gap in the supply and demand of care. AI chatbots provide new opportunities for quality care by supporting accessible, affordable, efficient, and personalized care. This study provides insights for developing and deploying AI chatbots such as ChatGPT in the context of mental health care. Findings could be used to design innovative interventions that encourage the adoption and continued use of AI chatbots among people with symptoms of depression and anxiety and who have difficulty accessing care.
Introduction
Anxiety and depression are among the most common mental health symptoms in the United States. 1,2 Despite their prevalence and negative impact, many individuals do not seek help due to stigma, lack of access, cost, and lack of awareness of available resources. 3 –8 Psychotherapy is usually viewed as an effective treatment for depression and anxiety. Treatments include cognitive behavioral therapy (CBT), positive psychology, problem-solving therapy, and acceptance and commitment therapy. 9 –11 While effective, the estimated treatment gap is 46.2% for anxiety disorders and 57.9% for affective disorders in North America. 12
One of the newest developments in the digital mental health field is the emergence of artificial intelligence-based chatbots (AI chatbots), 13 –15 which are gaining recognition with the release of ChatGPT. 16 –18 AI chatbots are conversational or relational large language model programs that imitate human conversations as they naturally occur, and they have been broadly implemented in e-commerce, health care, and entertainment. 13
Particularly, beginning with a computer program, known as ELIZA, developed to emulate the conversational abilities of a psychotherapist, AI chatbots have become increasingly prevalent in modern psychiatry. 14 Commonly integrated with CBT, 19 AI chatbots vary in complexity and are embedded into diverse forms, including mobile apps, websites, and SMS texting. 20
The most prominent example of AI chatbots for mental health is Woebot, which delivers CBT through instant messaging and mimics the relationship between human clinicians and patients. 21 A review found that AI chatbots effectively improved depression, distress, stress, and acrophobia. 22
In some studies, AI chatbots have been found to be as effective as, or even more effective than, traditional mental health interventions as patients are more likely to self-disclose with AI chatbots. 23 Moreover, AI chatbots offer advantages such as accessibility, affordability, anonymity, and convenience. 24,25
Despite AI chatbots' potential to bridge the gap between supply and demand for care, there are few theoretically driven studies regarding the acceptability and continued use of AI chatbots among individuals with symptoms of depression and anxiety. Notably, fewer studies have examined the factors related to the continued use of AI chatbots. 26 –28
As one major issue plaguing health technology use is the abandonment of technology after initial adoption, 29 –31 it is worth examining factors that predict the intention to continue using AI chatbots to further our understanding of AI chatbot usage. This study aimed to identify a range of psychological and technological factors influencing the intention to adopt as well as to continue using AI chatbots, informed by literature on technology adoption, AI perception, and mental health help-seeking behavior.
Previous studies focused on the initial adoption of AI chatbots, 15,21,32 commonly framed by general technology acceptance models (e.g., technology acceptance model, AI device use acceptance model, and unified theory of acceptance and use of technology). 32 –35 Specifically, researchers found that performance expectancy, effort expectancy, and social influence such as descriptive and injunctive norms 36 –38 positively related to the intention to use coaching, medical, or mental health chatbots 32,33,37 among general adults, 37 LGBTQIA+ individuals, 33 and cancer patients. 32
Additionally, while there is less direct evidence, previous studies in the health context have repeatedly found that facilitating conditions 39 –41 and price values 26,42,43 are positively related to behavioral intention to use various health-related technologies. Considering that these factors are expected to positively predict the adoption and continued use of AI chatbots for mental health, we propose the hypotheses below:
H1: (a) Performance expectancy, (b) effort expectancy, (c) facilitating conditions, (d) price value, (e) descriptive norm, and (f) injunctive norm positively predict the intention of adoption and continued use of AI chatbots for mental health.
For incorporating new technologies such as AI into health care, people may exhibit ambivalence about using AI-driven chatbots, given the concerns around the quality, trustworthiness, and accuracy of health information provided by chatbots and the perceived lack of empathy from depersonalized chatbots, which researchers have termed as AI hesitancy to describe the phenomenon. 44
Relatedly, privacy concerns about access to personal data have long been established as a barrier to seeking help online for mental health problems, as commonly discussed in studies on digital mental health. 45 –47 Therefore, to further identify factors particularly relevant to AI technology, we propose the following hypotheses.
H2: (a) AI hesitancy and (b) privacy concerns will negatively predict the intention of adoption and continued use of AI chatbots for mental health.
Specific to the context of mental health help-seeking behavior, one major barrier is stigma, which can usually be categorized into self-stigma and social stigma. 48 Self-stigma refers to when an individual internalizes the negative attitudes people have toward mental health problems, whereas social stigma refers to the perceived public stigma associated with seeking professional help. 49
Researchers reported that stigma about mental illness or mental health services is significantly related to a reluctance of seeking help for mental health problems in the general population. 49 In contrast, a higher level of psychological distress is positively associated with general help-seeking behavior for emotional problems. 50 Therefore, we hypothesize that these factors also apply to using AI chatbots for mental health.
H3: (a) Self-stigma and (b) social stigma negatively predict the intention of adoption and continued use of AI chatbots for mental health, while (c) psychological distress positively predicts the intention of adoption and continued use of AI chatbots for mental health.
Moreover, AI hesitancy may interact with social stigma and privacy concerns, exacerbating or alleviating their effect on adoption and continued use. On the one hand, as AI systems require a vast amount of personal data, users may be concerned with the potential leakage of their personal information. 51 In this case, AI hesitancy may exacerbate the effect of privacy concerns.
On the other hand, the lack of human-to-human interaction and the relatively high level of anonymity that AI chatbots afford may reduce privacy concerns. 24,25 Therefore, lower AI hesitancy may alleviate the effect of privacy concerns and encourage the adoption or continued use of mental health chatbots. Similarly, AI hesitancy may also impact social stigma's effect on using AI chatbots.
For those with lower AI hesitancy, the anonymity afforded by AI chatbots may lower the barrier of social stigma and therefore motivate the use of AI chatbots. For example, one study found that chatbot acceptability was significantly higher for stigmatized health issues. 52 Therefore, we ask the following research question:
RQ1: Will AI hesitancy moderate the relationship between (a) privacy concern and (b) social stigma and the intention of adoption and continued use of AI chatbots for mental health?
Methods
PARTICIPANTS AND PROCEDURES
The institutional review board approved the study at the university where the study was conducted. We conducted an online survey among residents of the United Sates on Amazon Mechanical Turk (Mturk). Data were collected online using Qualtrics in June 2021. To screen individuals with symptoms of depression and anxiety, participants first completed the Patient Health Questionnaire 2 (PHQ-2) and then the Generalized Anxiety Disorder Questionnaire (GAD-2).
For all four statements, participants indicated their frequency of distress (not at all = 0, several days = 1, more than half the days = 2, and nearly every day = 3). The survey was discontinued for individuals who scored less than 3 points as they did not meet our criteria.
Individuals who passed the screening were presented with a consent form for the survey. After providing consent, they were presented with a broad definition of automated e-mental health tools, encompassing AI chatbots: “Automated e-mental health tools for anxiety and depression are automated mental health services and information delivered through the Internet and related technologies with professional input from therapists/clinicians, but without real-time human support. It does not include online therapy in which you talk to a human provider either in real time or asynchronously.”
After reading the definition, they were shown an embedded video introducing Woebot as an example of AI chatbots, followed by a question to indicate whether they have used similar AI chatbots before. Participants who answered “yes” were directed to survey questions that were phrased accordingly to examine continued use intention, and participants who answered “no” were directed to survey questions that were phrased to examine adoption intention.
After deleting responses that failed to pass half of the six attention check questions, our final sample consisted of 393 participants, with 208 participants who had not used AI chatbots before (AI nonusers) and 185 participants who had used AI chatbots before (AI users). The survey took about 25 minutes, and participants who completed the survey were paid $3 through Mturk.
MEASURES
To measure intention to use/continue using AI chatbots, we adopted the validated scale of behavioral intention based on measures of the UTAUT model. 53 Participants rated their intention to use/continue using AI chatbots by indicating their level of agreement on three statements on 7-point Likert scales (1 = strongly disagree and 7 = strongly agree). Cronbach's α was 0.96 for intention to use among nonusers and 0.79 for AI users. Scores were averaged across the three statements to create the score for intention to use/continue using AI chatbots.
Performance expectancy, effort expectancy, facilitating conditions, and price value were measured by adapting the corresponding scales of performance expectancy (⍶ = 0.72 for AI users and ⍶ = 0.90 for AI nonusers), effort expectancy (⍶ = 0.71 for AI users and ⍶ = 0.88 for AI nonusers), facilitating conditions (⍶ = 0.71 for AI users and ⍶ = 0.77 for AI nonusers), and price value (⍶ = 0.71 for AI users and ⍶ = 0.87 for AI nonusers) based on measures of the UTAUT model.
The statements were phrased in the present tense for AI chatbot users and in future tense for AI nonusers. Descriptive and injunctive norms were both measured by a two-item scale adapted from the scale of descriptive/injunctive norms based on previous research. 54
Self-stigma was measured using a validated 10-item scale of self-stigma of seeking help (⍶ = 0.85). 55 Social stigma was measured by a validated 5-item Social Stigma Scale for Receiving Psychological Help. 56 Privacy concern was measured by a three-item scale adapted from previous scales related to online information privacy (⍶ = 0.83). 57,58 AI hesitancy was measured using four items (⍶ = 0.80) adjusted from a technophobia scale, 59 such as “I prefer to have people handle my mental health issues than automated e-health tools.”
Psychological distress was measured by combining participants' scores on PHQ-2 and GAD-2. 60,61 We also measured demographic variables, including age, sex, gender, income, education, and race, for sample description. Participants' demographics are reported in Table 1.
Participant Demographics
Two hundred eight participants have not used AI tools before and 185 have used AI tools in the past. N = 192 for age. N = 193 for all other variables. Not married included those who are single, divorced, separated, or widowed.
AI, artificial intelligence; M, mean; SD, standard deviation.
Results
Two regression models with the intention to use or continue using AI chatbots as the dependent variables were tested. No multicollinearity issue was found, and all analyses were conducted in Stata 14. We reported the descriptive statistics and correlations of all relevant variables in Table 2 for AI nonusers and Table 3 for AI users. Regression model results are presented in Table 4.
Means, Standard Deviations, and Zero-Order Correlations of Relevant Variables for Artificial Intelligence Nonusers (N = 208)
p < 0.05; ** p < 0.01; *** p < 0.001.
Means, Standard Deviations, and Zero-Order Correlations of Relevant Variables for Artificial Intelligence Users (N = 185)
p < 0.05; ** p < 0.01; *** p < 0.001.
Hierarchical Regression Analysis Predicting Intention to Use/Continue Using Artificial Intelligence Chatbots
Unstandardized coefficients (standard errors in parentheses);
+p < 0.1; * p < 0.05; ** p < 0.01; and *** p < 0.001.
Among AI nonusers (Model 1), we found that performance expectancy (H1a, b = 0.60, p < 0.001), price value (H1d, b = 0.30, p < 0.01), descriptive norm (H1e, b = 0.30, p < 0.001), and psychological distress (H3c, b = 0.09, p < 0.01) were positively related to intention to adopt AI for mental health. Meanwhile, effort expectancy (H1b, b = −0.28, p < 0.05) and AI hesitancy (H2a, b = −0.19, p < 0.05) were negatively related to intention to adopt AI among nonusers.
Facilitating conditions (H1c, b = 0.12, p > 0.05), self-stigma (H3a, b = −0.09, p > 0.05), social stigma (H3b, b = 0.12, p > 0.05), and privacy concerns (H2b, b = 0.02, p > 0.05) were unrelated to intention to adopt AI among nonusers. We did not find interaction effects between privacy concerns and AI hesitancy (RQ1a, b = −0.02, p > 0.05) and between social stigma and AI hesitancy (RQ1b, b = 0.02, p > 0.05) on the intention to adopt among AI nonusers.
Among AI users (Model 2), performance expectancy (H1a, b = 0.46, p < 0.001), price value (H1d, b = 0.23, p < 0.01), and descriptive norm (H1e, b = 0.21, p < 0.01) were positively related to the intention to adopt AI for mental health. Unlike AI nonusers, personal injunctive norms (H1f, b = 0.19, p < 0.05) were also positively related to the intention to continue using AI chatbots.
However, effort expectancy (H1b, b = 0.09, p > 0.05), AI hesitancy (H2a, b = −0.11, p > 0.05), and psychological distress (RQ1c, b = −0.0003, p > 0.05) were unrelated to the intention to continue using AI chatbots. The interactions between privacy concerns and AI hesitancy (RQ1a, b = −0.04, p > 0.05) and social stigma and AI hesitancy (RQ1b, b = 0.006, p > 0.05) were not significant predictors of intention to continue using AI chatbots among AI users, indicating the lack of moderation among the three predictors for AI users as well.
Discussion
AI chatbots have become increasingly popular in health care. 62 To examine predictors of both adoption and continued use of AI chatbots for mental health, an online survey was conducted among users and nonusers of AI chatbots who had depression and anxiety symptoms.
We tested critical predictors in the related domains, including mental health help-seeking behavior (i.e., social stigma and self-stigma), AI perception (i.e., AI hesitancy and privacy concern), and technology adoption (i.e., UTAUT2). Findings indicated that UTAUT2 was a viable model to predict the adoption and continued use of AI chatbots for mental health.
One unexpected finding was that effort expectancy negatively predicted adoption intention for people who have never used AI chatbots. A possible explanation is that the expected ease of using mental health chatbots might reduce people's perceived expertise or functionality of AI chatbots, especially for people who have never interacted with or experienced AI chatbots.
In other words, a high degree of effort expectancy might reflect the expectation that mental health chatbots would be limited in their ability to assist users in overcoming mental health issues, as they would only be able to perform simple and basic functions. This expectation may stem from nonusers' experiences with other types of AI chatbots, such as customer service chatbots, which often have limited functions.
However, unlike many simple tasks usually carried out by chatbots, mental health counseling is a complex task that can be challenging even for humans. Therefore, an easy solution using AI chatbots with minimal individual effort may not be perceived as credible and thus reduces individuals' adoption intention. For individuals who have already used AI chatbots, effort expectancy was not a significant predictor of continued use, suggesting that existing users may value other aspects of AI chatbots for continuance.
Social influence was operationalized separately as descriptive and injunctive norms. 26 We found that both the injunctive and descriptive norms predicted intention to continue to use AI chatbots. For nonusers, only the descriptive norm predicted adoption intention. The findings suggest that observing other people using AI chatbots is more critical in the adoption decision than recommendations by others to use AI chatbots for mental health issues. This finding implies that it is more important to demonstrate that many other people with similar problems are using AI chatbots for mental health rather than just encouraging adoption.
None of the concepts we imported from mental health help-seeking literature were found to be significant predictors of adoption or continued use, except that psychological distress predicted adoption intention. Although stigma has been consistently found to be an essential factor preventing individuals from seeking mental health help in an offline setting, 49 the findings suggest that concerns about social or personal stigma may not hinder people from trying or continuing to use AI chatbots, possibly because of the lack of judgment and anonymity provided by AI chatbots.
Consistent with our hypothesis, AI hesitancy was found to lower adoption intention. This finding suggests that in promoting AI-based tools for mental health, it is essential to explain to users how AI chatbots work and why they can be effective in combating their hesitancy. Surprisingly, privacy concern was not related to adoption or continuance intention. Prior research found that privacy was critical in adopting technology-based mental health tools. 63,64
In the case of AI chatbots, it seems that for users and nonusers, the fact that the AI chatbot is not an actual human therapist or counselor might have relieved the privacy concern. However, privacy concern was higher among users than nonusers. Whether the users truly understand the underlying data structure for their privacy concern needs to be further explored.
LIMITATIONS
There are several limitations to the current study. First, given that our sample is a convenience sample, our results are not representative. Second, we did not find any significant moderation, which could be due to our limited sample size rather than a lack of interaction effect. Third, our study was conducted before the release of ChatGPT.
Notably, more people have tried ChatGPT for various daily tasks, such as writing e-mails, summarizing long texts, generating business ideas, or creating software. 65 Their individual experiences with ChatGPT could sway their opinion on adopting AI chatbots for mental health concerns, such that a positive encounter may boost their confidence about AI chatbots' capabilities to provide solid mental health advice. At the same time, a negative one may raise doubt about its reliability.
Experts also reported heightened concerns about issues such as the privacy of personal data 66 and algorithmic bias of ChatGPT. 67 These factors could all complicate individuals' attitudes toward chatbots. Nevertheless, our data contribute insights regarding the use of AI chatbots before ChatGPT, which also serve as a valuable starting point for understanding the evolution of the public's acceptance of AI chatbots.
Conclusions
The coronavirus disease 2019 (COVID-19) pandemic has worsened the mental health of millions of Americans with an overburdened health care system that cannot meet the overwhelming demands. 68 AI chatbots provide a valuable tool for alleviating mental health concerns, especially those remediable by CBT.
As AI chatbots have the potential to provide accessible, affordable, efficient, and personalized mental health support experience, 23 findings of the study could help drive interventions that target the significant factors for predicting the adoption and continued use of AI chatbots, such as performance expectancy, price value, personal description norm, and AI hesitancy.
Footnotes
Authors' Contributions
L.L. was involved in conceptualization, methodology, formal analysis, investigation, writing—original draft, writing—review and editing, project administration, and funding acquisition. W.P. was involved in conceptualization, methodology, writing—original draft, writing—review and editing, and supervision. M.R. was involved in conceptualization, methodology, formal analysis, writing—review and editing, and funding acquisition.
Disclosure Statement
No competing financial interests exist.
Funding Information
This study is funded by the Charles J. Strosacker Foundation Research Fund for Health and Risk Communication.
