Concordance of assessments of clients’ mental and behavioral health with in vivo assessment of work performance

Abstract

BACKGROUND:

Assessing functioning and disability among individuals with mental and behavioral health disorders has historically relied on deriving accurate psychiatric diagnoses and assessing symptoms. However, growing empirical evidence suggests that this approach is inadequate to determine real world performance, particularly with respect to work.

OBJECTIVE:

We examined a performance-based approach to the assessment of work functioning and its relationship to mental and behavioral health status.

METHODS:

A cross-sectional study was conducted at two mental health programs. Trained employment providers conducted performance-based assessments of work function and ratings of mental and behavioral health while study participants self-reported their mental/behavioral health functioning. We hypothesized that participant and provider ratings of mental/behavioral health would be moderately correlated with performance-based assessments of work function.

RESULTS:

We found no significant correlation between participants’ self-report of their mental and behavioral health and performance-based assessments of work. Employment providers’ ratings of participants’ mental/behavioral health were moderately correlated with performance-based measures of work. Finally, we found low concordance between employment providers and study participants’ with respect to ratings of their mental/behavioral health.

CONCLUSIONS:

Contrary to our hypotheses, ratings of mental/behavioral health were only moderately correlated with performance-based measures of work. Results confirm earlier research suggesting that it is difficult to predict work performance from participants’ self-reports of their mental/behavioral health alone. Performance-based assessments of work capacity as well as ratings of mental and behavioral health may both be needed for a more complete and complimentary picture of the ability of individuals with mental and behavioral health disorders to function in the work place.

Keywords

Work performance work capacity individuals with psychiatric disability in vivo assessment concordance

1 Introduction

Assessing functioning and disability among individuals with mental and behavioral health disorders is complex. Historically, the medical model, with a focus on measuring symptoms and deriving accurate psychiatric diagnoses, was considered sufficient to understand and quantify functioning and disablement [1]. However, over the past two decades, a growing body of research has sought to better understand the relationship between functioning and psychiatric symptomatology [2, 3]. Empirical evidence has emerged suggesting that symptoms and functioning in general, and work performance more specifically, cannot be wholely predicted by symptoms, diagnosis, co-morbid conditions, or cognitive impairment [4 –11]. In a recent study of almost 500 individuals with psychiatric disabilities receiving employment services, Corbiére and colleagues corroborated recent research suggesting that clinical factors, such as psychiatric diagnoses and severity of symptoms, were not useful in predicting employment outcomes [6].

With this accumulating empirical evidence has come a growing emphasis on functional assessments of mental/behavioral health [12] and work performance [13 –15] with a concomitant focus on “real-world” functioning, as opposed to psychiatric symptoms and diagnosis alone [16]. In a recent review of performance-based measures, Harvey and colleagues [13] concluded that direct assessment of functional capacity has substantial advantages over other traditional measures of psychiatric status and provides a more valid estimate of functional disability. Leifker, [16] along with an expert panel, conducted a review of current performance measures across social, residential, and vocational domains (VALERO; “Validating Measures of Real-World Outcome”). From a total of 59 mental health assessments, recommendations of functional measures included those examining social and interpersonal behavior, self-care, life skills, and quality of life. No assessments focused on work function. Two additional performance-based assessments have been the object of study recently [14, 17] and both centered on self-care and activities of daily living among individuals with psychiatric disabilities. In addition, many of the studies that have been conducted on performance-based measures have been limited to the subpopulation of individuals with a diagnosis of schizophrenia alone. Together, the extant literature suggests that more research is needed to examine performance-based measures of work capacity and how they correlate with or diverge from measures of psychiatric symptoms, diagnoses, and status.

In this study, we sought to obtain empirical data about the relationship between a short form of the Work Disability Functional Assessment Battery, designed as a self-report assessment of behavioral and mental health (BH-FAB) [12] and a performance-based assessment of work capacity conducted by trained employment providers. We also wanted to examine the concordance between employment providers ratings of mental and behavioral health functioning and individuals’ concurrent self-reported ratings of the same. We hypothesized that: 1) employment provider ratings and participant self-report ratings of mental and behavioral health would each be moderately correlated with performance-based assessments of work capacity, 2) participants’ self-report of their mental and behavioral health functioning would be correlated with concurrent ratings by their employment providers.

2 Methods

We conducted a cross-sectional study to address study hypotheses. We recruited employment providers and clients they served to complete ratings of mental and behavioral health and to participate in structured, in vivo work performance assessments. All research procedures were reviewed and approved by two governing Institutional Review Boards (Boston University and the state Department of Mental Health). Data were collected from 2013 to 2016.

2.1 Sites

Research activities were conducted at two mental health programs in Massachusetts. The programs were chosen because they had an active employment services component and employment providers and were willing to participate in a fairly burdensome research study. Employment providers within these programs were recruited for participation by describing the research activities, the training they would receive as part of that research, and the observational and data collection portion of the study. Participation by employment providers was not mandated by the agency, but was voluntary. After recruitment, screening and consenting, employment providers were trained in the performance-based work assessment protocol over several weeks. Clients they served were then recruited for participation in the study. Assessments of work performance were carried out either at a “clubhouse program” for individuals with mental or behavioral health conditions or in community-based employment settings where participating individuals were working and receiving support from their employment providers. A trained research liaison at each site carried out informed consent procedures and coordinated data collection with senior research staff.

2.2 Participants

2.2.1 Employment providers

Six employment providers were recruited and trained to conduct observations of work performance; five were White American and one was African American; one was male and the remainder female. Most employment providers were not licensed clinical professionals. Overall, they had worked between 1 and 11 years as employment providers.

2.2.2 Study participants

A total of 42 mental/behavioral health clients were recruited using flyers distributed by their employment providers. Inclusion criteria included: 1) the presence of a serious mental or behavioral health condition, 2) current engagement in volunteer or competitive employment (sufficient to carry out the in vivo work observations), 3) the ability to provide full and knowing consent, and 4) a willingness to be observed and rated performing tasks during a typical work day. Exclusion criteria included individuals who had a guardian or those with significant cognitive or psychiatric impairments that would prevent full and knowing consent or participation in the research activities. Participants ranged in age from 19 to 68 years. Fifty-two percent of participants were female and 86% were White American. Approximately 50% were working for pay and 83% reported having a high school degree or vocational training. Further demographic information for participants is provided in Table 1.

Table 1
Demographics of study sample

	n	%
Psychiatric Diagnosis
Schizophrenia Spectrum/Psychotic Disorder	20	47.6
Bipolar Disorder	12	28.6
Major Depression/Depressive Disorder	8	19.0
Other (i.e., Anxiety, OCD, PTSD, etc.)	2	4.80
Gender
Male	22	52.4
Female	20	47.6
Race
White	34	81.0
African American	5	11.9
Other	1	2.4
Missing	2	4.7
Spanish/Hispanic/Latino
No	40	95.2
Yes	2	4.8
Education
Some High School	7	16.7
High School Graduate	14	33.3
Some College/No Degree	14	33.3
Associate of Arts Degree	5	11.9
Bachelor’s Degree	2	4.8
Working for Pay
No	21	50.0
Yes	21	50.0
# Jobs in Past 2 Years
0	13	31.0
1	21	50.0
2–4	8	19.0
Receiving SSDI/SSI
Yes	29	70.7
No	12	29.3
Current or recent Treatment
Individual Sessions Therapist/Counselor	37	88.1
Group Therapy Meetings	19	45.2
Medication Evaluation/Check-In	30	71.4
Meetings with Case Manager	25	59.5
Day Treatment/Partial Hospitalization	10	23.8
Inpatient Hospitalization	13	31.0
Other	9	21.4

2.3 Study variables and measures

The following instruments were used for data collection:

2.3.1 Demographic background

Basic demographics were captured using an instrument for that purpose and included age, gender, race/ethnicity, marital status, income, current work status, work history, educational attainment, Social Security Disability status, and living situation of study participants. With authorization, we obtained participants’ DSM-IV mental or behavioral health diagnoses from their mental health program.

2.3.2 Mental health function

We used a short form of the Behavioral Health domains of the Work Disability Functional Assessment Battery (WD-FAB) [12] to assess mental and behavioral health. The Behavioral Health subscales of the FAB (BH-FAB) assess critical dimensions of mental and behavioral health and were developed using item response theory (IRT) methodology. Item content was conceptually grounded in the World Health Organization International Classification of Functioning [18] and using a rigorous process that included extensive literature review, cognitive testing and focus group input from both individuals with work disabilities as well as work disability experts.

For this study we developed two paper-and-pencil Short Form versions of the BH-FAB, one for the study participants and one version for the employment providers 1 . Each version consisted of 51 items, providing a score in each of the following sub-domains: Self-Efficacy (15 items), Mood and Emotions (15 items), Behavioral Control (15 items) and Social Interactions (6 items). Approximately half of the items are measured on an “agreement” scale (Strongly Agree, Agree, Disagree, Strongly Disagree), and the remaining items on a scale that captures functioning in the past 7 days (using Never, Rarely, Sometimes, Often and Always). The BH-FAB and its Short Form scores calculate IRT-based standardized t-scores that are based on the US general working age sample (norm-referenced sample) with a mean of 50 and standard deviation of 10, allowing scores to be comparable from the mental/behavioral health clients to an age and gender matched working age adult in the US general population (sample items for each subscale appear in Table 2).

Table 2
Sample items from the Vocational Situational Assessment, the Behavioral Health Functional Assessment Battery, and the Behavior and Symptom Identification Scale-24^®

Measure	Subscale	Representative items from subscales
BH FAB Subscale	Behavioral Control	In the past seven days, I was resentful when I didn’t get my way.
		In the past seven days, I had trouble controlling my temper.
	Mood and Emotions	In the past seven days, I felt that nothing could cheer me up.
		In the past seven days, I felt nervous when my normal routine was disturbed.
	Self-Efficacy	If I make a mistake, I know I can deal with it.
		I look for the good in difficult situations.
	Social Interactions	In the past seven days, I looked forward with enjoyment to upcoming events.
		In the past seven days, I could keep up with my family responsibilities.
VSA	Work Skills	Organizes and maintains neat work area.
		Follows through and completes tasks.
		Works steadily but is responsive to environment.
	Interpersonal Skills	Communicates accurately and expresses self clearly.
		Initiates work related conversations with coworkers.
		Listens and responds to feedback provided by co-workers or supervisor(s).

Note: The BH FAB (Short Form) has a mean of 50 and a SD of 10 in the normative data; the VSA is scored on a 1–4 scale with higher scores indicating better functioning.

Higher scores on each scale indicate higher functioning. Internal consistency scores for the BH-FAB in this sample ranged from good to excellent with coefficients alpha for the participants self-report ratings ranging from 0.70 to 0.92 and from 0.70 to 0.95 for the employment provider ratings. There were predictable differences by diagnostic category which provided evidence for both the scale’s reliability and discriminant validity.

2.3.3 Performance-based work assessment

IIB [19] The Vocational Situational Assessment Scale (VSA), a 35-item scale developed to measure work performance among individuals with mental and behavioral health disorders, was used to guide in vivo observations and ratings by trained employment providers. The VSA is comprised of two subscales, Work Skills (21 items) and Interpersonal Skills for the Workplace (15 items), each measured on a 4-point rating scale (namely, 1 = cause to be fired to 4 = above acceptable work performance) and yielding a subscale score. Each scale point is anchored with descriptors which were carefully developed and evaluated based on the requirements of a minimum wage job (samples of the Work Skills subscale and the Interpersonal Skills for the Workplace subscale appear in Table 2).

The VSA has demonstrated reliability, concurrent and predictive validity in a small study of individuals with psychiatric disabilities [19]. Internal consistency reliability estimates for this sample and study of the VSA instrument was 0.97 overall with 0.95 for the Work Skills Subscale and 0.94 for the Interpersonal Skills Subscale.

2.3.4 Recruitment and training of employment providers

Research staff oriented and consented employment providers (informed consent of the employment providers was required by the governing IRBs). Remuneration was provided for their time and effort. Employment providers were trained extensively on procedures for observing work performance and on the ratings of the Vocational Situational Assessment Scale. They were provided an overview of performance-based measurement and were oriented to each item of the VSA using hypothetical scenarios. During the training, we capitalized on the employment providers’ considerable knowledge and experience in observing and assessing both adequate and problematic work performance, behaviors and skills; training helped them to operationalize their knowledge and skills using the scale items and ratings.

Secondly, we established inter-rater reliability and validity of the VSA ratings in two steps. First, we recruited and enrolled “practice” participants whom employment providers were able to observe and rate in pairs. Towards the end of their training, employment providers were paired with their supervisors to conduct an additional assessment. We took this step with the assumption that the supervisor was the “gold standard” for validity purposes and also to perform another check on inter-rater reliability. Employment providers were instructed to conduct unobtrusive observations and to keep concurrent notes. At the end of each training session, research staff calculated the agreement between raters, and reviewed the ratings and observational notes in detail with employment providers and their supervisors. We detected the most common sources of errors or differences between each pair, and addressed those differences by discussing the ratings and reaching consensus. We continued with practice participants until satisfactory correlations between raters were achieved. These training and consensus building steps allowed employment providers to achieve a satisfactory level of skill in conducting performance-based assessments of work before proceeding with the study.

2.3.5 Data collection

Research staff screened potential study participants based on eligibility criteria; participants were consented and enrolled by research liaisons at each site. Upon enrollment, participants completed the demographic questionnaire and the self-report version of the BH-FAB. Within 1–2 days of the study participant’s data collection, employment providers completed the BH-FAB about their respective client. Within 1–2 days of completing the BH-FAB, the employment provider began the performance-based work assessment by observing the participant in a natural work setting for at least 1 hour per day for a 3–5 day period. Employment providers kept notes about their observations and rated the participants at the conclusion of the observation period.

2.4 Data analysis

After verifying the data and examining it descriptively, we conducted analyses to examine concordance between providers and participants on mental and behavioral health functioning [20, 21] and to examine ratings of mental and behavioral health and performance-based assessments of work using both intra-class and Pearson correlation coefficients. We constructed a “folded empirical cumulative distribution plot” (i.e., mountain plots) [22] in order to visually depict and examine the magnitude and direction of discordance between ratings conducted by providers and self-report ratings by participants of their mental and behavioral health. A mountain plot provides information about the distribution of the differences between the two rating approaches, in this case, provider ratings vs. self-report rating. If the two ratings are unbiased with respect to each other, the mountain plot will be centered over zero. Long tails in the plot reflect large differences between the raters.

All data analyses were conducted in SPSS 20.0.

3 Results

Descriptive statistics (means and standard deviations) for both participant and employment provider ratings of the BH-FAB subscales and employment provider ratings of work performance are presented in Table 3. Participants’ self-report of their mental and behavioral functioning were at or slightly below average for working age adults for all subscales of the BH-FAB. Provider ratings on three BH-FAB subscales were higher than participant self-report ratings except for the Self-Efficacy scale on which participants rated themselves higher than providers. In terms of performance-based assessments of work, the mean total VSA was 3.27 (SD±0.44) and was 3.32 for the Work Skills and 3.25 for the Interpersonal Skills subscales, respectively. On average individuals were rated as exhibiting slightly above “adequate work performance” with interpersonal skills slightly lower than work skills.

Table 3
Means and standard deviations of measures

Measure	Subscale	n	Mean	SD
BH FAB	Subscale
Client Version	Behavioral Control	42	47.5	9.28
	Mood and Emotions	42	45.7	9.19
	Self-Efficacy	42	50.1	9.94
	Social Interactions	42	47.7	6.81
Provider Version	Behavioral Control	40	54.6	10.4
	Mood and Emotions	42	53.1	6.59
	Self-Efficacy	42	43.2	8.87
	Social Interactions	41	50.4	6.66
VSA	Total Vocational Situational Assessment	37	3.27	0.44
	Work Skills	41	3.32	0.46
	Interpersonal Skills	32	3.25	0.46

Note: The BH FAB (Short Form) has a mean of 50 and a SD of 10 in the normative data; the VSA is scored on a 1–4 scale with higher scores indicating better functioning. Missing data on the Interpersonal Skills subscale of the VSA was due in part to some behaviors not being observable in certain work situations (e.g., interactions with the public).

Table 4 presents the correlations among the work performance ratings by the providers, the provider ratings on the BH-FAB, and the participants’ self-report on the BH-FAB. The most striking finding is that there were no statistically significant correlations between participants’ self-report of their mental and behavioral health and employment providers’ ratings of work performance. Moderate correlations were found between the BH-FAB ratings completed by employment providers and their work performance ratings, with one exception (the correlation between the interpersonal subscale of the VSA and the provider rating of Self-Efficacy on the BH-FAB).

Table 4

Correlation coefficients between subscales of the BH FAB and the total score of the Vocational Situational Assessment, Work Skills and Interpersonal Skills subscales

BH FAB (Short-Form) Subscale	Total VSA		Work Skills		Interpersonal Skills
	r	n	r	n	r	N
Client Version
Behavioral Control	0.25	37	0.32	41	0.13	32
Mood and Emotions	0.24	37	0.22	41	0.23	32
Self-Efficacy	0.15	37	0.07	41	0.28	32
Social Interactions	0.16	37	0.16	41	0.28	32
Provider Version
Behavioral Control	0.46**	35	0.50**	39	0.34*	30
Mood and Emotions	0.49**	37	0.43**	41	0.39*	32
Self-Efficacy	0.44**	37	0.39**	41	0.26	32
Social Interactions	0.55**	37	0.45**	40	0.50**	32

Note: N’s vary because not all items of the VSA could be observed (e.g., “converses with co-workers … ” may have been unobservable if the participant was working alone).

= significant at p < 0.05;

= significant at p < 0.01.

Table 5 presents the intra-class correlation coefficients (ICC) of employment provider and participant self-report ratings on all subscales of the BH-FAB. Results suggest non-significant agreement between the provider ratings and the participant self-report on Mood and Emotions, Self-Efficacy, Social Interactions and Behavioral Control subscales.

Table 5

Intraclass correlation coefficients of employment specialist and client ratings on subscales of the Behavioral Health Functional Assessment Battery

BH FAB (Short-Form) Subscale	Between Employment Specialists and clients
Behavioral Control	0.21
Mood and Emotions	−0.08
Self-Efficacy	0.21
Social Interactions	−0.18

The magnitude and direction of discordance between participants and providers on ratings of mental/behavioral health are depicted graphically in mountain plots displayed in Fig. 1. We note that the median values of score difference between subject self-report scores and provider scores are –6.97, –6.68, and –3.32 respectively for Behavioral Control, Mood and Emotions and Social Interactions subscales, suggesting that providers rated particiants as less impaired when compared with self-report ratings for the same subscales. On the Self-Efficacy subscale, the median value is 7.05, suggesting that providers rated subjects as more impaired when compared to self-reported ratings by participants. Next, we examined the difference between 75th and 25th percentiles (interquartile range (IQR)) for each subscale. The Self-Efficacy scale had largest interquartile range (16.1) compared with others (Mood and Emotions: 15.35; Social Interactions: 14.18, Behavioral Control: 13.46), suggesting that scores on the Self-Efficacy scale had more variability compared to other subscales.

Fig. 1

Mountain plots of BH-FAB subscales comparing participant self -report to employment provider ratings. Differences with a negative skew indicate employment provider rated participant as functioning less well than did the participant (BC, SI, ME). The one difference with a positive skew was Self Efficacy, suggesting participants perceived and rated themselves more positively than providers. Note: BC = Behavioral Control; SI = Social Interactions; SE = Self Efficacy; ME = Mood and Emotions.

4 Discussion

Contrary to our hypotheses, we found no significant correlations between participants’ self-report of their mental and behavioral health and performance-based assessments of work made by trained employment providers. This was the most striking finding and parallels a significant body of extant research. Such discrepancies have contributed to controversies in the mental health field about the capacity of individuals with mental and behavioral health disabilities to work and to conclusions by many clinicians that they cannot work even when clients assert that they can and wish to be employed. Such discrepancies in the perspectives of mental health clients and their providers may help to explain in part the differing views that have been reported in the literature about the role of work in the lives of individuals with mental and behavioral health disorders [23, 24].

We found moderate correlations between provider assessments of participants’ mental/behavioral health and their performance-based assessments of work. However, the variance explained in work performance by multiple subscales of a mental and behavioral health assessment was modest, in the range of 20–25%. These data suggest that work performance and mental and behavioral health status are relatively independent and that mental health functioning is not highly predictive of work performance in this population. The absence of a strong relationship between diagnosis, symptoms and work has been asserted and demonstrated in numerous studies examining the predictors of employment outcomes among individuals with mental and behavioral health disorders [6 , 8–11].

Several factors could be responsible for the finding that there was not a high correlation between work function and ratings of mental/behavioral health. First is the possibility that not all mental and behavioral health symptoms by their nature interfere with an individual’s work performance. Having adequate employment and interpersonal skills in the workplace may supersede difficulties with mood and emotion, behavioral control, social interactions, or self-efficacy, particularly when difficulties in these areas are not severe. Secondly, it is possible that the observations of work performance conducted for this study did not accurately capture the totality of the participants’ work capacity.

In terms of the discordance between participants and providers in ratings of mental and behavioral health ratings, there may be other factors at play. First, the education and training of employment providers does not generally emphasize assessment of mental and behavioral health functioning, which may make concordance between employment providers ratings and participant self-report more difficult to achieve. However, this lack of concordance was also noted by Marfeo and colleagues [25] in a study examining concordance between mental health providers and Social Security claimant self-report. It is possible that the 7-day rating window imposed by some of the BH-FAB items was problematic for employment providers who did not necessarily have knowledge of difficulties in that abbreviated time span, particularly for low-incidence, but important behaviors, such as those represented in the Behavioral Control subscale (i.e., “In the past seven days, I threatened violence toward people or property”). Further, providers and participants may have had different perspectives about and interpretations of certain mental/behavior health items, as was noted by Marfeo and her colleagues [25].

Our findings mirror research suggesting that comparing ratings of psychiatric symptoms, functioning, goals, or problems when completed through self-report or using only the client’s perspective and compared to providers, often do not yield high agreement [25 –30]. In addition, studies suggest that certain symptom and function ratings may be more suitable for accurate self-report than others, such as self-efficacy and depression vs. psychotic symptoms [31] or developmental history vs current work functioning, [32] although that research is not unequivocal [33] or robust.

Results of this study confirm earlier research suggesting that it is difficult to predict work performance from participants’ self-reports of their mental/behavioral health. Performance-based assessments of work capacity as well as ratings of mental/behavioral health may be needed for a complete and complimentary picture of the ability of individuals with mental and behavioral health disorders to function in the work place. We conclude that understanding the relation between work function and mental and behavioral health status deserves further study, and that our knowledge suggests that a person’s psychiatric diagnosis and symptomatology is likely to be only moderately predictive of their work performance.

This study had several methodological limitations. First, we used a relatively small sample of individuals with mental and behavioral health disorders, about 50% of whom had schizophrenia-spectrum disorders and who evidenced varying degrees of impairment. A larger sample may have provided additional information about the relationship between the symptoms, functioning and work performance. We were able to conduct this study only within two mental health programs in one state; a different geographical representation could have yielded different findings. A significant number of the study participants were engaged in simulated work rather than competitive employment. Given the relatively small sample size, we were unable to fully exploit subset analyses of these two types of work, but such differences could lead to differences in performance. Employment providers were trained to criterion to conduct performance-based assessments of work capacity, however, this measure itself may be imperfect in its ability to capture work capacity. Work performance ratings were also relatively high, suggesting possible confounds from a ceiling effect. Our performance-based assessment of work function does not take into account other factors that could affect employment providers’ perceptions and ratings of work performance such as the participants’ beneficiary status, environmental pressures, and labor market issues. Lastly, it appears that these two measures tap distinctly different aspects of functioning with less correlation than hypothesized.

5 Conclusions

Research suggesting that mental and behavioral health status and symptoms are not highly predictive of work performance in this population [5 , 8] is borne out by this study. Taken together, results of this study suggest that observations and ratings of work performance conducted by a trained employment provider, when compared to self-report by participants of their mental and behavioral health are not highly correlated. The variance explained in work performance by the mental and behavioral health ratings was low, suggesting that capacity to work and work performance cannot be explained by mental health status alone. Each of these measures may provide unique and incremental information. We did confirm our hypothesis that performance-based observations of work correlated moderately with assessments of mental and behavioral health when both were conducted by employment providers. We conclude that multiple modes of observation and assessment of symptoms, function and performance are needed to obtain comprehensive and accurate assessments of “real world” functioning among individuals with mental and behavioral health disorders, particularly with respect to their ability to function in the workplace. These findings, coupled with other studies, have implications for the determination of social benefits and programs that rely on the establishment of work disability.

Conflict of interest

None to report.

Footnotes

1

In order to assess the extent of concordance between the Employment Provider and the participant about behavioral health/psychiatric functioning, a parallel version of the scale was created. Thus, an item that read “I am good at making new friends” for self-report by the participant was altered to read “The client is good at making new friends” for the provider.

Acknowledgments

This research was supported by Social Security Administration and National Institutes of Health (SSA-NIH) Interagency Agreements (NIH contract # HHSN269201100009I) and by the NIH intramural research program.

References

Anthony

, Cohen

, Farkas

, Gagne

. Psychiatric Rehabilitation. 2nd edition. Boston, MA: Boston University, Center for Psychiatric Rehabilitation; 2002.

Best

, Gupta

, Bowie

, Harvey

. A longitudinal examination of the moderating effects of symptoms on the relationship between functional competence and real world functional performance in schizophrenia, Schizophr Res Cogn. 2014;1(2):90–5.

McKnight

, Kashdan

. The importance of functional impairment to mental health outcomes: A case for reassessing our goals in depression treatment research. Clin Psychol Rev. 2009;29(3):243–59. 10.1016/cpr.2009.01.0005.

Bush

, Drake

, Xie

, McHugo

, Haslett

. The long-term impact of employment on mental health service use and costs for persons with severe mental illness, Psychiatr Serv. 2009;60(8):1024–31.

Campbell

, Bond

, Drake

. Who benefits from supported employment: A meta-analytic study, Schizophr Bull. 2011;37(2):370–80.

Corbière

, Lecomte

, Reinharz

, Kirsh

, Goering

, Menear

et al., Predictors of acquisition of competitive employment for people enrolled in supported employment programs, J Nerv Ment Dis. 2017;205(4):275–82.

Michon

, van Weeghel

, Kroon

, Schene

. Person-related predictors of employment outcomes after participation in psychiatric vocational rehabilitation programmes, Soc Psychiatry Psychiatr Epidemiol. 2005;40(5):408–16.

Razzano

, Cook

, Burke-Miller

, Mueser

, Pickett-Schenk

, Grey

et al., Clinical factors associated with employment among people with severe mental illness: Findings from the employment intervention demonstration program, J Nerv Ment Dis. 2005;193(11):705–13.

Rogers

, MacDonald Wilson

. Vocational capacity of individuals with mental health disabilities. In: Schultz

, Rogers

, editors. Handbook of Job Accommodation and Retention in Mental Health. New York: Springer; 2011, pp. 73–90.

10.

Tsang

, Lam

, Ng

, Leung

. Predictors of employment outcome for people with psychiatric disabilities: A review of the literature since the mid ‘80’s, J Rehabil. 2000;66(2):19–31.

11.

Wewiorski

, Fabian

. Association between demographic and diagnostic factors and employment outcomes for people with psychiatric disabilities: A synthesis of recent research, Ment Health Serv Res. 2004;6(1):9–20.

12.

Marfeo

, Ni

, Haley

, Bogusz

, Meterko

, McDonough

et al., Scale refinement and initial evaluation of a behavioral health function measurement tool for work disability evaluation. Arch Phys Med Rehabil. 2013;94(9):1679–86. 10.1016/j.apmr.2013.03.012

13.

Harvey

, Velligan

, Bellack

. Performance-based measures of functional skills: Usefulness in clinical treatment studies, Schizophr Bull. 2007;33(5):1138–48.

14.

Harvey

, Helldin

, Bowie

, Heaton

, Olsson

, Hjärthag

et al., Performance-based measurement of functional disability in schizophrenia: A cross-national study in the United States and Sweden, Am J Psychiatry. 2009;166(7):821–7.

15.

Harvey

. Assessment of everyday functioning in schizophrenia, Innov in Clin Neurosci. 2011;8(5):21–4.

16.

Leifker

, Patterson

, Heaton

, Harvey

. Validating measures of real-world outcome: The results of the VALERO expert survey and RAND panel, Schizophr Bull. 2011;37(2):334–43.

17.

Brown

, Velligan

. Issues and developments related to assessing function in serious mental illness, Dialogues Clin Neurosci. 2016;18(2):135–44.

18.

World Health Organization. International Classification of Functioning, Disability and Health [Internet], World Health Organization;2017,Available from http://www.who.int/classifications/icf/en/.

19.

Rogers

, Sciarappa

, Anthony

. Development and evaluation of situational assessment instruments and procedures for persons with psychiatric disability, Vocational Evaluation and Work Adjustment Bulletin. 1991;24, 61–7.

20.

Baldwin

, Murray

, Shadish

, Pals

, Holland

, Abramowitz

et al., Intraclass correlation associated with therapists: Estimates and applications in planning psychotherapy research, Cogn Behav Ther. 2011;40(1):15–33.

21.

Thayer

, Burlingame

. The validity of the Group Questionnaire: Construct clarity or construct drift? Group Dyn. 2014;18(4):318–32. 10.1037/gdn0000015

22.

Monti

. Folded empirical distribution function curves-mountain plots, Am Stat. 1995;49(4):342–5.

23.

Millner

, Rogers

, Bloch

, Costa

, Pritchett

et al., Exploring the work lives of adults with serious mental illness from a vocational psychology perspective, J Couns Psychol. 2015;62(4):642–54.

24.

Secker

, Grove

, Seebohm

. Challenging barriers to employment, training and education for mental health service users: The service user’s perspective, Journal of Mental Health. 2001;10(4):395–404.

25.

Marfeo

, Eisen

, Ni

, Rasch

, Rogers

, Jette

. Do claimants over-report behavioral health dysfunction when filing for work disability benefits? Work. 2015;51(2):187–94. 10.3233/WOR-141847

26.

Cuijpeer

, Li

, Hofmann

, Andersson

. Self-reported versus clinician-rated symptoms of depression as outcome measures in psychotherapy research on depression: A meta-analysis, Clin Psychol Rev. 2010;30(6):768–78.

27.

Goldberg

, Garakani

, Ackerman

. Clinician-rated versus self-rated screening for bipolar disorder among inpatients with mood symptoms and substance misuse [CME], J Clin Psychiatry. 2012;73(12):1525–30.

28.

Holmqvist

, Philips

, Mellor-Clark

. Client and therapist agreement about the client’s problems-Associations with treatment alliance and outcome. Psychother Res. 2016;26(4):399–409. 10.1080/10503307.2015.10103160.

29.

Tryon

, Winograd

. Goal consensus and collaboration. Psychotherapy. 2011;48:50–7. 10.1037/a0022061.

30.

Rush

, Carmody

, Ibrahim

, Trivedi

, Biggs

, Shores-Wilson

et al., Comparison of self-report and clinician ratings on two inventories of depressive symptomatology, Psychiatr Serv. 2006;57(6):829–37.

31.

Carter

, Frampton

, Mulder

, Luty

, Joyce

. The relationship of demographic, clinical, cognitive and personality variables to the discrepancy between self and clinician rated depression, J Affect Disord. 2010;124(1):202–6.

32.

DeFife

, Drill

, Nakash

, Westen

. Agreement between clinician and patient ratings of adaptive functioning and developmental history, Am J Psychiatry. 2010;167(12):1472–8.

33.

Rampton

, Waghorn

, De Souza

, Lloyd

. Employment service provider knowledge of service user assistance needs. 2010;13(1):22–9.

Concordance of assessments of clients’ mental and behavioral health with in vivo assessment of work performance

Abstract

BACKGROUND:

OBJECTIVE:

METHODS:

RESULTS:

CONCLUSIONS:

Keywords

1 Introduction

2 Methods

2.1 Sites

2.2 Participants

2.2.1 Employment providers

2.2.2 Study participants

Table 1 Demographics of study sample

2.3.1 Demographic background

2.3.2 Mental health function

Table 2 Sample items from the Vocational Situational Assessment, the Behavioral Health Functional Assessment Battery, and the Behavior and Symptom Identification Scale-24®

2.3.4 Recruitment and training of employment providers

2.3.5 Data collection

2.4 Data analysis

3 Results

Table 3 Means and standard deviations of measures

5 Conclusions

Conflict of interest

Footnotes

1

Acknowledgments

References

Table 1
Demographics of study sample

Table 2
Sample items from the Vocational Situational Assessment, the Behavioral Health Functional Assessment Battery, and the Behavior and Symptom Identification Scale-24^®

Table 3
Means and standard deviations of measures