Abstract
Purpose:
Family support programs commonly use self-sufficiency matrices (SSMs) to measure family outcomes, however, validation research on SSMs is sparse. This study examined the reliability of the Colorado Family Support Assessment 2.0 (CFSA 2.0) to measure family self-reliance across 14 domains (e.g., employment).
Methods:
Ten written case studies were developed. Family advocates (n = 24) independently coded each case study on each domain of the CFSA 2.0. Intraclass correlation (ICC) coefficients were calculated to determine the level of agreement between participants. Rating accuracy was calculated by subtracting the correct score for each domain and case study from the score given by the raters.
Results:
ICCs ranged from .79 to .96. Across all domains and case studies, family advocates were 84.4% accurate. Allowing for minor deviations (off by 1 on the 5-point scale) shows 96% accuracy.
Discussion:
A well-constructed SSM can be used to collect reliable and objective data on family self-sufficiency.
Economic hardship, family stress, poor parenting, and family instability can have detrimental effects on children’s development (Benedetti, 2012; Sandstrom & Huerta, 2013). Significant adversity and toxic stress in a child’s early years can affect the developing brain and can have lasting implications for the mastery of cognitive, language, and social skills (Shonkoff, 2010). In 2013, 20% of children in the United States lived in poverty (U.S. Census Bureau, 2014), and social service institutions are struggling to cope with the rising levels of poverty and inequality (Hout et al., 2014). Many families are struggling to make ends meet, which can interfere with their ability to provide the supportive, nurturing environments in which children thrive. In low-income families, stressors are often compounded, as there are rarely excess resources to draw from in times of crisis (Mills & Amick, 2010). Fortunately, well-implemented prevention programs that have a two-generation approach targeting young children, parents, and families can result in positive outcomes many years later (Manning, Homel, & Smith, 2010). Two-generation approaches build human capital across generations by combining education or job training for adults with early childhood education for their children (Chase-Lansdale & Brooks-Gunn, 2014). Interventions are likely to be most successful when they consider the broader family, neighborhood, cultural, and community contexts (Daro & Dodge, 2009). Thus, community-based interventions that are able to strengthen vulnerable families and protect children from the negative impacts of extreme and prolonged stress have the potential to dramatically improve the well-being of children, families, and communities.
As a means to strengthen families and build healthy communities, several states support Family Resource Centers (FRCs) as a model of service provision in which families can access resources and services in a family-centered manner to address complex issues (Hubble, 2010). The model is designed to reduce fragmentation in social service delivery by providing coordinated, multiservice care. FRCs use a philosophy of strengths-based family development, in which a family advocate acts as a coach and systems navigator to support the family, as they set and meet their own goals. There are a core set of principles and premises that guide family support programs, including the premise that empowering families to build on their own strengths supports the healthy development of children (Harper Browne, 2014). Strengthening family assets leads to growth and development, instills confidence in one’s own skills, and fosters a sense of empowerment (Fernandez, 2004, 2007). This holistic process contributes to the ultimate goal of creating internal motivation for change (Walker, 2008). Although a focus on deficits tends to discourage participants, positivity in relationships with family support workers and a positive outlook helps to maintain high family participation and the effort needed to overcome the inevitable obstacles in making life changes (O’Brien-Strain, Gunther, Rosenberger, & Theobald, 2012). Furthermore, family development specialists are well positioned to provide customized services that help families make long-term life changes.
As society has shifted toward increased accountability for service organizations, family support programs are being asked to demonstrate their value through outcome-based standards. The use of measures that objectively assess family self-reliance is necessary to support valid and accurate measurement of family outcomes. However, in the field, successful measurement implementation depends on tools that are consistent with a program’s underlying philosophy, that are effectively integrated into service delivery, and that do not create unnecessary administrative burden for workers or families. For these reasons, family support organizations have been attracted to self-sufficiency matrices (SSMs) as a means to measure family functioning and stability. SSMs are designed to create a full picture of circumstances for diverse clientele and they provide dual purpose, serving as both a case management and outcome tool (Snohomish County Self-Sufficiency Taskforce, 2004). In practice, family advocates use the SSM to identify and discuss family assets and areas of need. The information is used to document initial functioning (baseline assessment) and to help guide family goal setting, resource and referral provision, and multiservice delivery. After goal setting, advocates follow-up with families and readminister the SSM to assess progress toward changes in self-reliance and further modify, as needed, family goals and the provision of services and supports. However, despite increasing use of SSMs as outcome measures (Culhane, Parker, Poppe, Gross, & Sykes, 2008), rigorous testing of the use of these tools in this capacity lags behind. The goal of the current study was to address this gap in the literature by testing the reliability of a well-constructed SSM-type family assessment.
Research on SSMs
To our knowledge, only one peer-reviewed article has documented evidence of reliability and validity of a SSM: The SSM-D, a Dutch-language SSM adapted from the Utah and Arizona SSM versions, was examined for its use with clients receiving mental health services (Fassaert et al., 2013). In this study, researchers compared scores on the SSM-D to the Health of the Nation Outcome Scale (HoNOS) and the Camberwell Assessment of Need Short Appraisal Schedule (CANSAS). Social workers, psychiatric nurses, and case managers completed the SSM-D, and different raters (psychologists or research associates) completed the criterion measures (the CANSAS or HoNOS) on the same clients. Through a factor analysis of the 11 domains tested, the researchers found that the SSM-D measured a single construct of self-sufficiency. Furthermore, scores on the SSM-D correlated as expected with the HoNOS and the CANSAS, demonstrating that clients who had higher self-reliance scores on the SSM-D had lower scores on the nationally recognized measures of “need for care.” The Dutch study, however, focused on clients receiving intensive mental health services and only examined 11 domains assessing individual-level, rather than family-level, functioning.
In addition to the peer-reviewed Dutch study, there are two publically available reports examining the reliability of SSMs. Specifically, Parker (2008) prepared a report for the Arizona Department of Economic Security Office of Community Partnerships and Innovative Practices, in which he presented findings of a factor analysis on a version of an SSM used in Arizona. He found a two-factor structure: one labeled economic self-sufficiency and the other labeled social–emotional self-sufficiency. The study also found significant improvements in self-sufficiency over time for families receiving services. However, the study did not test the tool on its ability to achieve consistent ratings across different program case managers (interrater reliability).
In another report, in California, Endres, Richardson, and Sherman (1999) tested the reliability of the family development matrix (FDM) model. They developed 10 written narratives based on case histories of program participants and had 25 workers score the narratives using the FDM at two points in time. Results identified some domains that were reliably administered and some that required revisions to improve reliability. This study was not submitted for dissemination in peer review and thus lacked critical methodological information to assess its merits. Nonetheless, it provides an important standard to which assessments can be tested for reliability.
Considering the dearth of scientific studies testing family support assessments, additional research is needed to demonstrate the utility of SSM-type measurement as a reliable and valid method of assessing family outcomes. To accurately measure family change over time, research needs to demonstrate that SSMs can be administered in a way that obtains accurate data across multiple family workers.
Development of the Colorado Family Support Assessment 2.0
The Colorado Family Support Assessment 2.0 (CFSA 2.0) was developed to provide a strengths-based, objective index of family functioning for use by the Family Resource Center Association (FRCA) member FRCs. It is based on the concept of a scaled outcome matrix that documents family functioning across several areas that contribute to self-reliance (e.g., income, housing, employment, transportation, and debt management). Each domain area is rated on a scale from “in-crisis” to “thriving,” using indicators at each level to guide ratings on the scale (Snohomish County Self-Sufficiency Taskforce, 2004; Richmond & Valentin, 2012). Matrices are frequently developed by local groups to reflect community-based indicators of self-sufficiency in each area. To develop the CFSA 2.0, researchers and program staff at the state and local level engaged in a collaborative process.
To begin the development process, researchers reviewed the scant research literature on reliability and validity studies of SSMs, examined the many publically available SSMs to learn how other communities were assessing family outcomes, obtained feedback from FRC staff on the version of the CFSA that was in use prior to the development of the CFSA 2.0, and reviewed national organizations’ materials with deep knowledge of domain areas (e.g., United States Department of Agriculture [USDA] definitions of food security). This information was used to develop a draft matrix that sought to maximize objectivity with what could realistically be obtained during an interview with a family. The draft tool was vetted and revised with input from FRC directors, family advocates, and other stakeholders to ensure relevancy and utility in the field.
The collaborative process led to a family assessment that consists of three components: Part A, the domain matrix; Part B, the Protective Factors Survey (PFS; FRIENDS National Resource Center, 2011); and Part C, family readiness to change and goal setting. Part A, the domain matrix, includes 14 locally relevant categories considered critical to family self-reliance (i.e., income, employment, housing, transportation, food security, child care, child education, adult education, cash savings, debt management, health coverage, physical health, mental health, and substance abuse). Each domain is rated on a scale from 1 to 5, with higher scores indicating greater self-reliance. For each domain, each indicator is defined for scoring (e.g., for employment, 1 = all employable adults in the family are not employed, 2 = at least one adult in the family has temporary or part-time employment and no adult has full-time employment, 3 = at least one adult in the family is employed full time and no adult has stable employment, 4 = at least one adult has full-time stable employment, and 5 = at least one adult has full-time stable employment and access to employer-based benefits). To facilitate accurate scoring, key definitions are provided at the beginning of domains (e.g., for employment, definitions are provided for employable, stable employment, benefits, and full-time status). In addition, each domain includes a not enough information (N/I) option for instances in which the family advocate is unable to obtain the required level of information to accurately code the domain. Three domains also include a not applicable (N/A) option: Employment is N/A when there are no employable adults in the family (e.g., a single parent on disability); child care is N/A when there are no children under 12 years old or the family is adequately able to care for children and does not need child care (e.g., by choice, one parent stays home to care for the children); and child education is N/A when all children in the family are not school aged or have earned a GED or high school diploma. The N/I category was not tested as a part of this study.
Parenting and social support, although commonly included in SSMs, were not included in the domain matrix because the CFSA 2.0 development committee members believed that these areas would be too challenging to assess objectively through the matrix format. Rather, the PFS (FRIENDS National Resource Center, 2011) was included in the tool in its entirety in Part B, along with a single-item question on relationship safety. The PFS assess five factors that protect against child abuse and neglect (family functioning/resiliency, social support, concrete support, nurturing and attachment, and knowledge of parenting/child development). Because the PFS has demonstrated reliability and validity (Counts, Buffington, Chang-Rios, Rasmussen, & Preacher, 2010), the current study only examined the reliability of the domain matrix, Part A, of the CFSA 2.0.
The Current Study and Hypotheses
The current study was designed to fill a gap in the research literature by testing whether a well-constructed SSM can objectively and reliably measure family self-reliance, which is a necessary condition to support its use as an outcome tool. Current SSMs have not been adequately tested to determine whether two different family workers would score the same family in the same way. To test the objectivity of the tool’s construction, we used Endres and colleagues (1999) evaluation as a guide and designed a study in which family advocates scored written case studies on each domain of Part A of the CFSA 2.0. This approach ensures that each family advocate receives the exact same information when using the tool. High reliability estimates provide evidence that the domain matrix is sufficiently objective to collect consistent information across family advocates. Evidence of high reliability in a controlled environment will lay the groundwork needed in the future for a more resource-intensive study examining the psychometrics of the tool when used in the field.
We tested the following hypotheses: (1) family advocates will rate case studies similarly to each other on each domain of the CFSA 2.0 and (2) family advocates will be accurate in their ratings, compared to the advisory group who developed the tool. We also explored whether family advocate characteristics predicted rating accuracy.
Method
Study Population and Recruitment
The study population consisted of staff providing family development services in Colorado FRCs. In state statute (Colorado General Assembly, 2009), FRCs are tasked with “providing comprehensive, intensive, integrated, and collaborative state and community-based services.” Each of the 24 FRCs that are members of the FRCA provided a list of staff currently providing support services to families. Using this list, researchers sent an e-mail invitation to approximately 120 FRC staff. The invitation provided a brief description of the study and requested that participants be available at two preset times for data collection. Some individuals indicated that they were unavailable during the study window and others indicated that they would not have time with their current work schedules. In the end, 31 individuals expressed interest in participating and 24 consented and completed the study protocols.
The 24 participants represented 12 FRCs across urban and rural regions of the state. Table 1 provides descriptive information on participant sociodemographic characteristics. In brief, most participants were female, more than one third had a bachelor’s or master’s degree, about one fifth were of Hispanic/Latina origin, most were White, and about 15% spoke Spanish as their primary language. Participants were between 30 and 72 years old, with a mean age of 42 (SD =12.5). There was variability in length of employment at the FRC and length of time working with families. Similarly, participants varied in the number of CFSAs (version 1.0) that they administered in the month prior to participation. Due to lack of information on nonparticipants, we were unable to identify the degree to which participants reflect the larger population of family support workers.
Participant Demographic Characteristics as a Percentage of the Sample.
Note. FDC = family development credential.
Most (71%) participants had received their Strengthening Families Certificate (California Network of Family Strengthening Networks, 2014) and more than two-thirds had their Family Development Credential (FDC; Forest, 2003). The Standards of Quality for Family Strengthening & Support Training Certificate prepares social service providers with an understanding of each protective factor that helps build parental resilience and strengthen family support systems, while offering concrete programmatic strategies for implementation. The FDC is a research- and evidence-based family support training and credentialing program developed in 1996 at Cornell University and has expanded to 17 states and Washington D.C. The 90-hr training covers core competency areas for social service providers such as effective communication skills, strengths-based case management, cultural responsiveness, goal setting, home visitation, and facilitated family meetings. Both trainings help to equip family support staff with the tools, techniques, and resources needed to work with families in a strengths-based, family-centered approach.
Development of Written Case Studies
To support a rigorous test of the CFSA 2.0 domains, we sought to develop case studies that (1) provided sufficient information for accurate coding of each of the domains, (2) contained situations that were realistic and consistent with families seeking services in Colorado FRCs, and (3) captured sufficient variation so that each case study represents a different combination of problems and assets and defines a unique case for evaluation by participants.
To achieve these objectives, we first obtained the original 10 case studies used in California evaluation (Endres, Richardson, & Sherman, 1999). The 10 case studies were developed from authentic case histories of program participants from agencies in California participating in the study and contained a diverse set of family circumstances. FRCA staff reviewed the 10 case studies to verify that each scenario reflected the situation of a family that would be served by a Colorado FRC. FRCA staff determined that 9 of the 10 studies reflected situations similar to those of Colorado families receiving FRC services. One case study was deemed unsuitable for the study because it was a high-need individual who was not caring for children. To find a replacement, one of the advisory team members created a new case study based on a family previously served at a Colorado FRC.
The nine California cases studies were adapted to meet study goals. Specifically, a team of two FRCA staff, two researchers, and one family advocate reviewed and revised case studies using the following process: (1) each case study was organized to mirror the CFSA 2.0 interview progression so that the case studies provided information in the same order it would be obtained during the family interview; (2) California-specific language was replaced with Colorado-specific language (e.g., references to public assistance programs); (3) extraneous information that was not relevant to CFSA 2.0 coding was removed; (4) additional content was developed when key information was missing (e.g., “ … Cathy borrowed some money from the older children’s father that she is unable to pay back” was created for a case study in which no information on debt management was provided); (5) case study language was reviewed to ensure sufficient separation from CFSA 2.0 descriptors (e.g., one of the employment indicators in the CFSA 2.0 states “at least one adult in the family has temporary or part-time employment.” Thus, in one case study, we changed original wording of “temporary employment” to “Jerry works construction jobs when they are available, but it can be slow during the winter months”); and (6) as needed, case studies were revised so that each domain rating was captured at least once in a case study.
After the 10 case studies were developed, researchers examined the coding of each case study across each domain to determine whether the set of studies contained sufficient variability for a strong test of each of the domains. First, researchers calculated the percentage of responses across all domains and cases with domain codes of 1 (25%), 2 (18%), 3 (22%), 4 (11%), 5 (20%), and N/A (4%). Maximum variability would be 20% for each category (with no N/A ratings). Similarly, an even distribution of scores of 1–5 would yield a mean of 3 and an standard deviation (SD) of about 1.5. Looking across case studies and all domains, the mean was 2.85 and the SD was 1.49. Finally, we correlated scores for each of the domains with scores for the other domains across all case studies (N = 10). Some domains had high correlations, but we determined that this reflects realistic variability as problems occur in clusters in real life—fully randomizing characteristics within a case study would be unrealistic. In addition, given that high, positive correlations with some domains were balanced by negative correlations in other domains, the case studies avoided simple scenarios in which all domains were scored very high or very low. Based on these checks, we determined that the degree of variability was sufficient weighed against further revisions to the case studies.
As a final test, two family advocates and one FRCA staff member, who were not involved in case study development, reviewed the case studies. One advocate reviewed each case study and rated the degree to which she perceived the case study to be realistic (very, somewhat, and not at all) and why. The other two individuals each coded three case studies to test the data collection protocols. Few changes were needed at this point. Some minor edits were made to further refine and improve case study relevance, without adjusting the rating codes. These three individuals were excluded from further participation in the study.
Data Collection
Researchers hosted two web-based data collection sessions, 1 week apart in September 2014, during which participants independently read and scored 10 case studies (four case studies were coded in the first webinar and six were coded in the second). Participants were asked to complete a demographic survey prior to the first webinar. During the first webinar session, researchers provided a brief training (approximately 30 minutes) on scoring the CFSA 2.0. After the training, participants read and scored four case studies while researchers remained available on the webinar to answer questions. The second webinar session was similar to the first, with the exception of the training, and included coding the six remaining case studies.
Participants received a US$100 gift card at study completion. The study received Exempt status from the University of Colorado, Institutional Review Board.
Analysis
Intraclass correlation (ICC) coefficients were calculated to identify the degree to which family advocates agreed with each other on case study ratings. For the ICC calculations, the data file was structured so that the 24 raters of the 10 case studies were compared separately for each of the 14 domains. ICC values of .60–.74 are considered good, and values between .75 and 1.0 are considered excellent (Hallgren, 2012). For the three domains that include the N/A option (employment, child care, and child education), we excluded N/A responses from the ICC calculations because the code of N/A does not fit the 1–5 scale. However, we separately examined the appropriate and inappropriate use of the N/A code.
The advisory committee identified the correct score on each domain rating for each of the 10 case studies. Participant accuracy ratings were calculated by subtracting the correct score for each domain and case study from the score given by the participants. Using this method, the most accurate value is zero—no difference between the family advocate rating and the correct rating. A positive value means the rater scored the case study too high on the domain, whereas a negative value means the rater scored the case too low on the domain. Values of 1 or −1 reflect small error, but values of 2 or more or −2 or less reflect more serious deviations from the correct score. Given 14 domains by 10 case studies, each rater provides 140 scores. With 24 raters and 140 scores, the maximum sample size for the accuracy measure equals 3,360. When a case study did not apply to a domain, such as when the correct response for child care or child education was N/A, we present a separate accuracy analysis.
Finally, to investigate whether family worker characteristics predict accuracy, we first computed the mean error for each participant (N = 24). Then, the absolute value of the error served as the outcome variable. We used the absolute value because the direction of the error is not relevant. A score of 0 is best and each higher score reflects greater error. The analysis then used the mean absolute error to calculate bivariate correlations and multiple regression coefficients for the sociodemographic variables (with listwise deletion of missing data on the sociodemographic variables, the N drops to 22).
Results
Table 2 lists the ICC coefficients for each of the 14 domains, in order from highest to lowest. The results show high interrater reliabilities that are all in the excellent range. The case study Ns are reduced for the three domains in which the N/A code was assigned (for results of N/A analyses, see subsequently). The domains with the lowest reliabilities—although still in the excellent range—are transportation, adult education, housing, food security, and physical health. Domains with the highest reliabilities include debt management, cash savings, and health coverage.
Interrater Reliability by Domain.
Note. ICC = intraclass correlation.
Accuracy analyses are presented in Table 3. Focusing on scored ratings of 1–5 across the 14 domains, 84.4% of the ratings are correct (error score of 0) and 95.8% are within one category of the correct score (−1 or 1). Only 4% of the ratings err by two or more categories. The errors are symmetric rather than systematic, with positive errors about as common as negative errors. The maximum number of ratings is 3,360, but excluding those assigning an N/A, the N falls to 3,215.
Distribution of Errors in Case Study Ratings.
Separate analysis of the use of the N/A code shows similarly high accuracy. The N/A code can be used correctly or incorrectly. It is used correctly when the case study has no relevance to the domain (e.g., the child is too young for school). Incorrect usage of the N/A code occurs when it is wrongly used for a case study with information to assign a score or when it is not used for a case study with no information to assign a score. Of the 154 ratings that involved use of the N/A response, 83.1% were used correctly and 17% were used incorrectly. The accuracy rate is quite similar to the 84.4% accuracy for the scores.
Examining errors across domains in Table 4 shows that raters did poorest in correctly scoring mental health (70% correct), transportation (71% correct), and physical health (77% correct). All other domains were 80% accurate or higher. Scores on transportation tended to be too high, scores on mental health tended to be too low, and scores on physical health were balanced between too low and too high. Note that the accuracy of the scores for each domain does not correspond exactly to the ICC measures of interrater reliability. For example, the ICC for adult education is comparatively low, while the accuracy rate is relatively high. In this case, the errors are few but are dissimilar across raters. Conversely, the ICC for mental health is relatively high, while the accuracy is relatively low. In this case, it appears that the raters are similar to each other in the errors they make.
Distribution of Errors by Domain.
Table 5 first shows the correlations between sociodemographic characteristics and accuracy for variables with sufficient variation to be included in the analyses (there are too few males, minorities, and Spanish speakers to include gender, race, ethnicity, and language). Most correlations do not reach statistical significance (p < .05), but those with a FDC have significantly lower errors, whereas those with a high count of CFSAs in the last month show greater error. However, the positive correlation with the count variable stems from a single outlier and disappears when deleting the outlier (results not presented). The multiple regression results listed in Table 5, which exclude the measure of the length of time working with families because it is nearly identical to the length of time of FRC employment (r = .96), show much the same as the correlations. FDC lowers error, while the count of CFSAs increases the error due to a single outlier. In addition, a higher level of education reduces error, when examined with other variables in the model.
Bivariate Correlations and Standardized Multiple Regression Coefficients of Sociodemographic Characteristics With Mean Absolute Value of Rater Error.
Note. CFSA = Colorado family support assessment; FRC = Family Resource Center.
*p < .05.
Discussion and Applications to Practice
This study subjected a locally developed SSM to a systematic test of interrater reliability. Reliability is a necessary requirement of measurement validation, and this study suggests that SSMs can be constructed to capture reliable information across a wide range of family support workers. These findings are important to the social work field, as program funders are increasingly demanding evidence of program impact. Measures that are locally relevant, that can be realistically implemented in the field, and that have strong psychometric properties are critical for family support organizations to measure and document family-level program outcomes.
As hypothesized, results of this study indicate that the CFSA 2.0 is constructed so that family advocates will obtain consistent and objective ratings of family self-sufficiency across the 14 areas examined. We examined both the consistency across coders (ICCs) and the accuracy of coders compared to a consensus rating determined by an advisory group. Across both tests, the CFSA 2.0 achieved strong psychometric properties—all ICCs were in the excellent range, supporting agreement across family support workers, and workers achieved an exact accuracy rate of 84% overall. Allowing for some deviation (off by 1 on the 5-point scale) shows 96% accuracy, suggesting that, provided sufficient information, workers will achieve accuracy when using the tool. Thus, findings support the reliability of the CFSA 2.0.
Findings also suggest the value of using case studies as a training mechanism to “certify” family advocates after service training and prior to administration of the CFSA 2.0 with families. In the study, a brief web-based training was sufficient for accurate coding of the CFSA 2.0—most family advocates achieved high accuracy compared to the advisory group ratings. This supports the idea that the CFSA 2.0 is objective, easy to administer, and does not rely on extensive training to achieve accurate ratings. However, supplemental analyses indicate that three workers did not achieve an overall 80% accuracy rate and highlight the potential of using the case studies as a means to check whether family advocates are demonstrating a sufficient understanding of how to use the tool before using it with families. In addition, researchers received anecdotal feedback from family advocates that scoring the case studies was a beneficial exercise and would be a useful component of training protocols.
Training in providing family-centered services may also support strong implementation of the assessment. In particular, results of sociodemographic predictors indicated that having a FDC was associated with more accurate coding. Thus, FDC status may not only be important to providing strong family support services but it may also help with coding of the CFSA 2.0. In addition, we found that one participant who reported administering a high number of the prior version of the CFSA had high error. When this participant was removed from the analyses, the association between the number of prior CFSAs administered and coding error disappeared. Thus, we do not consider this finding robust, and it is not clear, at this stage, whether the amount of experience using a prior tool is a hindrance to accurate coding. Still, it is worth considering the possibility, as is often the case with use of instruments like this one, that raters become more mechanical, and potentially less careful, after long-term use. In addition, it may be that those with heavy caseloads are less careful when administering assessments. Monitoring the number of families advocates are serving, and supporting reasonable case loads, may also help ensure careful administration of assessments.
Despite strong results, findings from domain-specific analyses and anecdotal feedback offered opportunities to further improve the tool. Specifically, compared to the advisory group rating, three domains achieved lower than 80% accuracy—transportation, mental health, and physical health. The advisory committee reconvened, reviewed these three domains in particular as well as reviewed study participant feedback. Using this information, the committee recommended minor modifications to the tool, including additional definitions to provide further clarity and guidance.
A next step in the CFSA 2.0 validation process is to test the psychometric properties of the tool when used by family advocates in their work with families. Specifically, the current study was unable to examine family advocates’ ability to obtain, during the family interview, the necessary information for accurate coding. Training on how to ask guiding questions in a supportive manner that facilitates the provision of accurate information will be critical for successful implementation of the tool. In addition, while care was taken to develop case studies that reflect complex and real-world family conditions, family circumstances often are multifaceted and nuanced, and family members may describe their circumstances in sometimes vague or inconsistent ways. The tool needs to be tested in its ability to measure family situations across a broader range of circumstances and in real-life situations than the 10 provided in the case studies. Future tests should also examine the sensitivity of the CFSA 2.0 to measure changes in family functioning over time. Nonetheless, this study demonstrates that the domain index is well constructed and sufficiently objective to obtain reliable information, thereby laying the groundwork for the next validation phase of field testing.
Footnotes
Acknowledgment
We thank Jerry Endres, MSW, Family Development Matrix, for supplying the California case studies, and Kristy Beachy-Quick, MPP, for her work on early versions of the CFSA 2.0. We also thank the family advocates for their participation in the study and their dedication to supporting Colorado families.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was funded by the Family Resource Center Association.
