Abstract
Using Item Response Theory (IRT) and Confirmatory Factor Analysis (CFA), the goal of this study was to select a reduced pool of items from the French Canadian version of the Self-Directed Search—Activities Section (Holland, Fritzsche, & Powell, 1994). Two studies were conducted. Results of Study 1, involving 727 French Canadian students, showed that the psychometric qualities of the 66-item French Canadian version are equivalent to those of the original English version. Based on IRT and factor loadings derived from a CFA, 24 items were selected from the original 66 items. In Study 2 (n = 339 French Canadian young adults), we tested and obtained support for the construct validity of the 24 selected items using CFA and correlational analyses among interests’ dimensions. We concluded that the selected pool of items accurately captured Holland’s theoretical framework and showed adequate psychometrics qualities and construct validity.
Making a career choice is an important step in human development (Gati & Asher, 2001; Vondracek & Porfeli, 2003), given the significant consequences that it can have on personal and professional lives (Guay, Ratelle, Senécal, Larose, & Deschênes, 2006). Vocational interests, which have been found to be important determinants of career choice (Lent, Brown, & Hackett, 2002; Osipow & Fitzgerald, 1996), are defined as patterns of personal likes, dislikes, and indifferences regarding various career-relevant activities (Lent et al., 2002). Multiple inventories have been developed to assess vocational interests (Guglielmi, Fraccaroli, & Pombeni, 2004; Hood & Johnson, 2007; Kapes, Mastie, & Whitfield, 1994), most of which are based on Holland’s theory of vocational choices (Holland, 1992, 1997).
Holland’s theory (1992, 1997) states that most people can be classified according to six personality types: Realistic, Investigative, Artistic, Social, Enterprising, and Conventional (RIASEC). Each of these types is characterized by specific beliefs, values, abilities, and interests. These six personality types are theoretically situated on a hexagon that represents the expected relations among them (Holland, 1997). For example, two adjacent types on the hexagon, such as Realistic and Investigative, share more characteristics than two opposite types, such as Artistic and Conventional.
The Self-Directed Search ([SDS] Holland, Fritzsche, & Powell, 1994), based on Holland’s theory, is one of the most widely used interest inventories because it can be self-administered and self-interpreted, and the scoring does not require electronic tools (Spokane & Holland, 1995). In addition, the factorial structure of the SDS obtained empirical support (Khan & Alvi, 1991; Oosterveld, 1994). Finally, the SDS has been translated in 20 different languages (see Glidden-Tracey & Greenwood, 1997; Muck, 2005; Yang, Lance, & Hui, 2006 for examples), with one of the versions being a French Canadian one (Holland, 1991).
Like its original version, the French Canadian version of the SDS contains 228 items divided into 4 sections (Activities, Competencies, Occupations, and Self-Estimates) measuring several career choice determinants. Vocational interests are measured by the Activities section of this inventory, in which participants indicate whether they would like to do each of the 66 proposed career activities. These activities can all be categorized according to the six personality dimensions. Summing the scores obtained on each dimension yields a measure of the participant’s vocational interest profile.
Counselors usually have sufficient time to administer the SDS-Activities section to the counselee. Unfortunately, when it comes to research, investigators are usually limited by time such that they cannot administer lengthy inventories like the SDS, in addition to assessing other important constructs. Because researchers generally aim to further their understanding of the determinants or consequences of career interests, they need to include other measures in their survey. Knowing that longer surveys are expected to affect the response rate and amount of missing data (Graham, 2009), researchers often seek out a reduced pool of items with adequate psychometric properties. Hence, with 66-item, the SDS-Activities scale can be relatively long for research purposes. Despite the fact that using a restricted pool of items from a longer scale typically tends to yield worst psychometric qualities than longer versions, they have the advantage of diminishing item redundancy, which in turn reduces participants’ fatigue, frustration, and boredom (Robins, Hendin, & Trzesniewski, 2001). Furthermore, they can be considered to have acceptable, albeit not as high, psychometric qualities (e.g., factor loadings above .50, Cronbach’s αs .70 and higher, etc.). In fact, selecting a pool of items from an original scale has gained in popularity over the past few years. Notably, this is the case for the Inventory of Children’s Individual Differences (Deal, Halverson, Martin, Victor, & Baker, 2007), the Big Five Factor (Gosling, Rentfrow, & Swann, 2003; Muck, Hell, & Gosling, 2007; Rammstedt & John, 2007) and the Dyadic Adjustment Scale (Sabourin, Valois, & Lussier, 2005).
The Present Research
The main goal of the present research was to select a reduced pool of items from the French Canadian version of the Self-Directed Search—Activities section (Holland, 1991). The selected pool of items has to be small enough to allow a rapid assessment of vocational interest but also have sufficient items to ensure more accurate parameter estimates and greater reliability (Marsh, Hau, Balla, & Grayson, 1998). Based on Marsh et al.’s (1998) recommendation that a factor should include a minimum of 4 items, our goal was to select a total of 24 items (i.e., 4 items for each RIASEC dimension). This goal was pursued in two studies. Study 1 comprised two steps: (a) we verified whether the psychometric properties of the 66-item French Canadian version were equivalent to those of the English version, and (b) we selected a pool of items from the 66-item French Canadian version using Item Response Theory (IRT) and factor loadings from a Confirmatory Factor Analysis (CFA). Study 2 comprised three steps: (a) we tested the construct validity of the selected pool of items via a CFA, (b) we examined whether the factorial structure of the selected pool of items was invariant across gender (see Guglielmi et al., 2004; Turner et al., 2008), and (c) we verified if the correlations among subscales of this pool were comparable to those of the 66-item English version, as reported in the SDS technical manual (Holland et al., 1994).
The advantages of this research are twofold. First, it will provide useful information on the cross-cultural validity of the French Canadian version of the SDS, which, to the best of our knowledge, has never been assessed. Second, the validity and reliability of the selected items could be further tested by researchers working with the 66-item English-speaking populations as well as those speaking other languages (e.g., Spanish, Arabic, Vietnamese, and Chinese; Glidden-Tracey & Greenwood, 1997; Muck, 2005; Yang et al., 2006 for examples), especially when researchers do not have enough time to administer the full-length version.
Study 1
Method
Participants
A total of 727 French Canadian students (211 men, 511 women, and 5 unspecified) participated in the study. Participants’ mean age was 21.43 years (SD = 4.11; range = 16−51 with 38 participants being over 30). Of the 727 participants, 490 were university students and 237 attended college. In the province of Quebec, high school graduates can attend college in either a 2-year pre-university program or a 3-year technical program, leading to the labour market. A total of 0.08% of college students had academic achievement scores above 90%, 91.9% had scores between 70% and 80%, and 8.02% had scores lower than 70%. Among university students, 8.3% had academic achievement scores above 90%, 87.1% had scores between 70% and 80%, and 4.6% had scores lower than 70%. Overall, 94.3% of the participants were born in the province of Quebec, 1.4% in another Canadian province, and 4.3% in another country. A total of 97.2% of them spoke French as their first language, 1.1% had English as a first language, and 1.7% spoke another language. Half of the college students were in their third semesters (50.6%), while the remaining (49.4%) were in their first, second, fourth, fifth, or sixth semesters. Among university students, 27.8% were in their first semester, 29% in their third semester, whereas 43.2% were in their second, fourth, fifth, sixth, seventh, eighth, or ninth semester. In terms of domain, 45% of college students studied social sciences or humanities, 20.5% arts, and 34.5% other areas. Among university students, 35.6% majored in psychology, 21.8% in school guidance and counselling, and 42.6% in other programs. A total of 34.5% of the participants lived with both parents, 23.1% lived with one parent, and 42.4% lived away from their family (35.2% in apartment and 7% in residence). Finally, the mean family income ranged between Canadian $40,000 and $49,999 CAN. On average, participants’ mothers had at least a high school diploma and fathers, a college degree.
Procedure
Research assistants took appointments with 3 college teachers and 2 university professors in order to ask students in their classes to partake in a study on vocational development. Research assistants explained to students the goals of the study and asked them to raise their hands if they wanted to participate. Those who accepted received a consent form and a questionnaire. Those who declined were asked to work quietly. No compensation was offered to students for their participation.
Measures
Vocational interests
The Activities section of the French Canadian version of the SDS was used to assess students’ vocational interests. The scale asks participants to indicate whether they would like to engage in each of the 66 proposed career-related activities. The original version includes 11 activities per dimension of the RIASEC. Examples of activities include paint (Artistic), teach (Social), and sell (Enterprising). In the original version, participants use a Yes−No response format. In order to increase the variability in participants’ answers (Hogan, 2007), we modified the original scale—with the approval of Psychological Assessment Resources, Inc.—to a 5-point Likert-type scale ranging from I really don’t like or wouldn’t like to do this activity (1) to I really like or would like to do this activity (5). Specifically, for research purposes it is better to use a 5-point scale instead of a Yes−No forced choice scale since this increases the variability of the scores, which can increase reliability estimates (Hogan, 2007). Using such a response format can also increase the probabilities of obtaining better fit indices in subsequent CFA analyses.
Statistical Analyses
IRT
IRT (see Embretson & Reise, 2000) is a type of nonlinear factor analysis and can be used to complement traditional scale development methods when selecting an optimal sample of items from a larger pool (Kristjansson, Breithaupt, & McDowell, 2001). One of the goals of IRT is to estimate the probability of endorsing an item as a function of a latent variable (θ). This probability is estimated by a curve (Hambleton, Swaminathan, & Rogers, 1991; Santor, Ramsay, & Zuroff, 1994). In our study, the level of interest for each RIASEC dimension represents the latent variable (θ). Several models of IRT exist, but with the use of Likert scales, the Graded-Response Model (GRM; Samejima, 1969, 1997) was deemed the most appropriate (Embretson & Reise, 2000; van der Linden & Hambleton, 1997). In this study, we used the Excel Item Response Theory assistant (EIRT) program (Germain, Valois, & Abdous, 2007) to compute the parameter estimates under the GRM.
GRM was first used to estimate which items best differentiated participants’ answers as a function of their respective interest level. First, we examined the item slope (ai ) parameter, which indicates the location of the category response curves (CRCs). The higher the ai , the more response categories should differentiate among interest levels. We also evaluated the bij parameters (one for each CRC) and their standard errors. The bij parameters represent a given interest level as a function of the maximal probability of choosing each response option. A good item will have bij parameters that match their respective response categories and a low error level. We also used item informativeness (Ii ) to assess the accuracy of the interest level estimation (Baker, 2001). When the amount of information is large, the latent trait level (interest) is more accurately estimated (Baker, 2001). Informativeness can be computed for each latent trait level and represented graphically as an item information curve (IIC).
Structural equation modeling (SEM)
We also used SEM (EQS program, version 6.1) to perform a CFA in order to select items. The adequacy of the CFA solution was assessed with different goodness-of-fit criteria, notably the chi-square. With large sample size, the chi-square is usually significant. For this reason, we also evaluated model goodness of fit with other fit indices such as the Comparative Fit Index (CFI), Non-Normed Fit Index (NNFI), Root Mean Square Error of Approximation (RMSEA), and the chi-square-to-degree of freedom ratio (χ2/df). The NNFI and CFI vary along a 0-to-1 continuum in which values greater than .90 are considered acceptable (Schumacker & Lomax, 1996). Browne and Cudeck (1993; also see Jöreskog & Sörbom, 1993) suggested that RMSEA values lower than .05 are indicative of a close fit and values up to .08 represent reasonable errors of approximation. There is no clear guideline about the value of the χ2/df, but usually a value lower than 3 reflects acceptable goodness of fit (Kline, 1998).
Missing data
A total of 700 participants completed all items of the questionnaire. Only 26 participants (3.58%) had missing values. We used the Little’s Missing Completely at Random (MCAR) test to verify whether the pattern of missing values was completely at random (MCAR). This test is based on a chi-square distribution with F degrees of freedom. If the test is nonsignificant, the pattern of missing values is MCAR. The Little’s MCAR test, χ2 = 1569.079, df = 1533, p = .26, was nonsignificant, thereby justifying a listwise deletion of missing cases. Specifically, a listwise deletion of cases produces adequate parameters’ estimates under MCAR (see Peugh & Enders, 2004).
Results
Is the 66-item French Canadian version equivalent to the 66-item English version?
To test whether the 66-item French Canadian version was equivalent to the English version, as presented in the SDS technical manual (Holland et al., 1994), we compared reliability coefficients for both gender and correlations among RIASEC dimensions. Kuder-Richardson-20 coefficients (KR-20) were used with the American sample because the response format was dichotomous. In the present study, we used Cronbach’s coefficients because responses used a Likert-type format. For more information regarding these coefficients, see Kaplan and Saccuzzo (1993). As can be seen in Table 1, all reliability coefficients for English and French versions were acceptable (i.e., higher than .70; Nunally, 1978). To compare reliability coefficients, we use Feld’s test for independent samples, W = (1 − α2/1 − α1), where W can be interpreted as an F statistic with (N 1 − 1), (N 2 − 1) degrees of freedom; Feldt, 1969). Also, because we conducted several tests on the same data (6 tests for each gender), all p values were adjusted using a Bonferroni correction (Holm, 1979). Overall, we obtained three significant differences between reliability coefficients of the French version and those of the English one with those of the French version being highest.
Study 1: Comparisons of Reliability Coefficient Between the 66-item English and French Versions
Note: Kuder-Richardson-20 coefficients for the 66-item English version were reproduced by special permission of the Publisher, Psychological Assessment Resources, Inc., from the Self-Directed Search Form R Assessment Booklet by John L. Holland, Ph.D., Copyright 1970, 1977, 1985, 1990, 1994, 1995. Further reproduction is prohibited without permission from PAR, Inc.
* p < .008; Two-tailed test.
a N = 399.
b N = 211.
c N = 716.
d N = 511.
Next, we compared correlations among the RIASEC dimensions of the French version to those presented in the SDS technical manual using a Z score difference and a Cohen’s q effect size (Cohen, 1977; Rossi, 1985). The Z score tests whether the difference between two correlation coefficients is significant, whereas the Cohen’s q effect size specifies the magnitude of this difference. Because we computed several tests (15 tests for each gender), we applied a Bonferroni correction to adjust the significance level (α/n; Shaffer, 1995). The adjusted significance level was therefore set at p < .003 (.05/15 = .003). Of the 30 correlations, 9 (30%) were significantly different (see Table 2) although the effect size of these differences where small for most of them (only 2 were moderate). No large effect size was observed. In addition, few of the significant differences were problematic in light of Holland’s theory. Specifically, most significant differences among correlations paralleled the organization of the six personality types on the hexagon where distant types of interest were more weakly correlated than proximal ones. However, two significant differences between correlations appeared problematic for the French version. Specifically, the S-E and S-C correlations for the men sample were, respectively, .36 and .22 in the English version (r = .05; p = .45), but only .05 and −.07 in the French version (r = −.07; p = .31). Nevertheless, we conclude that overall the 66-item French Canadian version is relatively equivalent to the English version in terms of reliability and correlations, thereby allowing the possibility to shorten the 66-item French version.
Study 1: Comparisons of Correlation among RIASEC Dimensions for the 66-item English and French Versions
Note: Correlations for the 66-item English version were reproduced by special permission of the Publisher, Psychological Assessment Resources, Inc., from the Self-Directed Search Form R Assessment Booklet by John L. Holland, Ph.D., Copyright 1970, 1977, 1985, 1990, 1994, 1995. Further reproduction is prohibited without permission from PAR, Inc.
R = Realistic; I = Investigative; A = Artistic; S = Social; E = Enterprising; C = Conventional.
a N = 1002.
b N = 210.
c N = 1600.
d N = 511.
* p < .003.
Which items should be selected?
To determine which items to select, we used three parameters of the GRM, namely ai , bij , and Ii , as well as factor loadings from a CFA. In the first step, we used the GRM parameters to choose items that best differentiated the participants in terms of interest level. The item slope (ai ) parameter was first examined. As mentioned above, the higher the ai , the better response categories differentiate the interest level. Thus, based on Baker (2001), we kept items with ai > 1.35, which corresponds to a high differentiation level. We also used the bij parameters and their standard errors. We kept items with bij parameters having an adequate location on the interest continuum and the lowest standard error (<.30). Following a joint analysis of the ai and bij parameters, we eliminated 23 items. Table 3 presents the items that were dropped in this first step. Next, we examined participants’ IICs for the 43 remaining items and kept, for each dimension, the 5 items that most accurately estimated interest levels. We thereby eliminated 13 additional items (see Table 4).
Study 1: ai Parameters for the 66 Option Characteristic Curves and bij Parameters and Standard Errors of the 66 Category Response Curves
Note. Crossed-out items are the ones deleted from the entire pool of items.
Study 1: Maximal Informativeness of the 43 IIC
Note. Because item reproduction is prohibited without permission from PAR Inc., the labels of the items cannot be written. Nevertheless, their numbers are presented in the order in which they appear in the Activities section of the Assessment booklet, which can be purchased from PAR Inc. Crossed-out items are the ones deleted from the entire pool of items.
In the second step, we used a CFA to examine the factor loadings of the 30 remaining items, Satorra-Bentler (S-B) χ2 = 1 578.43; df = 391; S-B χ2/df = 4.04; NNFI = .87; CFI = .89; RMSEA = .07. The low values of these fit indices suggest that some items were problematic. In line with our goal to create a 24-item scale, we selected the 4 most optimal items to assess each dimension. The 6 items with the lowest factor loadings were thus eliminated (see Table 5). In sum, based on the parameters ai , bij, and Ii and the factor loadings, we selected 24 items that best captured the s6 vocational interests’ dimensions.
Study 1: Factor Loadings of the 30 Items From the CFA Solution
Note. Because item reproduction is prohibited without permission from PAR Inc., the labels of the items cannot be written. Crossed-out items are the ones deleted from the entire pool of items.
Summary
Results showed that the 66-item French Canadian version of the Activities section of the SDS is relatively equivalent to the original English version in terms of reliability and correlations. In addition, using IRT and CFA analyses, we selected 24 items. However, we have yet to determine whether the factor structure of these selected items is adequate, whether it is invariant across gender, and whether the correlations among the subscales are in line with Holland’ theory. These will be the goals of Study 2.
Study 2
Method
Participants
A total of 338 French Canadian young adults (95 men; 243 women) agreed to partake in this study. Of these participants, 252 attended college, 58 attended university, and 28 had a full-time job. Participants’ mean age was 19.82 years, SD = 2.91, range = (16−44), with 5 participants being over 30). Among college students, 24.2% were in humanities or social sciences, 19.8% in natural sciences, 7.1% in arts, and 48.8% in other domains. Among university students, 27.5% were in a teacher education program, 17.2% in administration, 12.3% in engineering, 10.3% in a business program, and 32.7% were in other programs. A total of 6.1% of college students had academic achievement scores above 90%, 82% had scores between 70% and 80%, and 11.9% had scores lower than 70%. For university students, 6.9% of them had academic achievement scores above 90%, 81% had scores between 70% and 80%, and 12.1% had scores below 70%. For the 28 participants who worked full time, 82.1% had a college degree, 14.3% a high school diploma, and 3.6% a bachelor’s degree.
A total of 52.2% of the students lived with both parents, 19.9% lived away from their family (18.1% in apartment and 1.8% in residence), and 27.9% lived with one parent. Overall, 96.7% were born in the province of Quebec, 1.5% in another Canadian province, and 1.8% in another country. Most of them (98.2%) spoke French as their first language (0.9% English and 0.9% another language). Parents’ average annual income was between Canadian $40,000 and $49,999 CAN and the majority of mothers (25.7%) and fathers (21.3%) had at least a college degree.
Procedure
Participants for this study were a subsample from a larger study on vocational development. All participants from this larger sample (n = 975) were contacted by phone and asked to participate in the present study, which consisted in completing an online survey that took approximately 20 minutes to fill out. They were told that 25 pairs of movie tickets would be drawn at random among those who had completed the survey. Upon participants’ acceptance, we gave them an ID and a password to access the online survey. Additional phone calls were made to remind participants to complete the survey. The 338 participants who accepted to participate in the study had no missing values (they answered all items of the survey). This could be explained by the fact that participants were unable to submit their answers if they had not completed all items.
Statistical Analyses
As in Study 1, we performed a CFA but, here, we tested the invariance of the CFA solution across gender. We first computed separate covariance matrices for men and women. The minimal condition of factorial invariance is the invariance of the factor loadings (Model 1). Thus, separate tests were conducted to test the invariance of the factor loadings (Model 2) as well as the invariance of variances and covariances (Model 3; Little, Preacher, Selig, & Card, 2007). We used the chi-square difference test at p < .01 to verify whether the factor structure was invariant across gender. As recommended by Cheung and Rensvold (2002), we used the .01 difference in CFI among invariance models. Specifically, if the difference in CFI between two models was lower than .01, we concluded that both models were equivalent.
Measures
Vocational interests
The 24 items selected in Study 1 were used to assess vocational interests. Cronbach’s αs were adequate, αR =.93; αI =.87; αA =.79; αS =.85; αE =.92; αC =.92, thereby providing support for the reliability of each dimension.
Results
Do the selected items have adequate construct validity?
To determine whether the selected items presented adequate construct validity, we first examined whether the factorial structure underlying the RIASEC model was reproduced by the data. To do so, we performed a CFA on the 24 selected items and examined the goodness-of-fit criteria. Results showed adequate fit indices, S-B χ2 = 495.55; df = 236; S-B χ2/df = 2.10; NNFI = .94; CFI = .95; RMSEA = .06. In addition, all factor loadings were above .50 (see Table 6).
Study 2: Factor Loadings of the 24 Items From the CFA Solution
Note. Because item reproduction is prohibited without permission from PAR Inc., the labels of the items cannot be written.
Are the selected items invariant across gender?
We tested the selected items for factorial invariance across gender. We computed three models. In the least restrictive model (Model 1), no parameter was constrained to be equal across gender. This model provided a relatively good approximation of the data, S-B χ2 = 836.22; df = 472; χ2/df = 1.77; CFI = .94; NNFI = .93; RMSEA = .06. In Model 2, factor loadings were constrained to be invariant across gender, S-B χ2 = 855.43; df = 490; χ2/df = 1.75; CFI = .94; NNFI = .93; RMSEA = .06. The chi-square difference between these two models was not significant, χ2 diff = 22.04; df diff = 18; p > .01. In Model 3, variances and covariances were constrained to be invariant across gender, S-B χ2 = 881.40; df = 505; χ2/df = 1.75; CFI = .94; NNFI = .93; RMSEA = .06. The chi-square difference between models 2 and 3 was not significant, χ2 diff = 25.74; df diff = 15; p > .01. In addition, there were no differences among the CFI of the three models, ΔCFI = .00. Based on these results, we can conclude that factor loadings, variances, and covariances were invariant across gender.
Are correlations among interest dimensions of the selected items equivalent to those of the original English version?
We compared correlations among the RIASEC dimensions for the 24 selected items to those of the 66-item English version presented in the SDS Technical Manual. Because correlations are presented in the technical manual as a function of gender, we compared correlations for each gender separately, even though we found no differences in correlational patterns in the invariance analyses. As in Study 1, we used a Z score difference and a Cohen’s q and we adjusted the α level via a Bonferroni correction. As we can observe in Table 7, five correlations reached significance (one for men and four for women). Based on Cohen’s criteria for estimating effect size (Cohen, 1977), we can qualify these differences as varying from small to moderate. No effect size was observed. Some differences were noteworthy. For men, the correlation between I-S was .26 for the 66-item English version, but −.24 (p = .02) for the 24-item French version. For women, correlations between I-C, A-E, S-E, and E-C were, respectively, .21, .22, .26, and .36 for the 66-item English version but .40 (p = .00), −.01 (p = .93), −.01 (p = .93), and .55 (p = .00) for the 24 selected items.
Study 2: Comparisons of Correlation among RIASEC Dimensions for the Original 66-item English and the 24 selected items from the French-Canadian Version
Note. Correlations for the 66-item English version were reproduced by special permission of the Publisher, Psychological Assessment Resources, Inc., 16204 North Florida Avenue, Lutz, Florida 33549, from the Self-Directed Search Form R Assessment Booklet by John L. Holland, Ph.D., Copyright 1970, 1977, 1985, 1990, 1994, 1995. Further reproduction is prohibited without permission from PAR, Inc.
R = Realistic; I = Investigative; A = Artistic; S = Social; E = Enterprising; C = Conventional.
* p < .003.
aN = 1002.
bN = 95.
cN = 1600.
dN = 243.
Summary
The findings of Study 2 showed that the selected pool of items of the Activities section of the SDS is adequate and that its factor structure was invariant across gender. In addition, correlations among subscales were in the expected direction, except for 5 differences between the original 66-item English version and the 24 selected items (one for men and four for women). One may argue that these differences originate from the 24 selected items. However, it should be noted that the differences for I-C, A-E, and E-C for the women sample also appear when we compared the 66-item of both French and English versions (see Study 1). These unexpected findings might consequently have more to do with the original, longer version. Future studies are needed to replicate those differences and to explain them.
General Discussion
The main goal of the present research was to select a restricted pool of items from the French Canadian version of the Self-Directed Search—Activities section (Holland, 1991). In Study 1, results showed that the 66-item French Canadian version was equivalent to the original English version regarding reliability and correlations among the 6 types of interests. Using IRT and CFA, we selected 24 items from the 66-item version. In Study 2, results showed that the selected items had adequate factorial validity and that they were invariant across gender. In addition, correlations among subscales were in the expected direction, although some differences emerged between the original 66-item English version and the 24 selected items. The implications of these results are discussed below.
Implications for Research, Theories, and Career Counseling
The findings we obtained lead to a number of implications for research, theories, and career counseling. First, by testing the equivalence of the 66-item French Canadian version of the Activities section to the 66-item English version, the results of Study 1 provided further support for the French version of this SDS subscale. To the best of our knowledge, this is the first study that directly compared the psychometric properties of these two versions. It thus provides some support for using the SDS French Canadian version with college and university students. However, as we noted, some differences among correlations exist between both versions, specifically for men regarding social, enterprising, and conventional types of vocational interests. That is, in the American men sample, the correlation between social and enterprising dimensions was moderate and positive whereas, in the French Canadian men sample, this correlation was quite low. Moreover, in the American men sample, the correlation between social and conventional was positive, but negative in the French Canadian men sample. Because these types of interests are adjacent on the hexagon, one would expect higher, positive correlations among these dimensions for the French Canadian sample. However, these differences in correlation were not significant when we compared American women and French Canadian women samples. At this point, we cannot explain such differences in the French Canadian male sample. Are they due to cultural differences (i.e., occupational activities in certain countries might be different from those in the United States, where the original scale was developed), sample characteristics, or gender differences in interpretation of the items? Future research is needed to reproduce these unexpected correlations and explain them. More research is also required to replicate the RIASEC structure with Canadian samples.
Second, results from IRT and CFA allowed us to select 24 items. As mentioned earlier, selecting a reduced pool of items when researchers face time constraints has the advantages of diminishing item redundancy and reducing participants’ fatigue, frustration, and boredom (Robins et al., 2001). We must, however, keep in mind that the selected items focus on vocational interests but not other aspects of the vocational development such as competencies, occupation, or self-estimates. In addition, the 24 selected items might be different from those that would be selected in another language (i.e., English, German). Although the strength of this research was that the 24 items were selected through strict statistical analysis, it is possible that using different samples in different countries might yield different choices of items. This possibility need to be investigated in future studies.
Third, results from Study 2 showed that the selected items were reliable and had an adequate factorial structure that it is invariant across gender. These results parallel those of Nagy, Trautwein, and Lüdtke (2010), which demonstrated in a sample of German students that the structure of vocational interests is invariant across gender. However, before using those selected items for research purposes, it would be important to test it on various populations including high school students, young adults, and adults on the job market. Although we found a psychometrically sound reduced pool of items in the SDS Activities section, we encourage researchers and practitioners to use the complete version when they have sufficient time.
Fourth, most correlations for the selected items were in line with those obtained with the original 66-item English version. It thus appears that having fewer items did not affect the direction or magnitude of correlations among interests’ dimensions. The pattern of the correlations was thus in line with Holland’s model in which interest types can be represented in a hexagonal fashion where the distance between each type represents the relative magnitude of the correlations. That is, correlations between adjacent types are greater than those between all nonadjacent types (e.g., r R-I > r R-S). However, as it was the case with the 66-item French Canadian version, some correlations with the 24 selected items appeared problematic, especially, in Study 2, those involving the enterprising dimension for the women sample. Future research is thus needed on this issue.
Limitations
Despite the innovative nature of our findings, we need to consider some limitations when interpreting them. First, we did not verify extensively whether the SDS 66-item French Canadian version was equivalent to the English one. Conducting invariance analyses between English and French Canadian versions on factor loadings, variances, covariances, and uniquenesses appeared warranted. We were unable to conduct such an analysis because in the SDS English technical manual covariances among items were not reported.
Second, although we concluded that the psychometric qualities of the selected 24 items were adequate, other tests are needed to further test the psychometric properties of these selected items. For example, we did not test the scale’s reliability or construct validity over time. One advantage of using multiple time measurements would be to compute a multitrait−multimethod (MTMM) matrices to assess the scale’s convergent and divergent validities (see Campbell & Fiske, 1959; Guay, 2005; Marsh & Grayson, 1995 for examples). In addition, it could be useful to test the validity of the selected items using more sophisticated method of analyses such as the Randomization Test of Hypothesized Order Relations ([RTOR] see Nagy et al., 2010), which would test more fully the relations among the 6 dimensions as well as to test its construct validity with the 5-factor model of personality.
Finally, the samples used in Studies 1 and 2 were not representative of the entire college and university student populations. Our findings should be replicated in studies using larger, more representative samples that include students from ethnic minorities as well as those coming from diverse socioeconomic backgrounds.
Conclusion
Although there need to be additional tests of the validity of these 24 items selected from the Activities section of the SDS, the present study nevertheless underscores their psychometric properties. Thus, researchers working with French Canadian students could use these items to test their hypotheses concerning the determinants and consequences of career interests, when they do not have the required time to administer the 66-item version.
Footnotes
The Self-Directed Search was modified and reproduced with special permission of the Publisher, Psychological Assessment Resources, Inc., 16204 North Florida Avenue, Lutz, Florida, 33549, from the Self-Directed Search Form R Assessment Booklet by John L. Holland, PhD, copyright 1970, 1977, 1985, 1990, 1994, 1995. Further reproduction is prohibited without permission from PAR, Inc.
The author(s) declared no conflicts of interest with respect to the authorship and/or publication of this article.
The author(s) disclosed receipt of the following financial support for the research and/or authorship of this article: SSHRC and by the financial support of the Canada Research Chair on Motivation and Academic Success.
