Abstract
The present study addresses a lack of literature examining contextual factors when investigating gender norms and leader effectiveness. Supervisor–subordinate dyads were assessed for gender dissimilarity effects on ratings of leadership effectiveness. The effect of organizational culture as a moderating force on these effects was also investigated. Results bolster relational demography theory, such that supervisors who fit existing norms (i.e., male supervisors) were rated higher than supervisors that do not (i.e., female supervisors). However, lower ratings for female supervisors were mitigated by effective organizational cultures.
Keywords
While women have made up ground in their representation in managerial positions (U.S. Department of Labor, 2010), the fact remains that women are still highly underrepresented in the upper echelons of organizations. The term “glass ceiling” (Barreto, Ryan, & Schmitt, 2009; U.S. Department of Labor, 1991) describes this phenomenon of women routinely facing an upward battle in their aspirations of obtaining managerial and executive-level positions, especially in roles that engender a masculine stereotype (Eagly, Karau, & Makhijani, 1995). Negative effects of the glass ceiling include disadvantages in terms of selection (Koch, D’Mello, & Sackett, 2015), promotion (Ortiz-Walters, Schneider, Sheikh, & Simione, 2012), and perceptions of effectiveness of performance (Thoroughgood, Sawyer, & Hunter, 2013).
The present study is rooted in stereotype research generally and relational demography theory specifically. Researchers continue to study the disparity between male and female leaders, reasons for the lack of females in the top levels of organizations, and how females are evaluated once they break the glass ceiling. However, one perspective has not received much attention: the gender congruence between supervisors and their subordinates (i.e., relational norms) together with the role of contextual factors (i.e., organizational culture). This article aims to (a) answer questions around whether female leaders are still perceived as less effective than male leaders; (b) determine if, consistent with past research, norms regarding males as leaders affect ratings of female leaders; and (c) extend theory by examining contextual factors, specifically the extent to which organizational culture may mitigate or exacerbate negative evaluations of female leaders.
Stereotypes of Male and Female Leaders
In a recent publication comparing the stereotypes of males and females from 1983 to 2014, Haines, Deaux, and Lofaro (2016) concluded that, with one exception, stereotypes have remained consistent. The exception? Females today are stereotyped more in terms of gender roles than they were three decades ago. With regard to leadership, males are commonly stereotyped as leaders, while females are not (Eagly & Karau, 2002; V. E. Schein, 1975). In fact, even when female leaders are perceived as competent as male leaders, they still face backlash in the form of bias and are less liked than male leaders (Rudman, Moss-Racusin, Phelan, & Nauts, 2012). In 1973, V. E. Schein proposed that individuals create prototypes of what an effective leader should be. Lord, Foti, and De Vader (1984) expanded on the idea of effective leadership profiles (i.e., implicit leadership theory), and suggested that individuals develop prototypes of an ideal leader based on experience with various leaders (Epitropaki & Martin, 2005). Indeed, research suggests the general male prototype more strongly relates to that of a successful leader prototype (e.g., Heilman, 2012; Heilman, Block, Martell, & Simon, 1989; Junker & van Dick, 2014; V. E. Schein, 1973, 1975).
More recent studies suggest that while there may be a perceived male advantage, the effect might be more complex than originally thought. This notion has been supported by the likes of Eagly and Carli (2003, 2007) who assert that the context of the organization (e.g., power distance, gender composition), societal norms, and organizational norms are causing the prescribed leadership roles of the past to change. In turn, compared with how female leaders were viewed in the past, female leaders today are looked at differently and in a more favorable regard (Duehr & Bono, 2006), though this effect is further complicated by the perceived masculinity/femininity of leaders in certain industries (e.g., Cabrera, Sauer, & Thomas-Hunt, 2009). Furthermore, the literature on leadership ratings is dependent on who is providing the ratings. Overall, early research on the discrepancies between stereotypes and perceived effectiveness of male and female leaders allude to a predominately masculine preference. Over time, it seems, these preferences have been diluted (Koenig, Eagly, Mitchell, & Ristikari, 2011). Not only that, but there seem to be variables that moderate preferences and ratings of female leaders (Duehr & Bono, 2006; Eagly & Carli, 2007). Relational demography theory may help better understand these complexities, as it focuses less on the descriptive and prescriptive roles of leaders by gender and focuses more on the interaction between gender (dis)similarity and context to explain leadership ratings.
Relational Demography Theory
Relational demography is the study of demographic characteristics (e.g., gender, age, ethnicity) that aid in the understanding of group similarities and dissimilarities, and their effects on various organizational outcomes (Bakar & McCann, 2014; Tsui & O’Reilly, 1989). According to researchers (e.g., Lau, Lam, & Salamon, 2008), prescribed categories (i.e., relational norms) influence individuals’ perceptions of others. In the present study, we focus on subordinates’ perceptions of effectiveness based on the gendered nature of the supervisor–subordinate dyad.
According to the relational-norm approach of relational demography, the outcomes of demographic (dis)similarity are largely contingent on social or cultural norms (Lau et al., 2008). Said another way, when the demographic characteristics of a group or dyad fit the societal, cultural, or organizational norm, there is status congruence, and a more positive relationship between a supervisor and subordinate will likely ensue (Lawrence, 1984). When these groups or dyads do not fit the norm, there is status incongruence within the group, and a less positive relationship will likely follow (Goldberg, Riordan, & Zhang, 2008). These statuses or roles are based on shared beliefs of society, that is, normative perceptions about what is expected by someone with a particular demographic attribute. Years of research support the notion that perceptions about how one gender, especially females, ought to act influence ratings (Koch et al., 2015).
In terms of the present study, it is expected that gender differences in dyads result in differential ratings, but also that those ratings depend on the pairing within the supervisor–subordinate dyad. Specific to individuals’ profiles of an ideal leader aligning more closely with a male, it is expected that demographic dissimilarities will have a positive influence on supervisor effectiveness ratings only when that dissimilarity in the dyad fits a societal or organizational norm.
Scant research has been done to understand the complexities of the relational-norm approach to relational demography research, especially within a gendered context (for an exception, see Goldberg et al., 2008). As such, it is important to test the role of gender in dyads on leadership ratings and determine which contexts may help shape these perceptions of effectiveness. Though previous research has educated our understanding of these relationships, there is still a gap in the literature. Our aim is to address this gap and aid in the understanding of how the gender of supervisors and subordinates interact with various norms to predict perceptions of supervisor effectiveness. Consistent with the research reviewed above, we expect to find context of the dyad to act as a relational norm and affect subordinates’ ratings of their supervisors. Namely, we expected that supervisors in dyads that are consistent with relational norms about males as leaders would be rated as more effective than supervisors in dyads that are inconsistent with relational norms about males as leaders. Thus, the following hypotheses are presented:
Organizational Culture as a Moderator
Relational demography research is not without its criticism. That is, the crux of this research is that it assumes an intervening variable, yet often does not directly measure any (Lawrence, 1997). Instead of assuming an intervening variable, and consistent with previous research integrating organizational context into the study of perceived gendered differences in leadership effectiveness, we argue that contextual factors (specifically organizational culture) may moderate the relationship between gender contrasts and leadership effectiveness ratings. In doing so, we argue that traditional gender norms are less important in organizations with a more effective culture and, thus, may affect how supervisors are rated in such organizations. Analyzing culture as a boundary condition is consistent with calls from leadership theorists and researchers for investigations into how context plays into leadership. Dinh et al. (2014), in the review of trends and changing perspectives in leadership, state: “ . . . while progress has been made, we still consider [contextual factors in which leaders operate] to be an under-researched topic, given the central importance of context to the emergence and manifestation of leadership processes.”
Organizational culture is commonly defined as the shared set of values, beliefs, and assumptions held by the constituents within an organization (E. H. Schein, 1992). It is these values, beliefs, and assumptions that provide a roadmap for leaders’ behaviors (Hartnell & Walumbwa, 2010) and, accordingly, followers’ perceptions of those behaviors. Indeed, research shows that culture plays into the beliefs of employees regarding how effective leaders ought to behave as well as leaders perceived effectiveness (e.g., Kirkman, Chen, Farh, Chen, & Lowe, 2009).
Certain culture models take a prescriptive (rather than descriptive) approach that measures culture in terms of cultural effectiveness, where higher culture scores suggest greater cultural and thus organizational effectiveness (Denison & Mishra, 1995). There is considerable support for these models of culture, finding that higher culture scores are related to better organizational performance (e.g., Boyce, Nieminen, Gillespie, Ryan, & Denison, 2015). Relatedly, effective leadership is also inextricably tied to organizational culture (e.g., Judge, Piccolo, & Ilies, 2004).
Given the importance of (a) context in the relational demography literature and (b) the relationships between culture, leadership, and organizational effectiveness, we expect organizational culture to moderate the relationship between both of the abovementioned gender contrasts and perceptions of leadership effectiveness as rated by supervisors’ subordinates. We expect that organizations with stronger shared perceptions of an effective culture likely have fewer barriers for supervisors in dyads that do not fit traditional norms about males as leaders. As organizational culture scores increase, we anticipate supervisors in nonnormative dyads (i.e., female supervisors rated by male subordinates or female subordinates) to be rated higher.
However, due to the scant theoretical or empirical evidence to bolster differential effectiveness ratings by dimensions of culture, we do not make hypotheses about specific culture dimensions. Because we thought it important to investigate such relationships, we, instead, pose the research question of which organizational culture dimensions moderate the relationship between gender contrasts and leadership effectiveness ratings.
Method
Participants and Procedure
Participants in this study were selected from a larger, global archival data set consisting of nearly 14,000 supervisors from approximately 1,000 organizations who had completed the Denison General Leadership Effectiveness Survey (DGLES). Supervisors were invited to participate in the survey via e-mail at which time they were also given the option to invite additional raters (i.e., their supervisor(s), direct report(s), peer(s), and other raters deemed appropriate) to rate their performance. Because the primary interest of the current study was ratings of supervisors as provided by direct reports, only those ratings were analyzed.
From the larger database of leaders, only those in organizations whose members had also completed the Denison Organizational Culture Survey (DOCS), the culture survey used in this study, were included. As a result, the final data set consisted of 3,510 supervisors being rated by 12,374 direct reports from 87 organizations. Of these supervisors, 11,497 reported their gender (77.7% male). Each supervisor had, on average, 3.53 direct reports (SD = 1.77) rate their effectiveness. The gender breakdown for the direct reports were 4,614 females and 7,760 males. In total, dyadic pairs were consistent with the notion of a male leadership norm. See Tables 1 and 2 for a breakdown of dyad representation and additional demographic information for those supervisors who provided such data.
Breakdown of Dyadic Pairs.
Leaders Demographic Information.
Data for perceptions of organizational culture were taken from the organizations from which the supervisors worked. In total, culture scores were gathered from 24,420 employees across the 87 organizations in the sample. The average number of employees across organizations was 283.95 (SD = 587.97), while the median was 104.00 employees per organization. Organizational culture data were either census (i.e., all members surveyed) or representative of the organization as a whole (i.e., randomly sampled across organizational units). Organizations represented a diverse range of sectors and industries. As explained in more detail below, female representation in each industry ranged from 5.20% to 76.80%.
Measures
Relational Demography
In order to test relational norms, it was necessary to first create dyadic pairing consistent with our hypotheses. That is, we coded dyadic pairs specific to the contrasts we were testing. This method of analysis was selected over an analysis of variance framework with supervisor gender, subordinate gender, and the interaction between them as predictors of effectiveness because such a framework fails to account for important normative information specific to dyadic pairs. Others have taken a similar approach to the study of relational norms (e.g., Tsui, Porter, & Egan, 2002).
Denison Organizational Culture Survey
Culture was measured using the DOCS (Denison & Neale, 1996). The DOCS was born out of a line of research by Denison et al. in which organizational culture was linked to organizational effectiveness (Denison, 1984; Denison & Mishra, 1995; Fey & Denison, 2003). More specifically, the DOCS was designed as a prescriptive measure of culture. Participants rated 60 items on a 5-point Likert-type scale. The items fit within 12 lower order indexes nested within four higher order traits (Involvement, Consistency, Adaptability, and Mission). For the sake of this study, culture was measured as the aggregate of trait-level factors. An example item of Involvement is “Most employees are highly involved in their work.” An example item of Consistency is “The leaders and managers ‘practice what they preach.’” An example item of Adaptability is “The way things are done is very flexible and easy to change.” And finally, an example item of Mission is “There is a long-term purpose and direction.” In the current sample, scale reliability (α) estimates for each of the higher order culture traits were as follows: Involvement = .91, Consistency = .89, Adaptability = .86, and Mission = .92.
Denison General Leadership Effectiveness Survey
The outcome variable for this study was leadership effectiveness, which was assessed using subordinates’ ratings of their supervisors on the DGLES. The DGLES is a seven-item Likert-type measure assessing descriptive leadership effectiveness. Responses ranged from 1 (strongly disagree) to 7 (strongly agree). Though the DGLES has never been used in peer-reviewed research, use of ad hoc leadership effectiveness scales is not uncommon (e.g., Bono & Ilies, 2006; Edelman & van Knippenberg, 2017; Kerr, Garvin, Heaton, & Boyle, 2006). In the current sample, coefficient α estimates for the DGLES were adequate (α = .96), and interitem correlations ranged from r = .67 to r = .86.
In addition to tests of internal consistency, we also conducted a multilevel confirmatory factor analysis (c.f. Hox, 1993) on the DGLES. Results from the multilevel CFA indicate that the seven items adequately fit a one-factor structure (χ2 = 935.23, degrees of freedom = 28; root mean square error of approximation = .05; comparative fit index = .98).
Control Variables
Three variables were used as covariates in the present study: organizational level, tenure with organization, and percentage of women in the industry. We controlled for these three variables because recent studies (e.g., Ko, Kotrba, & Roebuck, 2015; Paustian-Underdahl, Walker, & Woehr, 2014) found gender differences on ratings of leadership effectiveness across those measures. To calculate percentage of females within each industry, we matched industry data provided by the U.S. Bureau of Labor Statistics (2017) to the corresponding North American Industry Classification System industry codes provided by the U.S. Census Bureau (2017).
Results
Descriptive statistics and bivariate correlations are reported in Table 3. Please note that the correlations reported are at the individual level and do not consider the nested structure of the data.
Correlations, Means, and Standard Deviations of Model Variables.
Note. MF = male–female; FM = female–male; MM = male–male; FF = female–female; DGLES = Denison General Leadership Effectiveness Survey. Reliabilities in parentheses are reported along the diagonal.
*p < .05. **p < .01.
Analysis
Due to the nested, nonindependent structure of the data, that is, multiple subordinates rating supervisors across organizations, all hypotheses were tested using hierarchical linear modeling (Bryk & Raudenbush, 1992). Analyses suggest that 34% of the variance in effectiveness ratings are due to supervisor of the subordinate, intraclass correlation coefficient(1) = .34, and that subordinates sharing the same supervisor rated similarly (mean rwg(j) = .91). Moreover, organizations account for between 12% and 16% of the variance in culture scores, with adequate within-organization agreement (see Table 4).
Variance and Agreement Statistics.
Note. ICC = intraclass correlation coefficient.
Hypothesis 1a testing the relational-norm approach to relational demography predicted that female subordinates would rate their male supervisors higher than male subordinates would rate their female supervisors. Results of this hypothesis were supported (γ = −.05, p = .01; see Table 5). Hypothesis 1b predicted that male subordinates would rate their male supervisors higher than female subordinates would rate their female supervisors. Results suggest no significant difference between the male supervisor–male subordinate dyads and the female supervisor–female subordinate dyads (γ = .03, p = .14). Thus, Hypothesis 1b was not supported.
Results of HLM Analyses: Effects of FM and MF Dyads on Ratings of DGLES.
Note. HLM = hierarchical linear modeling; SE = standard error; DGLES = Denison General Leadership Effectiveness Survey. MF (male supervisor–female subordinate) dyads were coded “1.” FM (female supervisor–male subordinate) dyads were coded “−1.” The DGLES is rated on a 7-point scale.
p < .05. **p < .01.
Hypothesis 2a predicted that perceptions of effective organizational culture would moderate the relationship between the male supervisor–female subordinate versus female supervisor–male subordinate contrast and leadership effectiveness ratings such that female supervisors are rated higher in organizations with more effective cultures. We hypothesized that subordinates in female supervisor–male subordinate dyads would report higher leadership effectiveness ratings in organizations with higher culture scores than they would in organizations with lower culture scores.
Results of the Hypothesis 2a comparison suggest that two of the four culture traits act as moderators. Male subordinates rated their female supervisors higher in organizations with effective Involvement (γ = −.10, p = .09 1 ) and Mission (γ = −.10, p = .07) culture, as opposed to organizations with low scores for Involvement or Mission culture. Therefore, Hypothesis 2a was partially supported, with perceptions of Involvement (see Table 6; Figure 1) and Mission (see Table 7; Figure 2) acting as moderators for the relationship between gender contrast and leadership effectiveness ratings.
Results of HLM Analyses: Effects of MF and FM Dyads on Ratings of DGLES as Moderated by Ratings of Involvement Culture.
Note. HLM = hierarchical linear modeling; SE = standard error; DGLES = Denison General Leadership Effectiveness Survey. MF (male supervisor–female subordinate) dyads were coded “1.” FM (female supervisor–male subordinate) dyads were coded “−1.” The DGLES is rated on a 7-point scale.
p < .10. *p < .05. **p < .01.

Cross-level moderating effect of organizational involvement culture on the relationship between FM (female supervisor–male subordinate) compared with MF (male supervisor–female subordinate) dyads and DGLES (Denison General Leadership Effectiveness Survey).
Results of HLM Analyses: Effects of MM and FF Dyads on Ratings of DGLES as Moderated by Ratings of Mission Culture.
Note. HLM = hierarchical linear modeling; SE = standard error; DGLES = Denison General Leadership Effectiveness Survey. MM (male supervisor–male subordinate) dyads were coded “1.” FF (female supervisor–female subordinate) dyads were coded “−1.” The DGLES is rated on a 7-point scale.
p < .10. *p < .05. **p < .01.

Cross-level moderating effect of organizational mission culture on the relationship between FM (female supervisor–male subordinate) compared with MF (male supervisor–female subordinate) dyads and DGLES (Denison General Leadership Effectiveness Survey).
Hypothesis 2b hypothesized that culture would moderate the relationship between the male supervisor–male subordinate versus female supervisor–female subordinate contrasts and leadership effectiveness ratings. Regardless of the culture trait, perceptions of culture did not act as a moderator between supervisor–subordinate gender contrast and leadership effectiveness ratings. Thus, Hypothesis 2b was not supported.
In sum, Hypothesis 2b was partially supported, with results varying depending on the contrast in question. Culture interacted with the female supervisor–male subordinate versus male supervisor–female subordinate contrast, but not the male supervisor–male subordinate versus female supervisor–female subordinate contrast. In response to our research question, it seems different aspects of culture have varying effects on ratings of leadership effectiveness depending on the contrast.
Of note, we acknowledge that a plausible alternative explanation for our results being significant may be the result of male leaders, regardless of context (dyad or culture), being rated higher than female leaders. That is, one could reasonably argue that our significant interactions were driven by male supervisors being rated higher than female supervisors in the overall sample. To eliminate this alternative explanation, we conducted an additional follow-up analysis to test whether males were rated higher than females in the overall sample. We found no significant difference between the leadership effectiveness ratings of males (M = 5.92) compared with females (M = 6.00), t(11,495) = 3.32, p = .986. Thus, concordant with our results above, the significant finding that male supervisors are rated no differently than female supervisors lends support to the important role of context, both in supervisor–subordinate dyads and across organizational cultures.
Discussion
Females continue to face barriers that their male counterparts do not. This has been studied from multiple theoretical perspectives, but, to our knowledge, we are the first to study the lack of women in leadership positions through the interactive lens between relational norms and the cultural context of the organization.
Our first hypothesis asserted differences in ratings specific to the relational-norm hypothesis of relational demography. As expected, female supervisors were rated lower by their male subordinates than male supervisors were rated by their female subordinates. However, when comparing male supervisors rated by male subordinates with female supervisors rated by female subordinates, we failed to find support for the relational-norm hypothesis. Our results about males’ ratings of females may speak to the troubles female leaders face in attempting to break the glass ceiling. Namely, with promotions being tied to performance appraisals in many organizations, women are forced to hurdle a roadblock that their male colleagues are not (Lyness & Heilman, 2006). This result, combined with the fact that male leaders, in general, are not rated any higher than female leaders, provides support for the idea that discrimination of female leaders cuts across borders and is influenced by relational norms about males as leaders.
Although the relational-norm hypothesis held for male’s ratings of females versus female’s ratings of males, the same was not true for the comparison of male’s ratings of males to female’s ratings of females; male subordinates rate male supervisors as effective as female subordinates rate female supervisors. Though this result is not in line with our hypothesis about the effects of relational norms, it does conform to findings from less context-specific studies that suggest females are less discriminatory in their ratings of leaders (Duehr & Bono, 2006; V. E. Schein, 2001).
We also measured the extent to which organizational culture had an impact on ratings of effectiveness across both of our gender contrasts. Where female supervisors were rated lower by their male subordinates than male supervisors were rated by their female subordinates (Hypothesis 1a), adding the effect of culture yielded interesting findings. As hypothesized, organizational culture acted as an important boundary condition of the negative effects of gendered relational norms. In organizations with less effective Involvement and Mission cultures, female subordinates rate their male supervisors higher than male subordinates rate their female supervisors, but in organizations with more effective Involvement and Mission cultures, male subordinates rate their female supervisors higher than female subordinates rate their male supervisors. Similar to the main effect findings of male–male compared with female–female dyads (Hypothesis 1b), though, we found no support for culture as a moderator with that contrast.
Implications for Research
Taken as a whole, the implications of our research are twofold. First, it seems that males still perceive female leaders as less effective than they perceive male leaders. Considering all raters together (i.e., devoid of context), female supervisors were rated just as effective as male supervisors. However, as our results show, when considering the dyadic nature of the supervisor–subordinate relationship and organizational culture in which those dyads are embedded (i.e., contextual variables), results suggest male subordinates rate female supervisors lower than female subordinates rate male supervisors.
Second, by introducing organizational culture as a contextual variable in the relationship between supervisor–subordinate dyad and leadership effectiveness ratings, we show that culture may be able to mitigate some of the negative effects of the ways female leaders are rated. By accepting the call of previous researchers to include contextual variables in the study of leadership (e.g., Day, 2001; Dinh et al., 2014), we provide initial evidence that organizational culture is an important contextual variable that acts as a boundary condition for how leaders are perceived in different supervisor–subordinate dyads. Generally speaking, and considering the gendered nature of the supervisor–subordinate dyad, the more effective the culture, the more favorable female supervisors are viewed. These findings have important implications for practice, especially for organizational leaders seeking to enact change.
Limitations and Future Directions
This study, though providing a new lens in which to observe gender discrepancies in leadership and observing a boundary condition for those negative ratings, is not without limitations. One limitation of the study is the cross-sectional, archival nature of the study. Unfortunately, we were limited in our access to certain data that would have made our study stronger. Another limitation is that this study measured only one demographic variable with regard to relational demography. Also, we used a prescriptive measure of culture (i.e., the DOCS). Other researchers might use descriptive measures of culture.
Organizational culture is typically measured at a very broad level (c.f. Hartnell, Ou, & Kinicki, 2011). Thus, we were limited by the lack of a priori rationale for each of the individual culture traits. Where we did find the Involvement and Mission culture traits to moderate the relationship between supervisor–subordinate dyad and leadership effectiveness ratings, we failed to find support for the intervening effect of Consistency or Adaptability culture traits. Future studies could benefit from a more fine-grained lens of organizational culture to better assess how specific aspects culture affects employee perceptions. For example, one might assess how various values differentially affect male and female employee’s perceptions of their leaders’ effectiveness.
Conclusions
Our study shows that gender in supervisor–subordinate dyads matters when predicting leadership effectiveness ratings. The primary contribution of this article was the identification of contextual boundary conditions between gendered dyads and leadership effectiveness ratings. Specifically, we found that the negative perceptions of female leaders by male subordinates may be mitigated when an organization has a more effective Involvement or Mission culture. We suggest that organizations wishing to improve perceptions of female leaders, and possibly increase the number of female leaders, work on improving their cultures especially with regard to Involvement and Mission.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
