Abstract
The heuristics strategists use to make predictions about key decision variables are often learned from only a small sample of observations, which leads to a risk of inappropriate generalization when strategists misjudge regularities. Building on the statistical learning literature, we show how strategists can mitigate this risk. Strategies to learn heuristics that accept a bias, that is, a systematic deviation of predictions from actual outcomes, can outperform unbiased strategies because they can reduce the variance component of prediction error: the degree to which random fluctuations in observational data are inappropriately generalized. We demonstrate how strategists who are aware of the trade-off between bias and variance can learn heuristics more effectively if they are also aware of the relevant characteristics of their learning environment. We discuss the implications of our results for our understanding of heuristics, (dynamic) capabilities, and managerial cognitive capabilities, and we outline opportunities for empirical work.
Introduction
A strategist confronted with a decision of strategic importance—such as whether to allocate resources to a new venture, to go forward with a proposed acquisition, or to hire a candidate for a leadership position—must assess the key parameters that affect the outcome of the decision. For example, the expected performance of the new venture, of the acquired firm, or of the candidate hired are all important variables that must be predicted before making a decision based on the information that is available at that point. Rumelt (1979) and Nelson and Winter (1982) early on suggested that strategists should use simple heuristics to make predictions about such decision variables. More recently, strategy scholars have provided empirical evidence that strategists indeed use heuristics as “simple rules” (e.g. Bingham and Eisenhardt, 2011; Maitland and Sammartino, 2015), that these are often learned from and honed with experience (e.g. Bingham et al., 2019; Bingham and Eisenhardt, 2011), and that their use often leads to better decisions (e.g. Astebro and Elhedhli, 2006; Luan et al., 2019).
While strategy scholars have noted that the heuristics strategists use in a given decision situation have often been developed through a learning process (e.g. Bingham and Eisenhardt, 2011), making the conceptual distinction between learning a heuristic, on the one hand, and using it, on the other hand, explicitly has consequences that prior work on heuristics in a strategy context has not examined. Because strategists encounter typical decision problems only infrequently, they have to learn the relationship between available information and decision variables for a given type of decision from only a small sample of observations (Zollo, 2009; Zollo and Winter, 2002). As a consequence, when learning heuristics that capture this relationship, strategists are at risk of making “inappropriate generalizations” (Haleblian and Finkelstein, 1999)—seeing regularities where none exist or misjudging those that do exist (Gilovich et al., 1985; Taleb, 2001).
In this article, we build on the literature on statistical learning (Hastie et al., 2008) to formally analyze how accepting biases in learning and using heuristics can mitigate the risk of making inappropriate generalizations and thus lead to more accurate predictions in typical strategy contexts (Bingham and Eisenhardt, 2014; Loock and Hinnen, 2015; Vuori and Vuori, 2014). In doing so, we make use of the work of Gigerenzer and colleagues from the psychology literature and their research on “fast and frugal” heuristics (Gigerenzer and Brighton, 2009; Gigerenzer and Gaissmaier, 2011), whose core argument is that heuristics will be biased when they ignore available information, but surprisingly—rather than, therefore, being inferior to more exhaustive prediction strategies that do not ignore information—the very fact that they ignore information allows for more accurate predictions even if the information is freely available and can be processed at no cost. We thus define a heuristic as a prediction strategy that accepts a bias (e.g. by ignoring available information) with the goal of making decisions more accurately than unbiased prediction strategies. 1 We then build on a key premise of the fast and frugal program, namely that the accuracy of a prediction from any prediction strategy depends on the characteristics of the decision situation or environment (which is a point made by Herbert Simon long ago; for example, Simon, 1956). That is, a heuristic is “ecologically rational” (Gigerenzer and Selten, 2002; Goldstein and Gigerenzer, 2002) in a given type of environment if it leads to a more accurate prediction than alternative prediction strategies in that environment. In our analysis, we make a fundamental distinction between environments that are stable (where the same type of decision problem is repeatedly encountered and the regularities can be learned; for example, Bingham and Eisenhardt, 2011) and those that are changing (where a new decision problem is different from, yet similar to, one which has been previously encountered; for example, Eisenhardt and Martin, 2000). Based on this distinction, we demonstrate that biased heuristics can improve prediction accuracy in the following two ways.
First, consider a strategist in a stable environment, who evaluates acquisition targets and must predict how difficult it will be to integrate an acquired firm’s technology with the acquiring firm’s technology. He has observed three acquisitions of small technology firms. In two of these, the CEO of the acquired firm had an MBA, and integrating the acquired firm’s technology was easy. In the third one, the CEO had an engineering degree, and technology integration proved very difficult. Based on these observations, should the strategist conclude that CEO educational background is generally predictive of integration success, or would that be an instance of inappropriate generalization?
Ignoring it will systematically bias predictions about ease of technology integration if CEO educational background and ease of integration are in fact correlated. Say that on average technology integration is easier if CEOs have an MBA than if CEOs have an engineering degree. Then ignoring CEO background will lead to too optimistic predictions if the CEO has an engineering degree and to too pessimistic predictions if the CEO has an MBA. But the strategist does not know the true relationship and therefore needs to learn it from her observations. Because easiness of integration will also be determined by a number of other factors, the strategist—given a sample of just three observations—may well (simply by chance) observe two cases of successful integration with a CEO who had an engineering degree and one case of integration failure with a CEO with MBA background. Given such a pattern, concluding that integration will be easier for firms with an engineering CEO would be an instance of inappropriate generalization. The variance component of prediction error describes the average error a strategist makes as a result of such inappropriate generalization.
Generally, there is a bias–variance trade-off (Hastie et al., 2008): Determining which observable attribute to include in a prediction strategy requires a strategist to weigh the risk of inappropriate generalization against the bias from ignoring an attribute that actually matters. As we show, the question of whether the strategist should generalize about a relationship he observes in a small sample depends on the characteristics of the learning environment. For example, the risk of inappropriate generalization is generally the larger the fewer observations are available and the greater the variability between situations of the same type (variability is the degree to which the decision variable is determined by factors unique to the specific situation).
Second, assume that the same strategist is exposed to a changing environment: The firm he is working for has recently started to acquire much larger firms. In the meanwhile, he has seen dozens of acquisitions of small technology firms and has (correctly) learned that on average technology integration will be easier for small technology firms whose CEO has an MBA background. After having observed three large acquisitions, a similar pattern as above emerges: In the two acquisitions where the CEO had an engineering background, technology integration was difficult, whereas in the third one where the CEO had an MBA integrating the acquired firm’s technology was much easier. The strategist could conclude that it may be better to ignore CEOs’ educational background because he has too few observations to reliably generalize the observed relationship for large firms. However, he could also transfer his ample prior experience with small firms. Doing so will, however, lead to a bias if there are systematic differences between technology integration in small and large firms (Ellis et al., 2011).
This example shows that a similar trade-off exists in changing environments when a strategist has only few observations from the new environment but ample experience from the previous environment. Specifically, transferring earlier experience also has two effects: On the one hand, it leads to a systematic bias in predictions due to the dissimilarity between contexts (e.g. March, 1991). On the other hand, it can also reduce the risk of inappropriate generalization because it increases the number of observations from which the relationship between attributes and the decision variable has been learned. As we show, a “transfer heuristic” that makes predictions by combining what the strategist learns from the small number of observations in the changed environment with his experience from the previous environment can address this bias–variance trade-off. Interestingly, the bias–variance trade-off in changing environments has the opposite direction as the one in stable environments: Making use of biased experience from the previous context can improve prediction accuracy, while ignoring it can increase the risk of making inappropriate generalizations.
We build on the statistical learning literature (e.g. Hastie et al., 2008) to formally examine how the characteristics of the learning environments affect the prediction accuracy of heuristics that strategists learn from small samples in both stable and changing environments. Formally, we treat the learning process as the process of determining weights for different observable attributes that strategists will use in making predictions (in our above example, the educational background of the CEO is one such attribute). Ignoring an attribute means giving zero weight to it, and transferring experience means calculating a weighted average of the attribute weights determined from observations in the previous environment and the weights determined from observations in the changed environment. In addition to distinguishing between stable and changing environments, we also examine several characteristics of learning environments that have been analyzed in the fast and frugal heuristics literature (Gigerenzer and Gaissmaier, 2011).
A key insight of our article is that there is a conceptual difference between rules for learning heuristics, on the one hand, and heuristics as prediction strategies, on the other hand. Learning rules include, for example, rules to decide whether to generalize (e.g. do not generalize after only three observations) or rules to determine how much weight to give to prior experience that is transferred to a new context (e.g. weigh prior experience highly if the new environment is very similar to the previous one), whereas heuristics are the resulting prediction strategies (e.g. if the CEO has an MBA background, integration will be easier).
The key contribution our study makes is to show how such learning rules that capture the key characteristics of the learning environment affect the prediction accuracy of the heuristics strategists learn from experience. An implication of our findings is that to learn and use heuristics effectively, strategists must correctly assess the relevant environmental characteristics. Therefore, strategists who are aware of these characteristics will learn heuristics that will allow them to make more accurate predictions. This, in turn, implies that knowledge about environmental characteristics is an important but understudied aspect of managerial and dynamic capabilities (Helfat and Peteraf, 2015). In addition, our findings also raise the important question of how fungible and thus transferable heuristics are across different types of situations (Levinthal and Wu, 2010).
In the following, we formalize our argument. We then use this formal approach to explain the bias–variance trade-off in both stable and changing environments and show how the characteristics of learning environments affect the prediction accuracy of the heuristics that are learned from small samples. We then discuss the implications of our arguments and provide guidance for further work on heuristics in a strategy context.
Heuristics, information, and learning environments
The strategist’s prediction task and available information
Our starting point is a strategist who must evaluate a decision in terms of a key decision variable (which we call the “outcome”). In our introductory example, ease of technology integration in an acquisition is such a key variable. In the context of internationalization (Bingham et al., 2007), a key variable could be some measure of success of a market entry decision under consideration (e.g. market share or total sales). For predicting the outcome, the strategist has a number of observable “attributes” available. For example, for a market entry decision the observable attributes include, among many others, market language or country gross domestic product (GDP). In addition to the specific decision problem at hand, the strategist has a number of prior observations of the same “type” of decision problem from which he can learn the relationship between attributes and outcome. Being of the same type not only means that they are about the same task or process (e.g. acquisition, new product development, new market entry), but we assume that they also have the same observable attributes and that the outcome to be predicted is also observable for all instances.
We can formalize this situation as a discrete stochastic process. Formally, at each time
where both
The strategist’s task is to predict the outcome of the decision variable
Heuristics and learning rules
As defined above, a heuristic is a prediction strategy that accepts a bias (e.g. by ignoring information) with the goal of making decisions more accurately than unbiased prediction strategies. If the true relationship
However, we assume that the strategist only knows the functional form of
A prediction strategy is the result of applying a rule to learn
Characteristics of learning environments
Whether and by how much a biased heuristic will improve prediction accuracy depends on the characteristics of the learning environment. Gigerenzer and Gaissmaier (2011) list four characteristics of learning environments that have been identified in the fast and frugal program, which influence the potential superiority of biased heuristics: the number of observations of prior instances available to the strategist
However, there is also one aspect in which a typical strategy context can differ from the types of environments studied by Gigerenzer and colleagues. Specifically, the basic argument in the fast and frugal research program why heuristics work is that they are effective at exploiting regularities across instances of the same type of decision problem (Gigerenzer, 2008; Goldstein and Gigerenzer, 1999). To take an example for a regularity from Bingham and Eisenhardt (2011), in the context of internationalization, using language spoken in a country as a cue to determine its attractiveness can only work if the relationship between language and attractiveness is the same across countries. On the contrary, however, it has been argued that a key characteristic of the environments faced by strategists is that they are often changing (Vuori and Vuori, 2014). Thus, one needs to account for whether and to what extent regularities persist across instances of a decision problem encountered by a strategist.
A stylized distinction to account for differences in the persistence of regularities is to distinguish between two basic types of environments: those in which regularities persist across instances of the same type of decision problem (stable environments) and those in which the regularities are subject to a one-time change (changing environments). Clearly, both will be relevant in strategic decision contexts. For example, there are likely regularities across decision situations that require evaluating proposals for new ventures or for evaluating acquisition targets. On the contrary, the entry of a competitor may change the regularities for new product development (as has happened when Apple launched the iPhone into the mobile handset market), or a restaurant chain that needs to decide about opening restaurants in a new country may be confronted with regularities that are different from, yet similar to, those in its home country.
We formalize this distinction as follows. In a stable environment, the relationship
In the following two sections, we explain—for both stable and changing environments—the respective bias–variance trade-off and how the relevant environmental characteristics enable strategists to make more accurate predictions. For stable environments, we examine the four characteristics from Gigerenzer and Gaissmaier (2011) listed above, and for changing environments we examine
Characteristics of strategic environments and their effect on learning heuristics.
Heuristics and learning rules in stable environments
Typical decision problems faced by strategists in which regularities are stable and thus can be reliably learned by using appropriate learning rules concern, among others, evaluating proposed projects and new ventures as well as allocating resources to them (Bower, 1970; Burgelman, 1983), selecting markets for entry as well as mode of entry (Barkema and Vermeulen, 1998; Bingham and Eisenhardt, 2011), acquisition target evaluation and selection (Capron and Shen, 2007), or evaluating candidates for leadership positions (Barney and Wright, 1998). The “flow of opportunities” in the work of Eisenhardt and colleagues (e.g. Eisenhardt et al., 2010) can also be seen as repeated exposure to the same type of decision problems.
The bias–variance trade-off in stable environments
Consider the following stylized example to illustrate the challenge strategists face in learning about the relationship between attributes and outcome in a stable environment due to the bias–variance trade-off. Suppose that a company’s strategist must decide whether or not to offer the position of head of a new venture to a manager who has recently been hired from a competitor to lead another project for the company. The outcome variable to be predicted is the performance of the manager as head of the venture. To keep it simple, suppose the strategist has only two pieces of information available to predict the outcome: the manager’s performance on the project that he had been hired to lead and his performance on his last project working for the competitor. Assume that both can be perfectly observed and that both attributes are informative about the performance of the employee: On average, higher performance on in-house projects and higher performance on projects at other firms are both associated with higher performance as head of a new venture. Also assume that the former is a stronger predictor of new venture performance than the latter.
The strategist must now consider which of these attributes to use and which to ignore. Say he decides to only use performance on the in-house project in predicting the outcome. This is an instance of a “one good reason” heuristic, which uses the one attribute that best predicts the outcome of the target variable while ignoring all other attributes (Gigerenzer and Gaissmaier, 2011). It has been shown that this type of heuristic can be superior in correctly predicting outcomes compared to multiple linear regression models that use all available information, and it can even outperform complex non-linear prediction strategies (Czerlinski et al., 1999). Clearly, this heuristic leads to a systematic bias in predicting the manager’s performance compared to using both pieces of information. For example, if the candidate’s performance at the competitor would be very high, ignoring it would lead to underestimating expected performance on the new venture.
On the contrary, the fact that the strategist does not know whether and how much specific attributes explain an outcome of interest introduces a second component of prediction error. Say the strategist has observed three prior instances of this type of hiring decision and must learn the relationship between manager attributes and outcome from these three instances. Clearly, any generalization from just three observations about the relationship between manager attributes and outcome will be subject to variation (i.e. for any set of three different prior observations, the strategist will likely arrive at a different estimate for the relationship). For example, by sheer coincidence among the three instances, there may have been a negative association between performance at the competitor and performance as head of new venture even though the true relationship is positive. Ignoring an attribute in making a prediction will lower the risk of inappropriate generalization.
In deciding which attributes to pay attention to and which to ignore, the strategist thus faces a trade-off. To formalize this trade-off, the prediction accuracy of any prediction strategy is given by the expected squared prediction error (ESPE), which describes the expected difference between predictions and actual outcomes (see, e.g. Hastie et al., 2008: 223). It is given as follows (in addition to bias and variance, there is an irreducible error in making predictions because the new decision situation for which the outcome is predicted has idiosyncratic characteristics that are not captured by the observable attributes)
In our example, ignoring the attribute of performance on projects at other firms can be justified if the systematic bias from ignoring it is smaller than the prediction error due to inappropriate generalization. In addition, the decision to include performance on the in-house project but excluding the other attribute may be justified by the former being a stronger predictor than the latter. More generally, to decide which attributes to include and which to ignore, the strategist must assess how reliably he can learn about whether and how much each of the attributes matters to the outcome. This, in turn, depends on the characteristics of the learning environment.
Learning rules and characteristics of stable environments
To show how the characteristics of the learning environment affect how strategists can effectively learn heuristics, we examine the four characteristics of stable environments listed by Gigerenzer and Gaissmaier (2011). Appendix 1 provides an overview of the literature that contains the formal results.
Number of available prior observations
Strategic decision situations are often relatively novel or strategists encounter specific decision situations (like acquisitions) only infrequently (Zollo, 2009; Zollo and Winter, 2002). Therefore, the number of observations of prior instances of the specific type of situation is often relatively small. As any student of statistics knows, the smaller the sample size from which predictions are made, the larger the chance of over- or underestimating the true relationship from the specific observed values (e.g. Wooldridge, 2013). This effect also applies to strategists: Attempts to identify which attributes matter and how they relate to an outcome from a small number of available observations are fraught with a risk of making inappropriate generalizations.
In such situations, learning rules that ignore some or most of the available attributes lead to simple heuristics that have higher prediction accuracy than more comprehensive prediction strategies, because they reduce the risk of making inappropriate generalizations (Gigerenzer and Goldstein, 1996; Hogarth and Karelaia, 2005). As an example, consider a company that seeks to select markets for entry and that has already entered a few markets. In selecting additional markets, the company may do better by applying a learning rule that identifies the one or two attributes that are most strongly associated with success in prior entries than by performing in-depth studies of the markets in question and, for example, using a scoring model with a large number of attributes (e.g. Calantone et al., 1999).
Bingham and Haleblian (2012) provide evidence for a specific learning rule that strategists seem to actually use. They show that whether an attribute is included in a heuristic is affected by whether managers agree or disagree about its relevance. In the light of the bias–variance trade-off, using mutual agreement to infer that an attribute matters in a specific type of situation can be seen as a learning rule that reduces the risk of inappropriate generalization by increasing the sample size from which strategists learn about what is important and what is not.
Variability
Strategic decision situations differ in their degree of variability across instances (Davis et al., 2009), where high variability means that the outcome is to a large extent determined by factors unique to the specific situation that do not generalize to other situations of the same type. For example, new product launches in the fashion industry may be high in variability, whereas new product launches in the chemical industry may exhibit relatively low variability. A high degree of variability makes it difficult to discern the effect of the observable attributes that are stable over time from the effect of the idiosyncratic characteristics of the situation (Haleblian and Finkelstein, 1999). Therefore, variability exacerbates the problem of learning from small samples noted above and thus increases the risk of inappropriate generalization. As a consequence, a strategist working in the fashion industry will have a higher risk of inappropriate generalizations after having observed 10 new product launches than a strategist working in the chemical industry, who has also observed 10 new product launches. Learning rules that account for the degree of variability can lead to heuristics that have higher prediction accuracy.
Relative importance of attributes and redundancy among attributes
Among the observable attributes, some are more important and thus more informative about the outcome than others. The less important an attribute, the smaller the increase in bias from ignoring it relative to the reduction in the variance component of prediction error. This means that the distribution of the importance of attributes affects which combination of attributes minimizes expected prediction error. An important aspect of learning rules is, therefore, to identify the relative importance of attributes and, in particular, to decide which of them can be safely ignored.
If the distribution of attributes is highly skewed, heuristics that ignore less important attributes will be ecologically rational (Gigerenzer and Goldstein, 1996; Hogarth and Karelaia, 2005). In particular, when attributes are non-compensatory, which means that one attribute is more informative about the outcome than all other attributes combined, using only this one attribute can lead to lower expected prediction error than using multiple attributes (Gigerenzer and Goldstein, 1996). In addition, the more skewed the distribution of the importance of attributes, the easier it is to rank or prioritize them (Gigerenzer and Gaissmaier, 2011) and, as a consequence, the more easily the most important attributes can be reliably identified in small samples.
If two or more attributes are highly correlated, they largely contain the same information about the outcome. They are thus (imperfect) substitutes in predicting the outcome and at least partly redundant (Bingham and Eisenhardt, 2014; Hogarth and Karelaia, 2005). High correlations among attributes make it more difficult to reliably learn the relationship between these attributes and the outcome because it makes it difficult to discern their individual effects on the outcome. Therefore, ignoring one of two correlated attributes can lower the risk of inappropriate generalization. As a consequence, an important aspect of learning rules is to assess whether any two attributes are highly correlated so that one of them can be safely ignored.
Heuristics and learning rules in changing environments
Strategists must also often predict decision-critical variables in an environment in which the regularities are undergoing change. For example, a stable environment may be exposed to a discrete “shock” that leads to a change in regularities for a given type of decision problem. Take the example of Apple’s launch of the iPhone, which changed the characteristics that make a mobile phone successful. Similarly, strategists may encounter a new type of decision situation that is similar to one they have already frequently encountered in the past. As an example, consider Cisco, which in the 1990s grew through making a large number of acquisitions of small companies, but later switched to also acquiring large companies. In both cases, the strategist is faced with the question of whether and to what extent what he has learned about the relationship between available information and the decision variable in the original context is transferable to the new situation, whether adjustments are necessary, or whether it is better to discard prior experience altogether (Finkelstein and Haleblian, 2002).
The bias–variance trade-off in changing environments
We examine the simplest possible setup of a changing environment, namely one in which regularities are stable over time except for a one-time change after which the regularities are different from the situation before the change. While this is a highly stylized type of change, it allows to nicely illustrate the bias–variance trade-off in changing environments.
As above, the strategist wants to make an as accurate prediction as possible. Assume he knows that the environment has changed and that he has a small number of observations from the environment after the change has occurred. Intuitively, to make accurate predictions, he should only use the observations from the environment after the change to learn about the relationship between attributes and outcome and ignore his experience from before the change, because the latter is biased with respect to the true relationship after the change (March, 1991). For example, Bettis and Prahalad (1995) have argued that firms and strategists need to unlearn or forget outdated lessons once the environment has changed. Of course, the magnitude of the bias depends on the degree of similarity between the relationship between attributes and outcomes before and after the change (what we call the magnitude of change). Thus, outdated experience or experience from a different but similar decision situation may nevertheless contain useful lessons for the situation at hand (Gentner, 1983), and the more so the smaller the magnitude of change or the larger the degree of similarity (Finkelstein and Haleblian, 2002).
This does not, however, explain why using outdated experience can lead to more accurate predictions but only that the bias induced by doing so may sometimes be relatively small (but a bias nonetheless). What is missing—as in stable environments—is to account for the variance component of prediction error when strategists have to learn the relationship between attributes and outcome. Specifically, ignoring experience from before the change will eliminate systematic bias in learning this relationship but at the same time reduce the total number of observations used in learning. As a consequence of ignoring this experience, the variance component of prediction error may be large. That is, ignoring experience from before the change will increase the risk of inappropriate generalization.
Taken together, this means that transferring outdated experience to a changed or different environment has two effects: It leads to a systematic bias in making predictions but can reduce the variance component of prediction error. Thus, there is also a bias–variance trade-off in changing environments: Strategists have to weigh the risk of making wrong predictions because their prior experience does not apply to the changed or new environment against the risk of making wrong predictions from inappropriately generalizing from a small sample of observations in the changed or new environment. This bias–variance trade-off works in the opposite direction to the one in stable environments: Ignoring prior (outdated) experience will lower the bias but increase the variance component of prediction error. As a consequence, not ignoring biased experience can lower the risk of inappropriate generalization. The implication is that, counter-intuitively, it may be better to use experience that is known to be biased if doing so will reduce the variance component of prediction error by more than it increases the bias.
Learning rules, experience transfer, and characteristics of changing environments
A transfer heuristic
As in stable environments, the bias–variance trade-off in changing environments provides a general explanation why it may be beneficial to use a biased heuristic to predict an outcome. Similarly, whether this is the case depends on the characteristics of the learning environment. We thus examine this question with respect to three specific characteristics of changing environments: the number of observations
We formalize the transfer heuristic as follows. Assume the environment has changed from
where
Learning rules in changing environments
In our stylized model, after a change has occurred, the strategist essentially finds himself in a stable environment except that he can now draw on experience from the situation before the change has occurred. Therefore, a key aspect of learning in a changed environment or in a new environment that is similar to one that one is familiar with is to decide to what extent to start learning afresh based on using observations from the new or changed environment and to what extent to rely on and transfer prior experience.
In the context of our model, this means that a strategist who is confronted with a new decision problem that he knows is drawn from
For learning rules that help decide about experience transfer, it is important to identify situations in which strategists will want to give strong weight to prior lessons learned (and thus discount what they have learned in the environment after change). For example, in the context of a US firm’s entry into Israel, Szulanski and Jensen (2006) found that initially selecting a higher transfer rate was associated with higher performance. Intel’s famous “copy exactly” rule—which rules out that any learning about the new environment will be used in setting up new facilities—can be seen as a learning rule that is effective because it avoids the risk of misjudging what matters in the new environment. As observations in the changed or new environment accumulate, the strategist’s prediction error can be reduced by gradually reducing the transfer rate and increasingly relying on the learning from the new environment. Similarly, a higher transfer rate can increase the accuracy of predictions in the new environment if variability is high, because higher variability increases the risk of inappropriate generalization.
Variability and change as distinct environmental characteristics
In our model, variability and change are two distinct parameters. The strategy literature often confounds them in verbal treatments (e.g. Brown and Eisenhardt, 1997; Eisenhardt and Martin, 2000), whereas formal treatments typically assume a stable environment. For example, the environment in the simulation study by Davis et al. (2009) is stable, because the underlying distribution from which the opportunities are drawn does not change over time, but exhibits variability (thus, in principle, the parameters of the process that generates the opportunities could be learned by the agents, even though Davis et al. (2009) do not model this).
Our analysis shows that distinguishing variability from change is important because they impose two different types of constraints on how strategists may be able to deal with situations in which they have to learn the relationship between available information and the decision variable they must predict. While variability makes learning difficult because it increases the risk of making inappropriate generalizations, change means that what has been learned earlier will not necessarily be applicable anymore. This has consequences for the types of learning rules strategists may best use to reduce prediction errors: In stable environments, higher variability increases the benefits of simple heuristics that ignore observable attributes. In changing environments, higher variability increases the optimal rate at which the lessons learned in prior environments are transferred to a similar, new context.
Discussion
In the following, we discuss implications of our findings for research on heuristics, managerial cognition, and (dynamic) capabilities. We conclude by highlighting limitations of our approach.
Heuristics, simple rules, and learning rules
The distinction between learning and using heuristics allows for conceptual clarity and thus aids in interpreting results of empirical research on heuristics in a strategy context (Vuori and Vuori, 2014). For example, while Bingham and Eisenhardt (2011) argue that “firms learn heuristics” (p. 37), they only describe simple rules as the outcome of a learning process. More recent research by Bingham and colleagues (Bingham and Haleblian, 2012; Bingham et al., 2019), on the contrary, sheds light on the underlying learning process. Prior research on heuristics also provides hints that strategists indeed use learning rules to reduce the risk of inappropriate generalization. As stated earlier, an example can be found in the work of Bingham and Haleblian (2012), who show that managers decide whether an attribute is included in a heuristic based on whether managers agree or disagree about its relevance. Other learning rules (for example, concerning under which conditions to generalize or to transfer prior experience, or to select one of the multiple observable attributes, or how to weigh them) may exist and be frequently used by strategists.
However, our finding that the learning process exposes strategists to a bias–variance trade-off has yet to be accounted for in empirical research on heuristics. In particular, the question of whether and to what extent strategists take into account that they incur a risk of inappropriately generalizing from a small number of observations, and what mechanisms they use to mitigate this risk, remains largely unanswered. The implication for empirical research on heuristics is that—in addition to identifying and documenting the heuristics strategists use (e.g. Bingham and Eisenhardt, 2011)—researchers also need to dig deeper into the underlying learning processes and seek to systematically identify the learning rules which strategists use to learn heuristics in the face of the bias–variance trade-off. Here, our work provides the basis for examining whether and why such learning rules and the resulting heuristics can be considered ecologically rational.
Heuristics, learning rules, and dynamic capabilities
While heuristics as simple rules (as outcome of the learning process) can be considered an aspect of a capability, the underlying learning processes (including learning rules and decisions about transfer of prior experience) can be considered an aspect of dynamic capabilities (Winter, 2003). Bingham et al. (2019) provide initial empirical evidence for this link by showing how strategists develop and adapt heuristics over time and turn them into capabilities. However, the distinction between learning and using heuristics also has broader implications for the link between heuristics and dynamic capabilities at the level of the organization and can thus shed light on how learning rules can serve as microfoundations for dynamic capabilities (Di Stefano, Peteraf, and Verona, 2014; Felin and Foss, 2005).
Because their applicability depends on the characteristics of the environment, effectively applying learning rules and transferring prior experience from one context to another related one requires an understanding and a correct assessment of the relevant environmental characteristics. For example, situations with high variability require having more observations or weighing prior experience more strongly when transferring experience than situations with low variability. Similarly, deciding about transfer of experience requires an assessment of the similarity between contexts (Finkelstein and Haleblian, 2002). As a consequence, knowledge about environmental characteristics may be an important aspect of dynamic capabilities. It is a kind of “meta-knowledge” (Foss and Jensen, 2018), and examining its role in learning heuristics and transferring prior experience are important avenues for further research.
Our analysis of the transfer heuristic implies that the characteristics of environments may also constrain the extent to which heuristics are fungible across situations (Levinthal and Wu, 2010). In fact, the lessons from one context may simply not be applicable in another context, and such lack of transferability means that strategists may be forced to start learning afresh, which in turn increases the risk of making inappropriate generalizations from small samples. The question of how fungible and thus transferable heuristics are across different types of situations is therefore highly important for our understanding of dynamic capabilities, not the least because it also affects the extent to which heuristics may be idiosyncratic to specific firms (Eisenhardt and Martin, 2000).
Avoiding inappropriate generalization and managerial cognitive capabilities
Our results also have implications for understanding the sources of heterogeneity among individual strategists in their ability to make good decisions in novel and changing environments. Such heterogeneity is typically explained through differences in managerial cognitive capabilities (Adner and Helfat, 2003; Helfat and Peteraf, 2015), which are, in turn, considered to stem from either differences in relevant prior experience (e.g. Castanias and Helfat, 1991; Hambrick and Mason, 1984) or differences in specific mental faculties (e.g. Helfat and Peteraf, 2015). Our findings imply that understanding the bias–variance trade-off and being aware of the risk of inappropriate generalization in combination with knowledge of the relevant environmental characteristics are important aspects of managerial cognitive capabilities because they lead to better learning rules and, by consequence, to heuristics with higher prediction accuracy.
This leads to empirically testable predictions concerning the relationship between managers’ characteristics and the learning rules they use or the accuracy of prediction from learned heuristics. For example, managers trained in statistics (e.g. those who have an engineering degree) may be more aware of the risk of inappropriate generalization and thus use more appropriate learning rules. Similarly, managers with certain experience may be better at assessing the similarity between contexts and thus be better at deciding about whether and to what extent to transfer prior experience. An intriguing question for further research is whether differences between strategists can be reduced by training them about the bias–variance trade-off. Laboratory experiments can be used to answer that question. Finally, empirical research may also examine whether individual strategists deliberately and consciously avoid inappropriate generalization or whether they are intuitively aware of the bias–variance trade-off and what explains differences among strategists in that respect. Using think-aloud protocols may help to answer that question.
Limitations and conclusions
The bias–variance trade-offs that we explained in this article provide an explanation for how the characteristics of decision environments affect the accuracy of predictions strategists make when they have to learn the relationship between available information and the decision variable they must predict from available observations. While this is an important explanation for why simple and biased heuristics are associated with more accurate predictions, it is not the only one. For example, Bettis (2017) has argued that a key benefit of heuristics is that they are effective in dealing with complexity in strategic decision-making. In addition, Vuori and Vuori (2014) have argued that a key benefit of heuristics is that they facilitate coordination among individuals within an organization.
We have also made a number of simplifying assumptions in our formal approach. For example, we have assumed a linear relationship between attributes and outcomes and a continuous outcome variable. However, the fundamental explanation why simple heuristics allow strategists to make more accurate predictions (the bias–variance trade-off and the risk of inappropriate generalization) applies more generally, and thus, the results we have presented above are valid beyond the heuristics that can be modeled with the functional form we use. In addition, in studying changing environments, we have exclusively focused on the most simple instance of change, namely a one-time shift. However, in reality, often more complex patterns of change exist. For example, while some regularities may be stable across instances of the same decision type, others will change more frequently. In particular, there may be higher order regularities which give patterns to changing environments (e.g. industry lifecycles; Klepper, 1997). These, in turn, will affect how strategists may learn heuristics effectively. Further insights may thus be gained through formal work that relaxes some of the simplifying assumptions we have made.
We have highlighted some areas for further research above. The most important implication of our findings for future research is that both theoretical and empirical research must account for how the characteristics of strategic environments affect how strategists learn and use heuristics.
Footnotes
Appendix 1
Appendix 2
Acknowledgements
The authors would like to thank Pantelis P. Analytis, Eckehard Olbrich, Mikko Rönkkö and Jan Woike as well as our editor Ann Langley and three anonymous reviewers for their highly valuable comments.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
