Abstract
Scholarship on (mis)information does not easily translate into recommendations for policy-makers and policy influencers who wish to judge the accuracy of science-related truth claims. This is partly due to much of this literature being based on lab experiments with captive audiences that tell us little about the durability or scalability of any potential intervention in the real world. More importantly, the “accuracy” of scientific truth claims is much more difficult to define than many scholars in this space acknowledge. Uncertainties associated with the nature of science, sociopolitical climates, and media systems introduce compounding error in assessments of claim accuracy. We, therefore, need a more nuanced understanding of misinformation and disinformation than those often present in discussions of the “infodemic.” Here, we propose a new framework for evaluating science-related truth claims and apply it to real-world examples. We conclude by discussing implications for research and action on (mis)information, given that distinguishing between true and false claims is not as easy as it is sometimes purported to be.
Scientists and policy actors have long attempted to categorize the forms and attributes of misinformation and disinformation (Soprano et al. 2021), but this work is difficult to integrate along dimensions that serve both research and on-the-ground efforts to deal with inaccurate truth claims. Classifications variously center on aspects of (or combinations of) intent (Freelon and Wells 2020), facticity (Tandoc, Lim, and Ling 2017), authenticity (Glasser, Varma, and Zou 2018), falsifiability (Berinsky 2017), and other factors. Often, these classifications aim to typologize claims as misinformation, disinformation, pseudoscience, conspiracy, rumor, native advertising, and so on (for an overview, see Scheufele and Krause 2019). As this work has proliferated, hopes for a unifying logic to these categorizations and typologies have dimmed. For example, a criterion like “intent” (Wardle 2017) is not obviously relevant to classifying the accuracy of a claim, although it may be a useful criterion for assessing relative threat, albeit a blunt one. Meanwhile, others have argued that “some types of misinformation hold more consequence for public health behavior and outcomes than others” (Southwell et al. 2019, 282), thus highlighting variability in the relative threat claims might pose.
This lack of clear attention to the need to simultaneously assess both the accuracy of claims and their relative threat contributes to disjointed academic insights (Jerit and Zhao 2020), creates difficulty applying research in practice, and leaves us unable to judge whether emerging meta-analyses are treating conceptually distinct studies of “misinformation” as interchangeable (e.g., Walter and Tukachinsky 2019). “Misinformation” is also becoming a near-meaningless catch-all term for “studies focused on the proliferation and impacts of false information” (Cacciatore 2021, 1), with many studies (including some of our own) utilizing oversimplified operationalizations or offering no definition of misinformation whatsoever (for examples, see Vraga and Bode 2021; Freiling, Krause, Scheufele, and Brossard 2021; Nyhan et al. 2019).
Academics risk spawning an industry of solution-focused work that assumes that “false” science-related claims are easily identifiable—a flawed assumption that has already led to studies “correcting” false beliefs with misleading claims (e.g., Bode, Vraga, and Tully 2021). Although the limitations we highlight also apply to aggregate-level work on the scope of misinformation (e.g., Grinberg et al. 2019), our points are geared more urgently toward research on individual-level interventions, where the need for nuance is especially important. Without precision, we waste time and energy—or, worse, we may do collateral damage (Scheufele, Krause, and Freiling 2021).
Among policy-makers and policy influencers, fuzzy conceptualizations leave actors unable to discern the relative threat a claim might pose or the harm it might do to differentially vulnerable populations. Should we focus on inaccuracy, on characteristics of the source, or both? Interventions are also difficult to enact if the research that informs them asserts that the solution is effective only for claims that are not politically polarizing (e.g., Weeks 2018), when politically polarizing claims are precisely the ones that often concern policy-makers and pose greater threats. When a framework for classifying truth claims cannot account for forms of sociopolitical distortion (e.g., elite salience) or science-related distortion (e.g., fluidity of evidence about COVID-19 masking) and the intersections between these distortions, the scientific community becomes an easy target for politically motivated attacks. A good example of this problem is so-called Lancetgate, which occurred in summer 2020. When former president Trump touted the efficacy of hydroxychloroquine as a treatment for COVID-19, an outpouring of factual correctives appeared in the New York Times and elsewhere. These correctives to Trump’s statements were based largely on scientific evidence published in The Lancet, which was subsequently retracted, quickly sparking accusations by right-wing outlets like Breitbart of political bias in the scientific community (Delingpole 2020).
Integrating existing work in ways that inform productive pathways for the research and practice of misinformation, therefore, requires attention to the ecosystem in which (mis)information emerges and evolves, where assessing “accuracy” and “threat” is complicated by intersecting distortions arising from the nature of science, sociopolitical climates, and media systems. These distortions introduce error—and, therefore, uncertainty and variability—in the many criteria we use to assess accuracy and threat. While some existing frameworks (see above) do classify truth claims in terms of multiple, intersecting attributes (some of which are indeed conceived of as variable), none of them explicitly acknowledges that intersecting attributes can compound the possible error associated with our judgments in ways that influence the costs of intervention. In this article, we argue that evaluation of claims as misleading and worthy (or not) of intervention should be based on (1) a set of attributes that are variable, not dichotomous; (2) the relevance of any given attribute to a particular claim; and (3) the ways in which relevant attributes intersect and interact, perhaps leading to compounding sources of error.
Intersecting Attributes of Information: An Integrated Framework for Evaluating Truth Claims
Scientific truth claims are rarely clear-cut. In part, this is because scientific insights arise through collective scrutiny across many studies and many methodologies, as well as across time. The claims that science can make are necessarily limited in scope and domain, and they change as the evidence base shifts through scientific processes of self-correction and falsification. Even laws like gravity are explanatory models that cannot technically be “proven.” So why are we not concerned about fringe publics clinging to misguided beliefs that are at odds with the scientific consensus about gravity? The answer is twofold: First, the topic of gravity is not surrounded by sociopolitical debate that would introduce additional uncertainties or distortions to scientific truth claims. Second, and related, false claims about gravity are unlikely to gain significant traction in legacy media or on online platforms.
This example highlights three broad categories of uncertainty and distortion that can surround scientific truth claims and that form the organizational structure of our framework: (1) scientific, (2) sociopolitical, and (3) distributional uncertainties. As we show, each of these three types of uncertainty or sources of error dictates the classification of a scientific truth claim as more or less accurate and more or less threatening.
Within each category of uncertainty are specific claim attributes (see Table 1), each of which is itself surrounded by variability, either across issues or over time. Our list of attributes is inspired in part by a long legacy of work in the risk and decision sciences demonstrating the need to understand any risk—including the risk posed by dubious claims (Krause et al. 2020)—not only in terms of factors related to uncertainty, but also in terms of societal relevance, the form and magnitude of possible harm, and the liability of social amplification or distortion (Krimsky and Golding 1992). By evaluating truth claims in terms of the degree to which any of these attributes is relevant on its own but also when intersecting with other attributes, researchers and policy actors can assess a claim’s accuracy and potential threat simultaneously, as is necessary when deciding how (and whether) to mitigate (mis)information.
Intersecting Attributes Introducing Variability in Evaluation of Claim Accuracy and Threat
Intersecting Attributes Introducing Variability in Evaluation of Claim Accuracy and Threat
Attainability of knowledge
Some claims will face epistemological limitations that necessarily introduce uncertainty—for example, methodological limitations make it difficult to define a kilogram (Freiling, Krause, Scheufele, and Chen 2021), and earthquake forecasting is plagued by high error rates. A more extreme version of this attribute, however, appears in attempts to debunk unlikely and explosive claims, such as the COVID-19 virus escaping a research lab in China (Kessler 2021). Key information about this claim may never become available, or it may be lost.
Fluidity of evidence
To philosophers of science, the fluidity of scientific “truth” is well known—it is the deliberate and self-correcting nature of science (Hansson 2021). However, the uncertainties associated with self-correcting science are intensified during fast-moving crises like COVID-19, during which science and policy face the impossible task of countering claims they know are wrong with science that might not turn out to be right, either (Scheufele et al. 2020).
Stability across policy contexts
A claim can be accurate in one policy context but misleading in another. Standardized testing for college is a useful example. Discussions about predictive or criterion-related validity of standardized instruments like the GRE or MCAT have long questioned the ability of these instruments to accurately predict future academic success. Instead, many argue, these instruments tap candidates’ social class, socioeconomic background, and other sociodemographic confounds rather than an ability to succeed in different educational settings. The GRE, therefore, might be an accurate and reliable tool for assessing structural inequities, but it might inspire misleading claims if it is used to predict students’ real-world success in graduate school. Does this mean that standardized tests were not able to offer any valid information about applicants and we mostly admitted the wrong students? No. Does it mean that standardized tests were used without a full appreciation of the distortions and uncertainties surrounding their claims? Yes.
Fallibility of experts
Defining accurate claims (in part) as those aligned with the best available science at the time they are made (e.g., Southwell et al., this volume) can often be useful but will fail when science is simply wrong. In the 2010s, a paper widely hyped by NASA (2010) triggered over a year of global news coverage before being debunked by other scientists. While this could be construed as the self-correcting nature of science, it also demonstrates why intersectionality matters for assessing truth claims—that is, sources of error originating in the scientific community can be amplified by intersecting sociopolitical influences. As West and Bergstrom (2021, 1) noted, “Most analyses of misinformation focus on popular and social media, but the scientific enterprise faces a parallel set of problems—from hype and hyperbole to publication bias and citation misdirection, predatory publishing, and filter bubbles.” A detailed discussion of these points and the ways that “misinformation creeps into science” are available elsewhere in this volume (see Swire-Thompson and Lazer, this volume).
Consistency of evaluation standards
Whose standards of accuracy evaluation do we use, and when? Early in the 2010s, the results of a single study that some called “an intellectual foundation” in support of austerity politics shaped the Obama administration’s policy to curb the domestic deficit, as well as Germany’s austerity push for the Eurozone (Herndon, Ash, and Pollin 2014, 259). Yet attempts to replicate it revealed errors and omissions that undermined the evidence for these policies (Herndon, Ash, and Pollin 2014). The original authors claimed that the findings were valid and attributed the “kerfuffle,” as they diminutively called it, to differential evaluation standards, arguing that “politicians, commentators and activists” had “exaggerated or misrepresented” their results (Reinhart and Rogoff 2013).
Acceptability of source error
President Trump made 30,573 dubious claims in almost four years in office (Kessler, Rizzo, and Kelly 2021). Comparatively, the New York Times posts multiple daily corrections to its own reporting. Comparing Trump’s false claims to corrections by the New York Times is flawed, but it raises the question, What rate of error or timeliness of correction is sufficient for a claim to be seen as an “honest mistake” or as “misinformation?” This question is instructive for politicians, but it is even more relevant for sources of scientific claims. A 2021 National Academy of Medicine report, for instance, defined credible sources as “offering information that is consistent with the best scientific evidence available at the time and employing processes to reduce conflict of interest and promote transparency and accountability” (Kington et al. 2021, 6–7). But what error rates are acceptable for a “credible source” to (not) be aligned with the best available scientific evidence? How many retractions or nonreplications are acceptable for a peer-reviewed journal to be seen as a broker of credible science?
Associated harm
The potential harm associated with false belief in some claims will be much greater than it is for others. Misinformation might be considered particularly dangerous and worthy of interventions if it demonstrably influences attitudes or behaviors in ways that cause individual, societal, or institutional harm. For example, the individual and societal consequences of believing that the Earth is flat arguably situate this claim much lower on the associated harm spectrum than a claim such as “childhood vaccinations cause autism,” which can lead both to individual death and to the reactivation of a previously eradicated disease.
(Elite) salience
Decades of work on news values (Galtung and Ruge 1965) and agenda building (Cobb and Elder 1971) show that elite endorsements of claims increase their reach. Similarly, “influencers” in social networks have become the truth-defying equivalent of the “two-step flow” when they perpetuate misinformation (Katz and Lazarsfeld 1955). When it comes to identifying claims that warrant attention from academic or policy communities, elite endorsements matter. They are what move claims out of obscurity and into mainstream debate.
Algorithmic traction
Truth claims of the future will circulate via media that are capable of mutating, as algorithm-based dissemination platforms powered by artificial intelligence (AI) come to know us better than we know ourselves (Youyou, Kosinski, and Stillwell 2015). Companies that own these algorithms are incentivized to exploit human biases, to predictively model behavior, and to curate our content with unprecedented precision (Cacciatore, Scheufele, and Iyengar 2016). When we design machines to outperform humans, we should not be surprised when they do (Scheufele, Krause, and Freiling 2021), and we should recognize that our efforts to arm humans against nefarious forms of content curation may be limited in the short term, possibly futile in the long term, or even used against us by systems that are designed to anticipate and respond to human behavior. Further, we should expect these platforms to evolve in ways that we do not understand, either because we lack access to proprietary data (Freiling, Krause, Scheufele, and Chen 2021) or because the AI has advanced beyond our knowledge of its activities.
Intersecting Attributes: Using the Framework
Here, we use our framework to briefly trace three real-world science-related claims:
Claim 1: “Mask wearing can safely mitigate the spread of COVID-19”
The science of mask wearing amid COVID-19 was highly variable and contested. This fluidity of evidence made accuracy evaluation difficult, and these difficulties were amplified by both elite salience and different standards of evaluation among science and policy actors. Meanwhile, the associated harm linked to misperception on this topic was high. Uncertainty about the science of mask wearing interacted in this case with associated harm and other forms of sociopolitical distortion to produce a combustive mix that demanded attention.
Claim 2: “Injecting bleach can help to cure COVID-19”
The fluidity of evidence for this claim is virtually zero. But several other intersecting attributes have been more variable. This claim’s algorithmic traction was extremely high, partly because of its nonsensical novelty and partly because of excitement among liberal elites (elite salience) to catch former president Trump flat-footed, given that the “injecting bleach” claim was often falsely attributed to him (McGraw and Stein 2021), including by then-candidate Biden on the campaign trail (Lybrand 2020). A claim that should have been written off as press conference nonsense with little real-world impact was catapulted into a false statement by Biden due to intersecting distortions that fed each other over time.
Claim 3: “The Earth is flat”
Recently, scholars of misinformation have turned their attention to claims that the Earth is flat. As with the injecting bleach example, the fluidity of evidence is extremely low. Still, scholars are calling on “scientists and science communicators . . . to engage [flat earther] communities where they are” (Landrum, Olshansky, and Richards 2019, 160). Our framework clarifies why this intuition to act might be misguided: elite salience and algorithmic traction are low, so the claim is less likely to become widely repeated or endorsed in a manner that distorts its perceived accuracy. Beyond this, the associated harm is arguably quite low as well.
Nuance Matters . . . for Research and Action on Misinformation
We have argued that existing research on misinformation insufficiently accounts for intersecting sources of variation in judgments of claim “accuracy” and, further, that it does not easily facilitate the simultaneous assessment of accuracy and potential threat, opening a gulf between research and practice. Many claims languish in a “gray area,” conceptually and operationally. Although some may argue that we can still intervene in black-and-white claims, we neglect improving our understanding and handling of gray claims at our peril, especially if correcting them can have unintended consequences. (See the “Lancetgate” example we mentioned earlier, where politically salient fact-checks that relied on a shifting scientific evidence base resulted in accusations of bias against the scientific community.) In addition to providing a blueprint for evaluating the relative accuracy and potential harm posed by a given scientific truth claim, our framework, therefore, also has implications for the trajectory and shape of research and action on misinformation.
Researchers can do more coherent and policy-relevant work
Researchers who want to either explain how (mis)information works or to investigate interventions can use our framework to more systematically account for the complexity of claims and can operationalize them in more nuanced ways in their study designs. Researchers will also be better able to compare results across studies and to connect bodies of literature—both theoretical and empirical—as addressing similar kinds of claims. For example, studies presenting empirical evidence about the efficacy of correctives to false claims that lack elite salience or algorithmic traction can situate themselves as distinct from studies attempting to correct claims where such attributes are relevant. Further, this framework helps researchers to admit that we cannot (and typically do not) focus on all attributes at once. Specifying which attributes form the focus of a given study (and which are left out by necessity) will help us to assess, for instance, which controls to include in analyses. We can then talk with more clarity about our results, so they can better inform policy.
Interventions can better account for sources of distortion
Current interventions to mitigate misinformation are fraught with landmines. Fact-checking and accuracy prompts, for example, are not only sometimes agnostic to the scientific, sociopolitical, and distributional uncertainties that contextualize claims and form the basis of our framework, but they sometimes also assume that problems would be solved if only people knew the truth. There is, however, mounting evidence contradicting this “knowledge deficit model,” as well as other alleged public “deficits” that become rallying points for seeming “crises” of public opinion about science (Krause et al. 2021). Therefore, while the urge to “do something” about false claims is understandable, we need to think carefully about how to best use our resources.
A good starting point for refining intervention research is to ask which kinds of truth claims are indeed dangerous, and our framework enables this task. Recognizing both the complexity of claims and the variation in threat they might pose should direct efforts toward the most problematic claims and ensure that those efforts are tailored to the factors that make the claim threatening. For example, if algorithmic traction is a key threat-maximizing factor for many claims, then we should design interventions accordingly. We might deploy algorithms to flag claims that accelerate through new media, or intentionally optimize correctives for algorithmic prioritization. Of course, as long as companies make money by privileging claims that resonate with preexisting beliefs instead of promoting accuracy, this is a futile argument.
We can (and should) avoid creating an infodemic about the infodemic
The popular assertion that we are facing a novel misinformation crisis is an empirical claim, the veracity of which would require that misinformation has (1) not existed before, (2) not existed at this scale, (3) not existed in its current form, or (4) not influenced behaviors in ways that are as harmful as what we see now. In all cases, claiming that the threat is “new” necessitates a multipoint time comparison and evidence of a delta, as well as reason to believe that the measurement of “misinformation” at each point is comparable.
Yet as argued above, scholars of misinformation have had limited success in integrating and refining our conceptualizations to create clear “standards” for operationalization, which means that such analyses would be difficult, or at least beset by significant error. It is reasonable to suggest, then, that claims about the novelty, nature, or scope of misinformation problems—including an “infodemic” about COVID-19 or other topics—should be understood as carrying more uncertainty than some researchers are perhaps willing to admit. Shortcomings in how we understand what misinformation is (and wide variation in how we define it, if we define it at all) necessarily limit how confidently we can claim that an “infodemic” exists, what its contours are, and how problematic it is.
Footnotes
Nicole M. Krause is a doctoral candidate in the Department of Life Sciences Communication at the University of Wisconsin–Madison. Her work focuses on public perceptions of controversial science and issues of misinformation in science communication contexts.
Isabelle Freiling is a postdoctoral fellow in the Department of Communication at the University of Vienna. Her research focuses on media use and effects, as well as science and political communication, with a particular focus on social media and misinformation.
Dietram A. Scheufele is the Taylor-Bascom Chair in Science Communication and Vilas Distinguished Achievement Professor in the Department of Life Sciences Communication at the University of Wisconsin–Madison and the Morgridge Institute for Research. His work examines the social effects of emerging science and technology.
