Between a logic of disruption and a logic of continuation: Negotiating the legitimacy of algorithms used in automated clinical decision-making

Abstract

In both popular and academic discussions of the use of algorithms in clinical practice, narratives often draw on the decisive potentialities of algorithms and come with the belief that algorithms will substantially transform healthcare. We suggest that this approach is associated with a logic of disruption. However, we argue that in clinical practice alongside this logic, another and less recognised logic exists, namely that of continuation: here the use of algorithms constitutes part of an established practice. Applying these logics as our analytical framing, we set out to explore how algorithms for clinical decision-making are enacted by political stakeholders, healthcare professionals, and patients, and in doing so, study how the legitimacy of delegating to an algorithm is negotiated and obtained. Empirically we draw on ethnographic fieldwork carried out in relation to attempts in Denmark to develop and implement Patient Reported Outcomes (PRO) tools – involving algorithmic sorting – in clinical practice. We follow the work within two disease areas: heart rehabilitation and breast cancer follow-up care. We show how at the political level, algorithms constitute tools for disrupting inefficient work and unsystematic patient involvement, whereas closer to the clinical practice, algorithms constitute a continuation of standardised and evidence-based diagnostic procedures and a continuation of the physicians’ expertise and authority. We argue that the co-existence of the two logics have implications as both provide a push towards the use of algorithms and how a logic of continuation may divert attention away from new issues introduced with automated digital decision-support systems.

Keywords

cancer and palliative care chronic illness and disability ethnography technology in healthcare theory

Introduction

With healthcare revolving more around data (Hoeyer, 2016; Ruckenstein and Schüll, 2017), and as the magnitude of data made available by digital tools far exceeds the analytical capacities of the human mind, algorithms are gaining a central and an increasingly inevitable role in healthcare (Alston, 2019; Danish Ministry of Finance, 2016; Ministry of Finance and Ministry of Industry, Business and Financial Affairs, 2019; NHS Digital, 2019). One area within healthcare in which algorithms play a significant role is in relation to clinical decision-making on issues of triage, diagnosis, prognosis, and treatment.

Within both popular and academic literature the use of algorithms in healthcare is widely discussed. Recently discussions have revolved around the controversial character of algorithms being a way of automating human actions, with an interest in how digital algorithms ‘somehow mysterious and inevitable, are contributing to the shape of our lives in ways both big and small’ (Dourish, 2016). In common with many previous socio-technical imaginaries, algorithms in clinical care are infused with both high hopes and anxiety (Jasanoff and Kim, 2015). In popular accounts of the algorithmic future of healthcare, the substitution of the physician for an algorithm is already a well-established image (Wachter, 2017). Headlines such as ‘The algorithm will see you now’ (Balicer, 2018) and ‘A.I. versus M.D.’ (Mukherjee, 2017) feature on TED talks and news articles, and depict algorithms as powerful new autonomous actors in healthcare: disrupting existing roles and authority, potentially furthering more rational health systems (Topol, 2019).

In academic discussions within the social sciences, algorithms are discussed along with the more general and increased focus on ‘the datafication of health’ (Ruckenstein and Schüll, 2017). As in the popular account, these discussions also often seem to emphasise the disruptive character of new digital technologies, including algorithms. As such, within the field of critical digital health studies, introduced in 2016 by Lupton in this journal, we see how digitalisation within healthcare and the use of algorithms are addressed in the light of the ‘apparent disruptive and revolutionary opportunities’ they present (Lupton, 2016: 53). Similarly, in a recent special issue of this journal, the papers included were introduced by stating that they each contributed to the common question of ‘what the digital revolution in fact means for the present and future health and day-to-day lives of the people’ (Petersen et al., 2019: 382). In both these examples the disruptive character of digital health tools and algorithms seem to form the research interest.

In this paper, we argue that these contemporary accounts of algorithms in healthcare are often associated with what we call a logic of disruption. However, algorithms may also constitute a continuation of an established practice around standardisation and evidence-based medicine, and hence be associated with what we call a logic of continuation. The latter observation draws both on our early empirical findings and classical studies of evidence-based medicine within the field of Science and Technology Studies (STS). We apply the idea of the two logics as an analytical framing and examine how the delegation of decision-making to algorithms is negotiated in practice. On what grounds does it become legitimate to delegate, fully or partly, a clinical decision to an algorithm? And to what extent can the analytical framing of a logic of disruption and a logic of continuation help us understand such negotiations around the role(s) of algorithms in contemporary healthcare?

Empirically, we study the negotiations and enactments appearing during the development, adaptation, and application of a specific type of new algorithmic tool used in clinical decision-making in the context of Danish public healthcare: Patient Reported Outcome (PRO) tools. We followed two different disease areas: heart rehabilitation, and breast cancer follow-up care. In both cases, algorithms were introduced along with the PRO-tools, and used for clinical decision-making. Using PRO-tools, patients are requested to respond to digital questionnaires about their physical and mental wellbeing as part of their clinical treatment trajectory (Black, 2013; Trygfonden and ViBIS, 2016). The PRO-data produced are then processed by an algorithm and applied in clinical decision-making. Based on the algorithmic processing, PRO-data may be given specific colour codes according to the severity of the problems that the data indicate, and healthcare professionals may then apply this information in decision-making. In other cases, the patients are automatically screened based on the data, selecting those to be contacted by a health professional in person, and those to receive only an automated response. In a Danish context, algorithms used in PRO-tools may be based on validated and published thresholds or determined at a local level (The Danish PRO Secretariat, n.d.), and have, as far as we are aware not (yet) been coupled with machine learning. Thus, algorithmic sorting constitutes an essential part of digitised PRO-tools applied in clinical practice (Langstrup, 2019; Snyder and Aaronson, 2009).

In Denmark, there is currently a movement towards an increased clinical use of PRO-tools at a national level (Langstrup, 2019). This is partly driven by the health authorities who have initiated an institutionalisation and standardisation of the clinical use of PRO-data (Langstrup, 2019). The clinical use of PRO-tools is also tested and adopted in a number of local initiatives (see e.g. de Thurah et al., 2018; Hansen et al., 2019; Schougaard et al., 2016). The extent of the Danish movement towards using PRO-tools, and the central role that algorithms play in relation to PRO-tools, therefore makes this a highly relevant empirical case through which to study how the delegation of clinical decision-making to algorithms is negotiated in practice, and on what grounds legitimacy is established.

Algorithms as disruption and continuation

As described above, contemporary accounts of the use of algorithms in healthcare, both popular and academic, hold projections of how algorithms will revolutionise and disrupt healthcare. Other scholars have pointed to algorithms often being approached in terms of their opacity and ‘blackboxing’ (Lee and Björklund Larsen, 2019; Schwennesen, 2019). Thus, some critical scholars have an analytical interest in ‘going under the hood’ of algorithms, as Lee and Björklund Larsen (2019) characterise the approach of analytically exposing the assumptions and hidden politics inscribed in the ‘machinery’ of algorithms. This could for example be by drawing attention to the extent to which algorithms ‘blackbox’ the grounds on which decisions are made and results reached, with negative implications for human autonomy, public oversight and justice (Obermeyer et al., 2019; Prainsack, 2017). Along the same lines, the use of advanced big data analytics in healthcare (Lupton, 2018; Prainsack, 2017) and machine learning systems such as for example IBM’s Watson for oncology diagnostics (Tupasela and Di Nucci, 2020), have in particular prompted discussion of algorithms and their ‘authority’ (Lupton and Jutel, 2015; Schwennesen, 2019) vis-à-vis that of the human clinical expert. While this analytical interest takes its point of departure, to some extent, in the popular discourse regarding the disruptive potentials of algorithms, most studies of algorithms in the field of STS have an interest in understanding algorithms as unstable objects (Seaver, 2017) embedded in, and enacted as part of, practices (Lee and Björklund Larsen, 2019; Neyland, 2019: 151; Schwennesen, 2019). In the context of health, Schwennesen (2019), for instance, shows how the authority of an algorithm instructing patients in physical rehabilitation is contingent on active delegation and continuous human ‘repair work’. Lomborg et al. (2020) similarly describe how heart patients engaging with Fitbit’s algorithms for managing their health, mobilise and make sense of the data output in constant interaction with the care infrastructure in which these patients are embedded (Langstrup, 2013). And with regard to PRO-tools, Torenholt and Tjørnhøj-Thomsen (forthcoming) have shown how algorithmic sorting and hence the decontextualised nature of PRO-data entail that nurses mobilise complementary information in order to make PRO-data actionable in clinical decision-making. As pointed out by Lee and Larsen, studies like these offer an alternative to only ‘going under the hood’ and instead consider algorithms as ‘contingent upshot of practices, rather than [as] a bedrock reality with a focus on the intertwining of human and non-human actors’ (Lee and Björklund Larsen, 2019).

Similarly, as an alternative to approaching algorithms as disruptive per se, we suggest a need to consider also the history and the institutionalised practices within healthcare – in which algorithms have, for a considerable period of time, played a significant role – as part of the algorithmic system (Seaver, 2017). Hence, the expectation and hope that automation – an implicit part of applying algorithms in clinical decision-making – will lead to an improvement in healthcare, are not new. On the contrary, Berg in his work from 1997 presents these hopes and discussions, which at that time were already a decade old: how standardised tools, such as protocols and clinical guidelines for decision-support would rectify the problem of large variations and increasing costs in healthcare (Berg, 1997a). A different name for such protocol or guideline – even if on paper – is an algorithm. An algorithm can be defined as ‘an abstract, formalised description of a computational procedure’ (Dourish, 2016: 3). In conversation with early STS work on standardisation (e.g. Berg, 1997b; Bowker and Star, 1998; Star and Ruhleder, 1996) Timmermans and Berg (2003), studied how standards and digitised ‘expert systems’ affect medical practice, and highlighted how the ‘construction and application of standards transform the practices in which they become embedded’ (p. 22). The studies of Timmermans and Berg thereby demonstrate that algorithms, if understood as tools for automated decision-making/support, are not new in healthcare, nor in sociological studies. Rather, a strong tradition of delegating agency to standardised tools in clinical decision-making exists and current developments should be seen in this light. On an analytical level, this challenges the narratives of algorithms as per se disruptive when talking about healthcare and suggests that a sensitivity to the possibility of more continuation-oriented narratives is needed.

To align our analytical sensitivity and allow for both the disruptive and the more continuation-oriented narratives around algorithms in healthcare, we developed an analytical framework. This framework was not solely based on contemporary studies of algorithms within the social sciences and classical STS on standardisation and automation as described above. It also emerged through our empirical observations carried out in clinical practice, in which we observed that algorithms were often enacted as ‘normal procedure’ and not as ‘disruptive’. Thus, in a conversation between the existing theory and these early observations, we developed an analytical frame approaching algorithms in the light of two co-existing logics: a logic of disruption and a logic of continuation.

In applying the term ‘logic’ we follow Mol, as we intend to explore ‘what it is appropriate or logical to do in some site or situation, and what is not’ (Mol, 2008: 8). Hence, with ‘logics’ we do not refer to a stabile coherence, but rather to a coherence that ‘is not necessarily obvious to the people involved (. . .) It may be implicit: embedded in the practices, buildings, habits and machine’ (Mol, 2008). Thus, as opposed to consider a logic as a thought system expressed through for instance a managerial technology (as in the idea of institutional logics) (Allen, 2014), drawing on Mol’s approach we consider how logics come into being, can be identified in, and have an impact on the everyday activities involved in implementing algorithms in clinical work.

Fieldwork and data analysis

Fieldwork was conducted by both authors. Fieldwork carried out by Henriette Langstrup was centred on political documents and the developmental work of selecting and constructing national PRO-tools for cross-sectoral heart rehabilitation in a working group consisting of representatives of the health authorities, clinicians from different sectors and regions, patient representatives and consultants facilitating the work. Fieldwork carried out by Rikke Torenholt was centred on the development and the clinical use of a local PRO initiative for breast cancer follow-up. The two fieldwork studies are described separately in previous publications (Langstrup, 2019, Torenholt et al., 2020), but in this paper they are considered as a joint case, representing the different stages involved in the Danish movement around implementing algorithmic tools in clinical practice. Although the two fieldwork studies were originally designed as separate projects, we (the two authors) met regularly both prior to and during fieldwork, (including at scheduled meetings every second week). As we met and exchanged observations and experiences a consistent theme arose: the negotiations around legitimacy. This urged us to analyse data as a joint enquiry into how the legitimacy of delegating to algorithms was established. The joint fieldwork thus produced a unique opportunity to consider how legitimacy is negotiated at various levels throughout healthcare: the political level; the national level in working groups; and at local hospital level. And together, the ethnographical work covers a broad spectrum of relevant actors in different settings, including healthcare professionals, patients, researchers, IT-developers, stakeholders and public servants; and it comprises different methods including observations, interviews, document analysis and participation in relevant conferences.

Fieldwork on the political and national debate around the implementation of PRO-tools (not related to any specific disease), along with observations of the national coordination group on PRO for heart disease, which involved participant observations in eight workshops arranged by the Danish Health Data Authorities, were carried out during autumn 2018. Access was given to written material such as mail correspondence, and PowerPoint slides made and used by the clinical coordination group. Fieldwork carried out in relation to the local PRO initiative (Saltbæk et al., 2019) involved four intervals of observational studies, each lasting 5–8 days, at an oncology outpatient clinic in Region Zealand in Denmark. Observations were carried out between October 2016 and March 2019 and a total of 36 interviews (24 interviews with 18 patients, and 12 interviews with 6 healthcare professionals) were conducted. Additionally, observations were carried out in relation to the development of this local initiative by observing meetings conducted by the research team monitoring the use of PRO-tools, and by interviewing members of this research team. Besides this, we also conducted interviews with various stakeholders, reviewed related official documents, and participated in conferences of relevance to the national initiative around PRO.

All informants were fully briefed about the project and written consent was obtained prior to all interviews. Informants were also assured anonymity. Regarding observations conducted in the clinical coordination group, written consent was obtained both from the project manager from the Health Data Authorities, and from the clinician leading the work. Oral consent was obtained from other members of the clinical coordination group. Observations in the clinic focused solely on the healthcare professionals, who all gave their written consent prior to the observations. When recruiting, all healthcare professionals and patients were informed about our overall interest in examining how PRO-tools were developed and applied in practice. Fieldnotes were written by hand and afterwards written out and extended on computer. All interviews were audio-recorded and transcribed verbatim.

The analysis was made by the two authors in collaboration. Fieldnotes and interview transcripts were revisited and coded according to the analytical framing. First, we each revisited ‘our own part’ of the empirical data and coded it according to the two logics. In presenting our findings and examples to each other we then continued the analysis by comparing and situating the empirical examples. In developing and applying the analytical framework, we followed the request by Timmermans and Tavory (2012) to produce a ‘new hypothesis (. . .) based on surprising research evidence’. In the analysis, we also took inspiration from abductive analysis (Timmermans and Tavory, 2012), as we tested the emergent hypothesis by sorting data according to whether legitimacy was gained based on ideas about the algorithm’s disruptive character (logic of disruption), or based on the connection with existing practices (logic of continuation). In doing so, we both looked for examples that could verify the idea about the two logics, but simultaneously and in line with the idea about abductive analysis, we were also searching for examples that could challenge the hypothesis and provide alternative explanations (Timmermans and Tavory, 2012).

Findings

In this section, we demonstrate how logics both of disruption and of continuation appeared in the negotiations and enactments of PRO-tools. We do so by first presenting how the Danish PRO-movement began and how, based on a logic of disruption, political attention towards PRO-tools was awakened. We show how in political documents PRO-tools were presented and discussed in relation to their disruptive potentiality with regard to alleviating inefficiency in the organisation of work and the unsystematic involvement of patients. In the following two subsections we present empirical examples showing how closer to the clinical practice, PRO-tools were enacted as a natural continuation of the existing clinical practice. In the final section, we also show how tensions occurred when the algorithm’s transparency was at times questioned. However, despite these tensions, a logic of continuation seemed to contribute to legitimacy.

Disrupting inefficient work and unsystematic patient involvement

To set the scene, we start by describing how PRO-tools were first introduced in Denmark and how it was especially the framing of the disruptive potential of PRO-tools that led the propagation. Thus, the optimistic coupling of algorithms, automation, and efficiency, which forms part of the socio-technical imaginary we initially described, was also seen in relation to the Danish national investment in PRO-tools. This investment took off after a report in 2015 (PA Consulting Group, 2015), ordered by the Government, Danish Regions, and Local Government Denmark, argued that PRO-data had great potential to be used in automated triage-systems (see also Langstrup, 2019). The report showed that with PRO-data provided by patients through online systems, clinical visits could be scheduled based on individual needs. This differs from how, for instance, follow-up care had been, and still is, typically arranged, where patients are seen, for example, every 3 months based on a generic calendar system. According to the report, PRO-tools could provide a resource-saving and patient-involving alternative: Using algorithms to automate the processing of PRO-data, patients would be classified according to need (green, yellow or red) preselecting which patients should be seen in person for a clinical check-up and which patients should only receive an automated response or a phone call. The main message of the report was a business case, arguing that automation of follow-up visits held immense potential for saving money in a health system with an increasing number of patients in chronic care trajectories or in follow-up care after cancer:

A national use of PRO would imply a reorganisation of the clinical procedures at the outpatient clinics, which means that patients can be referred to PRO-based care; whereupon it is the patients’ responses that constitute the point of departure for the clinical judgement (. . .). In other words, it is a premise that an appointment in the ambulatory assessment unit is not made until a need is identified, on the basis of the PRO [and the algorithmic sorting, ed.]. (Our translation) (PA Consulting Group, 2015)

The consultancy report thus demonstrated how PRO-tools could disrupt previous inefficient ways of organising healthcare, bringing financial benefits. Hence, in line with other reports and strategies on the digitalisation of healthcare produced by health authorities in collaboration with consultancy bodies (Danish Ministry of Health, 2018; Danish Ministry of Health, Danish Regions, Local Government Denmark, 2013), the narratives found here are those of disruption: healthcare needs to change radically: digitalisation, automation, and algorithms provide the key. The consultancy report on PRO-tools influenced and mobilised stakeholders. Thus, from the onset of the Danish PRO-movement, the economic potential and a logic of disruption came to the fore.

However, in the public presentations and strategic documents that followed after the first national introduction of PRO-tools, the value of the data input – patients’ own experience of their health and wellbeing – became the central figure. Economic arguments could not in and of themselves provide legitimacy. As expressed in a report published by the Danish Knowledge Center for User Involvement in Health Care, part of the nongovernmental organisation Danish Patients: ‘With the implementation of PRO-data in clinical practice and in the quality of care development, the aim is to put into reality the vision of involving patients more in their own treatment’ (Trygfonden and ViBIS, 2016: 37).

The potential economic value of automation and the automation itself were downplayed, and little attention given to the algorithms involved. PRO-data and algorithms were consequentially recast, no longer being tools for ‘process automation’, as they appeared to be in the consultancy report, but rather tools for ‘systematic’ and ‘efficient’ patient involvement. These digitised capabilities are referred to in various and less conspicuous ways: ‘data are sent to the database for further processing’; data are used for ‘advanced decision-support’, or in ‘weighting’, or as ‘a smart selection of patients’ – here quoting from a number of internal reports, policy documents and public meetings. The objective around the use of PRO-tools for automated triage therefore seemed to change when PRO-tools were presented closer to clinical practice.

In the so-called clinical coordination groups, consisting of both clinicians and patient representatives who had the responsibility of developing national, cross-sectoral PRO packages including algorithms, the tension between the potential for saving time and money and the potential for patient involvement also appeared. Some clinicians hailed the disruptive potential in economic terms: one clinician stated ‘PRO is smart! We can use it to sort our patients. It’s more rational’. Yet most would only consider an automatic prioritisation of patients if aligned with a goal of increased patient-centredness. The public servant responsible for the workshops was very attentive to this tension:

She [the public servant] says to the group that she has asked in the ministry, and they have ensured her, that this is not a ‘money-saving exercise’. ‘It is to help prioritise – some patients will need more [clinical visits], some will need fewer, she says to the group. (Fieldnote).

Even clinicians who had experienced that the application of algorithms – as the first point of contact with patients – could reduce numbers of controls, were worried that PRO would never gain support from patients and health professionals if perceived as a ‘money-saving tool’. Instead, PRO-tools and the use of algorithm gained its legitimacy based on the assumption that it would ensure and systemise patient involvement.

Despite the gradual shift in rationale for the use of PRO-tools to sort patients, what the examples above demonstrate is that from the onset of the Danish PRO-movement a logic of disruption played a significant role. Both the consultancy report, the political documents and the clinical coordination group expressed a belief that PRO-tools and the delegation to an algorithm would disrupt existing inefficient and unsystematic ways in which healthcare was organised. However, in the following sections, we will demonstrate how this logic of disruption was accompanied by and partly or fully replaced by a logic of continuation as PRO-tools were developed and implemented closer to, and within, clinical practice.

PRO as a continuation of standardised and evidence-based diagnostic procedures

Members of the clinical coordination group at other times discussed PRO-tools not only regarding their potentiality to disrupt unsystematic work procedures, but also in terms of their similarity to established clinical practices. ‘It’s not much different from what we already do’ a physician and member of the clinical coordination group said, referring to standardised procedures and guidelines for interviewing and triaging patients. The algorithmic sorting of patient data is seen as a ‘natural’ prolongation of existing guidelines and best practices, thus expressing a logic of continuation. When PRO-tools are not discussed as something new, but rather as a continuation, it should be seen in the light of standardised and evidence-based procedures which already define clinical work (Allen, 2009; Timmermans and Berg, 2003). In a Danish setting, national guidelines are developed by the Danish Health Authorities with the participation of representatives from relevant scientific societies so that both for heart rehabilitation and breast cancer follow-up, national evidence-based guidelines are developed in order to ensure standardised procedures. Within breast cancer, Danish Breast Cancer Group and their work on preparing evidence-based and national guidelines regarding diagnostic procedures and treatment closely defines the clinical work around breast cancer. Comparable groups exist within other cancer areas.

The same references to ‘business as usual’ were seen in the local PRO-initiative. Here patients’ PRO responses were sorted by an algorithm based on predefined thresholds. To the physician leading the daily management of the project, the automatic sorting performed by the algorithm was comparable to the work she would carry out in face-to-face consultations, when considering how she would habitually ask the same clarifying questions and pay attention to the same symptoms. The physician’s enactment of the PRO-tools as a continuation of regular medical practice, indirectly appeared in terms of what the physician did not tell the patients. In the initial consultation, when patients were briefed about their follow-up care, the physician very rarely mentioned how the task of sorting the PRO responses and the following triage were performed on the basis of an algorithm, with no human involvement. Extra time had been allocated to this consultation as it contained much information and served several purposes including summarising the patient’s treatment, explaining about new preventive medication and radiotherapy, and advising about side effects. Lastly, the physician introduced the follow-up treatment and explained the programme, most often saying:

We’ll follow up on you through questionnaires which you’ll receive by email and where you’ll be asked about the symptoms you’re experiencing. Then a nurse compares your responses with the responses you gave the last time.

The automatic sorting of the responses and automated triage were not mentioned. The physician described how, due to the comprehensiveness of this specific consultation and its many purposes, she was very careful not to give too much information. For her, explaining about the use of automated triage was considered less important information. She instead emphasised the greater importance of informing patients about their treatment, and potential side effects. The physician appeared very dedicated and thorough in her work, and attentive towards the patient’s understanding and any questions raised. When asked why she did not mention or explain the automated screening, the physician said:

The ones who are experiencing symptoms which we are worried about, we get to call those (. . .) But you’re completely right, I consistently didn’t do it [mention the algorithm]. But I don’t believe there’s anything to hide, absolutely not, at any time I would [explain it] if the patient asked. There are a few who have asked about it; ‘How do you do it?’ And then I’ve told them. In that sense, I don’t think there is any jiggery-pokery in it.

The physician prioritises information she considers more important to the patient’s complex and challenging situation. As representing standardised care, the algorithm is established as good clinical practice. The example demonstrates how in clinical practice the algorithm is considered a natural extension of what the physician normally does when applying other standardised procedures.

PRO as a continuation of the physicians’ expertise and authority

In the local PRO-initiative, the PRO-tool – including its algorithm – was developed and introduced into clinical practice as part of an intervention study. The physician leading the daily management of the PRO-initiative had herself led the work of developing the PRO-tool. At first, PRO questionnaires were chosen and formulated based on her professional knowledge and experience. The number of response categories and headings were decided upon based on an already-validated scales. Using her clinical expertise, the physician designed the algorithm by determining what the thresholds should be for each question. The PRO-tool and its predefined thresholds were afterwards sent to six experienced oncologists who commented and made suggestions for improvements in both wording and thresholds. The PRO-tool was then adjusted in discussion between the physician and the six oncologists (Saltbæk et al., 2019). Therefore, to the physician, the algorithmic sorting constituted an extension of her own, and her experienced colleagues’ expertise, which ensured the authority of the algorithm:

I think that it’s a kind of inserted clinical judgement (. . .) I consider it a clinical judgement, which we made when we decided upon the thresholds (. . .) we had six consultants (physicians) contributing by looking at it, and if they didn’t want to call the patient at that time, then I don’t need to know anymore.

Here, the physician emphasises the role that the six experienced physicians played when developing the PRO-tool. To the physician, the algorithm constitutes a legitimate stand-in not only for herself, but also for six other experienced physicians. It is interesting that she uses the word ‘judgement’ as this has traditionally been seen as being in contrast to rule-based decision-making (Dreyfus, 1992; Simon, 1973). Hence, PRO-tools are enacted as an extension and continuation of her own clinical expertise and authority, and not in contrast to expert judgement. At another point in the interview, the physician, using a peculiar expression, described the algorithm as ‘a brain a long time ago’. However, as the quote above illustrates, the algorithm not only comes to represent one brain but the brains of all the healthcare professionals involved in developing the algorithm.

Likewise, the primary investigator behind the project also expressed the algorithm as being more than a representation of one physician. He considered the algorithm as an advance, as it is the result of an agreement between several leading experts. During an interview, Henriette questioned the algorithm’s complexity, referring implicitly to its fixed thresholds and the lack of machine-learning capabilities, something that she as an outsider considered the hallmark of complex algorithms. The primary investigator countered this by saying:

(. . .) we actually made an algorithm for recurrence, which is fairly complicated. Because (regarding) recurrence, we want to see those who we believe have recurrence or dissemination, right? So, it’s rather complicated I believe. It’s been pretty difficult to make. We’ve taken a long time. But I think that we got away with it pretty well.

In this quote, the primary investigator – a physician by training – considers the efforts of the six experienced consultants, and also projects their expertise and authority into the algorithm: its authority again rests on the clinical expertise behind the algorithm. It may take some power of judgement away from individual clinicians, but it rests on and constitutes a continuation of the authority from the expertise of a collective of clinicians (Timmermans and Berg, 2003).

However, when describing the needs for initiating the local PRO-project, the primary investigator simultaneously seemed to draw on algorithms’ more disruptive characters, showing that the two logics should not be seen as mutually exclusive. This appeared at the initiation of the project when he argued strongly for a need to change follow-up practices in oncology as, according to him and supported by research results, these practices rested too heavily on pre-scheduled control-visits and too little on objective indications of recurrences and other needs. He argued that using PRO and algorithms was a necessary disruption of established but costly and inefficient practices and directed a mistrust towards the physicians, who believed that they were effective at finding recurrences, but who – according to scientific studies – did not have a great success rate in spotting recurrences during clinical check-ups. Here, a logic of disruption seems very present. However, as demonstrated in the previous quote from the primary investigator, he simultaneously emphasised the close link between the algorithm and the physicians. This example demonstrates two things: on the one hand, it shows how the physician can tap into more logics, further strengthening the legitimacy for delegation to the algorithm. On the other hand, it may also suggest that the division of the two logics is in some cases too streamlined: that it is too simplistic to believe that a logic of disruption is present only at the political level, or that closer to clinical practice legitimacy is established based only on a logic of continuation.

Moments of tension: Negotiating the transparency of algorithms

Where the examples above demonstrate how the primary investigator seemed to juggle both logics, in this section we will show how during the developmental phase of the local PRO-initiative the logic of continuation was challenged, which momentarily caused some tension and uncertainty in the legitimacy of delegation. Hence, as we took part in the finalising meetings about the design and planning of the PRO-intervention, we observed how questions about the transparency of the algorithm were negotiated. During one of these meetings the IT developers, the physician leading the daily management, and researchers discussed if and how the delegation of decision-making power to the algorithm should appear to the patients, and hence how recognisable it should be that PRO-responses were automatically screened by a computer. The negotiations around transparency emerged as the researchers tried to decide whether patients should receive an auto-generated email shortly after submitting their responses, or if there should be a delay. The following discussion took place during a meeting with the technical company designing the software system running the PRO-tool:

Researcher 1:

[addressing IT-developer] We still need those automatic response emails to the patients, which I consider rather important. And they should come at least (. . .) 24 hours after [the patient has responded], [and say that] ‘We have now checked your responses, and everything looks fine. If anything appears before the next control, you are welcome to contact us.’

Physician:

Why do you say minimum of 24 hours after?

Researcher 1:

Well because, well it just seems a little (. . .) sometimes it’s just a little, you know, impersonal. [Imitating a patient:] ‘Then there’s no one who has checked this, they just send an automatic response. . .’

Physician:

But we’re not going to lie to people, right? We shouldn’t make patients believe that we’ve been sitting and looking at their responses. I suppose we can say: ‘There is nothing dangerous [indicated] in your responses, [so] you should not expect to receive a phone call,’ without suggesting that we’ve studied their response and thoroughly discussed it.

Researcher 1:

Well, it’s that thing, when you introduce machines [i.e. computers] like this between the practitioner and the patient (. . .). You’re of course right; we should be honest about what we do.

Researcher 2:

But they’ll know [that it is based on an algorithm] if they receive a response right away.

Researcher 1:

But maybe you should be very clear about that, when you inform [the patients] about the project.

In the example, the physician (the developer of the algorithm) expresses how they should be honest with patients, not letting them assume that she or the nurses had already examined the responses. Her trust in the algorithm, and her associated desire for full transparency, rest on a logic of continuation, as the algorithm represents the authority of a physician and a continuation of how to perform standardised care. Researcher 1 does not represent the same logic of continuation, being concerned with patients’ emotional response to having a ‘machine’ rather than a healthcare professional doing the screening – yet it is not the automatic screening itself that she considers problematic: she feels uncomfortable with how automation will be experienced by patients – such as whether patients will consider this a break with existing practices, and with the trusted relationship between patient and healthcare professional. Nevertheless, Researcher 1 is then convinced by the argument of the physician and tries to make it into a matter of continuation. In this situation, a logic of continuation becomes more persuasive, and it is decided that the email will be sent to the patients immediately after their responses, with no attempt to conceal the algorithm’s involvement.

However, contrary to what had been assumed in the meeting mentioned above, patients did not perceive the immediate-response email as a sign that an algorithm had screened their data rather than a human being. Instead, in interviews with patients it became clear, that the email was often perceived as an autoreply indicating that their data had been received, not indicating anything about who or what had been involved in evaluating their data. And as the physician only very rarely informed them about it, most patients had an inaccurate understanding of how their data were evaluated. A few patients had figured out that some sort of automatic screening probably took place, but none of the patients Rikke interviewed were fully aware of the two stages of the screening process: the first screening by an algorithm and the second by a nurse, and that the nurses only saw data from the patients not screened out in the first stage. This was also the case with Janne, who Rikke asked as to what she thought happened to her responses after submitting them.

I have the impression that a nurse looks at them [the responses], I don’t know if the physician maybe also looks at it, just to see if there is anything [the responses] that stands out, is there anything we [the healthcare professionals] should pay attention to? And then, if it’s something that we’ve already talked about, well, then that’s just it. And if there’s anything new, well, then they get hold of me. That’s sort of my impression of what happens.

This perception, that a nurse always received their response, was present among most of the patients interviewed; the delegation to algorithm remained unknown to them. To the physician and the nurses, the use of algorithms is enacted as a continuation based on their professional experiences and beliefs; while to the patients, treatment is perceived as ‘business as usual’, since they are unaware of the algorithmic sorting. It could be that patients too would consider the use of the algorithm as a matter of continuation, if they were aware of its function and the reason for delegation to an algorithm. However, it may also be that patients would actually perceive it in more disruptive terms, as predicted by Researcher 1.

Concluding remarks

We have demonstrated how at the political level PRO-tools in clinical practice are enacted primarily as tools for organisational improvement, drawing on a logic of disruption. At this political level, it is the algorithmic tools’ potential of improving inefficient and unstructured ways of organising healthcare that is being highlighted. However, in the national clinical coordination group and in clinical practice, PRO-tools are to a large extent enacted as a continuation of existing practices and physicians’ authority. Here, it is not the algorithm’s disruptive character that paves the way, rather it is its reproduction of existing practices. The analytical framework thereby provides an understanding of how at a political level the delegation to algorithms gains legitimacy through a logic of disruption, whereas at a more local level this delegation, generally speaking, gains legitimacy on the basis of a logic of continuation. What therefore emerges is a picture of a dual impetus for the use of algorithms, based on two seemingly opposing logics. However, and as we have tried to show, the two logics should not be seen as mutually exclusive ‘thought systems’ – rather, they constitute situated and sometimes co-existing ways of making sense in continuation with historically specific, local practices (Mol, 2008). At the same time, this co-existence of seemingly opposing logics may be explained by an interaction between locally enacted logics and a larger seductive, cultural narrative of promise and hope often associated with medical technological solutions and quantification (Good, 2001; Hoeyer, 2019; Merry, 2016).

Methodologically, combining the two sets of fieldworks has allowed for a unique opportunity to observe and find confirmation for the presence of more logics associated with the clinical use of algorithms. However, given that the two fieldworks relate to various areas of disease and were conducted in diverse settings this calls for reflections on how the contextual circumstances may have influenced our observations. Despite being diverse diseases, heart disease and cancer in this context share some essential common characteristics, both being diseases receiving immense attention when it comes to standardisation and organisational optimisation in an attempt to reduce number of deaths and economic burden on the health system. This for example has included the implementation of standardised patient pathways (Allen, 2009; Pedersen, 2017). Whether a logic of continuation is as present in legitimising algorithms within less standardised disease areas remains a question open to empirical enquiry. We therefore encourage future study to challenge our hypothesis and include disease areas characterised by various degrees of standardisation. Although this study concerns a specific algorithmic arrangement, we believe that the analytical approach is also of relevance to studies of the application of other algorithms, as well as of more advanced machine-learning algorithms within healthcare. Applying such analytical approach, taking a symmetrical point of departure grounded both in the past and the present, would at the same time involve a response to the request for more empirically grounded explorations of these emerging arrangements (Lee and Björklund Larsen, 2019).

Paying analytical attention to a logic of continuation provides an understanding of how at the clinical level the delegation to the algorithm progresses almost frictionlessly. Our findings suggest how this seems to hinder the open discussion and addressing of potentially problematic new issues introduced with automated decision-support. However, we did observe moments of tension, indicating that questions about the transparency of the algorithm were not fully settled. This tension also appeared during interviews and observations with patients where some patients were surprised to know that their responses were automatically sorted. These moments of tension caused an ethical dilemma. Should Rikke be the one who ‘revealed’ the delegation to the algorithm? In the first interviews, she had not expected that patients would be unaware of the delegation. If asked, or if patients expressed uncertainty about the procedure, Rikke explained how responses were handled. She made sure that patients knew that they could contact the nurses or the physician leading the project if they felt uncertain about anything and made sure that patients had the telephone number of the clinic. Rikke also told patients that they could contact her. As such she became ‘a-human-in-the-loop’ (Citron and Pasquale, 2014) engaging in some of the ‘repair work’ (Schwennesen, 2019) which, looking to the future, is likely to be increasingly in demand as algorithms proliferate in healthcare and beyond.

The observed moments of tension disclose questions that are not fully addressed when it comes to the delegation of clinical decision-making power to algorithms. How will patients experience this delegation? What will it mean to the relationship between healthcare provider and patient, and to credibility within that relationship? With delegation to algorithms being already such an intrinsic part of healthcare, it is difficult to determine in what situations patients are to know about it and in what situations they are not. The need to consider and discuss these questions seems especially relevant in relation to clinical-decision support tools such as PRO, as these are implemented with the purpose of also attaining more patient involvement, and therefore constitute critical questions for future studies to address. More well-rehearsed, but still highly important discussions to explore in more detail are the implications of the proliferation of algorithmic tools for clinical expertise and for the working conditions of the individual clinicians. Questions already pointed out in relation to general standardisation efforts (Berg, 1997a; Pedersen, 2017; Timmermans and Berg, 2003; Traynor, 2009) concern how standardisation shape a particular understandings of professional and clinical autonomy (Timmermans and Berg, 2003) and whether it leads to an impairment of clinical judgement (Pedersen, 2017) and may result in a ‘de-skilling’ or a ‘re-skilling’ of professionals (Berg, 1997a). Considering how notable a change the use of algorithms is to healthcare professionals individual judgement (Justesen and Plesner, 2018) and how in most cases the algorithmic sorting will not leave room for individual adjustment, we once again need to consider the implications for clinical expertise and for the working conditions of the individual clinicians. This leads to the question of whether the use of algorithms will erode the development of clinical judgement in the future? As algorithms – as also argued by our informants – build on current expertise, what will this mean to the development of expertise and clinical judgement among less experienced healthcare professionals? And since healthcare professionals only gets to handle the special and complex cases sorted out by an algorithm (Justesen and Plesner, 2018) – meaning that in their daily work they are prevented from following the more ‘normal’ cases – what does this means in terms of ‘knowing one’s patients’ and making clinical decisions? Such questions need to be examined through future empirical studies of the practices around healthcare professionals’ clinical practice and decision-making when carried out in relation to or being supported by algorithmic systems.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by the Danish Cancer Society (Grant number R133-Rp12097).

ORCID iD

Rikke Torenholt

Author biographies

Rikke Torenholt has an MSc in Public Health and is a PhD candidate at the Centre for Medical Science and Technology Studies at University of Copenhagen. In her PhD study, she explores the data practices around the clinical use of Patient Reported Data seen in the light of the intensified use of data within healthcare.

Henriette Langstrup is Associate Professor at the Centre for Medical Science and Technology Studies at University of Copenhagen. She has extensively explored the social implications of digital health innovation during the last 15 years. Her main research interest concerns the ways in which emergent personal health technology enables, changes or inhibits responsible infrastructures of care.

References

Allen

(2009) From boundary concept to boundary object: The practice and politics of care pathway development. Social Science & Medicine 69(3): 354–361.

Allen

(2014) Lost in translation? ‘Evidence’ and the articulation of institutional logics in integrated care pathways: From positive to negative boundary object? Sociology of Health & Illness 36(6): 807–822.

Alston

(2019) Report of the Special rapporteur on extreme poverty and human rights (UN report). Geneva: United Nations Digital Library.

Balicer

(2018) Medicine transformed: The algorithm will see you now. Available at: https://www.ted.com/talks/ran_balicer_medicine_transformed_the_algorithm_will_see_you_now (accessed 17 February 2021).

Berg

(1997a) Problems and promises of the protocol. Social Science & Medicine 44(8): 1081–1088.

Berg

(1997b) Rationalizing Medical Work. Decision-Support Techniques and Medical Practice. Cambridge: The MIT Press.

Black

(2013) Patient reported outcome measures could help transform healthcare. BMJ 346: f167–f167.

Bowker

Star

(1998) Building information infrastructures for social worlds—The role of classifications and standards. In: Ishida

(ed.) Community Computing and Support Systems. Berlin: Springer, pp.231–248.

Citron

Pasquale

(2014) The scored society: Due process for automated predictions. Washington Law Review 89(1): 1–33.

10.

Danish Ministry of Health, Danish Regions, Local Government Denmark (2013) Digitalisering med effekt: National strategi for digitaliserings af sundhedsvæsenet. Copenhagen: Danish Ministry of Health, Danish Regions, Local Government Denmark.

11.

Danish Ministry of Health (2018) A coherent and trustworthy health network for all. Digital health strategy 2018-2022. Available at: https://sundhedsdatastyrelsen.dk/da/diverse/download (accessed 17 February 2021).

12.

de Thurah

Stengaard-Pedersen

Axelsen

, et al. (2018) Tele-health followup strategy for tight control of disease activity in rheumatoid arthritis: Results of a randomized controlled trial. Arthritis Care and Research 70(3): 353–360.

13.

Dourish

(2016) Algorithms and their others: Algorithmic culture in context. Big Data & Society 3(2): 205395171666512.

14.

Dreyfus

(1992) What Computers Still Can’t Do: A Critique of Artificial Reason. Cambridge: Mit Press Ltd.

15.

Good

MJD

(2001) The biotechnical embrace. Culture, Medicine and Psychiatry 25(4): 395–410.

16.

Government, Local Government Denmark & Danish Regions (2016) A Stronger and More Secure Digital Denmark: The Digital Strategy 2016-2020. Copenhagen: [Danish] Government.

17.

Hansen

Kjerholt

Christensen

Hølge-hazelton

, et al. (2019) Haematologists ‘experiences implementing patient reported outcome measures (PROMs) in an outpatient clinic: A qualitative study for applied practice. Journal of Patient-Reported Outcomes 3(1): 74.

18.

Hoeyer

(2016) Denmark at a crossroad? Intensified data sourcing in a research radical country. In: Mittelstadt

Floridi

(eds) The Ethics of Biomedical Big Data. Cham: Springer International Publishing, pp.73–93.

19.

Hoeyer

(2019) Data as promise: Reconfiguring Danish public health through personalized medicine. Social Studies of Science 49(4): 531–555.

20.

Jasanoff

Kim

S-H

(2015) Dreamscapes of Modernity: Sociotechnical Imaginaries and the Fabrication of Power. Chicago: University of Chicago Press.

21.

Justesen

Plesner

(2018) Fra skøn til algoritme: Digitaliseringsklar lovgivning og automatisering af administrativ sagsbehandling. Tidsskrift for Arbejdsliv 20(3): 9–23.

22.

Langstrup

(2013) Chronic care infrastructures and the home. Sociology of Health & Illness 35(7): 1008–1022.

23.

Langstrup

(2019) Patient-reported data and the politics of meaningful data work. Health Informatics Journal 25(3): 567–576.

24.

Lee

Björklund Larsen

(2019) How should we theorize algorithms? Five ideal types in analyzing algorithmic normativities. Big Data & Society 6(2): 205395171986734.

25.

Lomborg

Langstrup

Andersen

(2020) Interpretation as luxury: Heart patients living with data doubt, hope, and anxiety. Big Data & Society 7(1): 205395172092443.

26.

Lupton

(2016) Towards critical digital health studies: Reflections on two decades of research in health and the way forward. Health: An Interdisciplinary Journal for the Social Study of Health, Illness and Medicine 20(1): 49–61.

27.

Lupton

(2018) How do data come to matter? Living and becoming with personal data. Big Data & Society 5(2): 205395171878631.

28.

Lupton

Jutel

(2015) ‘It’s like having a physician in your pocket!’ A critical analysis of self-diagnosis smartphone apps. Social Science & Medicine 133: 128–135.

29.

Merry

(2016) The Seductions of Quantification: Measuring Human Rights, Gender Violence, and Sex Trafficking. Chicago: The University of Chicago Press.

30.

Ministry of Finance and Ministry of Industry, Business and Financial Affairs (2019) National Strategy for Artificial Intelligence. Copenhagen: The Danish Government.

31.

Mol

(2008) The Logic of Care: Health and the Problem of Patient Choice. Oxon: Routledge.

32.

Mukherjee

(2017) A.I. versus M.D. What happens when diagnosis is automated? The New Yorker, 3 April. Available at: https://www.newyorker.com/magazine/2017/04/03/ai-versus-md#

33.

Neyland

(2019) The Everyday Life of an Algorithm. London: Palgrave Macmillan.

34.

NHS Digital (2019) Health and Social Care Information Centre (HSCIC) Annual Report and Accounts 2018-19. Leeds: Dandy Booksellers.

35.

Obermeyer

Powers

Vogeli

, et al. (2019) Dissecting racial bias in an algorithm used to manage the health of populations. Science 366(6464): 447–453.

36.

PA Consulting Group (2015) Analyse af Patientrapporterede Oplysninger (PRO). Copenhagen: PA Consulting Group.

37.

Pedersen

(2017) Organisatorisk trumfkort eller reduceret dømmekraft?: Et pragmatisk perspektiv på kræftpakker. In: La Cour

Waldorff

Højlund

, et al. (eds) Når teknologier holder mere, end de lover: kritiske perspektiver på ledelse af velfærd. Frederiksberg: Nyt fra Samfundsvidenskaberne, pp.141–166.

38.

Petersen

Tanner

Munsie

(2019) Citizens’ use of digital media to connect with health care: Socio-ethical and regulatory implications. Health: An Interdisciplinary Journal for the Social Study of Health, Illness and Medicine 23(4): 367–384.

39.

Prainsack

(2017) Personalized Medicine, 1st edn. New York: New York University Press.

40.

Ruckenstein

Schüll

(2017) The datafication of health. Annual Review of Anthropology 46(1): 261–278.

41.

Saltbæk

Karlsen

Bidstrup

, et al. (2019) MyHealth: Specialist nurse-led follow-up in breast cancer. A randomized controlled trial–development and feasibility. Acta Oncologica 58(5): 619–626.

42.

Schougaard

LMV

Larsen

Jessen

, et al. (2016) AmbuFlex: Tele-patient-reported outcomes (telePRO) as the basis for follow-up in chronic and malignant diseases. Quality of Life Research 25(3): 525–534.

43.

Schwennesen

(2019) Algorithmic assemblages of care: Imaginaries, epistemologies and repair work. Sociology of Health & Illness 41(S1): 176–192.

44.

Seaver

(2017) Algorithms as culture: Some tactics for the ethnography of algorithmic systems. Big Data & Society 4(2): 205395171773810.

45.

Simon

(1973) The structure of ill structured problems. Artificial Intelligence 4(3–4): 181–201.

46.

Snyder

Aaronson

(2009) Use of patient-reported outcomes in clinical practice. The Lancet 374(9687): 369–370.

47.

Torenholt

Saltbæk

Langstrup

(2020) Patient data work: Filtering and sensing patient-reported outcomes. Sociology of Health & Illness 42(6): 1379–1393.

48.

Torenholt

Tjørnhøj-Thomsen

(forthcoming) Is this something I should be worried about?: A study of nurses’ recontextualisation work when making clinical decisions based on patient-reported outcome data. Social Science and Medicine.

49.

Trygfonden and ViBIS (2016) Program Pro-Anvendelse af PRO-data i kvalitetsudviklingen af det danske sundhedsvæsen. Trygfonden og Videncenter for brugerinddragelse. Trygfonden: ViBIS.

50.

Star

Ruhleder

(1996) Steps toward design an ecology and access of infrastructure: For large spaces information. Information Systems Research 7(1): 111–134.

51.

The Danish PRO Sectretary (n.d.) Available at: https://skema.pro-danmark.dk/ (accessed 12 June 2020).

52.

Timmermans

Berg

(2003) The Gold Standard. The Challenges of Evidence-Based Medicine and Standardization in Health Care. Philadelphia, PA: Temple University Press.

53.

Timmermans

Tavory

(2012) Theory construction in qualitative research. Sociological Theory 30(3): 167–186.

54.

Topol

(2019) Deep Medicine: How Artificial Intelligence Can Make Healthcare Human Again. New York: Basic Books.

55.

Traynor

(2009) Indeterminacy and technicality revisited: How medicine and nursing have responded to the evidence based movement. Sociology of Health & Illness 31(4): 494–507.

56.

Tupasela

Di Nucci

(2020) Concordance as evidence in the Watson for Oncology decision-support system. AI & Society 35:811–818.

57.

Wachter

(2017) The Digital Doctor: Hope, Hype, and Harm at the Dawn of Medicine’s Computer Age. New York: McGraw-Hill Education.