Abstract
In this paper, we offer an original framework to study Artificial Intelligence (AI). The perspective we propose is based on the idea that AI is a system technology, and that a useful description of AI cannot abstain from mapping the components of the system, their interdependence, and how the synergies they create shape at the roots the directions of AI development. We adopt the concept of Large technical systems (LTS) to give substance and structure to our idea. Using LTS, we are able to scaffold AI and the forces at work steering its production, deployment, and evolution. We find that AI as a system shares essential features with infrastructural technologies such as the Internet. The LTS framework proves very useful to capture important nuances of the technology, and it allows us to trace the connections and cross–influences among its constituting domains—algorithms (software), compute (hardware), and data. We compare our proposed framework with other concepts usually associated with radical innovations, and suggest in which respects AI differs from these ideal–types. We consider ours a timely exercise, as we witness the formation of an AI industry. While in the making, this industry is rapidly ossifying, together with its specific problems, power imbalances, and development scenarios; the focus on the system–ness of AI allows uncovering the deeper structure of this technological breakthrough.
Keywords
Introduction
Current Artificial Intelligence (AI) promises a profound discontinuity in the way economic and non–economic actors organise production, employ labour, provide services, and intermediate social and business interactions. This discontinuity is grounded in the re–domaining of human activities as instances of prediction. Once an activity – for example, ad recommendation, text generation, or product design – is described as a case of prediction, AI-powered prediction ‘engines’ can be set in motion, automating partly or in full the activity in question, if that is economically viable. As the prediction engines improve over time in performance, complexity, and capabilities, an increasing range of activities has been touched by AI, so much to suggest that we are witnessing a technological revolution in the making, and that AI is without doubts the ‘next big thing’ in the innovation landscape.
The fact that AI is becoming seamlessly integrated in processes, products and services throughout the economy is certainly due to its capabilities – as well as to its almost ‘magical’ appeal to business and final users alike (Campolo and Crawford, 2020). However, we posit that another reason for AI taking deep roots in our societies lies in its very nature as a technology. In fact, in a manner that reminds infrastructures such as the Internet, AI solutions become ubiquitous because they ‘weave themselves into the fabric of everyday life until they are indistinguishable from it’ (Weiser, 1991). We see clearly the signs of AI weaving into the societal fabric: AI is everywhere, accessible from our browsers as services like chatbots or coding ‘autopilots’, and integrated in our handheld devices through tools ranging from photo-editing software or social networks’ recommendation algorithms. At the same time, AI is always ‘elsewhere’, with computing occurring in remotely located facilities, data acquisition and processing ran in the cloud, and solutions supplied as pre-packaged products by vendors. In other words, AI production and diffusion depends on the joint work of a distributed network of actors, technologies, and domains.
We summarise all this by suggesting that AI is a system technology. The ‘system-ness’ of AI is the fundamental property that drives its evolution; however, to our knowledge, such property has not been taken yet as a priority angle of analysis. Hence, in this paper we dissect this idea and outline a conceptual framework to study AI from a system perspective. The framework we propose is drawn from the theory of Large technical systems (LTS; Hughes (1987)). The value of LTS for the study of AI lies in the fact that it facilitates a ‘thick description’ that is both technological and social: the vocabulary of notions introduced by LTS theory is useful to unpack and understand the complementarities and tensions that characterise AI.
We provide an analysis to highlight how the LTS framework captures important features of AI as a system technology. This allows us to rationalise dynamics in the AI field, to explore fundamental properties of the system, and to use our perspective to present policy issues. However, the system perspective is only one way to look at AI. Hence, we also compare AI to other epistemic devices used in the literature, in order to assess the fit of alternative concepts as conceptual grids to describe AI. In particular, we discuss the parallels between AI and the notions of General Purpose Technologies (GPT; Goldfarb et al. (2019, 2022)) 1 and Invention of a Method of Inventing (IMI; Agrawal et al. (2018); Griliches (1957)).
The key contribution of the paper is the introduction of a novel organising framework to study and ‘scaffold’ AI. This can guide further research and policy design. The complex but tractable view we propose bridges concepts and frames from the fields of information systems, economics, and sociology of technology. In this sense, it can be seen as the cornerstone of a multidisciplinary research programme on AI as a system technology.
The paper proceeds as follows: in The ‘next big thing’: Artificial Intelligence we build the foundations of the analysis by providing our working definition of AI (Our object of study) and showing the many forces and dynamics shaping AI system components using the example of the data domain (Unpacking the AI data domain). In Artificial Intelligence through the Large Technical System lenses, we first introduce the LTS framework and we use it to map current AI essential characteristics (Defining AI as LTS) and to provide a thick description of the AI system (Speaking of AI in LTS terms). In Comparing AI to other Epistemic Devices, we discuss the fit of alternative epistemic devices (GPT and IMI) to describe AI. Discussion and Outlook concludes the paper by discussing the policy implication of our proposed framework and summarising the analysis in a final outlook.
The ‘next big thing’: Artificial Intelligence
Our object of study
As a first step of the analysis, we discuss AI through different lenses, in order to build a common understanding of AI with the reader. This is a necessary step, as AI is a complex and dynamic field which has multiple layers, from theoretical underpinnings and questions that cut deep into the existential domain of humankind to AI-based devices for everyday application. As the readers of this paper might have different vantage points being more familiar with one or the other ‘face’ of AI, we leverage several perspectives.
Definitions
There exist several definitions of AI. We start with OECD (2019) definition, as we find it the most encompassing: ‘an AI system is a machine-based system that is capable of influencing the environment by producing an output (predictions, recommendations, or decisions) for a given set of objectives. It uses machine and/or human-based data and inputs to (i) perceive real and/or virtual environments; (ii) abstract these perceptions into models through analysis in an automated manner (e.g. with machine learning), or manually; and (iii) use model inference to formulate options for outcomes. AI systems are designed to operate with varying levels of autonomy’. Note that the OECD definition is method-agnostic, meaning that it doesn’t prescribe how the AI system achieves the result. This allows encompassing various approaches to AI such as symbolic, cellular automata and connectionism. (Boden, 2016)
We take this definition and specify the methods that currently prevail in AI discourse, systems and solutions, resembling the notion of ‘modern AI’ introduced by Russell (2019). Our working definition of current AI is as follows: a variety of systems each consisting of virtual machine(s) (algorithm(s)) performing statistical learning and inference, using data of different modalities (e.g. visual and audio) and types (e.g. text, cross–section and panel) and relying on dedicated computing capacity. Taddy (2019) develops a similar definition to outline the technological dimensions of AI. Our definition is also in line with what Aleksander (2017, 2021) calls ‘smart computing’: a well-bounded (though evolving) category of algorithmic processes that yet require human design as well as interconnected supporting technologies. Current AI, as smart computing, is unrelated to intelligence at large as found in humans. Importantly, rather than a monolithic technology, current AI is a label that envelops a range of information technologies working according to a common principle: that of predicting output based on patterns inferred from input data. In sum, current AI systems are statistical engines with unprecedented capacity to identify and encode correlations in training data.
State of AI
Artificial intelligence, being a technological mirror of ourselves, is inevitably compared to natural intelligence. A seemingly philosophical question of whether or not AI possesses a ‘true’ intelligence has very tangible technical and socio–economic implications. For instance, cognition and understanding are among the criteria as well as fields of ongoing research that separate the so-called weak AI from strong AI. The distinction is based on the fact that the former only emulates intelligent behaviour, while the latter aims at re-creating it. Emulation of intelligence can be achieved through various methods; when a set of methods with common theoretical underpinnings gains traction, it forms an approach to AI, such as the symbolic and connectionist approaches mentioned earlier on. However, the question of how to re-create intelligence to reach emergence of cognition and understanding in algorithms remains yet unanswered. Hence, we can claim that the current state of AI belongs to the weak type.
For what concerns the technical side, to be considered strong, AI must exhibit understanding. In what follows, we dissect this concept within the natural language domain due to its inherent familiarity to any reader. Understanding, as found in humans, is the capability to infer communicated meaning given natural language expression (Bender and Koller, 2020). This definition of understanding clearly separates meaning (the former) and the form of language (the latter). In turn, Language Models (LM) (regardless of their architectural complexity) are systems that perform string prediction – that is, statistical prediction of a character, word, or phrase – given preceding and/or surrounding text (Bender et al., 2021). The combination of these two insights makes it evident that LM are trained solely on the linguistic form (vast collections of text) while meaning remains beyond their grasp. Progress in leveraging context has been made to capture increasingly longer sequences – from few preceding tokens (e.g. N-gram model) to paragraphs of text (e.g. Transformers) – with architectural innovations in LM as well as with increasing size of models and training datasets that allows the detection and encoding of finer and finer details of linguistic form. In other words, LMs are getting increasingly good at reproducing the form of language, but come nowhere near its meaning and, hence, to understanding. The limits of the statistical approach appear even more prominent when put in perspective: if capturing longer dependencies in text based on co–occurrences and distributions to emulate understanding (i.e. reproduce linguistic form) is a challenge, the construction of a robust and functioning representation of an even simple world using this approach is an intractable task. Instead, human-like understanding ‘is based on concepts – internal mental models of external categories, situations, and events, and of one’s own internal state and “self”. In humans, understanding language (as well as nonlinguistic information) requires having the concepts that language (or other informa-tion) describes, beyond the statistical properties of linguistic symbols’ (Mitchell and Krakauer, 2023).
There is no doubt that statistical learning and inference play some role in the process of achieving understanding. However, human understanding is by far more parsimonious and prudent: humans do not require such vast data and do not update the entire mental model of the world every time new data arrives; it seems to be more modular and relational. Apparently, new approaches and their combinations with existing ones are required to achieve genuine understanding by AI systems and, thus, strong AI. Perhaps, the conclusion that AI needs various approaches to modeling understanding and eventually intelligence to become stronger can be conveyed in the following quote of Marvin Minsky: ‘The power of intelligence stems from our vast diversity, not from any single, perfect principle’. (Minsky, 1988)
Given that we discuss the socio-economic side of AI at length in the remainder of the paper, here we limit ourselves to only few points that directly descend from the technical side. As weak AI inherits the same limits of the statistical methods it relies on, it poses challenges for development and creates risks of harm once deployed. One example of a challenge current AI faces is dealing with non-transitive alternatives, that is, dilemma, in various applications. One of the proposed solutions in such cases is of probabilistic nature: the usage of mixed strategies and/or fuzzy ranking. However, if an application is characterised by high-stake loss function, that is, the cost of mistake is high, probability-based solutions are dubious both legally and ethically. The Moral Machine experiment illustrates this challenge perfectly (Awad et al., 2018). In a nutshell, a large scale survey was conducted to elicit preferences of people across the globe with regard to variations of the trolley problem in the context of autonomous driving. Here, we want to clarify that the problem in question is not the incapability of AI to find the solution of a fundamental moral dilemma; instead, it is the potential to scale up a certain level of damage that is inherent to the probabilistic solution as the deployment scales up. For what concerns risks, current AI systems are sensitive to representativeness and quality of data (the ‘garbage in, garbage out’ principle) and (adversarial) attacks aimed at distorting or ‘polluting’ statistical co-occurrences in the data teaching the system to behave oddly and/or to produce harming output (Marjanovic et al., 2021; Shumailov et al., 2021). Even in presence of high-quality data and no malicious intervention, optimisation along a ‘flat’ target function might lead to a sub–optimal or detrimental outcomes (Lambrecht and Tucker, 2019). The identification of unintended harms and the monitoring of purposeful attacks in order to remedy the situation require (i) commitment on the side of AI–proprietors and (ii) high costs in terms of time, programming effort, computing power and energy, new tests, and even environmental toll (Bender et al., 2021; Strubell et al., 2019).
System components
Regardless of the methodological approach, any AI technology necessarily consists of the following components: (i) algorithms or virtual machines, (ii) computing power (and related physical devices delivering it), (iii) data, and (iv) domain structure as the problem environment and search space of actions an AI system is working with. This description builds on Taddy (2019) with some variations, and is in line with the OECD definition provided earlier, now stressing the components the AI system consists of.
Each domain is characterised by its own market structure, business models, regulations and pace of development. For instance, the semiconductor industry as the producer of hardware for (ii) is distinguished by well-defined but complex value chain (VC) with pockets of market power concentration and hence high entry barriers; it leverages clear technical opportunities (e.g. Dennard scaling principle) 2 and economic mechanisms such as capacity races and economies of scale (Prytkova and Vannuccini, 2022).
The data domain is less defined and, therefore, less constrained (Koutroumpis et al., 2020a). Data markets are younger, less regulated, and by the virtue of intangibility more dynamic. In the increasingly digital economy, the importance of data as a commodity or production factor is growing so does the need to define data markets and understand their mechanisms. This is particularly visible in the context of and due to (i) privacy (Sveinsdottir et al., 2020) and (ii) competition in and for data markets (Jenny, 2021). For what concerns the latter, it is worth stating explicitly that data is frequently the source of supply–side network effects, spreading gains across related markets.
Last but not least, the software domain of AI resembles something in–between the hardware and data domains in terms of its market conjuncture. On the one hand, software markets existed before, so the mechanisms at work are well-studied; however, AI programs are not a usual download–install piece of software; at the moment, they require a whole stack of libraries and interfaces to be produced, integrated, and utilised. 3 On the other hand, this domain is still less monopolised and more open-sourced than the other two, even though this property might vanish as the presence of big firms induces smaller players to pursue the so-called ‘innovation-for-buyout’ strategy (namely, innovating just for the sake of signalling value and be acquired) (Cabral, 2018) to scale-up effectively.
Outlook
Concluding the entire Section, it is worth repeating our main claims: our object of study is current AI, which belongs to weak AI with rather statistical underpinnings above all other existing approaches. The system-ness of AI must be accounted for as a research priority because it has far reaching technical, social and economic implications, which we continue to unpack and discuss in the remainder of this paper.
Unpacking the AI data domain
It is useful to start our analysis by describing challenges of AI ‘in the wild’. In this subsection, we unpack the AI data domain to illustrate the incentives and bottlenecks it creates, the strategic challenges for AI development linked to it and, in general, the complexities related to AI deployment and value capture that involve data. This will provide a contextualised ground to our claim that a system perspective can offer novel and important insights on AI and will shed a light on the properties of AI that our organising frameworks can help scaffolding. Furthermore, it allows us to introduce the theme of AI policies, in particular the aspects concerning data.
Over the last decade, we observe proliferation of business models that are reliant on data monetisation (and in particular Big Data – see Wiener et al. (2020)). The diffusion of the Internet and the globalisation of markets made possible at the same time an unprecedented expansion of the consumer base, a boom in the amount of offers from businesses of all kinds, and drastically lowered the related (information) search costs and the cost of tracking the consumption behaviour (content, goods, services, etc.) of online users (Goldfarb and Tucker, 2019). Atop of this abundance of data, new market opportunities for businesses that collect, store, structure, and elaborate the data grew rapidly: online databases, search engines, consulting firms, digital platforms, software management systems and many other examples of data-fuelled businesses. This is a key transformation: where there is data, there will be AI. AI has the potential to spread into applications where (i) data is generated and can be collected in sufficient amounts and (ii) its structuring and elaboration creates value added for the business.
Getting the data
First, in order to deploy current AI to support any given application, an established and systematic process of data collection is required. In other words, the implementation of AI requires a good representation of business processes (essential or not for a firm) in data – namely, their digitisation. This is why pioneering industries in AI adoption have been the likes of Fintech and logistics, which are characterised by highly digitised and measurable processes and had forms of algorithmic automation and optimisation already in place. The current AI systems expanded the set of ‘digestible’ data by improving capabilities of processing of sensory data such as images, video, audio, making activities that involve these capabilities cheaper (and thus economically viable), and less labour and time consuming. Thus, the current AI expanded the frontier of automation.
The existence of data does not automatically make the case for an AI application. Sometime data might exist but its accessibility could be either hindered, inefficient or even welfare-damaging. This is partially due to unresolved data ownership and absence of mechanisms such as data markets to coordinate data supply and demand which would ensure the lawful and effective exchange of data ownership rights. An insightful summary of the situation with data markets is expressed in a quote of Edward Snowden: ‘there is no property less protected and yet no property more private than data’ (Snowden, 2019). In some applications, data is a mere representation of an environment’s state or processes (e.g. temperature control in data centres). However, when data is an imprint of activities conducted by actors, individuals or organisations that are external to the owners of AI systems, then data might be considered as a property of the actors that created it (Jones and Tonetti, 2020). Said differently, when data is a public good, ownership issues do not emerge, while the elaboration of data, which has the nature of a private good, requires solutions that address simultaneously consensual data transfer and privacy concerns (personal data that owners might either sell at a very high price or not to sell at all).
In sum, the collection of data that reflects business processes including demand’s feedback loops and the establishment of data markets is a necessary, though not sufficient, prerequisite for AI deployment.
Choosing monetisation strategy
Second, to persist as a useful technology within an economic activity, data elaboration performed by AI has to bring returns. The value of data elaboration can lie in harnessing vast amounts and/or complexity of data detecting patterns in time infeasible for humans. Retrieving information about, for example, highly non-linear relations between a set of covariates and whether or not a person has clicked on an ad is undoubtedly a useful insight, but in order to systematically turn this information into profit a firm has to build a sustainable business model to monetize on it. Monetisation strategies can vary across applications, which in turn are characterised by different payoffs from the implementation of AI. For example, for online retail, the monetisation strategy would involve the structuring of pricing and versioning of the offer given the association revealed by data elaboration. This strategy allows obtaining profit directly and from each offer independently. Differently, an AI algorithm that controls an industrial robot through the processing of sensory data and producing an adequate response in order to perform a task creates value added that is more implicit and grows in a non-linear way with the scale of deployment of the AI technology.
In sum, all kinds of data elaboration done by AI has to produce either valuable/unique result in the firm’s production process or contribute to a valuable offer to the consumers, in both B2B and B2C markets, to ensure retention and generate profit.
Investing in complementary assets
Third, sustaining the monetisation strategy requires investments into complementary assets that constitute and/or support the AI system. The costs of primary collection or acquisition of data from third parties (e.g. the purchase of database licences, cookies or data appends – see Bergemann and Bonatti (2019)), the storage within a firm or purchasing cloud space in order to further elaborate the data with AI, or even contracting micro-work to conduct data annotation (Tubaro et al., 2020), constitute yet another part of the data–related decision-making. Depending on the revenue stream from the activity that involves AI, a firm has to choose between investments into the development of AI systems at least in part in–house (including all domains – data, hardware and software) or in partnership with AI solutions providers at various links of the AI value chain. The choice between the two alternatives shapes the distribution of market power among actors in the nascent AI industry. Obviously, small and medium-sized enterprises gravitate towards outsourcing option to minimise costs. Even big companies for which AI performs a side function would be prone to purchase customised but ready–made AI solution, benefiting from sharing the risks and legal responsibilities with the developer. Indeed, among AI–users the emergent strategy of ‘join-and-share’ AI-as-a-service solutions due to the high costs of every component of AI systems steers AI development towards a form of infrastructure, with the most powerful actors (AI–producers) meticulously building and piecing together the AI stack. Moreover, the burden of high costs is coupled with cross-domain network effects. For example, depending on the application, the nature of data might vary – pixel matrix for images, text corpus for legal disputes, or panel data for consumer databases. This affects the choices and developments in the hardware domain (bandwidth capacity, memory size and placement, parallel or sequential processing and so on), programming framework (programming language, libraries), and algorithms themselves (loss function, optimization procedure). Together, the initial costs of implementation and cross-domain network effects increase the switching costs of an alternative to any component and lead quickly to hard lock-ins for both supply and demand in the software and hardware domains.
In sum, AI adopters make a choice on how to deploy AI-based solutions and invest in the respective complementary assets. This creates a demand–pull effect steering the innovative efforts of AI–producers further along existing technological trajectories. The opportunity costs in this situation might be substantial, as alternative trajectories are locked out by prohibitively high switching costs.
Outlook
Now we have an understanding of the role the data domain plays in businesses that aim to use AI systems. An economic activity performed with AI (partially or entirely) immediately implies value creation tied to data through the several channels we described above. The degree of reliance on AI systems differentiates users in their preferred mode of AI consumption. Policies that aim to ensure a balanced and beneficial AI development have to account for the preferences of various user markets and, simultaneously, of AI producers. The task for policy-makers is to make sure that arguments related to high costs and strong network effects (both supply– and demand–side) are not used as a justification to tilt the development of AI systems towards an inefficient and/or monopolised realisation. Inefficient in technological sense might mean avoiding following the principle ‘the bigger the better’ in terms of ever-increasing size of data and algorithms, amount of compute, number of processors, and so on for the sake of marginal improvement in performance. In socio–economic terms, an inefficient instantia-tion would drain resources and resemble a skewed representation of stakeholders’ interests – AI–users, AI–producers, society, public institutions – creating dead-weight losses, violating rights, damaging competition, and producing an asymmetric distribution of gains. A variety of measures is available to policy–makers to address the challenges: from incentivising R&D investment in scalable AI techniques (e.g. federated learning and neural network compression) to promoting compatibility among already existing components of the system (e.g. consortium of AI software developers). The choice of measures to maintain an healthy competition and divert from hard lock-ins is crucial and should account for both (i) the mechanisms at work in AI component markets and (ii) the mechanisms that connect AI component markets; in other words, the system nature of AI is an eminent property that substantially influences the way AI should be governed.
In the sections to follow, we move from the specificity of the data domain and return to the entire AI system to assess how a framework used to map and study infrastructural technologies fares in capturing the essence of current AI.
Artificial Intelligence through the Large Technical System lenses
Large Technical System
Large technical systems are a specific category of technologies: they are large infrastructural and production systems (Van der Vleuten, 2009), or ‘large–scale, capital–intensive infrastructures’ (Sovacool et al., 2018) that are structured as ‘spatially extended and functionally integrated socio-technical networks’ (Myntz et al., 2019; Hughes, 1987). In other words, LTS describe system technologies, rather than isolated artefacts. LTS originality is precisely in moving the focus from single elements to their alignment into integrated structures. Examples of LTS are telecommunications, railways, energy supply and distribution systems. The prevalence of physical infrastructures does not exclude system technologies characterised by a higher degree of intangibility to be classified as LTS. In fact, Ewertsson and Ingelstam (2004) identify information-based LTS that contain both ‘hard’ and ‘soft’ components, such as radio and television distribution networks.
The notion of LTS belongs to the fields of sociology and history of technology and science and technology studies. Since the very introduction of the concept (Hughes, 1983; Hughes, 1987), LTS is also considered a specific approach to the study of technology that places emphasis on connectedness and ensembles. For the aim of this paper, the value of LTS lies in two aspects: first, in the interpretative power it offers to describe the features of specific technologies (in our case, AI). Second, in the possibility to use the phases along which any LTS evolves as a foresight device to explore how a given system technology can unfold over time. These two dimensions are related, as different LTS driving forces play a different role and have different relevance in different phases. Overall, LTS is a powerful epistemic device because it has a combinatorial nature: it endows researchers with a rather flexible scheme that permits to place under the same family of technologies different configurations of a given system, as these are the result of specific combinations of LTS elements and phases.
An LTS is the result of a series of driving forces and features: system builders, reverse salients, load factor, technological style, and momentum. We describe these next.
System builders
System builders are the actors that strive to extend the reach of the system and perform the ‘sociotechnical integration’ necessary to its deployment (Van der Vleuten, 2009). These can be inventors–entrepreneurs or managers with engineering capabilities, individual actors or large firms. In different phases, system builders can work to align the interests and objectives of the different actors involved, allowing an LTS to grow and achieve its goal(s).
Reverse salients
Reverse salients ‘are components in the system that have fallen behind or are out of phase with the others. Because it suggests uneven and complex change, this metaphor is more appropriate for systems than the rigid visual concept of a bottleneck. Reverse salients are comparable to other concepts used in describing those components in an expanding system in need of attention, such as drag, limits to potential, emergent friction, and systemic efficiency’ (Hughes, 1987). Reverse salients, emerging from the uneven development of the system’s components, are sources of critical problems and, given that problems are typically focusing devices to allocate innovative efforts (Rosenberg, 1969), they are also potential loci of innovation.
Load factor
Load factor is ‘the ratio of average output to the maximum output during a specified period’ (Hughes, 1987) and, hence, an indicator of performance, here meant as use or deployment of the technology at full potential over time. The distribution of load factor indicates when and where the system is under stress. Knowing that can guide investments in capacity expansions or adjustments, as well as policy interventions.
Technological style
As for the common use of the word, style indicates a type of fashion: the specific design of a particular LTS that descends from choices regarding which features are emphasised, and in which way. An LTS technological style emerges from the particular choice and combination of its elements, given their relative importance and the specific role they play in the whole system. LTS executing the same function and aiming at the same goal can differ in style in different contexts. For example, the organisation and control structure of energy distribution systems can change across countries while the fundamental function and goal they pursue are comparable.
Momentum
Momentum, or dynamic inertia, is the degree of autonomy the LTS acquires once it reaches a certain stage of development and a ‘mass’ in terms of relevance for the economic system. Systems with high momentum are less sensitive to pressures for change – they continue their motion undisturbed.
The LTS elements we listed have an eminent socio-technical flavour. For example, the concept of system builder has mostly a social dimension, while reverse salient and load factor are aspects related to engineering and technology. Many of these concepts have closely related siblings in the other sub-fields of innovation studies. For example, reverse salients approximate bottlenecks; momentum approximates path dependence and cumulative change. However, their engineering or social angle makes them more sophisticated categories to label complex phenomena, and useful to capture the features of system technologies that are uniquely embedded in specific epistemic communities, regulatory settings, and cultural contexts. A system builder can be an entrepreneurial actor, but also a carrier of a rare combination of technical and social skills (and, potentially, power). Momentum is close to path dependence, but path dependence is a process that emerges from chance and choices, while momentum is a later-phase property of a system that keeps existing and functioning due to mass and acquired autonomy, thus refusing any role to chance.
In terms of the phase an LTS experiences from its birth to maturity, Hughes (1987) lists (i) invention, (ii) development, (iii) innovation, (iv) growth, competition and consolidation, and (v) technology transfer. The latter is characteristic of LTS: technology transfer occurs when a system developed in a given context is replicated in other environments, and can happen in parallel to other phases. More recent work added new phases experienced by mature LTS, such a stagnation, reconfiguration and decline (Sovacool et al., 2018). Gokalp (1992) stresses another key property of LTS: they develop by layering up over existing systems, creating a superposition of systems that influences the LTS configuration. The superposition of systems is characteristic of infrastructural projects and is, therefore, an essential feature to look for when assessing whether a technology can be considered an LTS.
In summary, the LTS categories can guide the analysis of a given system technology. For example, one might want to know: where is the ‘locus of control’ in the system? Which actors store and hold the relevant technological (and market) knowledge to ‘produce’ the system technology? Who advances and builds the system out of its components? Who has power on the factors constraining the development of the LTS? Which elements of the systems and related actors can facilitate the process of convergence around standards and protocols in order to improve communication and control at large? What happens if the LTS becomes so large to be unmanageable? Joerges (1988) quotes Aristotle, reminding us that when things get too small or too large ‘they either wholly lose their nature, or are spoiled’. A very timely point, when endless accounts of misuses, biases, discriminatory and malicious deployments suggest that we might be already spoiling AI.
Defining AI as LTS
We claim that LTS framework is useful to scaffold AI and describe its system–ness. Using the example of the data domain in Unpacking the AI data domain, we showed that the construction of AI depends on a complex circuit of interdependencies and complementarities. Now, to support our claim more generally we identify, element by element, the LTS features in AI. This exercise goes in line with other recent attempts to develop a language to study AI as a system or an infrastructure. For example, Crawford (2021) defines machine learning as a range of technical approaches ‘which are, in fact, social and infrastructural as well, although rarely spoken about as such’. We begin from the essential properties of LTS (being ‘large’ and being a technical system). Then, we proceed with the mapping of LTS elements onto AI ones.
Artificial Intelligence is large
LTS as a framework draws its specificity from the use of the attribute large. Following Joerges (1988) and Gokalp (1992), large can be considered in terms of territorial or user coverage, involving large-scale actors in the production of technology, or generating far– reaching socio-economic and/or environmental impacts. In this sense, large is used to label a technology that is encompassing, infrastructural, impactful, costly or global, or a combination of these properties. The attribute large is partially overlapping with that of pervasiveness, which is usually associated to General Purpose Technologies, and that we discuss in Comparing AI to other Epistemic Devices. However, largeness of LTS is a broader concept, more malleable and adaptive, and more apt to describe an infrastructural technology.
The way in which large is measured in the LTS framework allows for a degree of flexibility with respect to the dimensions employed to measure the largeness of current AI. Current AI spreads large in terms of user base and territorial coverage given the diffusion of its end-user applications such as recommendation systems, IoT devices (e.g. Amazon’s Alexa), or prompt– able interfaces. Stand-alone artefacts can have a large user base as well – think about classic rival private goods as a fridge or a hammer. However, in the case of AI, as in the case of the Internet, users ‘append’ themselves to AI solutions, in the same manner as they rely on other infrastructural services such as utilities (Steinhoff, 2018). AI is large also because it is developed and promoted by a large community of actors (developers and vendors), across its constituting domains, in complex technology stacks that feature both vertically integrated and specialised companies. The frontier of this large community is represented by large actors – large companies (the tech giants), national and supranational institutions, industrial consortia, global networks of Universities and organisations, as well as dedicated academic conferences. This creates a situation in which a large actor invests substantially into AI and provides access to it to a large user base for sharing. For example, this is the case of sharing computing facilities and storage via cloud, AI–powered software–based services (AI–as–a–service) such as visual recognition systems for airports, or API access to proprietary image and language models for developers. The economies of sharing (Shapiro et al., 1998) at work with AI make it similar to classic LTS, such as transport and energy supply systems. Finally, the societal traction of AI is large: ‘AI has seen itself elevated from an obscure domain of computer science into technological artefacts embedded within and scrutinised by governments, industry, and civil society’ (Mohamed et al., 2020); the AI debate has been popularised and is universal and touches all changes that AI might bring to contemporary societies, from its immediate effects on labour, inclusiveness, and exploitation, to more medium and long terms issues of sustainability and environmental toll, and even the value ‘alignment’ of AI systems.
In sum, AI is large according to various criteria identified by the LTS framework. This characteristic is extensively defined, encompassing and, hence, convenient for both the identification of AI as a system and its empirical analysis.
Artificial Intelligence is a technical system
The system nature of AI is the cornerstone to understand the working of the technology, and the rationale for us to propose the LTS framework to encapsulate it. The system–ness of AI is already suggested by the fact that the technology is shaped and evolves as a joint product of the three constituent domains we introduced in Our object of study: (i) the domain of AI algorithms and models that, in terms of actors involved and specific system builders engaged, is a subset of the software industry; (ii) the domain of computation, in practice constituting a subset of the hardware industry; and (iii) the domain of data generation, collection, storage, analysis, and transaction. As in a Venn diagram, at the intersection of these three domains one can find current AI. These three domains are large in their own right, along the criteria outlined in the previous paragraph. The functioning of the AI system relies on compatibilities and standards precisely as in the case of the Internet, railway networks, and energy distribution systems. Recent examples of how AI workloads are managed and services delivered globally are Microsoft’s Singularity service (Shukla et al., 2022), Meta’s AI Research SuperCluster, and Amazon’s AWS Lambda. Flexible AI workload delivery captures well the distributed, and technology–stack–embedded nature of AI.
Beyond the techno-economic interdependence among its constituting domains, the system–ness of AI arises from additional features of the technology. The first is the inner complexity of its components. The hardware and computation domain of AI is certainly complex. Consider, for instance, the decision-making involved in production of microchips capable to deliver on the increasing demand of AI compute: chipmakers need to resolve a complex technical trade-off among delivering processing speed, energy efficiency, and heterogeneous computing (Prytkova and Vannuccini, 2022). The algorithm domain is complex as well, with an emerging ecosystem of actors competing by developing proprietary AI solutions or, alternatively, relying on hubs that make available for re-use pre-trained models. As we have seen, the data domain is also complex. For example, the configuration of data marketplaces depends on whether the different actors (i.e. data buyers, sellers, exchanges and third–party service providers) are connected via arms’ length transaction or are merged in a handful of players (Koutroumpis et al., 2020a; Spiekermann, 2019; Srinivasan, 2019).
The second feature is that AI development clearly relies on a superposition of systems (Gokalp, 1992), a property that is characteristic of LTS. Similarly to infrastructural technologies as well as utilities (Steinhoff, 2018), current AI layers up on a broader technical stack of ICTs, such as telecommunication and Internet infrastructure. Within public and private organisations, elements of their information systems are upgraded by integrating AI while remaining integrated in existing organisational routines. For example, AI solutions integrate with existing database technologies, a trend already started decades ago (Brodie, 1989). Specific AI solutions might be seen as stand-alone products or services. However, their production requires the joint work of the software, hardware, and data domains, whose interdependence is made possible by the underlying digital and ICT infrastructure. The notion of superposition of systems is useful to illustrate the layered construction of the AI system: AI domains are complex systems in their own right, linked to produce the AI system technology which, in turn, is enabled by strata of system infrastructures.
The third feature relates to how AI solutions are implemented within actors’ existing processes. Using the term introduced by Arthur (2009), AI implementation is a form of ‘re-domaining’, which happens when a certain activities continues to be performed as usual, while the way in (the inputs through) which it is delivered changes. Re–domaining for AI means bringing certain activities under the umbrella of prediction technologies. This is achieved through capital deepening, that is, through the replacement of existing (digital) capital goods – software technologies – with more sophisticated (digital) capital goods – AI solutions. Capital deepening implies that, inside organisations, AI works as an ‘upgrade’ to ICT capital that enables the provision of better or new capabilities. Bresnahan (2021) elaborate on this idea, suggesting that AI solutions are implemented through system-level substitution. System-level substitution conveys the fact that entire modules of a business process (hence, a system enabling given functions) are replaced in block with different ones. In other words, substitution takes place mostly between production systems, rather than in production systems. For example, online retail replaces brick–and–mortar one, apps substitute interaction via websites, automated user support or algorithmic fraud check replace the computer-aided but human-controlled version, recommendation and search engines are substituted by chatbot–based text conversations. System level substitution naturally aligns with our arguments, as it illustrates that for what concerns AI, the system dimension permeates all levels of analysis.
Speaking of AI in LTS terms
We have established the fact that, when studying AI as a system, it maps the definitional characteristics of a LTS. Now, we can proceed with a more fine–grained analysis, discussing how the socio-technical elements and drivers of LTS presented in LTS can be useful to describe AI and rationalise its dynamics.
Artificial Intelligence system builders
AI and its constituting domains are constructed by a variety of actors that actively initiate, support and shape developments of the system. The AI system builders are AI–producing and AI–using companies, dedicated regulatory bodies, industrial consortia, non-profit research organisations. Every system builder exerts efforts to influence the selection of their priorities and problems for the system to implement and address. This can be done by trying to weave in a particular manner the network of elements in the systems (an example are tech giants hiring AI pioneers and leading figures to lead their AI programmes) or by forcing the very system to converge on new standards, protocols, and shared practices. The latter can be achieved by making obsolete or ineffective the status quo through, for instance, forking decisions (Simcoe and Watson, 2019).
Consider AI–producers. They have the power to design the system and to decide which bridges between actors and subdomains to build or cut–off. AI–producing companies are an ecosystem of firms that conduct AI research, develop AI solutions, and participate in the AI value chain. 4 Among them, key system builders are the already mentioned tech giants, and established software and hardware companies, 5 vendors, startups, and platforms. 6 In terms of main line of business activity and industrial classification (NAICS), the majority falls under the codes ‘software publishers’ (49%) and ‘Computer Systems Design and Related Services’ (17%) 7 . Jacobides et al. (2021) offer a typology of AI actors along different modes of production and consumption. Next to the tech giants, the AI ecosystem features ‘AI creators’, that produce AI solutions but rely on some basic input from the giants, such as pre-trained models or programming frameworks; ‘AI integrators’, intermediaries that re–sell off–the–shelf AI solution adding some value in this step; ‘AI-powered operators’, that produce AI for their internal operations; and ‘AI takers’, that source AI solutions to be used in-house. This typology is useful to capture the ongoing process of specialisation in the industry – a signal of the ‘coming of age’ of the commercial side of AI. One example that illustrates how system builders exert their power is the ongoing issue revolving around handling harmful AI. In this context, current commercial system builders have supported the establishment of ethical boards and voluntary guidelines rather than regulation. While regulation would impose common and accountable rules on the development of the system, commitment–based solutions can be considered strategic concessions 8 to other stakeholders (in particular regulators, consumers of AI services, and the society at large). In practice, these initiatives represent a ‘seductive diversion’ that allow AI–producing companies to show engagement while retaining full power over the design of the system. Another leverage AI system builders acquire with their role is their ‘knowledge holding’ (Steinmueller, 2006). In the production of AI solutions, novel know–how is created and knowledge about it settles in the hands of AI–producers. Through this process, AI system builders become knowledge gatekeepers that can facilitate the diffusion of knowledge and expertise through co-invention activities as well as strategically withhold it. For example, machine learning platforms – open or closed source (Isdahl and Gundersen, 2019) – might improve access to and reproducibility of AI solutions, however at the cost of product development knowledge being held for a larger share by the platform owner. The mode of access to AI input is strongly linked to monetisation and value capture strategies: for instance, API access favours the appropriability of the AI engines behind specific commercial solutions and, thus, the concentration of knowledge holding upstream.
System builders in AI are changing over time, and their variety is increasing as the system gains complexity. AI companies superseded individual AI pioneers and universities’ computer science departments; at the same time, they are accompanied by governments and ‘lateral’ organisations. The latter are, for example, advocating to make the system more inclusive and less harmful (e.g. AI Now and DAIR research), pursuing technical advancements through non-profit or research organisations (e.g. Anthropic and Adept), facilitating coordination on principles and standards (e.g. the Partnership on AI), focusing on the existential risk linked to the possible emergence of strong AI (e.g. the Future of Humanity Institute), or acting as basic research labs for larger actors (e.g. OpenAI for Microsoft or DeepMind for Google), in a fashion resembling the Bell Labs (Gertner, 2012). Another type of system builders, much less empowered than the ones mentioned earlier on, are the (platform) workers that support the deployment of AI systems and that are subjects of processes of ‘heteromation’ (Tubaro et al., 2020). 9 These workers operate at the margins of AI and fill gaps in the working of the technology – they run the so-called ‘AI last mile’, labelling or moderating the data necessary for the training of algorithms, verifying their performance or even emulating the functioning of AI systems when these are not yet functional or cost-effective to use.
Artificial Intelligence reverse salients
As the system scales and becomes larger, tensions appear. These fault lines are the reverse salients of the system.
One recurring source of reverse salients in AI is the scarcity of resources in AI’s constituent domains: being a nascent industry, the provision of AI capabilities can be subject to shortages of input resources. The shortage is relative among domains, that is, the worst performing domain is a source of reverse salient, which can be of quantitative or qualitative kind, that is, respectively, delivering an insufficient amount of an input resource, or a qualitatively inadequate input. This might hold back or derail the evolution path(s) of AI. Quantitatively speaking, AI is data-hungry, but other resources whose demand grows faster than supply can become constraining factors as well: among them, AI talent, and management trained to lead AI-powered companies. Qualitatively speaking, computation is a reverse salient because the delivery of compute might not be tailor-made to the needs of AI algorithms (Hooker, 2020; Prytkova and Vannuccini, 2022). Atop of the purely technical challenge, there is also a techno–economic tension: the competition between cloud and edge modes of organisation of the computing infrastructure. These two modes of delivery of compute have different degrees of appeal to different system builders. Actors already specialised in value capture from devices might bet on AI built around edge features and constraints; actors that already invested in cloud business models could push for AI development tailored to cloud–based service provision. The relative economic success of one or the other model will feed back on the direction of progress in AI. More specifically, the cloud mode entails allocation of resources with priority on coverage and speed of access networks, and on computing capabilities of cloud–providers. The edge mode emphasises connectedness (which is not the same as coverage), compatibility, and computing capabilities of edge devices. The resolution of this qualitative reverse salient from the computation domain will shape AI system with regard to the organisation of computation: inside computing devices (chips) among its components, as well as inside the industry among producers and users.
Another source of qualitative reverse salients is the absence of compatibility and interoperability within and among domains. One example is the lack of a well-designed and regulated architecture for data troves. AI systems can be trained on (i) public dataverses and open data, (ii) AI–producers’ proprietary data, or (iii) data supplied by data marketplaces. These alternatives carry with them different implications in terms of reverse salients. In the case of (i), the reverse salient of data supply entails setting up clear rules for fair access to (and profiting from) data commons. For example, if to the exploitation of publicly available data by a private actor (i.e. a company training image generation AI models) corresponds the release of AI models to developers and final users either open source or via API, AI developments might accelerate on the algorithm domain, following a non-linear circuit that is characteristic of a complex, system technology. In the case of (ii), the limiting factor to AI development is the siloing of the data domain and the emergence of an oligopolistic market structure that can give rise to collusive behaviours, concentration of benefits, and even slowdown of AI developments if the data assets are hoarded and made un–accessible. In the case of (iii), the brake to AI progress can be related to the strategically extractive behaviour of data intermediaries.
Other examples of compatibility–based reverse salients can be found in the domain of AI algorithms. One lies in the proliferation of AI models, as more models equal multiple lines of research, duplication of efforts, less pooling of resources, and possibly coordination failures in the direction of technical advances. The reverse salient can be addressed in different ways, many of which are bottom–up initiatives, such as the organisation of contests 10 , or the launch of open-source initiatives to assist the coherence of the community and the development of cross–compatibilities, the establishment of standardised libraries and programming frameworks, and more fundamental theoretical and technological advances (Ben-David et al., 2019; Geirhos et al., 2020; Marcus, 2020). This reverse salient creates a disequilibrium but also an opportunity for entry in the AI value chain. For example, model hubs such as the platform Hugging Face 11 favour experimentation and evolvability, lower the cost of coordination, and induce convergence towards a smaller set of underlying model architectures. This is precisely what seems to be happening in the domain of large language/foundation models, with the Transformer architecture becoming the established underlying structure of many widely used models (Amatriain, 2023).
The discussion around reverse salients in the data and algorithm domains makes a good case for the idea of treating AI as a system technology. An even stronger case is made by stressing the cross-domain (hence, systemic) influence of reverse salients. This was already implicit in the discussion so far; we now discuss it explicitly. For instance, despite the stubborn focus of the industry on ‘the bigger the better’ principle behind the launch of ever–larger neural networks, performance improvement is function of both model scale and data size. Hoffmann et al. (2022) find that, for Transformer–based large language models, the focus on model scaling (increase in parameters) shifts away attention from the required amount of training data. The authors were able to outperform some of the largest models at the time (e.g. Gopher or GPT–3) with a compute–optimal, much smaller model (dubbed Chinchilla) trained on a data set four times larger than those used by the competitors. Hence, improvements in one AI domains (algorithms performance) are bounded by the data reverse salient.
Similarly, reverse salients originated in the domain of algorithms have implications for the hardware domain: ad hoc AI algorithms appeal to smaller demand and have short-lived returns, quickly becoming obsolete. At the same time, the design and production of hardware that caters the needs of an ad hoc AI solution has high sunk costs. Therefore, the resolution of reverse salients in the algorithm and hardware domains is entangled, and both remain in a turbulent state until a dominant design emerges in either of the domains. Though connected, the development of the two domains remains driven by rather separate criteria, treating each other as a source of constraint for its own performance rather than a tandem partner with a common goal. The current AI shock shed light on this issue, exposing the shortcomings of such divided approach; several studies call for an expansion of performance criteria into various joint efficiencies, beyond separately accuracy in the software domain and processing speed in the hardware domain (Chen et al., 2019; Hooker, 2020; Prytkova and Vannuccini, 2022; Strubell et al., 2019).
The resolution of cross–domain reverse salients will produce positive feedback and virtuous cycles of AI progress: for example, the already mentioned foundation models (Bommasani et al., 2021) based on the Transformer architecture seem to be an area of convergence across AI communities – and, thus, as a potential dominant design – given their performance in treating multimodal data such as text and images combined, so much that some research groups envisage the emergence of ‘generalist agents’ grounded on multi-modality (Reed et al., 2022). The more these models area adopted and developed, the less binding the algorithm–related reverse salient. As a result, this increases the incentive for researchers and companies to compile multimodal dataset, which addresses in part the data related reverse salient, and open room for the advent of new AI applications, such as generative AI (Radford et al., 2021; Ramesh et al., 2021) or the embedding in physical technologies such as robots (Ahn et al., 2022; Driess et al., 2023).
Artificial Intelligence load factor and momentum
A system evolves by solving tensions created by reverse salients, producing what Sahal (1985) defines learning by scaling. Learning occurs by accumulating understanding on which changes (innovations) to implement, and how to redistribute the load in the system while it scales up. This means that mapping the load factor in the system can guide directed interventions to address salients and release resources to accelerate the system’s evolution, as well as monetisation. For example, AI companies can design third–party access to their models in order to maximise value extraction from the load distribution. One way to do that is to offer price differentiated services, with free access with queuing, and paid access for premium performance. Nightingale et al. (2003) suggest the notion of ‘economies of system’ to explain the gains a system can enjoy by redistributing activities according to the load factor, dynamically balancing the stress. 12 Economies of system in AI would occur by rearranging the structural dependencies among its elements when some of them develop unevenly or are overloaded. The shift to federated learning architectures (Li et al., 2020) represents a system re-arrangement towards a design potentially capable of addressing the computation–related reverse salient: this would be done by distributing workload over the edge components of the system rather than leaving few giant actors to route (and control) finite computing power in the cloud. Scaling up an infrastructural technology is not a trivial task. For the case of the Internet (the LTS that in our opinion resembles current AI), Monteiro (1998) described the ‘sociotechnical negotiations’ required to revise the Internet Protocol IP as the whole information infrastructure scaled. Similar sociotechnical negotiations take place between AI system builders, as their roles and powers to shape the system change together with the nature and application of AI itself.
Concerning momentum, the growing number of actors jumping on the bandwagon of AI successes, as well as the grandiose media coverage of AI advances and the expectations of further ubiquitous diffusion of AI, build up a considerable one. However, expectations can work in both positive and negative direction. On the one hand, they channel large investments in public and private AI R&D and venture. On the other hand, the expectations of a large and ubiquitous impact of AI risk remaining unfulfilled: sustained commercialisation and growing competition among system builders make them race against each other, undertaking myopic steps in AI development leading to short-term payoffs. Stagnating diversity of AI research is among the early signs of such dynamics (Klinger et al., 2020). When the expectations that a new AI winter might be at the horizon will start to be considered plausible and the AI hype will begin slowing down, the momentum of the system might follow a similar path.
Current phase
The ‘invention’ phase of AI is a contested territory, as the very understanding of what AI is shifts over time; this is why in Our object of study we drew the boundaries of our unit of analysis to current AI. However, following the impressive results in the ImageNet visual recognition competition in 2012 and the subsequent media interest in AI – mostly due to the shadows AI seemed to cast on the future of work – the AI system was launched into the phases of intensive invention and development. From the perspective of compute alone, Sevilla et al. (2022) distinguish three phases: the pre Deep Learning era, the Deep Learning era, and the Large-Scale era. From the overall LTS perspective, we can place AI in-between the phases of innovation and growth, competition and consolidation, with commercialisation accelerating its pace and increasing technology transfer from academia to business, including a sizeable talent drain of professors and graduate students (Zhang et al., 2021). The process of growth by expanding to novel application fields generates continuous feedback into the phases of innovation and development. Technology transfer has also accelerated with the increasing efforts of national, supranational and sub-national institutions to govern AI developments as the technology has acquired geo-strategic significance. 13 An interesting question is whether AI experiences a process of so-called ‘technological convergence’. Technological convergence, a concept introduced by Rosenberg (1963), is a form of ‘upstreaming’, a process occurring when an activity embedded within diverse sectors or/and tasks exhibits some common features and principles that eventually mature and unbundle into a fully-fledged sector on its own. 14 We see signs of this process at work in the evolution of the AI: AI–producing companies – key system builders – emerge as specialised suppliers of AI–as–a–service tools, business automation services (e.g. recruitment), scientific discovery tasks (‘science–as–a–service’) and data analytics tasks. Traces of technological convergence signal that we are entering the consolidation phase of the AI system: the system reached the critical size that induces vertical specialisation along the supply chain. The latest stage of this process is the development of interfaces to facilitate the direct interaction of final users with AI solutions. This is the natural consequence of the emerging prevalence of language models amongst AI applications, as interactions based on natural language are key to the seamless integration of AI into ‘the fabric of everyday life’. The discussion of the phases of the AI system is important as it allows to place the technology in context, along an evolutionary trajectory. Moreover, as we already mentioned, LTS phases can be used as a predictive device to foresight possible scenarios. Sovacool and co-authors (2018) point out how LTS can experience later–stage phases of stagnation, reconfiguration, and decline. In these phases, some system builders might decide to adopt extractive strategies, sacrifice product quality for the sake of immediate returns, and concentrate further the control over the key input factors. Hence, anticipating the possible outcomes of upcoming phases of the LTS might inform the actions of policy makers and other system builders that prioritise the rebalancing and relaunching of the system.
Control mode
A challenge for any LTS is the possibility that diseconomies of scale appear during the growth phase, leading the system to lose internal coherence and to face ‘crises of control’ (Beniger, 1986). Under control we understand mechanisms of coordination on progression and management of the deployed applications. These mechanisms can feature the participation of different actors, and among these policy makers can play a fundamental, facilitating role. The issue of control arises when these mechanisms are non-existent or do not function efficiently. Nightingale et al. (2003) study and provide examples of innovations in control technologies as tools to retain control of LTS as they scale up. The issue of control does not appear as prominent in AI as in LTS such as railway or telecommunication networks. This is due to the heterogeneity of AI system’s components, which is therefore less coherent as an ensemble, but also due to the fact that AI is yet a ‘young’ infrastructure. At the moment, from the control mode perspective, AI is closer to the Internet than to integrated transport systems. As with the Internet, AI displays a mix of centralised and decentralised mechanisms of control and a layering up of commercial and non-commercial areas of development (Greenstein, 2020). Coordination among the system builders is achieved through the convergence on standards and interfaces, which is a non-frictionless process. In AI, there are not yet fully agreed standards for progression, nor an essential need for the centralised maintenance of coherence among the system’s parts; also, the actors do not have to be explicitly aware of the infrastructural nature of AI in order to conduct their operations. Therefore, the minimum requirements to make the AI system work and to avoid the system falling apart are lower than for other infrastructural technologies, while the social impact is potentially larger for AI. This does not mean that AI evolution will continue to follow the same loosely coordinated path: as the AI system consolidates and technological convergence takes place, control will become an increasing relevant challenge.
In the current phase, the system already gained significant momentum, but ‘diseconomies of scale’ have not occurred and internal coherence has not been upset due to the compartmental structure of AI. Instead of the ‘death’ of the AI system, the failure of control mechanisms could steer AI towards compartmentalisation, proprietary integration of layers of the technological stack, and detrimental turns at crucial points of its evolution path.
Technological style
AI will display different technological styles in different environments. This is evident when considering national (and supra–national) implementations of AI systems. The ‘division of labour’ and the direction of AI development and deployment depend in part on the structure of AI– and (as discussed in Unpacking the AI data domain) data value chains, but is also strongly affected by government strategies, resulting in rather distinctive styles.
Let’s consider the role of the latter in giving AI its style. Technological style can emerge as a result of the particular policy levers and priorities the regulator decides to pursue. This is reflected in public budget allocations, which can channel funds to AI through the university system, the military sector 15 , or directly to private actors – for example, in form of financial support to AI startups. Beyond the sheer amount of expenditures and the broad direction imposed on the system’s evolution, different technological styles for AI can emerge as a result of the specific tools of technology policy used (Steinmueller, 2010). Here, top–down command and control policy actions share the stage with more bottom–up governance initiatives. Horizontal interventions, such as the design of regulatory frameworks against AI harms, misuses and biases, are part of a specific style. The creation of new dedicated institutions (Calo, 2014) and intermediate bodies to facilitate coordination in the system is another, potentially complementary, option. Policy designs such as the proposal to establish a National Artificial Intelligence Research Resource (NAIRR) in the US, namely, a cloud infrastructure to pool and share computing power for AI compute workloads, is a further direction. 16 Examples of policies that can influence technological style include the efforts made by governments to attract, retain and develop AI talent through the visa regime 17 , or the alignment of macro policy levers (e.g. immigration and trade policy) with AI–related strategic priorities. A relevant case of the latter option are export controls policies targeting the semiconductor industry, as this is the producer of key components for AI–tailored hardware and its productive capabilities are a fundamental strategic asset. The use of policy levers in strategic technologies such as AI is not a novelty: the just cited semiconductor industry has been subject of trade policy interventions to shield domestic companies against emerging competitors (Langlois and Steinmueller, 2000). This point highlights the tight link existing between the actions leading to the development of a technological style and the competition between institutional actors at the international level. AI becomes one of the territories over which geopolitical forces compete, so much that some authors discuss whether an AI ‘arms race’ is ongoing (Asaro, 2019). In particular, Ding and Dafoe (2021) highlight how the ‘infrastructure–strategic logic’ underlying the identification and provision of strategic assets is relevant to frame AI because AI solutions are used as a platform to upgrade other important technologies or services.
Goal orientation
We can exploit the LTS framework to assess whether AI as a system technology has an identifiable goal orientation. LTS are goal–oriented systems; this means that all components coordinate – easily or at the cost of frictions, negotiations and forced adjustments – to achieve an overarching aim. This characteristic is easy to detect in LTS such as transport systems or water distribution networks (the goals being, respectively, mobility and water supply), while it is less evident for system technologies such as telecommunications, the Internet, or AI. One reason for that is these are highly heterogeneous technology systems, less prone to have centralised control being exerted on their applications and market conditions. We posit that AI as a system has a characteristic goal–orientation, even if latent yet. The goal of the AI is a ‘cybernetic’ one: to re–domain the ‘fabric of control’ of socio–technical systems based on human decisions into an automated one, starting with the transformation of existing activities into instances of adaptive prediction. In other words, the end game of current AI deployment is not the reproduction of cognition in vitro, as discussed in Our object of study. Rather, it is the expansion of the reach of autonomous system to every aspect of social life. This idea goes in line with recent explorations in the field of information systems, where the emergence of ‘metahuman systems’ as ‘new, emergent, sociotechnical systems where machines that learn join human learning and create original systemic capabilities’ (Lyytinen et al., 2021).
If we consider the necessary step to complete any given task as described by Agrawal et al. (2019), namely the elaboration of input data, judgement, and decision-making, and action and feedback, the AI system must be able to permeate all of them and perform them in a closed loop in order to pursue its cybernetic goal. This requires capital deepening, as we discussed earlier on, and depends on coordination and advances in all AI domains. An example of task controlled by AI along all stages is the industrial control system of cooling facilities in Google’s data centres that went completely autonomous in 2018. 18
In summary, to use the terminology of Flueckiger (1995), the goal of AI as a system is to shift further the balance from economies based on operations of transformation to economies based on operations of control – and to automate these. Whether this is a desirable outcome in terms of societal welfare as well as human progress is a normative question – one that requires democratic and inclusive discussion and evaluation.
Outlook
The study of AI we conducted through the LTS lenses is useful in at least three ways. First, it allows to take snapshot of the forces at work that are shaping AI as we write. This exercise can be repeated in the future, and further studies can go more in depth in specific aspects of the AI system, while maintaining a networked view on the interlocking nature of AI domains. For instance, it is possible to focus on those system builders advancing open source business models in the algorithm domain to single out the competitive challenges they introduce in the system, and the imbalances they can create in the other domains.
Second, our analysis provides a tool to identify emerging dynamics that descend from the fundamental mechanisms we outlined. For example, cross–domain constraints might relax or tighten-up, but will not disappear as they are essential features of the system. Consider the following scenario: what if a few system builders integrate their activities across all domains, relaxing cross–domain reverse salients but siloing AI developments and shaping them to mirror the priorities set by their business models and monetisation strategies?
Third, the bird’s eye view analysis we introduced can help detecting signals on possible future scenarios. Again, the progressive consolidation of the industry and the co–existence of vertical specialisation along the AI value chain with many specialised companies together with a full-stack integration as a strategic move by the largest actors suggests that the AI system might move towards a bi-modal structure. This will be characterised by a handful of dominant actors shaping the rules of the game across all domains of AI, and a parallel universe of small actors capturing (less) value by delivering specific AI solutions. The ‘language turn’ of AI sets the stage for an increasing focus on the interfaces between algorithms and final users. This might hide in the background the dynamics happening in the other components, giving leeway to the most powerful system builders to craft the system while the presence of AI becomes an accepted fact, an essential facility indistinguishable from the fabric of everyday life.
Comparing AI to other epistemic devices
In Artificial Intelligence through the Large technical system lenses we showed how the LTS framework can be useful to map the system–ness of AI, and in particular the dynamics shaped by the interlocking parts that produce AI solutions. In this Section, we discuss whether other epistemic devices, usually associated with radical innovations, capture as well certain essential features of AI, and how they fare in comparison with the LTS framework.
General Purpose Technology
A rather consensus idea in innovation studies is that AI is a so-called General Purpose Technology (Goldfarb et al., 2019) — a particular kind of radical innovation with pervasive diffusion and enabling capabilities. 19 This idea is predicated upon the wide range of applications of AI, its expected pervasive diffusion, it rapid-paced improvements, and its innovation-inducing effect.
To some extent, AI features seem to be in line with the characteristics of GPTs. According to Bresnahan and Trajtenberg (1995) definition, a GPT is a technology that displays (i) general applicability, (ii) technological dynamism, and (iii) innovational complementarities. General applicability refers to the pervasiveness of GPTs. A GPT can be used as input or core component by a wide array of downstream (user) economic activities. Technological dynamism suggests that a GPT should display a steep learning curve in performance and/or efficiency – namely, a fast ongoing improvement pulled by the growing expenditure in the technology coming from downstream. The property of innovational complementarities, or innovation spawning, captures the fact that GPTs are enabling mechanisms, as they induce or reinforce innovation incentives in the industries that use them as an input; in turn, GPT–induced user innovation feeds back in higher innovation incentives at the GPT level.
GPT and its diffusion is an ideal type of a particular kind of technological change: one leading to the adoption of a radical innovation, used as a core component, across very diverse economic activities – in the limit, to ubiquitous adoption and possibly to a technological revolution (Perez, 2010). For this reason, the label of general–purpose has been assigned to many emerging technologies with transformative potential. 20
AI can be considered through the GPT framework: AI introduces a different logic (grounded on statistical inference) to address existing tasks and to create new ones – in other words, it is a type of radical innovation – it is used in a wide range of applications, it evolves rapidly, and it enables innovative activities in user sectors. The general mechanisms of GPT development and diffusion can be traced in the way AI evolves. Consider innovational complementarities: better and cheaper AI models enable product and service innovation and lower entry barriers in AI–using sectors; this feeds back on the incentives for AI–producers to invest in further developments of AI. The enabling nature of AI is increasingly supported by firm–level evidence (Rammer et al., 2022). However, compared to the stylised GPT dynamics, AI’s innovational complementarities follow a more complex, less–linear circuit, with the set of AI domains (hardware, software, data) connected to downstream user sectors, as well as among themselves. For instance, the design of autonomous vehicles craves equally for more accurate algorithms because of their high stake loss function, faster processing, and less energy consuming chips because of cars’ battery capacitance, while more static applications like virtual assistants prioritise heterogeneity of computing and scalability. Within the hardware domain, the established technological trajectory of semiconductors is being derailed because of misaligned preferences among an increasing number of downstream sectors (Prytkova and Vannuccini, 2022); this reverberates on innovation incentives in the algorithms domain Hooker (2020). In sum, AI displays innovational complementarities, but of a rather systemic nature, which stress the networked structure of the AI system.
When looking at technological dynamism, considering AI as a GPT means focusing on the software domain. There, the performance of AI algorithms compared to different benchmarks is certainly improving rapidly, so much to achieve above–human scores in many tasks, even though signs of saturation and levelling-off in dynamism are increasingly discussed (Zhang et al., 2022). However, this insight would fail to recognise how the sources of successes and failures in AI dynamism lie outside the software domain: algorithms’ capabilities display biases and are yet limited to a series of tasks that are the same AI pioneers envisioned in the 1950s and 1960s: pattern recognition, some perception (vision, speech), learning, bruteforcing of search through combinatorial spaces (Minsky, 1961). 21 This is due to problems arising in the computing and data domains: the efficiency of compute as well as data scarcity and limited representiveness are fundamental brakes to AI performance’s improvement. As the GPT framework does not properly consider these domains, it would not help addressing issues arising there.
In AI, dynamism takes the form of a systemic technological dynamism: it is the result of joint advances in all its constituting domains. In a sense, this mechanism is the reciprocal of the cross–domain reverse salients limiting AI advance we discussed in Defining AI as LTS. Let’s consider again the algorithm domain: there, technological dynamism can be captured also by improvements in algorithm design. Algorithms are information goods; given the low replication costs characterising information goods, once developed old algorithms can co-exist with newer ones, multiplying the directions in which the effort devoted to improvement can be allocated and, thus, slowing down technological dynamism. In this sense, progress in algorithm design might be close in dynamism to programming languages, that display both persistence and continuous forking. The kind of improvements of algorithm design are strongly driven by all sorts of efficiencies – computation per time, improvement in prediction per data batch, and trade-offs like bias–variance, complexity–explainability, and other factors such as the ease of scaling up AI models and costs of replication rather than by classic physical economies of learning (Nagy et al., 2013). Hence, the constraints for the improvement of algorithm design have a systemic nature. Furthermore, if the possibility to improve the performance of algorithms depends on the programming environment used and/or the hardware chosen, then algorithms evolution is function of strategic choices of the actors that control the programming environment and hardware production. From this perspective, the GPT metaphor for AI can hold at an essential level, but becomes a straight–jacket, as it simplifies what is in practice a networked interplay of cross–inducements.
Another feature of GPTs is that they are expected to produce non-linear impacts on economic outcomes, in particular productivity (Jovanovic and Rousseau, 2005). General Purpose Technologies do not necessarily produce these macroeconomic effects (Bekar et al., 2018). However, given their novelty and appeal to a variety of uses, it is possible that they display implementation lags. The reason for it is that in order to exploit in full the pervasive potential of a GPT, resources already employed in productive uses need to be temporarily foregone and allocated to develop complementary assets (Brynjolfsson et al., 2021). As GPTs, AI solutions are also expected to be characterised by implementation lags. However, these are not necessarily driven by the same mechanisms predicted by the GPT literature. For GPTs, implementation lags are demand–driven: in order to adopt it, GPT users need to incur adjustment costs, among which those for organisational changes, capital investments, and development of skills to handle the new technology. In the case of AI, The bottlenecks delaying AI implementation are mostly supply–driven: AI producers need to obtain required inputs (data, hardware, and skills), set up production process and deliver a minimum viable product. The typical example is the acquisition of datasets to train AI models, which can take time and postpone the launch of AI solutions in the market.
The major issue in equating AI with GPT arises when assessing the feature of pervasiveness. This property of GPT has always spurred debates in the literature, with scholars discussing whether in order to be pervasive a technology must have many uses (pervasive in scope), or also being widely used (pervasive in scale) (Bekar et al., 2018). At the moment, AI has a limited penetration in the economy; this holds at the level of the labour market, with retardation in job postings (Zhang et al., 2022) and limited evidence of labour substitution 22 , as well as at the firm level (Rammer et al., 2022). However, as discussed in the previous paragraph, the pervasiveness of AI might be held back by implementation lags. AI capabilities and use cases are expanding without doubts, even if it is a matter of discussion whether the set of tasks that AI can perform really expands: is every command that a language model can address a new task acquired by AI, or just an instance of the same problem – the statistical processing of natural language? The LTS perspective comes to the rescue here: instead of discussing the span of pervasiveness of AI solutions, it is important to highlight the mode of AI diffusion. As we discussed, this occurs through system–level substitution at the process level, and as system superposition at more aggregate levels: AI ‘thickens’ the information technology stack with effective prediction engines, and as it delivers its capabilities and services as an infrastructure, it grows large, rather than pervasive in GPT sense.
They key remark when discussing the parallels between AI and GPT is that GPT as a category refers to well-bounded, identifiable component technologies and devices, such as steam engines, electric dynamos, lasers, or integrated circuits. In order to increase the explanatory power of GPT framework for AI, we suggest two possibilities.
The first option consists in limiting the GPT–parallel to one very specific component of the AI system, Artificial Neural Network (ANN) algorithms. ANN are the key software engine powering AI products and services and, hence, a technological component. They are also stand-alone, in the sense that they represent portable packages of code. As ‘quants’ of algorithm, they can be applied to any situation in which problems are represented in a digital form, prediction is involved, and information processing can influence decision–making. This has been made evident by the embedding of LMs in web applications thanks to the introduction of plugins and user-friendly interfaces. 23 Given that, more than AI, it is the ANN ‘construct’ the unit of analysis closely fitting to the GPT description, as it is an identifiable, though intangible (because digital), structure or device.
The second option to reconcile AI with the epistemic device GPT is to expand the reach of GPT theory. Studies following this direction exist. For example, Lehrer et al. (2016) distinguish between ‘mega GPTs’ and ‘anchor technologies’. Mega GPTs are broad technological areas (such as ICT or nanotechnologies), while anchor technologies are identifiable technologies (like semiconductor chips, or enterprise management software packages) nested into the broader mega GPTs. Similarly, Bresnahan and Yin (2010) introduce the notion of GPT clusters to encompass the many components that are part of a GPT such as computer platforms. These perspectives provide a taste of a layered view on GPTs. However, their very introduction highlights the limits of the GPT framework, which needs to be stretched beyond its original purpose in order to accommodate the complexity of system technologies – all while an original framework dedicated just to that, LTS, already exists.
To conclude, using an econometric metaphor, the classic GPT concept is a misspecified model of AI when seen as a system. The GPT misspecification originates from a potentially incorrect use of the included variables (functional misspecification) and, most importantly, due to omitted variables. The latter has two implications: first, it under– or overestimates of the importance of the included factors and, second, it misses a number of dimensions to represent AI adequately. In a nutshell, the problem of equating AI and GPT lies not in the GPT features that are identifiable in AI, but in those that GPT does not include, which can be summarised in the system–ness of the technology. If we were to focus exclusively on the software domain, AI would fit better the GPT frame. However, that would come at the cost of a partial view of AI.
Invention of a Method of Inventing
Another epistemic device that seems relevant to AI is that of invention of method of inventing (IMI). Being an IMI is a feature of a ‘generative’ class of technologies (Koutroumpis et al., 2020b) that introduce a new logic to search, organise, and discover knowledge. In other words, IMIs are ‘meta–technologies’ – technologies for the production of new knowledge (Agrawal et al., 2018). IMIs usually refer to specific techniques – the most famous example being the double—cross hybridisation of corn studied by Griliches (1957) – research tools and instruments. IMIs need not display general applicability as GPTs: their defining feature is that they lower the cost (or increase the precision) of certain measurements, and in doing so they alter qualitatively the way in which invention or innovation activities are conducted. The measurements affected by IMIs can be physical (think of fMRI scanners) as well as digital. The latter could be the case, for example, of the measurement of precision of a (statistical) prediction involving the recombination of knowledge. AI intervenes in the production process of knowledge precisely in this manner. In fact, leveraging compute and data, AI algorithms lower search costs by handling the complexity of combinatorial spaces and exploring knowledge combinations in an automated manner. This is particularly relevant in what Agrawal et al. (2018) call ‘needle–in–a–haystack’ problems, characterised by high dimensionality of the search space. By bruteforcing the knowledge space (such as, for example, corpora of annotated medical text), AI can identify potentially valuable associations and guide exploration.
When used in the fields of science and innovation, AI solutions seem to fit the IMIs frame. In fact, they can participate actively in invention and innovation processes by creating information input to be further processed in the ‘production function’ of knowledge generation. This has practical applications in business and in science. In science, AI is increasingly used to aid the discovery of new drugs, materials, or biological structures such as the folding of proteins (Senior et al., 2020). In business, AI can intervene in product design and prototyping. Given the potentially wide spectrum of use cases of AI for discovery, Cockburn et al. (2018) and Agrawal et al. (2018) suggest that AI combines the features of GPT and IMI in what they label a ‘general–purpose technology for invention’. Their focus is on the specific component of AI that can work as widely applicable research tool – artificial neural networks running deep learning. Bianchini et al. (2022) show that even the narrow conceptualisation of neural networks as a GPT for invention has limitations – at least at the current stage of development of these techniques: neural networks are a versatile research tool, but do not qualify yet as an autopilot for science. Rather, they are an emerging general IMI, whose potential deployment is function of the evolution of complementary domains, which include the very organisation of scientific activities; in other words, ANN impact on knowledge creation is dependent on its embeddedness in broader technology and socio-economic systems.
Outlook
Both GPT and IMI epistemic devices capture some features of AI: its enabling capability (for innovation and invention, respectively), as well as its breakthrough uses as a tool. However, these are limited to the software domain. The LTS framework has the unique merit to envelop the system–ness and complexity of AI, which is crucial to understand its nature and evolution.
Discussion and outlook
General implications for policy
The production and diffusion of AI and its constituent components is a key issue for technology and industrial policy, such that in recent years we have witnessed a ‘Cambrian explosion’ of AI policies and strategies at all levels of government, from the supranational to the local in an emerging AI federalism (Jobin et al., 2021). As we anticipated using the case of data in The ‘next big thing’: Artificial Intelligence, our framework can guide the discussion around AI policy and its instruments. In fact, policy intervention in the mode of formation and provision of a system technology responds to a different rationale compared to the stylised policy case for stand-alone radical innovations. In the latter case, market failures usually lead to a lack of incentive to invest in and, thus, to underproduction of, the technology. If technology development depends on distributed efforts of heterogeneous users (as it is the case for GPTs), coordination failures can emerge as well, which in turn have the effect of decreasing the demand for the component. In this context, public procurement and contract spending can emulate, substitute, or subsidise the unrealised demand for the technology.
For system technologies, coordination issues extend beyond simple incentive formation, and become a matter of joint design and production of the whole network of components and domains involved in the system. Failures take the form of system or orchestration failures, with actors failing to develop the ties and alliances necessary to strike a balanced development of the system. Reverse salients can appear locally and slow down or disable the whole system, making it work inefficiently or even miss its goal(s) entirely. In system technologies, the source of failure might be located within one component, distributed among several components, or be the very disconnectedness of the system itself. In this scenario, the correct identification of reverse salients and the detection of their composition and reach across the system is a key step to undertake. Once diagnosed, the task for policy markers and system stakeholders becomes that of devising a strategy to tackle the problematic areas of the LTS network.
Before that, technology policy for a system technology such as AI requires actors to learn the specificities of the system under consideration: who are the system builders, where are the boundaries of the system, which mode of control is at work at a given moment and context, how the load factor is measured and distributed. Policy makers must map the state of the system, its current phase and potential paths of evolution, in order to inhibit detrimental or catalyse dormant useful activities, components and actors, facilitate connections, re-balance control or redistribute load factor, and in general to decide if to opt for command–and–control types of interventions or indirect ones. Depending on which reverse salient is addressed, policy can build on a different recipe of science, technology, industrial, and competition policy tools (Steinmueller, 2010).
From the LTS angle, AI policy design should descend directly from the identification of the framework’s building blocks and driving forces. For example, policy makers can focus: (i) on the balanced construction of the system, that is, by supporting the development of AI talent, suggesting fair and welfare–enhancing use cases, or providing resources and facilities for experimentation; (ii) on curbing the monopolisation of resources and control in the hand of a few actors across the fundamental domains of AI by ensuring the conditions for fair competition and access among system builders, by lowering the cost of exploration and support of alternative technological solutions and partnerships, and by supporting open business models for the provision of AI products and services; (iii) on pushing for inclusive or public models of governance by pursuing the identification of technical and non–technical standards.
Concluding remarks
Artificial Intelligence is considered a breakthrough that is technologically revolutionary and also philosophically existential; it is capable of transforming societies and economies while at the same time offering a mirror to look inside ourselves and our human intelligence.
It is indeed the case that AI has the potential to influence many real world processes. But the very nature of current AI is not as romantic as usually depicted. The human–level performance displayed by AI in certain tasks hides the ‘Clever Hans’ nature of these technology, that is very far from displaying understanding. In reality, AI is essentially smart computing, a new wave of information technologies. At the moment, notwithstanding rapid advances and the expansion of their capability set, AI solutions are brittle prediction engines – powerful tools that facilitate the automation of data processing. However, while AI programs are statistics–based ‘vehicles’ (Braitenberg, 1986), AI production and diffusion depends on the joint work of a distributed network of actors, technologies, and domains. As AI is increasingly in the wild, commercialised and embedded in products and services, there is need for researchers and policy makers alike to develop a framework to study AI that focuses less on its romantic aspects and, instead, captures in details how the technology is developed and deployed.
In this paper, we addressed this need by suggesting that the key feature of AI is its being a system technology. Building on this perspective, we offer an original analysis of AI that highlights its system aspects. We drew the boundaries of our object of study to what we labelled current AI, and showed that focusing on the system–ness of AI delivers useful insights. The system nature of AI emerges from the interdependence of its fundamental components: the software, hardware, and data domains. In other words, AI is a synergistic system which is more than the sum of its parts.
We adopted the language and framework of LTS – a theory developed precisely to study infrastructural technologies – to map in details the elements and forces at work in the development of AI. The LTS scheme is useful to scaffold AI features, and to discuss issues such as the very governance and goals of the technology. We compared our novel perspective with alternative epistemic devices that are commonly adopted to describe radical innovations – General Purpose Technology and Invention of a Method of Inventing. Certain aspects and mechanisms of AI development are considered by these alternative ideal types; however, we found out how they also limit our capability to represent the complexity and peculiarities of AI. Through our analysis, we could describe with a fine-grained level of detail how AI is implemented as a new layer in existing technological stacks. AI develops into an infrastructural technology, large and superimposed on other infrastructures; through that, it becomes interwoven into the economic fabric. In sum, AI is more Internet than integrated circuit.
With our study, we contributed to the growing strand of research on AI in innovation studies and in the fields of information systems and economics of technological change. Furthermore, we extend the reach of analysis to the fields of sociology of technology and science and technology studies. As a result, we offered a novel, interdisciplinary angle to look at system technologies through the case study of AI, which we trust will be useful to scholars interested in uncovering the deeper structure of technological breakthroughs.
Footnotes
Acknowledgements
We are grateful to Ed Steinmueller for his comments and the inspiring discussions shaping many of the arguments contained in this paper. We are also thankful to two anonymous reviewers of the Journal of Information Technology for their feedback and reports, which helped us to find a better balance for the paper. We also thank two anonymous referees of the SPRU Working Paper Series for their useful suggestions.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
