Abstract
Since the ruling of the European Court of Justice, the right to be forgotten has provided more informational self-determination to users, whilst raising new questions around Google’s role as arbiter of online content and the power to rewrite history. We investigated the debate that unfolded on Twitter around the #righttobeforgotten through social network analysis. The results revealed that latent topics, namely Google’s role as authority, alternated in popularity with rising and fading flare topics. The public sphere, or Öffentlichkeit, that we observed resembles the traditional one, with elite players such as news portals, experts and corporations participating, but it also differs significantly in terms of the underlying mechanisms and means of information diffusion. Experts are critical to comment, relay and make sense of information. We discuss the implications for theories of the public sphere and examine why social media do not serve as a democratising tool for ordinary citizens.
Introduction
Every day millions of individuals worldwide produce and upload user-generated content in the form of comments, likes, images and posts. For most users, the creation of content and the engagement with others’ content are a fun and rewarding routine that is integrated into their everyday rhythms and practices (Hogan and Quan-Haase, 2010). Third parties also generate copious amounts of data on users such as automatic credit ratings and location data. While users are often oblivious to how and where personal information is stored, the data can have significant consequences for an individual’s reputation and career, as they may have uploaded unflattering or compromising images or content (Yang and Albers, 2013).
The European Union (EU) has recognised the potential threat of data disclosure to an individual’s reputation and the need to provide users with the right to informational self-determination, that is, the right of the individual to decide about what personal information should be public and under what circumstances (Boguslaw and Westin, 1968). In our hyperconnected society (Quan-Haase and Wellman, 2005), this right is frequently violated because information spreads easily and quickly at little cost (Yang, 2015). As a countermeasure, the European Court of Justice (ECoJ, 2014) introduced a new ruling on 13 May 2014, the right to be forgotten (RTBF), which allows citizens to request personal information to be removed from a search engine’s result page. 1 This decision is transformative in terms of how data ownership and stewardship are understood, and as such it has received much scholarly attention (e.g. Gilbert, 2015; Shoor, 2014). However, there is a gap in the literature in terms of understanding how citizens, the media and key stakeholders discuss, interpret and make sense of the ruling. Understanding how this debate takes place in the public sphere is particularly relevant for future developments of the RTBF, as public opinion is one central mechanism for gaining knowledge about the influence of legislation on people’s digital lives and can serve as a catalyst for future revisions to the law.
This article focuses on the debate that took place on Twitter a year after the RTBF ruling. Much debate has focussed on how social media open new opportunities for reviving and revitalising the public sphere, not as a single, one-dimensional space, but rather as ‘a more complex system of distinct and diverse, yet interconnected and overlapping, publics […] which represent different topics and approaches to mediated communication’ (Bruns and Highfield, 2016: 124). We will examine how Twitter could function as part of this alternative form of public sphere in which information, ideas and debates can diffuse fast and in an unfettered way (Colleoni et al., 2014). The study gathered real-time data from Twitter users to learn about attitudes and opinions surrounding the debate and key topics that emerged. We aim to reveal public concerns and provide knowledge that could help lawmakers to improve the legal framework to empower citizens and to help companies in their implementation of the RTBF. The following three research questions guide the study:
What are the central topics and sub-topics discussed by Twitter users employing the #rtbf? How do central topics and sub-topics relate to one another?
Who are the key players in the RTBF debate?
What can we learn about the nature of the public sphere on Twitter from analysing tweets related to the RTBF?
Related work
Mayer-Schönberger’s (2009) work has sparked an intense debate on the potential harm of compromising data on an individual’s reputation and the need for informational self-determination. In his view, data need to have an expiration date – once the data have reached their shelf life, they should be deleted automatically. In this context, the RTBF’s role is to enable users to control their personal history to achieve so-called social forgiveness (e.g. Korenhof et al., 2014; Mayer-Schönberger, 2009) and allow them to reinvent their digital persona (Novotny and Spiekermann, 2014).
The European Commission (2012) integrated part of Mayer-Schönberger’s ideas into its 2012 draft of the General Data Protection Regulation (GDPR) to aid individuals in enforcing their informational self-determination. The European Commission did not integrate an expiration date for data, but it did give individuals the right to request that their personal information be deleted. While the GDPR is based on Mayer-Schönberger’s ideas, it is quite different in that it is a ‘control right’ meant to give users control over their personal data and further extend this right to third parties such as Google. The RTBF applies not only to search engine providers, but also to ‘other intermediaries […] whenever the conditions established in the ruling are met’ (Art. 29 WP, 2014: 8). So far, the definition of ‘other intermediaries’ remains unclear and unspecified (Gilbert, 2015). The EU institutions published the GDPR on 4 May 2016, explicitly mentioning the RTBF, which in the GDPR is combined with the right to erasure (European Parliament, 2016). In the GDPR, the EU institutions decided not only to integrate the RTBF as per the previous ECoJ decision but also to extend its scope by allowing individuals and companies to request the erasure of their personal data from all websites and other storage locations (European Parliament, 2016, Article 17, 2). The GDPR necessitates data processors and controllers to ‘take reasonable steps, including technical measures, to inform [other] controllers which are processing the data, that the data subject has requested the erasure by such controllers of any links to, or copies or replications of that personal data’ (European Parliament, 2016, Article 17, 2). This article focuses on the RTBF decision as outlined by the ECoJ, as this is as of today the only legally binding ruling. The ECoJ’s RTBF decision and the RTBF in the GDPR have similar aims. Therefore, we follow the EU’s understanding of the RTBF, which sees it as a person’s right to ‘obtain from the controller the erasure of personal data concerning him or her without undue delay’ (European Parliament, 2016, Article 17, 1).
Mayer-Schönberger’s (2009) arguments and the GDPR draft have received support among the research community. Castellano (2012), in his analysis of existing data protection regulations in the EU, concluded that updated laws were needed in the context of Web 2.0. In this environment, the traditional means of control of content, via gatekeepers such as news agencies, broadcast corporations and standards of journalism, cease to exist. Ausloos (2012) analysed the benefits and disadvantages of the RTBF for individuals and proposed a version of the RTBF that strikes a balance between individuals’ right to informational self-determination and freedom of speech for the public’s interest.
Shoor (2014) investigated non-legislative approaches to privacy and data protection and concluded that the RTBF is probably the most suitable solution because other technical solutions are hard to implement due to the decentralised nature of the Internet. Ambrose (2012) suggested adding a time component to information, a similar argument to Mayer-Schönberger’s (2009), and thereby introduced the notion of a data lifecycle. Other researchers have been more critical of the RTBF and have suggested examining the risks it presents to society, instead of focussing only on its benefits. Rosen (2012) argued that the RTBF can be a double-edged sword: either an effective tool that empowers users and helps them to enforce their right to informational self-determination or the single biggest threat to the Internet, as it has the potential to undermine freedom of speech. Scholarship has expanded the debate to legal frameworks in other countries (e.g. Japan), and similar concerns to Rosen’s have been voiced (e.g. Gilbert, 2015). Overall, the literature suggests that debate needs to continue around the potential benefits of the RTBF as well as its possible detrimental effects on society, as the RTBF could shape the future of the Web.
Twitter activity around #righttobeforgotten
From its early inception, the Internet has been exalted for its potential to foster political debate and serves as an alternative and more inclusive public sphere (Brundidge, 2010). The public sphere is a critical forum for open debates, as it constitutes ‘a constellation of communicative spaces in society that permit the circulation of information, ideas, debates – ideally in an unfettered manner – and also the formation of political will (i.e. public opinion)’ (Dahlgren, 2005: 148). In the context of political discussion, social media platforms are seen as particularly well suited for fostering open debate due to their high level of interactivity, fast diffusion of information and capability for facilitating discussions in real time without geographical constraints (Colleoni et al., 2014). Among social media platforms, according to Colleoni et al. (2014), Twitter is perceived in the domain of political communication as a central space for the exchange of information and the debate of current topics because of four unique characteristics. First, all tweets are visible to everyone by default, except when a tweet or an account has been set to private. Second, hashtags, as a key feature of Twitter, facilitate discussions around specific topics without explicitly creating bounded groups. This feature also enables users to follow a debate anonymously without being noticed, that is, users can lurk. Third, Twitter’s retweet function allows information to be diffused quickly and effortlessly. Finally, and important in the context of political communication, Twitter, in comparison to other sites like Facebook, does not require reciprocal relationships among users, allowing for the easy emergence of central nodes of information diffusion.
To address the three research questions guiding this study, we collected real-time data from Twitter over a 2-month period. Anderson (2008) has claimed that ‘with enough data, the numbers speak for themselves. […] There’s no reason to cling to our old ways. It’s time to ask: What can science learn from Google?’ Despite this, we did not rely solely on big data for our analysis, we also employed the following three strategies to supplement our data: we examined individual Twitter accounts, we drew on mainstream news reporting to gain an understanding of the social and informational contexts of the debate and we looked for background information in the data that could provide context for the findings and trends. Our approach, thus, relied on large amounts of data, but was not blind to the nuances available in these data and their meaning. In particular, we followed Monroe et al. (2015) and recognised the value of big data analytics in the following areas:
Helping scientists to design better experiments for testing hypotheses that could not be examined without large amounts of data;
Making more precise comparisons between two or more sub-populations by providing fine-grained samples;
Identifying behaviour that could not be observed before by uncovering behaviours that are otherwise actively hidden by individuals or organisations (e.g. censorship).
The study relied on Netlytic 2 to collect data from Twitter. Netlytic is a cloud-based research tool designed to collect, manage and analyse data from social media platforms. Its key strengths are that it allows scholars to explore emerging themes and trends and facilitates the examination of interactions among Twitter accounts as they take place in real time (Gruzd et al., forthcoming). In Netlytic, researchers can control what data are collected by adjusting and modifying keywords and hashtags as needed. It also provides flexibility for data cleaning by filtering irrelevant or duplicate data and includes features that allow scholars to drill down and examine individual records and investigate how a particular record was used to create the various data visualisations (Gruzd et al., forthcoming). Hence, investigations of data can occur at various scales, which is important for sensemaking.
As discussed above, on 13 May 2014, the ECoJ ruled against Google in the RTBF case. Our data collection started about a year later and took place from 20 May to 18 July 2015. As a result of the late start date, the initial Twitter exchanges following the RTBF decision were not included – exactly 12 months and 7 days of data are missing. Future research can attempt to purchase this dataset from such providers as Gnip and expand the present investigation. While the initial tweets are absent, past research by Yang (2015, 2016) suggests that a time lag can be beneficial to capture engagement by a broader public instead of limiting the investigation to the initial reaction. Hence, two important considerations related to the time frame of data collection, and which help put our findings in context, are as follows:
As the data collection was started 1 year after the ECoJ’s decision, we are looking at the debate once it has been established and consolidated, rather than at the initial flurry of activity. This means that opinions are more thought-through and issues have been considered from different points of view.
Furthermore, due to the elapsed time, we hoped that information regarding the decision had reached a broader audience and that this audience would have had the chance to consider the options available to search engine providers as to how to practically implement the RTBF ruling and the consequences for users.
The end date of data collection was determined by volume and the fact that the discussion had started to dwindle. After the peak of the discussion was reached in the fourth week with 9954 tweets, only 1099 tweets were gathered in the last week.
Our initial data collection focussed on tweets using the keyword ‘right to be forgotten’ and the hashtag #rtbf. After 10 days, we examined the preliminary data and determined that two additional hashtags – #2beforgotten and #righttobeforgotten – were often utilised as well. We added these to retrieve as comprehensive a dataset as possible. Twitter hashtags evolve organically as users interact with one another, and hence multiple hashtags can emerge around a single topic (Conover et al., 2011). When we manually scrutinised the data for tweets unrelated to the RTBF discussion, we realised that the hashtag #rtbf collected a lot of irrelevant tweets because it was used not only for discussing the RTBF ruling but also in relation to a Belgian television company that broadcasted the Eurovision Song Contest during the same time frame. Consequently, we excluded tweets from the data collection which included any of the following hashtags or keywords, which were frequently used simultaneously with #rtbf: #eurovision, #eurovisionrtbf, #eurovision2015 and ‘eurovision’. Tweets in many different languages were collected, including German, Russian and Korean, with English being the most prevalent language.
In total 30,894 relevant tweets produced by 18,959 user accounts were gathered in the specified time frame that provided insight into how the discussion unfolded and how various groups and individuals engaged with the topic. Looking at tweets over time was critical to the aim of this study, as it allowed us to observe the development of trends in the corpus, the emergence of new sub-topics and how the community evolved (boyd and Crawford, 2012). Additionally, the amount of collected data from Twitter provided a broader data foundation than other data collection methods, such as surveys, experiments or interviews. A key advantage of drawing data from Twitter was the large and diverse sample size encompassing perspectives, opinions and interactions of individuals from different countries, cultural backgrounds, socioeconomics and, most importantly, for this specific study, legal frameworks (boyd and Crawford, 2012). In contrast to other means of data collection, where individuals are aware of being part of a study, data gathered from Twitter provide insight into individuals’ perspectives and opinions on topics, in this case the RTBF, in a more naturalistic setting; at least one that is not influenced by reactivity to the study (Ansolabehere and Hersh, 2012). To protect the privacy of users, we anonymised the data.
Using social network analysis to analyse Twitter data
We employed social network analysis (SNA) to analyse and visualise the Twitter data. SNA is not only an important theoretical lens but also a methodological approach for investigating connections and the flow of information in social media, as it generates rich and complex data as users discuss, debate and exchange their opinions in real time (Quan-Haase and McCay-Peet, forthcoming). SNA was ideal for this study because it allowed us to identify central actors as well as examine the overall structural features of the network. We used the following two types of open-source software: OpenRefine 3 and Gephi. 4 OpenRefine was used to process and prepare data for further analysis in Gephi (e.g. extract mentions and hashtags into separate columns, create a node and an edge table). We used Gephi for visualising the results and its integrated features for calculating network structural measures.
Central keywords
To provide some context for interpreting the Twitter data, Table 1 lists some of the key pieces of news published prior to and following the data collection period. The British technology news website The Register reported on 4 July 2014 that legal experts were discussing Google’s role as sole decision maker with regards to whether information requested for deletion falls under an individual’s right to privacy or if the information is considered of public interest. On 13 May 2015, Wall Street Journal Digits reported on France’s intention to request that Google extend globally the RTBF’s scope.
Timeline of important news published before and after data collection.
EU: European Union; RTBF: right to be forgotten; WSJ: The Wall Street Journal; GDPR: General Data Protection Regulation.
The impact of the two news reports on the RTBF debate on Twitter is visible in the word cloud depicted in Figure 1. 5 Size reflects the keyword’s ranking, with bigger words occurring more often in the dataset. The number in the right upper corner shows the absolute frequency of each keyword.

Main keywords based on frequency of usage.
The keywords ‘France’, ‘requests’ and, in particular, ‘Google’ were the most frequently used, with 4905, 5087 and 20,064 mentions, respectively. There were three reasons why ‘Google’ and #Google were so frequently mentioned in tweets related to the RTBF. First, due to Google’s predominance, it is seen as a synonym for conducting web searches, reflected in the verb ‘to google’. Second, the RTBF decision was specifically directed at Google because it is the number one search engine in Europe and has the largest market share worldwide (Statista, 2015). Google’s share in Europe is 90% in comparison to Bing and Yahoo with 2.7% and 2.3%, respectively (Rosoff, 2014). Finally, the RTBF could impact people’s access to information and thus threaten freedom of speech online (Rosen, 2012), which would run against Google’s code of conduct, summarised in the slogan ‘Do the right thing’ (Google Inc., 2015). No other search engine was mentioned in relation to the RTBF, although the decision affects all search engines operating within the EU.
The keywords ‘France’, ‘French’ and ‘global’ were frequently mentioned in relation to the RTBF because the French government played a central role in shaping the debate: it requested that Google expands the scope of the RTBF beyond the borders of the EU to the rest of the world within a 15-day time frame (Statt, 2015). The term ‘French’ was often used in conjunction with ‘regulator’, indicating that France was not going to settle on an agreement it felt did not provide citizens with sufficient self-determination. No other European nation was so highly involved in shaping the debate. The terms ‘Canada’, ‘order’ and ‘global’ also appear in Figure 1 because a judge in British Columbia ordered on June 2015 that Google removes data from its search results not only in Canada, but worldwide. Again, these cases show how the debate on Twitter focussed on specific aspects of the RTBF and its scope.
Much of the debate on Twitter also focussed on the keyword ‘requests’, which reflected a discussion regarding information included in Google’s annual transparency report released July 2015: the report revealed that less than 5% of requests (of about 220,000 requests) to remove information from its search results come from criminals, politicians and high-profile public figures; the majority come from regular citizens (Tippmann and Powles, 2015). Tweets focussed primarily on the consequences, for both regular citizens and the media, of Google being the sole decision maker when it comes to differentiating whether information requested for removal is of interest to the public and falls under freedom of speech, or if it violates personal privacy and is eligible for removal (Tippmann and Powles, 2015).
Latent and flaring sub-topics
The mapping of central keywords to published news revealed some important trends (see Figure 2). Initially, the discussion was centred around the French government setting a 15-day limit for Google to implement the RTBF on a global scale (Schechner, 2015; Statt, 2015). Despite the fact that the keywords ‘France’ and ‘Google’ were intensely discussed early on, after they peaked the discussion flattened and the number of mentions of the words quickly dropped. At this point, the attention focussed on the keyword ‘Russia’, which dominated the discussion for a few days after a new bill that introduced the RTBF in that country (GlobalVoices, 2015a). The RTBF as introduced by the ECoJ prohibits public figures such as politicians from removing information deemed of public interest, whereas the Russian ruling extends its scope to allow this (Shaftan, 2015).

Thematic relevance and news.
On Twitter, the Russian ruling sparked much debate. Discussions focussed around how public figures could abuse the law by removing compromising reports to enhance their online reputation. In this case, the RTBF would function as a mechanism to restrict the right to free speech instead of its originally intended purpose (e.g. Rosen, 2012). Within the ECoJ (2014) ruling, public figures cannot remove information that is deemed of public relevance, but can ask for information to be removed that applies to their private lives. However, it is hard to find a balance between freedom of speech and an individual’s right to privacy because there is no clear line as to where public life ends and private life starts. Russia’s interpretation and version of the RTBF circumvented this problem by extending the scope to all individuals in society including public figures (Doyle, 2015). The ruling can potentially also empower politicians by allowing them to increase their informational self-determination beyond the private sphere.
Looking at a snapshot of the RTBF debate, Figure 3 shows a rapid increase in posts around 13 June 2015 resulting from the news regarding the Russian government’s interpretation of the RTBF (GlobalVoices, 2015b). The radically different interpretations of the RTBF decision have the potential to polarise future debates on attaining the right balance between freedom of speech and privacy rights, particularly in relation to its application to public figures such as politicians, celebrities or athletes.

Peak around the Russian right to be forgotten (RTBF) debate.
Figure 2 shows that the keywords ‘Google’ and ‘Russia’ were inversely related, that is, when ‘Russia’ emerged as a central keyword, the debate no longer included the keyword ‘Google’ and vice versa. This could potentially be related to the fact that Google’s market share and popularity is low in Russia, where other search engine providers, such as Yandex, dominate the search engine market (Clayton, 2013). Therefore, concerns about how Google operates and how it will implement the RTBF ruling are less relevant to the discussion surrounding the RTBF in Russia.
Influencers in #righttobeforgotten
How influence occurs on social media is a central topic. Influencers can be defined in terms of their centrality in a network – major influencers are referred to as stars (Heider, 1958) or opinion leaders (Merton, 1968) – or in terms of their ability to move information between otherwise disconnected networks; major players in this category are referred to as gatekeepers (Rogers, 1962), cosmopolitans (Merton, 1968) or bridging actors (Burt, 1992). We first look at central individuals in terms of volume of contributions and network centrality and then discuss those players who control the flow of information and help the network remain interconnected.
Figure 4 shows that the top 10 posters comprised nine user accounts representing corporations or news outlets. Only one personal account ( @Account_A ) was among the top 10; this user was very engaged and posted 33% of all posts. From the user’s account description, it was evident that the user worked in the communications industry as the head of communications of a non-profit organisation. Postings may have been motivated both by the user’s employment and personal interest.

Top 10 accounts by posts in the right to be forgotten (IRTBF) network.
The news account @Account_K was the most frequently mentioned account, with over 450 mentions (see Figure 5). In contrast to the top 10 posters, the top 10 accounts by mentions contained three personal accounts:
A British Broadcasting Corporation (BBC) North America Technology reporter (Account_M),
A lawyer and technology researcher from Cambridge University (Account_Q) and
A law lecturer, who focuses on privacy and human rights (Account_S).

Top 10 accounts by mentions in the right to be forgotten (RTBF) network.
The frequent mention was due to each user’s knowledge and prominence in the areas of privacy and law. While all three were not among the top posters, they regularly posted tweets with their opinions, links to resources and other relevant posts. News portals focussed on tech-related topics, such as @Account_K , were often mentioned in tweets, and their content was frequently retweeted because they regularly posted updates and comments relevant to the operations of search engines. @Account_L (Google’s company account) was also frequently mentioned in tweets because the RTBF decision affected its operations the most (ECoJ, 2014), but @Account_L did actually not post content nor participate in the debate.
Network structure of #righttobeforgotten
The network structure indicated a sparsely knit and decentralised network consisting of 9192 separate clusters, which is a large number of clusters for a network consisting of 18,959 nodes. The average cluster coefficient was .04, which means that a few nodes already formed a cluster (Cherven, 2015). That is, each cluster is rather small in relation to the overall network size. The modularity of the network was .876, which means strong interconnections among nodes within a cluster but sparse interconnectivity to other clusters (Newman, 2006). There were few connections among nodes, with the network density value being .0000377179 (Coleman and Moré, 1983). To better visualise and examine the topological features of the network, we relied on the ‘giant component’ filter. This filter allowed us to focus on large connected clusters by fading out smaller clusters (Cherven, 2015) and resulted in 102 clusters, each with many bridging nodes facilitating the flow of information.
Betweenness centrality measures how often a node is on the shortest path between two nodes and thereby indicates how often a node connects with otherwise disconnected clusters (Cherven, 2015). Five nodes serve as bridges with high betweenness scores connecting various clusters (for their individual betweenness scores, see Table 2). One of the bridges, @Account_X , is run by the Stanford Center on Democracy, Development and the Rule of Law. The other four accounts are all personal accounts: @Account_Q is a lawyer and technology researcher at Cambridge University, @Account_V is the personal account of Organization for Security and Co-operation in Europe (OSCE) representative on Freedom of the Media, @Account_U is a senior lecturer at the School of Computing at the University of Kent and @Account_A is an Italian cyber-utopian. Figure 6 illustrates the clusters and functions of bridging actors. The size of the account labels represents the size of the betweenness score of the node.
Centrality measurements of the top five bridging actors.

Clusters and bridges in the right to be forgotten (RTBF) network.
Table 2 depicts the betweenness scores, in- and out-degree centrality for the top five bridging actors. Taking into account that 18,959 nodes comprised the overall network of tweets exchanged in the time frame, the influence of the five bridging actors cannot be ignored. Moreover, their level of engagement was also visible in that they were exchanging information not only with multiple clusters but also with each other.
@Account_Q and @Account_X have the largest in-degree values with 174 and 35, respectively. @Account_A and @Account_Q have the largest out-degree values with 448 and 44, respectively. Considering that the average degree centrality in our sample is .639, it is apparent how influential and significant these top five bridging actors are in the network. We can conclude that @Account_A is an information hub, with high out-degree value, that distributes information about the RTBF, while @Account_Q can be considered a core bridging actor that has the greatest influence on the RTBF Twitter network by connecting otherwise isolated clusters. The betweenness scores (see Table 2) show that the top five bridging actors appear at least 22,874 times on the shortest path between two nodes, whereas Account_Q as the most influencing node appears 69,805 times on the shortest path. Considering that the total number of shortest paths in the network is 155,571, the influence of the bridging actors becomes apparent. The function of information hubs and bridging actors is critical for keeping the discussion alive and the network interconnected.
Discussion
Dynamics of latent and flare topics
Research question (RQ) 1 examined the various topics discussed on Twitter concerning the RTBF. One central topic spanned the entire period and prevailed throughout the debate, namely Google’s role in the RTBF. We refer to this as a latent topic, as it was pervasive and dominant. Identifying the latent topic was critical, as the focus on a single topic kept the discussion alive and functioned as a recurring theme in the narrative. While it is true that alternative search engines are less central due to their low market share, they are still key players in many regions of the world. This demonstrates how Google’s monopoly plays out with regards to debates on Twitter; the brand becomes accountable for its practices, as the entire debate has been funnelled towards its operations.
Several sub-topics emerged and flooded the Twitter stream for 3–5 days at a time. We refer to these topics as flare topics because of their sudden and intense appearance followed by a quick fading from the discourse. These flare topics were often linked to news originating in the mainstream media and would spark heated, topic-specific debates. They would quickly reach a peak and then slowly flatten and completely vanish; this is when the latent topic would return into focus and sustain the debate.
We also observed that these flare topics could completely shift the focus of the discussion. For example, when Russia became a key topic, the previously ongoing discussion around Google and France came to a sudden halt. This suggests that discussions evolving around the RTBF were guided primarily by one dominant latent topic, but that they received new impetus for further discussion from multiple flare topics. Furthermore, it seems that there is currently no flare topic that can substitute for the focus on Google and take the position of a latent topic, although the RTBF decision also involved other search engines and equally affected their operations. These findings could be used to anticipate future developments concerning the RTBF ruling in the Twittersphere.
Key players in the network
RQ2 examined the role of influencers in the RTBF debate. Three types were identified:
Companies. These represent corporate interests and are affected by the RTBF decision directly (e.g. Google, which is frequently mentioned by others, but does not participate in the debate);
Experts. These are individuals who are experts (e.g. privacy researchers and lawyers) and contribute with their opinions on and understandings of the decision;
News portals. This includes varied broadcasters, who are interested in spreading news and engaging a wide audience in an issue of broad public interest (e.g. searchengineland.com).
When looking at actors that serve as bridges between clusters, it was observed that four of five influencers comprised personal accounts belonging to experts. These have the greatest influence on the diffusion of RTBF-related information and their role is also to sustain the network over time. This suggests that the majority of influencers in the network continue to be the gatekeepers and agenda-setting organisations that structured and controlled information in the public sphere in the past.
Twitter as a 21st-century public sphere
RQ3 examined the nature of the public sphere on Twitter. The public sphere that emerged around the RTBF debate on Twitter resembles, on the one hand, the Öffentlichkeit as described by Habermas in the 1960s. On the other hand, it differs significantly in terms of the underlying mechanisms and means of information diffusion. We did not observe Colleoni et al.’s (2014) four characteristics of Twitter in the expected way. Twitter was not able to fulfil its function as a virtual public sphere, as information did not diffuse in an unfettered manner, but rather was controlled by a few key influencers. On Twitter, we continue to see this elite being highly involved and central to the RTBF debate. In particular, news outlets – through editors, newsroom staff and broadcasters – continue to have an agenda-setting function, often being the first to tweet on RTBF news. Their role is critical as they drive the dynamics of networked communication by filtering, making sense of and selecting information and topics of interest to the public (Habermas, 2006; Webster, 2006). The mass media’s role in the debate concerning the RTBF is to continuously provide updates and developments.
How agenda setting occurs on Twitter is neither a simple nor a straightforward process, as topics shift quickly and dynamically. We observed that experts directly participate, shape and drive discussions, rather than the mass media serving as an intermediary. In this way, there is much more of a plurality of opinions on Twitter than was ever possible in the traditional public sphere. The position of the traditional mass media has weakened, and social media have emerged as new spaces where experts have the potential to actively participate and steer the discussion (Meraz, 2009).
However, a central question emerges from our study: why are ordinary citizens unengaged in the RTBF debate on Twitter? As mentioned previously, the Internet is often considered as a new public sphere because of its potential to foster political discussions (Brundidge, 2010) and citizen engagement (e.g. Bruns and Highfield, 2016). In the case of the RTBF, however, the results of our study show the opposite, as ordinary citizens do not (actively) participate. There might be several reasons: a space dominated by experts may seem intimidating to ordinary citizens. Twitteracy (a combination of Twitter and literacy) has been described as an acquired skill (Greenhow and Gleason, 2012), and there may be a barrier to participation in a topic that is highly driven by legal institutions and regulatory bodies. Ordinary citizens may be ‘lurkers’ or passive information consumers, that is, their participation consists primarily of following debates, retweeting content and observing developments, rather than voicing their opinions. If this is true, it would mean that Twitter’s features – in particular, hashtags, which allow users to follow a debate anonymously (Colleoni et al., 2014) – are not always helpful in terms of establishing a virtual public sphere and can actually have the opposite effect. In the case of the RTBF, hashtags do not motivate ordinary citizens to participate; rather, they promote a passive audience. The lack of participation may also be due to ordinary citizens’ sense of inexperience or ignorance with regard to the topic of the RTBF (Bishop, 2007). In a space dominated by the elite, ordinary citizens may think that ‘they did not need to post […] and they are being helpful by not posting’ (Bishop, 2007: 1889–1890), as they are then not interrupting the discussions moderated by the experts. The lack of engagement may also be related to features specific to Twitter and the sparsely knit network structure it promotes. The high number of cliques and their internal focus may make it difficult for ordinary citizens, who are at the periphery, to penetrate the debate. Ordinary citizens may not be aware of the ongoing debate, as it may take place within contained information bubbles (Pariser, 2011). We describe the Twittersphere as a dynamic space consisting of a multitude of small clusters that function largely independently and in a decentralised fashion.
Our analysis of influencers shows that important players are individuals who are experts in the area. These key players, who could potentially reach out to ordinary citizens and engage them in the debate, have only a limited influence on Twitter, as they have fewer than 3000 followers (except for @Account_X , which is a university institution and has 40,000 followers). This limits their role as bridges to the wider public and leads to the debates around the RTBF remaining contained between news outlets, elites and organisations. One way to break this kind of information bubble and engage citizens could be to redesign the personalised recommendation algorithm (Garrett and Resnick, 2011). It could recommend RTBF-related tweets to users who are interested in privacy and data protection. Also, celebrity endorsement could help to promote the relevance of the topic. Such personalised recommendations would not only broaden people’s knowledge but also help them engage more meaningfully with current debates that affect them directly.
Study limitations and future work
A total of 30,894 tweets were collected during the 2-month data collection period. Other studies have gathered larger data sets; for example, Conover et al. (2011) scraped over 250,000 relevant tweets within a 6-week period prior to the 2010 US midterm elections. The elapsed time between the RTBF ruling and the start of our data collection, a 1-year gap, may explain the comparatively low numbers. It is also possible that the general public has not as yet joined the debate on Twitter, they may not be fully aware of the RTBF ruling itself, they may not fully comprehend its opaque legal language or they may be unaware of its implications for their own daily web experiences, for example, the possibility of skewed search results (Ambrose, 2012). It is possible that as the debate becomes more widely known among Internet users, greater engagement will occur on Twitter and other social media platforms. Compared to the decision of the ECoJ, the GDPR extends the scope of the RTBF and could spark further debate. Future research could document how the debate shifts and what new sub-topics emerge on Twitter and other platforms. The results of this study could then be useful for anticipating future discussion structures on related topics.
Another limitation of the study is its exclusive focus on Twitter. While the Twittersphere has been deemed an important space for public debate and an extension of more traditional public spheres, studies that exclusively focus on Twitter may have limited generalizability. Discussions in personal circles, social spaces or on other social media platforms such as Facebook are equally valid and important, but were not included in this study. Another limitation comes from our lack of understanding of our sampling frame. Twitter users are not representative of the general population – users have tended to be younger and more tech-savvy – and consequently do not include voices from all strata of society (Quan-Haase and McCay-Peet, forthcoming).
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was funded by a Social Science and Humanities Research Council of Canada (SSRHC) grant given to A. Quan-Haase (R3603A13) and by the Faculty of Economics and Business Administration at the Goethe University Frankfurt. We would like to thank colleagues at the Faculty of Information and Media Studies, Western University, for their input and particularly to Chandell Gosse and David Brown.
